一、查询某个字段重复
1
2
3
4
5
|
select * from User u where u.user_name in ( select u.user_name from User u group by u.user_name having count (*) > 1) |
二,删除表中某几个字段的重复
例:表中有条六条记录。 其中张三和王五 的记录有重复
TableA
1
2
3
4
5
6
7
8
9
10
11
|
id customer PhoneNo 001 张三 777777 002 李四 444444 003 王五 555555 004 张三 777777 005 张三 777777 006 王五 555555 如何写一个sql语句将TableA变成如下 001 张三 777777 002 李四 444444 003 王五 555555 |
测试环境
1
2
3
4
5
6
7
|
create table TableA ( id varchar (3),customer varchar (5),PhoneNo varchar (6)) insert into TableA select '001' , '张三' , '777777' union all select '002' , '李四' , '444444' union all select '003' , '王五' , '555555' union all select '004' , '张三' , '777777' union all select '005' , '张三' , '777777' union all select '006' , '王五' , '555555' |
结果
1
2
3
4
5
6
|
delete TableA from TableA Twhere exists( select 1fromtablea where customer=T.customer and phoneno=T.phoneno andid < tt.id ) |
总结
该方法适用于有一个字段为自增性,例如本例中的:id
1
2
3
4
5
6
|
delete 表名 from 表名 as Twhere exists( select 1from表名 where 字段A=T.字段A and 字段B=T.字段B,(....) and 自增列 < T.自增列 ) |
三,查询并删除重复记录的SQL语句
查询及删除重复记录的SQL语句
1、查找表中多余的重复记录,重复记录是根据单个字段(peopleId)来判断
1
2
|
select * from people where peopleId in ( select peopleId from people group by peopleId having count (peopleId) > 1) |
2、删除表中多余的重复记录,重复记录是根据单个字段(peopleId)来判断,只留有rowid最小的记录
1
2
3
|
delete from people where peopleId in ( select peopleId from people group by peopleId having count (peopleId) > 1) and rowid not in ( select min (rowid) from people group by peopleId having count (peopleId )>1) |
注:rowid为oracle自带不用该.....
3、查找表中多余的重复记录(多个字段)
1
2
|
select * from vitae a where (a.peopleId,a.seq) in ( select peopleId,seq from vitae group by peopleId,seq having count (*) > 1) |
4、删除表中多余的重复记录(多个字段),只留有rowid最小的记录
1
2
3
|
delete from vitae a where (a.peopleId,a.seq) in ( select peopleId,seq from vitae group by peopleId,seq having count (*) > 1) and rowid not in ( select min (rowid) from vitae group by peopleId,seq having count (*)>1) |
5、查找表中多余的重复记录(多个字段),不包含rowid最小的记录
1
2
3
|
select * from vitae a where (a.peopleId,a.seq) in ( select peopleId,seq from vitae group by peopleId,seq having count (*) > 1) and rowid not in ( select min (rowid) from vitae group by peopleId,seq having count (*)>1) |
(二)
比方说
在A表中存在一个字段“name”,
而且不同记录之间的“name”值有可能会相同,
现在就是需要查询出在该表中的各记录之间,“name”值存在重复的项;
1
|
Select Name , Count (*) From A Group By Name Having Count (*) > 1 |
如果还查性别也相同大则如下:
1
|
Select Name ,sex, Count (*) From A Group By Name ,sex Having Count (*) > 1 |
(三)
方法一
1
2
3
4
5
6
7
8
9
10
11
12
|
declare @ max integer ,@id integer declare cur_rows cursor local for select 主字段, count (*) from 表名 group by 主字段 having count (*) >; 1 open cur_rows fetch cur_rows into @id,@ max while @@fetch_status=0 begin select @ max = @ max -1 set rowcount @ max delete from 表名 where 主字段 = @id fetch cur_rows into @id,@ max end close cur_rows |
set rowcount 0 方法二
"重复记录"有两个意义上的重复记录,一是完全重复的记录,也即所有字段均重复的记录,二是部分关键字段重复的记录,比如Name字段重复,而其他字段不一定重复或都重复可以忽略。
1、对于第一种重复,比较容易解决,使用
select distinct * from tableName
就可以得到无重复记录的结果集。
如果该表需要删除重复的记录(重复记录保留1条),可以按以下方法删除
1
2
3
4
|
select distinct * into #Tmp from tableName drop table tableName select * into tableName from #Tmp drop table #Tmp |
发生这种重复的原因是表设计不周产生的,增加唯一索引列即可解决。
2、这类重复问题通常要求保留重复记录中的第一条记录,操作方法如下
假设有重复的字段为Name,Address,要求得到这两个字段唯一的结果集
1
2
3
|
select identity( int ,1,1) as autoID, * into #Tmp from tableName select min (autoID) as autoID into #Tmp2 from #Tmp group by Name ,autoID select * from #Tmp where autoID in ( select autoID from #tmp2) |
最后一个select即得到了Name,Address不重复的结果集(但多了一个autoID字段,实际写时可以写在select子句中省去此列)
(四)
查询重复
1
2
3
4
5
|
select * from tablename where id in ( select id from tablename group by id having count (id) > 1 ) |