【求教】如何剔除重复的记录

cynthialam

3573

收藏 2012-01-10

假设有数据如下：
Var1 Var2
a b
a c
b c
b a

在这个数据中，第一条记录和最后一条记录的意义是一样的，可以认为是重复的记录，那要怎么删除呢？
求教！

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

全部回复

cynthialam

2012-1-10 10:46:32

一直木有人回答么.....

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

桶桶nancy

2012-1-10 10:55:16

一、具有主键的情况
a.具有唯一性的字段id(为唯一主键)
delect table
where id not in
( select max(id) from table group by col1,col2,col3... )
group by 子句后跟的字段就是你用来判断重复的条件，如只有col1，
那么只要col1字段内容相同即表示记录相同。

b.具有联合主键
假设col1+ ', '+col2+ ', '...col5 为联合主键
select * from    table where col1+ ', '+col2+ ', '...col5 in ( select max(col1+ ', '+col2+ ', '...col5) from table where having count(*)> 1
group by col1,col2,col3,col4 )
group by 子句后跟的字段就是你用来判断重复的条件，
如只有col1，那么只要col1字段内容相同即表示记录相同。

or
select * from table    where exists (select 1 from table x where table.col1 = x.col1 and
table.col2= x.col2 group by x.col1,x.col2 having count(*) > 1)

c:判断所有的字段
select * into #aa from table group by id1,id2,....
delete table
insert into table
select * from #aa

二、没有主键的情况

a:用临时表实现
select identity(int,1,1) as id,* into #temp from ta
delect #temp
where id not in
(  select max(id) from # group by col1,col2,col3... )
delete table ta
inset into ta(...)
   select ..... from #temp

b:用改变表结构（加一个唯一字段）来实现
alter table 表 add    newfield int identity(1,1)
delete 表
where newfield not in
( select min(newfield) from 表 group by 除newfield外的所有字段 )

alter table 表 drop column newfield

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

桶桶nancy

2012-1-10 10:55:43

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

cynthialam

2012-1-10 13:35:05

谢谢~
但看得不是太明白，特别是关于Max那块...

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

shenliang_111

2012-1-10 14:34:37

复制代码

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

点击查看更多内容…

mymine

2012-1-10 15:08:37

来一个逻辑简单，重复的行都删除模式

data a;
input var1 $ var2 $;
cards;
a b
a c
b c
b a
d c
e f
t g
c d
;
run;

data a1;
set a;
var3=compress(var1||'*'||var2);
nn=_n_;
run;
data a2;
set a;
var3=compress(var2||'*'||var1);
nn=_n_;
run;
data aa;
set a1 a2;
run;

proc sort data=aa out=aa nodupkey;
by var3;
run;
proc sql;
create table ab as
select distinct var1,var2
from aa group by nn
having n(var1)=2;
quit;

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

mymine

2012-1-11 14:27:41

重复的行保留一行的模式

data a;
input var1 $ var2 $;
cards;
a b
a c
b c
b a
d c
e f
t g
c d
;
run;

data a1;
set a;
var3=compress(var1||'*'||var2);
nn=_n_;
run;
data a2;
set a;
var3=compress(var2||'*'||var1);
nn=_n_;
run;
data aa;
set a1 a2;
run;

proc sql;
create table ab as
select distinct nn,var1,var2,max(nn) as n1,min(nn) as n2
from aa group by var3;
quit;

proc sort data=ab out=ab nodupkey;
by n1 n2;
run;
proc sort data=ab;
by nn;
run;

data ab;
set ab;
drop nn n1 n2;
run;

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

sushe1527

2012-1-11 17:22:24

复制代码

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

tj0412ymy

2012-1-11 17:56:47

复制代码

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

cynthialam

2012-1-12 14:10:46

tj0412ymy 发表于 2012-1-11 17:56

多谢~

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

H奥

2012-1-12 14:11:23

、具有主键的情况
a.具有唯一性的字段id(为唯一主键)
delect table
where id not in
( select max(id) from table group by col1,col2,col3... )
group by 子句后跟的字段就是你用来判断重复的条件，如只有col1，
那么只要col1字段内容相同即表示记录相同。

b.具有联合主键
假设col1+ ', '+col2+ ', '...col5 为联合主键
select * from    table where col1+ ', '+col2+ ', '...col5 in ( select max(col1+ ', '+col2+ ', '...col5) from table where having count(*)> 1
group by col1,col2,col3,col4 )
group by 子句后跟的字段就是你用来判断重复的条件，
如只有col1，那么只要col1字段内容相同即表示记录相同。

or
select * from table    where exists (select 1 from table x where table.col1 = x.col1 and
table.col2= x.col2 group by x.col1,x.col2 having count(*) > 1)

c:判断所有的字段
select * into #aa from table group by id1,id2,....
delete table
insert into table
select * from #aa

二、没有主键的情况

a:用临时表实现
select identity(int,1,1) as id,* into #temp from ta
delect #temp
where id not in
(  select max(id) from # group by col1,col2,col3... )
delete table ta
inset into ta(...)

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群

扫码加我拉你入群