删除除 MySQL 中的一行之外的所有重复行?

Delete all Duplicate Rows except for One in MySQL?(删除除 MySQL 中的一行之外的所有重复行?)
本文介绍了删除除 MySQL 中的一行之外的所有重复行?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着跟版网的小编来一起学习吧!

问题描述

如何从 MySQL 表中删除所有重复数据?

How would I delete all duplicate data from a MySQL Table?

例如,使用以下数据:

SELECT * FROM names;

+----+--------+
| id | name   |
+----+--------+
| 1  | google |
| 2  | yahoo  |
| 3  | msn    |
| 4  | google |
| 5  | google |
| 6  | yahoo  |
+----+--------+

如果是 SELECT 查询,我会使用 SELECT DISTINCT name FROM names;.

I would use SELECT DISTINCT name FROM names; if it were a SELECT query.

我将如何使用 DELETE 执行此操作以仅删除重复项并仅保留每个记录?

How would I do this with DELETE to only remove duplicates and keep just one record of each?

推荐答案

编辑器警告:此解决方案计算效率低下,可能会导致大表的连接中断.

注意 - 您需要首先在您的表的测试副本上执行此操作!

NB - You need to do this first on a test copy of your table!

当我这样做时,我发现除非我还包含了 AND n1.id <>n2.id,它删除了表中的每一行.

When I did it, I found that unless I also included AND n1.id <> n2.id, it deleted every row in the table.

  1. 如果要保留 id 值最低的行:

DELETE n1 FROM names n1, names n2 WHERE n1.id > n2.id AND n1.name = n2.name

  • 如果要保留 id 值最高的行:

    DELETE n1 FROM names n1, names n2 WHERE n1.id < n2.id AND n1.name = n2.name
    

  • 我在 MySQL 5.1 中使用过这种方法

    I used this method in MySQL 5.1

    不确定其他版本.

    更新:由于人们在谷歌上搜索删除重复项最终会出现在这里
    尽管 OP 的问题是关于 DELETE,但请注意使用 INSERTDISTINCT 会快得多.对于一个有 800 万行的数据库,下面的查询用了 13 分钟,而使用 DELETE 时,用了 2 个多小时还没有完成.

    Update: Since people Googling for removing duplicates end up here
    Although the OP's question is about DELETE, please be advised that using INSERT and DISTINCT is much faster. For a database with 8 million rows, the below query took 13 minutes, while using DELETE, it took more than 2 hours and yet didn't complete.

    INSERT INTO tempTableName(cellId,attributeId,entityRowId,value)
        SELECT DISTINCT cellId,attributeId,entityRowId,value
        FROM tableName;
    

    这篇关于删除除 MySQL 中的一行之外的所有重复行?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持跟版网!

    本站部分内容来源互联网,如果有图片或者内容侵犯了您的权益,请联系我们,我们会在确认后第一时间进行删除!

    相关文档推荐

    ibtmp1是非压缩的innodb临时表的独立表空间,通过innodb_temp_data_file_path参数指定文件的路径,文件名和大小,默认配置为ibtmp1:12M:autoextend,也就是说在文件系统磁盘足够的情况下,这个文件大小是可以无限增长的。 为了避免ibtmp1文件无止境的暴涨导致
    SQL query to group by day(按天分组的 SQL 查询)
    What does SQL clause quot;GROUP BY 1quot; mean?(SQL 子句“GROUP BY 1是什么意思?意思是?)
    MySQL groupwise MAX() returns unexpected results(MySQL groupwise MAX() 返回意外结果)
    MySQL SELECT most frequent by group(MySQL SELECT 按组最频繁)
    Include missing months in Group By query(在 Group By 查询中包含缺失的月份)