MySql 刪除表中重複的數據（但要保留一條）

原創

2020-06-04 00:32

【補充一下】經過 KimZing 同學提醒，mysql5.7不會默認給一個id，但我們可以在查詢條件種加上 min(id)

今天遇到一個問題。相同的數據在同一張表裏出現了多次。我的需求是刪除多餘的數據，但要保留其中一條。
定義表名 table_a ,判斷唯一的兩個字段 c_1,c_2，無關字段data
表中原始數據如下

首先我們要查看數據庫中那些數據重複了，執行如下SQL

SELECT * FROM 
(SELECT COUNT(*) as num,c_1,c_2,min(id) as id FROM table_a GROUP BY c_1,c_2)e 
WHERE e.num>1;

結果如下

其中num字段爲數據出現的次數，可以發現我們已經找出了出現重複的數據，那麼我們該怎麼去除其中多餘的數據呢。
我的思路是：再查詢一個id 字段，我們group by 的時候 id 字段只能查詢到重複數據中的一條。然後我們把這些id的數據刪除，就達到了去重的效果。SQL 如下

DELETE FROM table_a 
WHERE id IN 
(SELECT e.id FROM (SELECT COUNT(*) as num,c_1,c_2,min(id) as id FROM table_a GROUP BY c_1,c_2)e WHERE e.num>1);

2018-01-20 更新：
突然想到一個更好的方法，SQL如下：

DELETE FROM table_a 
WHERE id IN 
(SELECT id FROM (SELECT id FROM table_a GROUP BY c_1,c_2 HAVING count(*) > 1)e);

執行：

可以看到有兩行被刪除了。這時再看看數據表，數據已經變成了：

成功將重複的數據刪除。

如果重複數據是三條或者更多怎麼辦呢？很簡單，再多執行幾次這個SQL 就好了。

最後，別忘了給字段加個唯一索引，避免數據再出問題

如果有幫到您，打個賞唄

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

一場數據架構變革正在來臨

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-12-21 10:54:01

解讀數字化轉型下的數據安全：AI正在開闢新的可能性

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-19 14:03:54

雲原生數據庫企業Cockroach Labs再獲 2.73 億美元融資，估值高達50億美元

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-16 15:18:50

數千個數據庫、遍佈全國的物理機，京東物流全量上雲實錄 | 卓越技術團隊訪談錄

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"blockquote","content":[{"type":"pa

2021-12-16 10:38:55

前車之鑑：聊聊我在基礎設施中掉過的坑

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-14 13:33:55

洞察數據庫變革趨勢，亞馬遜雲科技正在憑藉這項技術改變着遊戲規則

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-10 16:53:54

MongoDB發佈第三季度財報，雲數據庫收入增長加速

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-09 15:33:57

MySQL探祕(四):InnoDB的磁盤文件及落盤機制

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

程序员历小冰

2021-12-08 12:33:52

Oracle 大佬離職，怒噴 MySQL “糟糕的數據庫”

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-07 19:58:57

Jellyfish：爲Uber最大的存儲系統提供更節省成本的數據分層

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

Mohammed Khatib

2021-12-06 10:33:48

企業需要什麼樣的數據庫，One Size Fits All？

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-03 18:19:01

這個重要開源項目全靠一位低調的“怪老頭”維護！他和比爾蓋茨一樣撐起了計算機世界

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-03 14:23:56

數據庫事務的三個元問題

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-12-03 10:33:52

一個 Babelfish ，看懂雲數據庫的發展方向

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-01 18:43:50

數據庫內核雜談(二十一): 流處理系統簡介

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-24 10:38:57

24小時熱門文章

最新文章

最新評論文章