oracle刪除重複數據並至少保留一條（大數據操作）

原創

2020-07-06 02:39

1. 前言

有時候會遇到這樣的問題，一個表中有重複數據，並且數據量比較大，在一萬條以上。這個時候如果用delete刪除重複數據並保留一條的時候會非常之慢，數據庫直接卡死。

這個時候可以通過創建臨時表來進行重複數據的篩選，然後刪除原來的數據，把臨時的表數據再移到數據庫的表中。自己測試這種方法十幾萬的數據量不到1分鐘就可以完成。

2. 開始

2.1 創建臨時表

僞語法：

CREATE TABLE TEMP_DELETE_ORDER_INFO AS

(

查找重複的數據並只顯示一條

union all

查找不重複的數據

)

SQL 案例：


CREATE TABLE TEMP_DELETE_ORDER_INFO AS (
    --id重複的數據
    select T1.* from ORDER_INFO T1 where T1.id in
	    (select id from ORDER_INFO group by id having count(id) >1) 
        AND T1.ROWID NOT IN 
	    (SELECT MIN(ROWID) FROM ORDER_INFO GROUP BY id HAVING COUNT(*) > 1)

    --將數據連起來
    union all

    --id不重複的數據
    select T2.* from ORDER_INFO T2 where T2.id in
        (select id from ORDER_INFO group by id having count(id) =1)
);

2.2 清空原來的表

truncate TABLE ORDER_INFO;

2.3 將臨時表的數據導入到原來的表中

insert into ORDER_INFO  select * from TEMP_DELETE_ORDER_INFO;

2.4 刪除臨時表


DROP TABLE TEMP_DELETE_ORDER_INFO;

3. 結尾

創建臨時表查找重複數據和不重複數據需要根據自己的需求，我這個是根據id重複來查找的。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

一場數據架構變革正在來臨

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-12-21 10:54:01

解讀數字化轉型下的數據安全：AI正在開闢新的可能性

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-19 14:03:54

雲原生數據庫企業Cockroach Labs再獲 2.73 億美元融資，估值高達50億美元

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-16 15:18:50

數千個數據庫、遍佈全國的物理機，京東物流全量上雲實錄 | 卓越技術團隊訪談錄

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"blockquote","content":[{"type":"pa

2021-12-16 10:38:55

前車之鑑：聊聊我在基礎設施中掉過的坑

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-14 13:33:55

洞察數據庫變革趨勢，亞馬遜雲科技正在憑藉這項技術改變着遊戲規則

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-10 16:53:54

MongoDB發佈第三季度財報，雲數據庫收入增長加速

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-09 15:33:57

MySQL探祕(四):InnoDB的磁盤文件及落盤機制

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

程序员历小冰

2021-12-08 12:33:52

Oracle 大佬離職，怒噴 MySQL “糟糕的數據庫”

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-07 19:58:57

Jellyfish：爲Uber最大的存儲系統提供更節省成本的數據分層

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

Mohammed Khatib

2021-12-06 10:33:48

企業需要什麼樣的數據庫，One Size Fits All？

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-03 18:19:01

這個重要開源項目全靠一位低調的“怪老頭”維護！他和比爾蓋茨一樣撐起了計算機世界

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-03 14:23:56

數據庫事務的三個元問題

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-12-03 10:33:52

一個 Babelfish ，看懂雲數據庫的發展方向

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-01 18:43:50

數據庫內核雜談(二十一): 流處理系統簡介

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-24 10:38:57

24小時熱門文章

Wireshark 安裝+使用（一）

最新文章

最新評論文章