Apache Druid(0.15.0版本)數據遷移/數據保留規則實際使用指南(更新中)

1. 數據遷移(兩個Druid集羣間數據遷移)

    參考  https://blog.csdn.net/u012164361/article/details/87881926

   1.1拷貝老集羣segments到新集羣指定目錄下(由於公司有oss,我這邊就用oss做了中間存儲,老Druid集羣 HDFS Deep Storage目錄下數據--->OSS--->新hdfs集羣Druid Deep Storage目錄)

   1.2 導入原Druid集羣dataSource的元數據(使用mysqldump)

由於老集羣中有很多不需要的dataSource,所以只同步需要的,這裏使用where條件

###這裏只把dataSource爲behavior_pcpvuvprod的取出來同步到新Druid集羣中

mysqldump -uroot -pdruiddruid druid druid_segments --where="dataSource='behavior_pcpvuvprod'" > ./behavior_pcpvuvprod.sql

注意:1. 這裏導出來的.sql文件裏有刪除新Druid集羣druid_segments表的sql語句,由於新集羣裏已經有數據了,所以這裏我註釋掉了,只導入數據

           2. 老Druid集羣和新Druid集羣的Deep storage目錄不一致,所以可以在導入元數據前修改,也可以導入新mysql庫之後修改

-- MySQL dump 10.13  Distrib 5.6.43, for Linux (x86_64)
--
-- Host: localhost    Database: druid
-- ------------------------------------------------------
-- Server version	5.6.43

/*!40101 SET @OLD_CHARACTER_SET_CLIENT=@@CHARACTER_SET_CLIENT */;
/*!40101 SET @OLD_CHARACTER_SET_RESULTS=@@CHARACTER_SET_RESULTS */;
/*!40101 SET @OLD_COLLATION_CONNECTION=@@COLLATION_CONNECTION */;
/*!40101 SET NAMES utf8 */;
/*!40103 SET @OLD_TIME_ZONE=@@TIME_ZONE */;
/*!40103 SET TIME_ZONE='+00:00' */;
/*!40014 SET @OLD_UNIQUE_CHECKS=@@UNIQUE_CHECKS, UNIQUE_CHECKS=0 */;
/*!40014 SET @OLD_FOREIGN_KEY_CHECKS=@@FOREIGN_KEY_CHECKS, FOREIGN_KEY_CHECKS=0 */;
/*!40101 SET @OLD_SQL_MODE=@@SQL_MODE, SQL_MODE='NO_AUTO_VALUE_ON_ZERO' */;
/*!40111 SET @OLD_SQL_NOTES=@@SQL_NOTES, SQL_NOTES=0 */;

--
-- Table structure for table `druid_segments`
--

###註釋掉------註釋掉-------註釋掉-------註釋掉DROP和CREATE TABLE語句
--DROP TABLE IF EXISTS `druid_segments`;
--/*!40101 SET @saved_cs_client     = @@character_set_client */;
--/*!40101 SET character_set_client = utf8 */;
--CREATE TABLE `druid_segments` (
--  `id` varchar(255) NOT NULL,
--  `dataSource` varchar(255) NOT NULL,
--  `created_date` varchar(255) NOT NULL,
--  `start` varchar(255) NOT NULL,
--  `end` varchar(255) NOT NULL,
--  `partitioned` tinyint(1) NOT NULL,
--  `version` varchar(255) NOT NULL,
--  `used` tinyint(1) NOT NULL,
--  `payload` longblob NOT NULL,
--  PRIMARY KEY (`id`),
--  KEY `idx_druid_segments_used` (`used`),
--  KEY `idx_druid_segments_datasource_used_end` (`dataSource`,`used`,`end`)
--) ENGINE=InnoDB DEFAULT CHARSET=utf8;
--/*!40101 SET character_set_client = @saved_cs_client */;

--
-- Dumping data for table `druid_segments`
--
-- WHERE:  dataSource='behavior_pcpvuvprod'

LOCK TABLES `druid_segments` WRITE;
/*!40000 ALTER TABLE `druid_segments` DISABLE KEYS */;
INSERT INTO `druid_segments` VALUES ('behavior_pcpvuvprod_2019-05-01T00:00:00.000Z_2019-05-02T00:00:00.000Z_2019-05-07T08:06:33.155Z','behavior_pcpvuvprod','2019-05-07T08:08:27.880Z','2019-05-01T00:00:00.000Z','2019-05-02T00:00:00.000Z',1,'2019-05-07T08:06:33.155Z',1,'{\"dataSource\":\"behavior_pcpvuvprod\",\"interval\":\"2019-05-01T00:00:00.000Z/2019-05-02T00:00:00.000Z\",\"version\":\"2019-05-07T08:06:33.155Z\",\"loadSpec\":{\"type\":\"hdfs\",\"path\":\"hdfs://nameservice1/user/hive/warehouse/imply_druid/segments/behavior_pcpvuvprod/20190501T000000.000Z_20190502T000000.000Z/2019-05-07T08_06_33.155Z/0_index.zip\"},\"dimensions\":\"actionName,actionTimestamp,actionValue,eventPara,event_id,event_type,function1,function2,platform,requestPath,session_id,source,tracker_type,userAgent,user_id,user_mobile,user_name\",\"metrics\":\"\",\"shardSpec\":{\"type\":\"numbered\",\"partitionNum\":0,\"partitions\":1},\"binaryVersion\":9,\"size\":9605202,\"identifier\":\"behavior_pcpvuvprod_2019-05-01T00:00:00.000Z_2019-05-02T00:00:00.000Z_2019-05-07T08:06:33.155Z\"}'),('behavior_pcpvuvprod_2019-05-02T00:00:00.000Z_2019-05-03T00:00:00.000Z_2019-05-07T07:58:35.440Z','behavior_pcpvuvprod','2019-05-07T08:00:18.293Z','2019-05-02T00:00:00.000Z','2019-05-03T00:00:00.000Z',1,'2019-05-07T07:58:35.440Z',1,'{\"dataSource\":\"behavior_pcpvuvprod\",\"interval\":\"2019-05-02T00:00:00.000Z/2019-05-03T00:00:00.000Z\",\"version\":\"2019-05-07T07:58:35.440Z\",\"loadSpec\":{\"type\":\"hdfs\",\"path\":\"hdfs://nameservice1/user/hive/warehouse/imply_druid/segments/behavior_pcpvuvprod/20190502T000000.000Z_20190503T000000.000Z/2019-05-07T07_58_35.440Z/0_index.zip\"},\"dimensions\":\"actionName,actionTimestamp,actionValue,eventPara,event_id,event_type,function1,function2,platform,requestPath,session_id,source,tracker_type,userAgent,user_id,user_mobile,user_name\",\"metrics\":\"\",\"shardSpec\":{\"type\":\"numbered\",\"partitionNum\":0,\"partitions\":1},\"binaryVersion\":9,\"size\":9813922,\"identifier\":\"behavior_pcpvuvprod_2019-05-02T00:00:00.000Z_2019-05-03T00:00:00.000Z_2019-05-07T07:58:35.440Z\"}'),('behavior_pcpvuvprod_2019-05-03T00:00:00.000Z_2019-05-04T00:00:00.000Z_2019-05-07T07:52:54.668Z','behavior_pcpvuvprod','2019-05-07T07:54:41.174Z','2019-05-03T00:00:00.000Z','2019-05-04T00:00:00.000Z',1,'2019-05-07T07:52:54.668Z',1,'{\"dataSource\":\"behavior_pcpvuvprod\",\"interval\":\"2019-05-03T00:00:00.000Z/2019-05-04T00:00:00.000Z\",\"version\":\"2019-05-07T07:52:54.668Z\",\"loadSpec\":{\"type\":\"hdfs\",\"path\":\"hdfs://nameservice1/user/hive/warehouse/imply_druid/segments/behavior_pcpvuvprod/20190503T000000.000Z_20190504T000000.000Z/2019-05-07T07_52_54.668Z/0_index.zip\"},\"dimensions\":\"actionName,actionTimestamp,actionValue,eventPara,event_id,event_type,function1,function2,platform,requestPath,session_id,source,tracker_type,userAgent,user_id,user_mobile,user_name\",\"metrics\":\"\",\"shardSpec\":{\"type\":\"numbered\",\"partitionNum\":0,\"partitions\":1},\"binaryVersion\":9,\"size\":9789188,\"identifier\":\"behavior_pcpvuvprod_2019-05-03T00:00:00.000Z_2019-05-04T00:00:00.000Z_2019-05-07T07:52:54.668Z\"}'),('behavior_pcpvuvprod_2019-05-04T00:00:00.000Z_2019-05-05T00:00:00.000Z_2019-05-07T07:49:21.548Z','behavior_pcpvuvprod','2019-05-07T07:51:10.438Z','2019-05-04T00:00:00.000Z','2019-05-05T00:00:00.000Z',1,'2019-05-07T07:49:21.548Z',1,'{\"dataSource\":\"behavior_pcpvuvprod\",\"interval\":\"2019-05-04T00:00:00.000Z/2019-05-05T00:00:00.000Z\",\"version\":\"2019-05-07T07:49:21.548Z\",\"loadSpec\":{\"type\":\"hdfs\",\"path\":\"hdfs://nameservice1/user/hive/warehouse/imply_druid/segments/behavior_pcpvuvprod/20190504T000000.000Z_20190505T000000.000Z/2019-05-07T07_49_21.548Z/0_index.zip\"},\"dimensions\":\"actionName,actionTimestamp,actionValue,eventPara,event_id,event_type,function1,function2,platform,requestPath,session_id,source,tracker_type,userAgent,user_id,user_mobile,user_name\",\"metrics\":\"\",\"shardSpec\":{\"type\":\"numbered\",\"partitionNum\":0,\"partitions\":1},\"binaryVersion\":9,\"size\":9355446,\"identifier\":\"behavior_pcpvuvprod_2019-05-04T00:00:00.000Z_2019-05-05T00:00:00.000Z_2019-05-07T07:49:21.548Z\"}'),('behavior_pcpvuvprod_2019-05-05T00:00:00.000Z_2019-05-06T00:00:00.000Z_2019-05-07T07:46:21.542Z','behavior_pcpvuvprod','2019-05-07T07:47:41.696Z','2019-05-05T00:00:00.000Z','2019-05-06T00:00:00.000Z',1,'2019-05-07T07:46:21.542Z',1,'{\"dataSource\":\"behavior_pcpvuvprod\",\"interval\":\"2019-05-05T00:00:00.000Z/2019-05-06T00:00:00.000Z\",\"version\":\"2019-05-07T07:46:21.542Z\",\"loadSpec\":{\"type\":\"hdfs\",\"path\":\"hdfs://nameservice1/user/hive/warehouse/imply_druid/segments/behavior_pcpvuvprod/20190505T000000.000Z_20190506T000000.000Z/2019-05-07T07_46_21.542Z/0_index.zip\"},\"dimensions\":\"actionName,actionTimestamp,actionValue,eventPara,event_id,event_type,function1,function2,platform,requestPath,session_id,source,tracker_type,userAgent,user_id,user_mobile,user_name\",\"metrics\":\"\",\"shardSpec\":{\"type\":\"numbered\",\"partitionNum\":0,\"partitions\":1},\"binaryVersion\":9,\"size\":3871583,\"identifier\":\"behavior_pcpvuvprod_2019-05-05T00:00:00.000Z_2019-05-06T00:00:00.000Z_2019-05-07T07:46:21.542Z\"}'),('behavior_pcpvuvprod_2019-05-06T00:00:00.000Z_2019-05-07T00:00:00.000Z_2019-05-07T07:42:24.945Z','behavior_pcpvuvprod','2019-05-07T07:43:53.176Z','2019-05-06T00:00:00.000Z','2019-05-07T00:00:00.000Z',1,'2019-05-07T07:42:24.945Z',1,'{\"dataSource\":\"behavior_pcpvuvprod\",\"interval\":\"2019-05-06T00:00:00.000Z/2019-05-07T00:00:00.000Z\",\"version\":\"2019-05-07T07:42:24.945Z\",\"loadSpec\":{\"type\":\"hdfs\",\"path\":\"hdfs://nameservice1/user/hive/warehouse/imply_druid/segments/behavior_pcpvuvprod/20190506T000000.000Z_20190507T000000.000Z/2019-05-07T07_42_24.945Z/0_index.zip\"},\"dimensions\":\"actionName,actionTimestamp,actionValue,eventPara,event_id,event_type,function1,function2,platform,requestPath,session_id,source,tracker_type,userAgent,user_id,user_mobile,user_name\",\"metrics\":\"\",\"shardSpec\":{\"type\":\"numbered\",\"partitionNum\":0,\"partitions\":1},\"binaryVersion\":9,\"size\":3237018,\"identifier\":\"behavior_pcpvuvprod_2019-05-06T00:00:00.000Z_2019-05-07T00:00:00.000Z_2019-05-07T07:42:24.945Z\"}');
/*!40000 ALTER TABLE `druid_segments` ENABLE KEYS */;
UNLOCK TABLES;
/*!40103 SET TIME_ZONE=@OLD_TIME_ZONE */;

/*!40101 SET SQL_MODE=@OLD_SQL_MODE */;
/*!40014 SET FOREIGN_KEY_CHECKS=@OLD_FOREIGN_KEY_CHECKS */;
/*!40014 SET UNIQUE_CHECKS=@OLD_UNIQUE_CHECKS */;
/*!40101 SET CHARACTER_SET_CLIENT=@OLD_CHARACTER_SET_CLIENT */;
/*!40101 SET CHARACTER_SET_RESULTS=@OLD_CHARACTER_SET_RESULTS */;
/*!40101 SET COLLATION_CONNECTION=@OLD_COLLATION_CONNECTION */;
/*!40111 SET SQL_NOTES=@OLD_SQL_NOTES */;

-- Dump completed on 2019-09-07 11:27:12

進入mysql,導入元數據

執行sql語句修改新導入元數據Deep Storage路徑

update druid_segments  set `payload` = replace(payload,'hdfs://10.80.230.158:9000/druid/','hdfs://nameservice1/user/hive/warehouse/imply_druid/');

此時可以在頁面中看到新的數據已經加載進來

2.數據保留規則

官網鏈接  http://druid.apache.org/docs/latest/operations/rule-configuration.html

     1.dropByInterval 

設置rules

已有的interval,假如要刪除某些interval,例:刪除19年5月到6月的數據

2019-05-01T00:00:00.000Z/2019-06-01T00:00:00.000Z

2 保留近幾天的數據

我這裏只保留近7天的數據,loadByPeriod 是P7D   

P30D代表 30天  P1M表示一個月

 此數據刪除不能刪除深度存儲裏的數據,仍會佔用深度存儲的磁盤,爲了徹底刪除數據,請移步

https://blog.csdn.net/qq_34864753/article/details/102861322

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章