Hive中數據導入與導出

原創

2020-07-06 01:32

1 數據導入

1.1 向表中裝載數據（Load）

1．語法

hive> load data [local] inpath '/opt/module/datas/student.txt' [overwrite] | into table student [partition (partcol1=val1,…)];

（1）load data:表示加載數據

（2）local:表示從本地加載數據到hive表；否則從HDFS加載數據到hive表,可選項

（3）inpath:表示加載數據的路徑

（4）overwrite:表示覆蓋表中已有數據，否則表示追加，可選項

（5）into table:表示加載到哪張表

（6）student:表示具體的表

（7）partition:表示上傳到指定分區

2．實操案例

（1）創建一張表

hive (default)> create table student(id string, name string) row format delimited fields terminated by '\t';

（2）加載本地文件到hive表中

hive (default)> load data local inpath '/opt/module/datas/student.txt' into table default.student;

（3）加載HDFS文件到hive表中

#上傳文件到HDFS
hive (default)> dfs -put /opt/module/datas/student.txt /user/atguigu/hive;

#加載HDFS上數據到hive表
hive (default)> load data inpath '/user/atguigu/hive/student.txt' into table default.student;

（4）加載數據覆蓋表中已有的數據

#上傳文件到HDFS
hive (default)> dfs -put /opt/module/datas/student.txt /user/atguigu/hive;

#加載數據覆蓋表中已有的數據
hive (default)> load data inpath '/user/atguigu/hive/student.txt' overwrite into table default.student;

1.2 通過查詢語句向表中插入數據（Insert）

（1）創建一張分區表

hive (default)> create table student(id int, name string) partitioned by (month string) row format delimited fields terminated by '\t';

（2）基本插入數據

hive (default)> insert into table  student partition(month='201709') values(1,'wangwu');

（3）基本模式插入（根據單張表查詢結果）

hive (default)> insert overwrite table student partition(month='201708')

             select id, name from student where month='201709';

4．多插入模式（根據多張表查詢結果）

hive (default)> from student

              insert overwrite table student partition(month='201707')

              select id, name where month='201709'

              insert overwrite table student partition(month='201706')

              select id, name where month='201709';

1.3 查詢語句中創建表並加載數據（As Select）

根據查詢結果創建表（查詢的結果會添加到新創建的表中）

create table if not exists student3 as select id, name from student;

1.4 創建表時通過Location指定加載數據路徑

（1）創建表，並指定在hdfs上的位置

hive (default)> create table if not exists student5(

              id int, name string

              )

              row format delimited fields terminated by '\t'

              location '/user/hive/warehouse/student5';

（2）上傳數據到hdfs上

hive (default)> dfs -put /opt/module/datas/student.txt /user/hive/warehouse/student5;

（3）查詢數據

hive (default)> select * from student5;

1.5 Import數據到指定Hive表中

注意：先用export導出後，再將數據導入。

hive (default)> import table student2 partition(month='201709') from  '/user/hive/warehouse/export/student';

2 數據導出

2.1 Insert導出

（1）將查詢的結果導出到本地

hive (default)> insert overwrite local directory '/opt/module/datas/export/student'
            select * from student;

（2）將查詢的結果格式化導出到本地

hive(default)>insert overwrite local directory '/opt/module/datas/export/student1'

           ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t'             select * from student;

（3）將查詢的結果導出到HDFS上(沒有local)

hive (default)> insert overwrite directory '/user/atguigu/student2'

             ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t'

             select * from student;

2.2 Hadoop命令導出到本地

hive (default)> dfs -get /user/hive/warehouse/student/month=201709/000000_0

/opt/module/datas/export/student3.txt;

2.3 Hive Shell 命令導出

基本語法：（hive -f/-e 執行語句或者腳本 > file）

[atguigu@hadoop102 hive]$ bin/hive -e 'select * from default.student;' > /opt/module/datas/export/student4.txt;

2.4 Export導出到HDFS上

(defahiveult)> export table default.student to '/user/hive/warehouse/export/student';

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Hive中數據導入與導出

1 數據導入

1.1 向表中裝載數據（Load）

1.2 通過查詢語句向表中插入數據（Insert）

1.3 查詢語句中創建表並加載數據（As Select）

1.4 創建表時通過Location指定加載數據路徑

1.5 Import數據到指定Hive表中

2 數據導出

2.1 Insert導出

2.2 Hadoop命令導出到本地

2.3 Hive Shell 命令導出

2.4 Export導出到HDFS上

移位操作搞定兩數之商

如何基於surging跨網關跨語言進行緩存降級

2024合集

程序員天天 CURD，怎麼才能成長，職業發展的思考(2)

教你用Perl實現Smgp協議

如何通過前端表格控件在10分鐘內完成一張分組報表？

win11關閉自動檢測病毒刪文件

通用代碼生成器簡介

lightdb 單機模式下數據庫平移

千兆寬帶實際網速能到達多少？

Hive中數據導入與導出

Hive基本概念及運行原理

Hive DDL常見操作

Java集合List按日期升序或降序四種方法

Kafka的Rebalance機制可能造成的影響及解決方案

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結