hive的复合数据类型

时间:2022-12-20 13:04:08

hive中的复合数据类型

  • Array

array中的数据为相同类型,例如,假如array A中元素['a','b','c'],则A[1]的值为'b'

数据结构如下:
zhangsan beijing,shanghai,tianjin,hangzhou
wangwu shanghai,chengdu,wuhan,haerbin create table complex_array(name string,work_locations array<string>) row format delimited fields terminated by '\t' collection items terminated by ','; load data local inpath '/home/work/data/test1.txt' into table complex_array;
  • Map

可以通过["指定key名称"]访问

1,zhangsan,唱歌:非常喜欢-跳舞:喜欢-游泳:一般般
2,lisi,打游戏:非常喜欢-篮球:不喜欢 create table complex_map(id int,name string,hobby map<string,string>)
row format delimited
fields terminated by ','
collection items terminated by '-'
map keys terminated by ':'; load data local inpath '/root/hivedata/complex_map.txt' into table complex_map; 以普通字符串映射:
create table complex_map_s(id int,name string,hobby string)
row format delimited
fields terminated by ',';
  • Struct

集合元素可以类型不一样

例如c的类型为STRUCT{a INT; b INT},我们可以通过c.a来访问域a

1,zhou:30
2,yan:30
3,chen:20
4,li:80 create table complex_struct(id int, info struct<name:STRING, age:INT>)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ','
COLLECTION ITEMS TERMINATED BY ':'; load data local inpath '/home/work/data/test3.txt' into table complex_struct; select info.age from complex_struct;