for loop - How to partition large Hive table with many categories -
i want partition table in hive every unique item in row creates partition. there ~250 partitions 4 billion row table to loop or distinct. here thoughts in code (which have not worked)
alter table mytable add if not exists
partition( mycolumn = distinct mycolumn);
or there kind of loop in hive? require udf? hive answer preferable if possible.
thanks.
just use dynamic partitions
https://cwiki.apache.org/confluence/display/hive/tutorial#tutorial-dynamicpartitioninsert
it partition creation on go
Comments
Post a Comment