hadoop - Pig: Splitting large large file into multiple smaller files -

June 15, 2013

i need split output part file, generated pig script, , generate groups each containing 1000 lines. these groups posted webservice further processing. there no relation between data cannot group data on specific field.

how can in pig?

if split not related data why use pig or mapreduce @ all? alternative use standard split program split data, if didn't misunderstand. example:

cat part-* | split -d -l 1000 - result-

Search This Blog

Live

hadoop - Pig: Splitting large large file into multiple smaller files -

Comments

Post a Comment

Popular posts from this blog

javascript - JS causing window size to be bigger than necessary - Dropdown bug -

How to mention the localhost in android -

php - Calling a template part from a post -