hadoop - Pig: Splitting large large file into multiple smaller files -


i need split output part file, generated pig script, , generate groups each containing 1000 lines. these groups posted webservice further processing. there no relation between data cannot group data on specific field.

how can in pig?

if split not related data why use pig or mapreduce @ all? alternative use standard split program split data, if didn't misunderstand. example:

cat part-* | split -d -l 1000 - result- 

Comments

Popular posts from this blog

How to mention the localhost in android -

php - Calling a template part from a post -

c# - String.format() DateTime With Arabic culture -