apache pig - Cassandra pig store -
i have cassandra table
create column family users comparator = utf8type , key_validation_class=utf8type , column_metadata = [ {column_name: full_name, validation_class: utf8type} {column_name: email, validation_class: utf8type} {column_name: state, validation_class: utf8type} {column_name: gender, validation_class: utf8type} {column_name: birth_year, validation_class: longtype} ];
and have pig script below
rows = load 'cassandra://hwh_tracking/users' using org.apache.cassandra.hadoop.pig.cassandrastorage(); emailgroup = group rows email; dump emailgroup;
now output
((email,r@gmail.com),{(rrr24,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr10,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr20,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr23,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr9,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr8,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr22,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr21,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{})}) ((email,zzzz@gmail.com),{(rrr0,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr6,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr7,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr3,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr1,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr5,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr4,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr2,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{})})
now want store output cassandra clue less how do.
any appreciated.
pigserver.registerquery("likes = filter rows vote_type.value eq 'like' , rating.value > 5;"); //pigserver.registerquery("dislikes_extras = filter rows vote_type.value eq 'dislike' , count(columns) > 0;"); pigserver.registerquery("store likes 'cassandra://pigtest/copyofsomeapp' using org.apache.cassandra.hadoop.pig.cassandrastorage();"); //pigserver.registerquery("store dislikes_extras 'cassandra://pigtest/copyofsomeapp' using org.apache.cassandra.hadoop.pig.cassandrastorage();");
Comments
Post a Comment