Save this Search
Sort by:
  • Posted Date
IT & Programming
Fixed Price: Less than $500   |  Posted: May 28, 2015  |  Ends: 13d, 23h  |   2 Proposals
I need someone to write a blog post titled: "Real world Hadoop - Implementing a left outer join in Cascading" This is the 5th blog post in a series:   [obscured]  /2013/01/05/a-quick-guide-to-hadoop-map-reduce-frameworks.html#updates I need you to implement the same thing in Cascading that I have implemented in Hive, Pig, Scoobi, and Map/Reduce: =========== Given two (fake) datasets: A set of user demographic information containing [id, email, language, location] A set of item purchases, containing fields [transaction-id, product-id, user-id, purchase-amount, product-description] Calculate the number of locations in which a product is purchased. ============ For example, here is the solution in HiveQL -   [obscured]  /2013/02/20/real-world-hadoop---implementing-a-left-outer-join-in-hive.html The results should be the same. So this task involves: 1. Fork my example code repository -   [obscured]  /rathboma/hadoop-framework-example...
Category: Data Engineering       
Skills: Technical Writing, Hadoop, Apache Hive, Cascading       

Sign in to view client's details.
| r****oma
|    United States
Symbol Key
Payment method not yet verified
Payment verified
Purchased $1-$500
Purchased $500-$5,000
Purchased more than $5,000
You have already submitted a
proposal to this job