Fixed Price: Less than $500
| Posted: May 28, 2015 | Ends: 13d, 23h |
I need someone to write a blog post titled: "Real world Hadoop - Implementing a left outer join in Cascading" This is the 5th blog post in a series: [obscured] /2013/01/05/a-quick-guide-to-hadoop-map-reduce-frameworks.html#updates I need you to implement the same thing in Cascading that I have implemented in Hive, Pig, Scoobi, and Map/Reduce: =========== Given two (fake) datasets: A set of user demographic information containing [id, email, language, location] A set of item purchases, containing fields [transaction-id, product-id, user-id, purchase-amount, product-description] Calculate the number of locations in which a product is purchased. ============ For example, here is the solution in HiveQL - [obscured] /2013/02/20/real-world-hadoop---implementing-a-left-outer-join-in-hive.html The results should be the same. So this task involves: 1. Fork my example code repository - [obscured] /rathboma/hadoop-framework-example...
Category: Data Engineering