Hadoop Bay Area January 2010 User Group – Recap

Hi Hadoopers

Thanks everyone for joining us last night at the Yahoo!’s Sunnyvale campus. There were close to 150 attendees, a nice way to start the meetings for 2010. I was happy to see familiar and many new faces. It was also great to see the thriving conversations and solution sharing..


For those of you who were unable to attend in person the session's details, slides and video recordings are posted below

Bhupesh Bansal, Senior Engineer at LinkedIn shared the details behind Project-Voldemort (distributed key-value storage system based on the Amazon Dynamo project), challenges, performance, features and more. Bhupesh reviewed the growing use of Hadoop for Batch Computing at Linkedin - data store, workflows, ETL, prototyping and more.
Bhupesh is a member of LinkedIn's search and data platform team and an active commiter for Project-Voldemort






Chris Douglas, from the Yahoo's Hadoop development team, discussed the collection and sort of map outputs. Chris reviewed the history and improvements made in this area for Hadoop 0.22. and how it effect efficiency, cluster utilization and tuning job performance.







As always, we are looking for exciting technologies and experiences you want to share.

Please email presentation requests to dekel at yahoo hyphen inc dot com.

See you all in Feb 17th, 2010


Dekel Tankel

Director, Product Management, Cloud Computing