Hadoop2010: Data Apps & Infrastructure at LinkedIn

allowFullScreen='true' src='https://s.yimg.com/m/up/ypp/default/player.swf' flashvars='vid=21232270&autoPlay=0'>

iPod: Download high-resolution version

LinkedIn runs a number of large-scale Hadoop calculations to power its features — from computing similar profiles, jobs, and companies, to predicting People You May Know recommendations to help users find their professional connections. This talk covers how Hadoop fits into a production data cycle for a consumer-scale social network, including some of the technology, infrastructure, and algorithms for calculating tens of billions of predictions in a social graph.

Media Production by BAYCAT, a non-profit community media producer that educates and employs underserved youth and adults in the digital media arts.