Hadoop2010: Hive integration – HBase & RCFile

allowFullScreen='true' src='https://s.yimg.com/m/up/ypp/default/player.swf' flashvars='vid=20986764&autoPlay=0'>

iPod: Download high-resolution version

John Sichi and Yongqiang He of Facebook discuss Facebook's recent integration of two related projects in the Hadoop ecosystem: HBase and Hive. This integration gives powerful SQL query capabilities to HBase, and brings the potential for low-latency incremental data refresh to Hive. The talk will go over performance results from initial testing of the integration. Yongqiang will discuss RCFile, which is a columnar storage for Hive. It is already deployed within Facebook, which is in the process of converting old partitions to RCFile. Depending on the data layout, it has resulted in ~20% space savings.

Baycat logo Media Production by BAYCAT, a non-profit community media producer that educates and employs underserved youth and adults in the digital media arts.