By Francis Liu and Sumeet Singh
In 2009-2010, Yahoo! saw an unprecedented growth in the number of users coming onboard to its Apache Hadoop platform for their data processing and analytics needs. We attribute a majority of that success and increase in user base to the introduction of multi-tenancy, security, and partitioned namespaces in Hadoop.
Screen Shot 2013-06-07 at 1.23.42 PM
With Hadoop and its ecosystem components like Apache Pig and Apache Oozie getting popular at Yahoo!, we needed a solution to store mutable data and support random access to the stored data to complement the Apache Hadoop platform. Yahoo! had been using Apache HBase in isolated instances, most notably for the CORE personalization platform and for the web crawl cache at the time. However, the use of Apache HBase was limited to large projects that had the resources to operate dedicated HBase clusters.
In 2012, Yahoo! developed multi-tenancy in Apache HBase to cater to a growing number of use cases where HBase was an excellent fit as part of itsRead More »from Apache HBase at Yahoo! – Multi-tenancy at the Helm Again