Developer Network Home - Help

Hadoop and Distributed Computing at Yahoo!: April 2008 Archives

« March 2008 | Main

Grid Computing Archive

April 28, 2008

Hadoop 0.17 Preview

Apache Hadoop 0.17 is due for release any day now. Feature freeze for the release was on April 4th. The Hadoop dev community is currently actively fixing blocking issues discovered by users that have tried it out. This is a release we’re very excited about as it introduces many long awaited performance fixes to the platform. We’ve observed on the order of 30%(!) improvement in the runtime of some of the Hadoop benchmarks. As always, user feedback is invaluable and we urge folks to kick the tires on the release and help close it out. Here is a quick rundown of the important changes in the release.

HDFS

 

Map/Reduce

 

Sameer Paranjpye
Yahoo! Grid Computing Team

Posted by jzawodn at 2:13 PM | Comments (1) | TrackBack

April 25, 2008

VIM Color Syntax Highlighting for Pig

I joined the Yahoo! Research Engineering group a few weeks ago, and I was literally blown away with the possibilities that Hadoop and Pig open for me. Immediately, I wanted to hack up something good to say thank you to all smart people that build and support such a great software.

I am convinced that Pig deserves more respect from the major text editors, so I wrote a small vim script that adds syntax highlighting for Pig files.

pig in vim

You can download it from vm.org site.

To install, follow instructions on the web page, and don't forget to vote! :-)

Emacs version is coming up soon (yes, I use both vim *and* emacs). It will be my project for the upcoming Yahoo! Hack Day.

Sergiy Matusevych
Yahoo! Research Engineer

Posted by jzawodn at 10:15 AM | Comments (0) | TrackBack

April 18, 2008

Hadoop Summit Slides and Video Available

It's been a few weeks since the Hadoop Summit in Santa Clara, and we hope everyone had a good time and learned a lot. Feedback has been quite good so far, but don't be shy about sending us comments.

The Yahoo! Research team has assembled a single page containing links to all the presentation slides and video from both the Hadoop Summit and the Data Intensive Computing Symposium.

As a sample, here's the opening presentation that Doug and Eric gave:

Update: Videos are currently unavailable outside of Yahoo! We're working on the problem...

Posted by jzawodn at 1:46 PM | Comments (5) | TrackBack

Copyright © 2008 Yahoo! Inc. All rights reserved.

Privacy Policy - Terms of Service - Copyright Policy - Job Openings

d