Hadoop2010: Integration Patterns & Practices

allowFullScreen='true' src='https://s.yimg.com/m/up/ypp/default/player.swf' flashvars='vid=21232264&autoPlay=0'>

iPod: Download high-resolution version

Hadoop is a powerful platform for data analysis and processing, but many struggle to understand how it fits in with regard to existing infrastructure and systems. A series of common integration points, technologies, and patterns are defined and illustrated in this presentation. Eric Sammer looks at job initiation, sequencing and scheduling, data input from various sources (e.g., DBMS, messaging systems), and data output to various sinks (DBMS, messaging systems, caching systems). You will see how integration patterns and best practices can be applied to Hadoop and its related projects. This talk is focused on the suitability and architecture of these integration patterns. Care is taken to not duplicate talks on specific tools that are likely to be covered by other talks.

Baycat logo
Media Production by BAYCAT, a non-profit community media producer that educates and employs underserved youth and adults in the digital media arts.