allowFullScreen='true' src='http://d.yimg.com/m/up/ypp/default/player.swf' flashvars='vid=21232264&autoPlay=0'>
Hadoop is a powerful platform for data analysis and processing, but many struggle to understand how it fits in with regard to existing infrastructure and systems. A series of common integration points, technologies, and patterns are defined and illustrated in this presentation. Eric Sammer looks at job initiation, sequencing and scheduling, data input from various sources (e.g., DBMS, messaging systems), and data output to various sinks (DBMS, messaging systems, caching systems). You will see how integration patterns and best practices can be applied to Hadoop and its related projects. This talk is focused on the suitability and architecture of these integration patterns. Care is taken to not duplicate talks on specific tools that are likely to be covered by other talks.