ETL with Hadoop and MapReduce
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? And how can we use it in our traditional BI solutions?
Apache Hadoop is a collection of tools mainly used nowadays in the Big Data space. With Hadoop (and MapReduce, Pig and Hive) we can use it as part of our ETL process to get insight from our (un)structured data sources.
In this session we will cover these basics, but also introduce some internals of Hadoop.
The world of data is very dynamic; every week the world is producing ~1% more data than the week before. How to manage that amount of data and how to extract insights from it? Jan Pieter as a data consultant, a more breeder BI consultant, shifted his expertise to answers those questions. Jan Pieter has a great overview and hands-on experience of the possibilities in the modern world of data, e.g. Big Data solutions, BI Solutions, Database / Modern Datawarehouse Platforms and Self Service BI solutions. In addition to this knowledge, Jan Pieter has also experience with project management, pre-sales activities and customer advisory programs.
Jan Pieter is a MCITP, MCSA and speaks frequently about different subject related to Big Data and Microsoft BI.