ETL with Hadoop and MapReduce
Big Data is hot! And the magic word is Hadoop. But what is it? And more important: what can I do with it? And how can we use it in our traditional BI solutions?
Apache Hadoop is a collection of tools mainly used nowadays in the Big Data space. With Hadoop (and MapReduce, Pig and Hive) we can use it as part of our ETL process to get insight from our (un)structured data sources.
In this session we will cover these basics, but also introduce some internals of Hadoop.
The current world of Business Intelligence is very dynamic. Whether it's dashboards, scorecards (traditional BI) or Big Data, Jan Pieter has a good overview of all the various possibilities. Jan Pieter is the (technical) lead of Microsoft BI and Big Data at Inter Access, a Dutch consultancy firm.
Jan Pieter is a MCITP, MCSA and speaks frequently about Big Data and Microsoft BI.