The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.Why Take This Course?
- How Hadoop fits into the world (recognize the problems it solves)
- Understand the concepts of HDFS and MapReduce (find out how it solves the problems)
- Write MapReduce programs (see how we solve the problems)
- Practice solving problems on your own
What is "Big Data"? The dimensions of Big Data. Scaling problems. HDFS and the Hadoop ecosystem.
The basics of HDFS, MapReduce and Hadoop cluster.
Writing MapReduce programs to answer questions about data.
MapReduce design patterns.
Answering questions about big sales data and analyzing large website logs.
Ian Wrigley and Sarah Sproehnle