The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
Why Take This Course?
How Hadoop fits into the world (recognize the problems it solves)
Understand the concepts of HDFS and MapReduce (find out how it solves the problems)
Write MapReduce programs (see how we solve the problems)
Practice solving problems on your own
What is "Big Data"? The dimensions of Big Data. Scaling problems. HDFS and the Hadoop ecosystem.
The basics of HDFS, MapReduce and Hadoop cluster.
Writing MapReduce programs to answer questions about data.
MapReduce design patterns.
Answering questions about big sales data and analyzing large website logs.
Can anyone tell what the benefit end users usually forum moderators will get from the extracted information in the final project?? i.e, Benefits of
1-List of students per thread
2-Finding Top Ten Tags.
3- Student Timings i.e. at which they are most active
4-Co-relation between question and answer.
Jason Michael Cherry completed this course, spending 2 hours a week on it and found the course difficulty to be easy.
This is a solid introduction to Hadoop and MapReduce concepts. The assignments are a good exercise in getting familiar with the basics. There's a lot that this course doesn't cover, but it's enough to get your feet wet with Hadoop and MapReduce concepts.