The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
HDFS and MapReduce
Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
Write your own MapReduce code.
MapReduce Design Patterns
Use common patterns for MapReduce programs to analyze Udacity forum data.
Can anyone tell what the benefit end users usually forum moderators will get from the extracted information in the final project?? i.e, Benefits of
1-List of students per thread
2-Finding Top Ten Tags.
3- Student Timings i.e. at which they are most active
4-Co-relation between question and answer.
Jason Michael Cherry completed this course, spending 2 hours a week on it and found the course difficulty to be easy.
This is a solid introduction to Hadoop and MapReduce concepts. The assignments are a good exercise in getting familiar with the basics. There's a lot that this course doesn't cover, but it's enough to get your feet wet with Hadoop and MapReduce concepts.