This course will help learn how the output of Hadoop is sorted, by default the output is sorted on the basis of input keys. We can tweak the key in such a way that the output is sorted according to values and not input keys.
Next you would be understanding Joins in Hadoop. Joins are not straight forward in Hadoop, a little complex logic is implemented to perform a join in Hadoop on gives datasets/files. This session will cover Map/Reduce side joins to show implementation of joins in Hadoop.
Further we understand the combiners wherein it will give you an understanding of combiners in Hadoop which basically help in reducing network congestion. This combiner function summarize the map output records with the same key and output of combiner will be sent over network to actual reduce task as input.
- Understanding what is Big Data
- Learning and programming using Hadoop
- To understand combiners in Hadoop
- Anyone who wants to learn a Big Data
- Basic Computer Knowledge
- Basic understanding of Core Java programming is must
- Experience of coding