Tuesday, March 20, 2018

Map Reduce Programming and Exercises

Very good references links :

Hadoop Definitive Guide :

http://javaarm.com/file/apache/Hadoop/books/Hadoop-The.Definitive.Guide_4.edition_a_Tom.White_April-2015.pdf



Good examples on  word count :




Hadoop operations :

https://data-flair.training/blogs/hadoop-hdfs-data-read-and-write-operations/


Orielly presentation

assets.en.oreilly.com/1/event/75/Introduction%20to%20Apache%20Hadoop%20Presentation.pdf



Topics:
·         About MapReduce and  Understanding block and input splits
·          MapReduce Data types
·          Understanding Writable
·          Data Flow in MapReduce Application
·          Understanding MapReduce problem on datasets
·          MapReduce and Functional Programming
·          Writing MapReduce Application
·          Understanding Mapper function
·          Understanding Reducer Function
·          Understanding Driver
·          Usage of Combiner
·          Understanding Partitioner
·          Usage of Distributed Cache
·          Passing the parameters to mapper and reducer
·          Analysing the Results
·          Log files
·          Input Formats and Output Formats
·          Counters, Skipping Bad and unwanted Records
·          Writing Join’s in MapReduce with 2 Input files. Join Types.
·          Execute MapReduce Job – Insights.
·          Exercise’s on MapReduce.
·          Job Scheduling: Type of Schedulers.















From 1  to  5  one split

       From  6  to  9 another split.
        








No comments:

Post a Comment

Hyderabad Trip - Best Places to visit

 Best Places to Visit  in Hyderabad 1.        1. Golconda Fort Maps Link :   https://www.google.com/maps/dir/Aparna+Serene+Park,+Masj...