File formats in Hadoop
1) TextInputFileFormat
2) KeyValueTextInputFileFormat
Key is always delimitter
Eg : key=praveen
value = 18
Eg: 2
Praveen,18,male,5000
Kumar,35,male,45000
Key=Praveen
Value=18,male,5000
3) NlineInputFileFormat
Scenario: I have 600 records = 2 blocks - 2 splits - 2 mappers
For every 200 records
200 records = 2 blocks - 3 splits - 3 mappers.
4) CombineFileInputFileFormat
200 files - each file 100kb - 200 splits - 200 mappers
No comments:
Post a Comment