Post Profile

Hadoop - InputSplit and RecordReader explained

If you have been reading about hadoop and hadoop programming, you would have heard about InputSplits and RecordReaders.  In this blog I will try to explain these and how we can manage our reads across splits.Input file used for this blog is below. Its size is 47 bytes. Why it is important to know th
read more


Related Posts

A clinical psychiatrist explains: Does marijuana affect your sleep?

News : The Raw Story

If you speak to someone who has suffered from insomnia at all as an adult, chances are good that person has either tried using marijuana, or cannabis, for sleep or has thought about it. This is reflected in the many variations of ca...

Pinterest explains how it runs a souped-up version of Hadoop

Technology : Gigaom

The social scrapbook and visual discovery site, which processes about a petabyte of data daily, uses the Hadoop-as-a-service startup Qubole to handle Hadoop jobs and Puppet for its data processing configuration. Pinterest explains h...

Cloudera Now Certifies Companies For Apache Hadoop; Launches Partner Program

Technology : TechCrunch: Enterprise

Cloudera, the startup that commercially distributes and services Apache Hadoop based data management software and services, is announcing a comprehensive partner program today for the companies that use its services. Hadoop is a Jav...

InputSplit indexing on Mapreduce

Programming / Windows Development : CodeProject

Create custom indexes for improving Mapreduce performance

Read CSV input from a text file and add integer values in each string read

Programming / Windows Development : CodeProject

Read CSV input from a text file, and add integer values in each string read (line by line). Incorporate Unit Tests to test the program's functionality using C#.


Copyright © 2016 Regator, LLC