Post Profile

Hadoop - Read multi-ine record of variable number of lines

Say we have a input files as in below figure, where each numeric key is followed by any number of lines of values.  In this case as our record length is not fixed we cannot rely on NLineInputFormat as in our previous blog.  To solve this problem we need to identify our record boundary which in our c
read more


Related Posts

Notable PHP package: Random Access File - PHP Classes

Programming / Web Development : Planet PHP

Notable PHP package: Random Access File By Manuel Lemos Database management systems use special techniques to find records of data very quickly. One of those techniques is to make each record of data in table have a fixed length, ev...

Democratizing big data — is Hadoop our only hope?

Technology : Gigaom

Is Hadoop our only hope for solving big data challenges? From scalability to fault tolerance, Hadoop does myriad things very well. Yet, Hadoop is not the solution to all big data problems and use cases. Several key issues remain, in...

Read Registry using C#

Programming / Windows Development : CodeProject

You can try the function below to read values from the Registry:myRegistry.Read("MY_KEY");where: input: the key name (string)output: the value of the key (string)using System;using Microsoft.Win32; public string Read(string KeyName)...

Checksum Calculation in nodejs

Programming / Windows Development : CodeProject

The checksum calculation is a one way process of mapping a large data set of variable length (e.g. message, file), to a smaller data set of a fixed length (hash). The length depends on a hashing algorithm.

Read CSV input from a text file and add integer values in each string read

Programming / Windows Development : CodeProject

Read CSV input from a text file, and add integer values in each string read (line by line). Incorporate Unit Tests to test the program's functionality using C#.


Copyright © 2016 Regator, LLC