Geeks With Blogs
Rahul Anand's Blog If my mind can conceive it, and my heart can believe it, I know I can achieve it. October 2015 Entries
Writing your first MapReduce program
Before we delve into the IDE and start writing code lets understand a bit more about the MapReduce. The MapReduce computation takes a set of input key/value pairs, and produces a set of output key/value pairs. The user of the MapReduce library expresses the computation as two functions: Map and Reduce. Map takes an input pair and produces a set of intermediate key/value pairs. The MapReduce library groups together all intermediate values associated with the same intermediate key (k2) and passes them ......

Posted On Monday, October 19, 2015 9:47 PM

Hadoop installation for Development
The best way to learn about Hadoop is getting your hands dirty with real Hadoop programs and their execution. In order to do so we first need a Hadoop installation in local development box. Steps to install Hadoop: 1. Download and install the Oracle Virtual Box 2. Download and install Hortonworks Sandbox virtual appliance for VirtualBox *Tip: If you get any error running Oracle Virtual Box, please ......

Posted On Monday, October 19, 2015 9:46 PM

Introduction to Hadoop
Apache Hadoop is a framework which supports distributed processing and distributed storage of very large data sets on clusters of commodity computers. The distributed processing is achieved through MapReduce and distributed storage is achieved through HDFS (Hadoop Distributed File System). The Hadoop framework is created especially for the clusters of computers so it is very much aware of the nodes, its network configuration and handles node/storage/network failures. YARN (NextGen MapReduce) further ......

Posted On Monday, October 19, 2015 9:45 PM

Debugging - Remote JVM
Most of the JAVA IDE like Eclipse or InteliJ allow configuring the application through Debug configuration to attach a remote JVM and debug the code libraries. Let’s walk over this configuration and understand how this debugging works. Coming from MS Visual Studio background, I always missed the easy way of attaching your code by launching the Debug -> “Attach to process” and selecting the target process which is actually executing the library. Under JAVA this needs to be done in two steps. First ......

Posted On Monday, October 19, 2015 7:25 PM

Copyright © Rahul Anand | Powered by: