Apache Hadoop is an open-source software framework for distributed storage and distributed processing of very large data sets on computer
clusters built
from commodity hardware.
Hadoop can basically be
divided into two major components
1.
HDFS(Hadoop Distributed File
System)
2.
MapReduce
Basic Factors about Hadoop
-
Hadoop only runs on Linux
powered Operating Systems such as Ubuntu, RedHat.
-
Hadoop is written and
developed using Java.
As Hadoop is written in Java
doesn't mean that working with Hadoop requires you to be a master in Java.
But basic building blocks of
Java are required to be clear before you start working on Hadoop.
So here is the list of the
topics you should be familiar if you need to start working in Hadoop.
Objects and Classes
A class is nothing but a
blueprint or a template for creating different objects which defines its
properties and behaviors. Java class objects exhibit the properties and
behaviors defined by its class. A class can contain fields and methods to
describe the behavior of an object.
Arrays
Java provides a data
structure, the array, which stores a fixed-size sequential collection of
elements of the same type. An array is used to store a collection of data, but
it is often more useful to think of an array as a collection of variables of
the same type.
Control Flow Statements
- In Java, the statements
inside any source file are executed in an ascending order, i.e from top to
bottom. Control flow statements are commands that allow breaks in the execution
pattern. You can actually customize and execute particular blocks of code in
your source file - using control flow statements.
Inheritance
Inheritance in java is a mechanism in which one object acquires all the
properties and behaviors of parent object.
The idea behind inheritance in java is that you can create new classes
that are built upon existing classes. When you inherit from an existing class,
you can reuse methods and fields of parent class, and you can add new methods
and fields also.
Inheritance represents the IS-A relationship, also known as parent-child relationship.
Interfaces
It is similar to class. It is a collection of abstract methods. A class
implements an interface, thereby inheriting the abstract methods of the
interface. Along with abstract methods, an interface may also contain
constants, default methods, static methods, and nested types.
Asterix Solution with its 60 hours specialized Big Data Hadoop Training in Mumbai
Provides good opportunities for getting you placed as a Hadoop
Developer.
No comments:
Post a Comment