Big Data is one of the most sought after technology in the market in today’s day and age is Big Data.
Big data means big information, it collection large volume of data through traditional computing techniques Hadoop. Hadoop provides that ability to store the large scale data on HDFS process. Big Data is one of the driving employment contracting on the world. It is the most important structure for data to vast arrangements of information to implement, and it quickly transforming into most needed advancements in various Professionals like.
Core topics of BIG DATA HADOOP Online Course
Introduction to BigData and its Eco-system
– What is Big Data? – Hadoop – Big Data Overview – Hadoop – Big Data Solution – Hadoop – Introduction – Introduction to Hadoop Eco system
Introduction to HDFS and its Command-Line Interface
– Hadoop – HDFS Overview – Features of HDFS – HDFS Architecture – Hadoop – HDFS Operations
Environment Setup
Introduction to MapReduce (in Java) Core Java Basics
– Hadoop – MapReduce Overview – MapReduce Architecture – YARN Architecture InputFormat
InputSplits, RecordReader, Mapper, Reducer, Partitioner and Combiner
Basic MapReduce Programming ( word count execution in both local and cluster mode)
Advanced MapReduce Programming
– Use Custom Record reader – Use Custom Partitioner
Identity Mapper/Reducer, Distributed Cache, Arbitable Offset, Writable and Writable Comparable Interface
Data Ingestion in to Hadoop
– Apache Sqoop – Apache Flume
Introduction to Hive (Regular Hive + Hive on Tez)
– Hive Introduction – Hive Datatypes -Hive Managed Tables and External Tables – Hive DDL Operations – Hive DML operations – Hive Partitions and Bucketing – Hive Joins – Hive Built-in functions – Hive UDF’s – Use FileFormat like Avro, Sequence, RC and ORC Hive SERDE’s
Performance Tuning in Hive – Hive JDBC connectivity
Introduction to PIG
– PIG Introduction (About Pig , HIVE vs PIG) – PIG Datatypes – PIG operators – PIG UDF’s – Process dataset in PIG script using PiggyBank jars
Introductito Workflow Processing
– Apache Oozie – WordCount Program execution using Oozie workflow – Oozie workflow for sqoop , hive and pig
Introduction to Scala Programming Language and basics
Introduction to SPARK – Spark Introduction – Spark Architecture – Spark RDD’s
– Spark Components( Streaming and Spark SQL) -Programming in Spark + Spark Streaming
Introduction to Kafka
Introduction to NoSql Databases
– Hbase Introduction – Hbase Architecture – HBase Commands – Mongo Db