Big Data Hadoop Administration

Big Data Hadoop Administration:-

Course Duration: 2 Months

1. Big Data Overview

  1. What is big data
  2. Discussion over Databases
  3. Databases v/s Hadoop
  4. Problems with Large scale data
  1. Why Hadoop
  2. Apache Hadoop Architecture
  3. Apache Hadoop workflow
  4. Apache hadoop Component
  1. Basic Prerequisites for Hadoop
  2. Apache Hadoop Standalone Installation
  3. Apache Hadoop Management
  1. Understanding HDFS Architecture
  2. Understanding HDFS Management and Core Component
  3. HDFS Snapshot and Management
  4. Understanding FSImage and edit logs management
  1. Understanding Apache Hadoop Namenode
  2. Understanding Apache Hadoop Datanode
  3. Understanding Apache Secondary NameNode
  4. Apache Hadoop Backup & Management
  1. Hadoop multi Node Cluster setup
  2. Hadoop Cluster Management
  3. Include and Exclude Data node in cluster
  1. Yarn service Introduction & Architecture
  2. Yarn Resource manager & Node Manager
  3. Yarn Scheduling and Management
  1. Managing Hadoop Cli
  2. Managing Hadoop Web
  3. Managing Failover & Nodes
  1. Hive Architecture and Hadoop
  2. Hive Installation and management
  3. Data manipulation with Hive
  1. Pig Architecture
  2. Pig Installation and management
  3. Pig Latin script for Data Manipulation
  1. Configure and Management of Sqoop
  2. Application Management using  Flume
  3. Configure Management of HBASE
  1. Deployment Namenode on HA using Zookeeper
  2. Namenode as a standby and active
  1. Cloudera manager
  2. Apache Strom
  3. Ambari
WhatsApp chat