Big Data Hadoop Administration:-
Course Duration: 2 Months
- What is big data
- Discussion over Databases
- Databases v/s Hadoop
- Problems with Large scale data
- Why Hadoop
- Apache Hadoop Architecture
- Apache Hadoop workflow
- Apache hadoop Component
- Basic Prerequisites for Hadoop
- Apache Hadoop Standalone Installation
- Apache Hadoop Management
- Understanding HDFS Architecture
- Understanding HDFS Management and Core Component
- HDFS Snapshot and Management
- Understanding FSImage and edit logs management
- Understanding Apache Hadoop Namenode
- Understanding Apache Hadoop Datanode
- Understanding Apache Secondary NameNode
- Apache Hadoop Backup & Management
- Hadoop multi Node Cluster setup
- Hadoop Cluster Management
- Include and Exclude Data node in cluster
- Yarn service Introduction & Architecture
- Yarn Resource manager & Node Manager
- Yarn Scheduling and Management
- Managing Hadoop Cli
- Managing Hadoop Web
- Managing Failover & Nodes
- Hive Architecture and Hadoop
- Hive Installation and management
- Data manipulation with Hive
- Pig Architecture
- Pig Installation and management
- Pig Latin script for Data Manipulation
- Configure and Management of Sqoop
- Application Management using Flume
- Configure Management of HBASE
- Deployment Namenode on HA using Zookeeper
- Namenode as a standby and active
- Cloudera manager
- Apache Strom
- Ambari