Course Description
What are the objectives of this course?
This Hadoop Admin training course will help you understand the basic and advanced concepts of Big Data and all of the technologies related to the Hadoop stack and components of the Hadoop Ecosystem.
What skills will you learn?
- Understand the fundamentals and characteristics of Big Data and various scalability options available to help organizations manage Big Data
- Master the concepts of the Hadoop framework, including architecture, the Hadoop distributed file system and deployment of Hadoop clusters using core or vendor specific distributions
- Use Cloudera manager for setup, deployment, maintenance and monitoring of Hadoop clusters
- Understand Hadoop Administration activities and computational frameworks for processing Big Data
- Work with Hadoop clients, nodes for clients and web interfaces like HUE to work with Hadoop Cluster
- Use cluster planning and tools for data ingestion into Hadoop clusters, and cluster monitoring activities
- Utilize Hadoop components within Hadoop ecosystem like Hive, HBase, Spark and Kafka
- Understand security implementation to secure data and clusters.
Who should take this course?
- Systems administrators and IT managers
- IT administrators and operators
- IT Systems Engineers
- Data Engineers and database administrators
- Data Analytics Administrators
- Cloud Systems Administrators
- Web Engineers
- Individuals who intend to design, deploy and maintain Hadoop clusters
What projects are included in this course?
Project 1
Scalability: Deploying Multiple Clusters
Your company wants to set up a new cluster and has procured new machines; however, setting up clusters on new machines will take time. Meanwhile, your company wants you to set up a new cluster on the same set of machines and start testing the new cluster’s working and applications.
Project 2
Working with Clusters
Demonstrate your understanding of the following tasks (give the steps):
- Enabling and disabling HA for namenode and resourcemanager in CDH
- Removing Hue service from your cluster, which has other services such as Hive, Hbase, HDFS, and YARN setup
- Adding a user and granting read access to your Cloudera cluster
- Changing replication and blocksize of your cluster
- Adding Hue as a service, logging in as user HUE, and downloading examples for Hive, Pig, job designer, and others
Project 3
Data Ingestion and Usage
Ingesting data from external structured databases into HDFS, working on data on HDFS by loading it into a data warehouse package like Hive, and using HiveQL for querying, analyzing, and loading data in another set of tables for further usage.
Your organization already has a large amount of data in an RDBMS and has now set up a Big Data practice. It is interested in moving data from the RDBMS into HDFS so that it can perform data analysis by using software packages such as Apache Hive. The organization would like to leverage the benefits of HDFS and features such as auto replication and fault tolerance that HDFS offers.
Project 4
Securing Data and Cluster
Protecting data stored in your Hadoop cluster by safeguarding it and backing it up.
Your organization would like to safeguard its data on multiple Hadoop clusters. The aim is to prevent data loss from accidental deletes and to make critical data available to users/applications even if one or more of these clusters is down.
Why learn Big Data Hadoop Administrator course?
The world is getting increasingly digital, and this means big data is here to stay. In fact, the importance of big data and data analytics is going to continue growing in the coming years. Choosing a career in the field of big data and analytics might just be the type of role that you have been trying to find to meet your career expectations.
Professionals who are working in this field can expect an impressive salary, with the median salary for data scientists being $116,000. Even those who are at the entry level will find high salaries, with average earnings of $92,000.
Price
Self-Paced Learning
- 180 days of access to high-quality, self-paced learning content designed by industry experts
Price
Online Classroom Flexi-Pass
90 days of access to 2+ instructor-led online training classes
- 180 days of access to high-quality, self-paced learning content designed by industry experts