Cloudera - Certified Administrator for Apache Hadoop (CCAH)

Duration

Duration:

Only 3 Days

Method

Method:

Classroom / Online / Hybrid

Next date

Next date:

24/6/2024 (Monday)

Overview

Get the skills you need to operate and manage Hadoop clusters. The Cloudera Certified Administrator for Apache Hadoop (CCAH) certification proves that you have this knowledge.

From installation and configuration, to load-balancing and tuning your cluster; this accelerated course covers everything 33% faster than traditional training.

What will you learn?

You'll get the knowledge you need to pass the CCAH exam. Through lectures and hands-on exercises, you'll cover the following topics:

  • The internals of YARN, MapReduce, and Hadoop Distributed File System (HDFS)
  • Determining the correct hardware and infrastructure for your cluster
  • Proper cluster configuration and deployment to integrate with the data centre
  • How to load data into the cluster from dynamically-generated files using Flume and from a Relational Database Management System (RDMS) using Sqoop
  • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster
  • Best practices for preparing and maintaining Apache Hadoop in production
  • Troubleshooting, diagnosing, tuning, and solving Hadoop issues

Note: this course will cover content and practical tests to cover preparation to the exam. Firebrand cannot deliver the exam at our centre. Students will be provided with an exam voucher to take the exam.

Benefits

Other accelerated training providers rely heavily on lecture and independent self-testing and study.

Effective technical instruction must be highly varied and interactive to keep attention levels high, promote camaraderie and teamwork between the students and instructor, and solidify knowledge through hands-on learning.

Firebrand Training provides instruction to meet every learning need:

  • Intensive group instruction
  • One-on-one instruction attention
  • Hands-on labs
  • Lab partner and group exercises
  • Question and answer drills
  • Independent study

Curriculum

The Case for Apache Hadoop

  • Why Hadoop?
  • Core Hadoop components
  • Fundamental concepts HDFS

HDFS Features

  • Writing and reading Files
  • NameNode memory considerations
  • Overview of HDFS cecurity
  • Using the Namenode Web UI
  • Using the Hadoop File Shell

Getting Data into HDFS

  • Ingesting Data from external sources with Flume
  • Ingesting Data from relational databases with Sqoop
  • REST Interfaces
  • Best practices for importing data

YARN and MapReduce

  • What Is MapReduce?
  • Basic MapReduce concepts
  • YARN cluster architecture
  • Resource allocation
  • Failure recovery
  • Using the YARN Web UI
  • MapReduce Version 1

Planning Your Hadoop Cluster

  • General planning considerations
  • Choosing the right hardware
  • Network considerations
  • Configuring nodes
  • Planning for cluster management

Hadoop Installation and Initial Configuration

  • Deployment Types
  • Installing Hadoop
  • Specifying the Hadoop configuration
  • Performing Initial HDFS configuration
  • Performing Initial YARN and MapReduce configuration
  • Hadoop Logging

Installing and Configuring Hive, Impala, and Pig

  • Hive
  • Impala
  • Pig

Hadoop Clients

  • What is a Hadoop Client?
  • Installing and configuring Hadoop Clients
  • Installing and configuring Hue
  • Hue authentication and authorization

Cloudera Manager

  • The cotivation for Cloudera Manager
  • Cloudera Manager features
  • Express and Enterprise versions
  • Cloudera Manager Topology
  • Installing Cloudera Manager
  • Installing Hadoop using Cloudera Manager
  • Performing basic administration tasks using Cloudera Manager

Advanced Cluster Configuration

  • Advanced configuration parameters
  • Configuring Hadoop Ports
  • Explicitly including and excluding hosts
  • Configuring HDFS for rack awareness
  • Configuring HDFS high availability

Hadoop Security

  • Why Hadoop security is important
  • Hadoop’s security system concepts
  • What Kerberos is and how it works
  • Securing a Hadoop Cluster with Kerberos

Managing and Scheduling Jobs

  • Managing running jobs
  • Scheduling hadoop jobs
  • Configuring the FairScheduler
  • Impala query scheduling

Cluster Maintenance

  • Checking HDFS Status
  • Copying data between clusters
  • Adding and removing cluster nodes
  • Rebalancing the cluster
  • Cluster upgrading

Cluster Monitoring and Troubleshooting

  • General system monitoring
  • Monitoring Hadoop clusters
  • Common troubleshooting Hadoop clusters
  • Common misconfigurations

Exam Track

As part of this accelerated course, you'll receive the following exam voucher:

  • Cloudera Certified Administrator for Apache Hadoop CCAH CDH 5 (CCA-500)

The exam consists of 60 questions and must be completed within 90 minutes. You must have a passing score of at least 70% to get your certification.

Note: this course will cover content and practical tests to cover preparation to the exam. Firebrand cannot deliver the exam at our centre. Delegates will be provided with an exam voucher to take the exam.

What's included

Included:

  • Official Cloudera courseware

Your accelerated course includes:

  • Accommodation *
  • Meals, unlimited snacks, beverages, tea and coffee *
  • On-site exams **
  • Exam vouchers **
  • Practice tests **
  • Certification Guarantee ***
  • Courseware
  • Up-to 12 hours of instructor-led training each day
  • 24-hour lab access
  • Digital courseware **
  • * For residential training only. Accommodation is included from the night before the course starts. This doesn't apply for online courses.
  • ** Some exceptions apply. Please refer to the Exam Track or speak with our experts
  • *** Pass first time or train again free as many times as it takes, unlimited for 1 year. Just pay for accommodation, exams, and incidental costs.

Prerequisites

This course is best suited to Systems Administrators and IT managers who have basic Linux experience. Prior knowledge of Apache Hadoop is not required.

Unsure whether you meet the prerequisites? Don’t worry. Your training consultant will discuss your background with you to understand if this course is right for you.

Reviews

Here's the Firebrand Training review section. Since 2001 we've trained exactly 134561 students and asked them all to review our Accelerated Learning. Currently, 96.41% have said Firebrand exceeded their expectations.

Read reviews from recent accelerated courses below or visit Firebrand Stories for written and video interviews from our alumni.


"Training was very good, explanation was very clear and teacher detailed a lot, so for a 3 day course and to have a first understanding of POWER BI is good."
Rosanna Seerattan Cruz, JTI. (19/3/2024 (Tuesday) to 21/3/2024 (Thursday))

"The instructor and the structure of the course were very clear."
EK, JTI. (19/3/2024 (Tuesday) to 21/3/2024 (Thursday))

"CEH is a very hard training, but it's doable thanks to the friendly employees at Firebrand and the accommodations."
Kas Ramjiawan, ITQM. (4/3/2024 (Monday) to 8/3/2024 (Friday))

"Heavy stuff! Long days and almost no time for some leisure or preparing for exam... I thought there was more hands-on training involved."
MR. (4/3/2024 (Monday) to 8/3/2024 (Friday))

"The course was well structured and concise with a knowledgeable and personable instructor. I will recommend Firebrand courses to all colleagues"
LT. (6/3/2024 (Wednesday) to 8/3/2024 (Friday))

Course Dates

Start

Finish

Status

Location

Book now

19/2/2024 (Monday)

21/2/2024 (Wednesday)

Finished - Leave feedback

-

 

24/6/2024 (Monday)

26/6/2024 (Wednesday)

Wait list

Nationwide

 

5/8/2024 (Monday)

7/8/2024 (Wednesday)

Limited availability

Nationwide

 

16/9/2024 (Monday)

18/9/2024 (Wednesday)

Open

Nationwide

 

28/10/2024 (Monday)

30/10/2024 (Wednesday)

Open

Nationwide

 

9/12/2024 (Monday)

11/12/2024 (Wednesday)

Open

Nationwide

 

Latest Reviews from our students