Cloudera Data Analyst Training - Bangalore

Cloudera Data Analyst Training - Bangalore


  • Cloudera data analyst

    Sale Date Ended

    INR 56000
    Sold Out

Invite friends

Contact Us


For Enquiries

Amit (m) +91 8826194414 Anubhav (m) +91 9718065717

Page Views : 709

About The Event

Programme and Course Overview

This three-day hands-on data analyst training course, focusing on Apache Pig and Hive and Cloudera Implala,
will teach you to apply traditional data analytics and business intelligence skills to Big Data.
Xebia, official Cloudera trainings partner, presents the tools participants need to access,
manipulate and analyse complex data sets using SQL and familiar scripting languages.

Apache Hive makes multi-structured data accessible to analysts, database administrators and others without
Java programming expertise. Apache Pig applies the fundamental of familiar scripting languages to the Hadoop
cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.

Trainer's Profile 

Kris Geusebroek is a developer with a passion for combining technologies to create new possibilities for people around him. Coming from a Java and Geographical Information Systems background and being a fan op open source software, he started working with distributed systems and graph databases in the last couple of years. He’s currently working on visualizing Big Data with the help of Hadoop and Neo4j.

Kris has a broad interest in programming practices and languages. He likes to work and communicate closely with the customer to achieve the best results.

Before joining Xebia, Kris worked for two other IT companies and started his working career in logistics at the Dutch Railway. He's fluent in Dutch and English and speaks passable German. He lives in Wilnis, The Netherlands, with his wife Gita and their kids Lieke and Bas.

Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:

The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
Joining multiple data sets and analyzing disparate data with Pig;
Organizing data into tables, performing transformations, and simplifying complex queries with Hive;
Performing real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala;
How to pick the best analysis tool for a given task in Hadoop,


This course is best suited to data analysts, business analysts, developers, and administrators who have experience
with SQL and basic UNIX or Linux commands. Prior knowledge of Apache Hadoop is not required.


  • Introduction
  • Hadoop Fundamentals
  • Introduction to Pig
  • Basic Data Analysis with Pig
  • Processing Complex Data with Pig
  • Multi-Dataset Operations with Pig
  • Extending Pig
  • Pig Troubleshooting and Optimization
  • Introduction to Hive
  • Data Analysis with Hive
  • Hive Data Management
  • Text Processing with Hive
  • Hive Optimization
  • Extending Hive
  • Introduction to Impala
  • Analyzing Data with Impala
  • Choosing the Best Tool for the Job
  • Conclusion


Venue Map