Cloudera Data Analyst Training - Bangalore( 7-9, Sept 2014)

Cloudera Data Analyst Training - Bangalore( 7-9, Sept 2014)


  • Cloudera Data Analyst Training-Bangalore( September 07-09 2014)

    Sale Date Ended

    INR 56000
    Sold Out

Invite friends

Contact Us


For Enquiries

Amit (m) +91 8826194414

Page Views : 459

About The Event


Programme and Course Overview

This three-day hands-on data analyst training course, focusing on Apache Pig and Hive and Cloudera Implala,
will teach you to apply traditional data analytics and business intelligence skills to Big Data.
Xebia, official Cloudera trainings partner, presents the tools participants need to access,
manipulate and analyse complex data sets using SQL and familiar scripting languages.

Apache Hive makes multi-structured data accessible to analysts, database administrators and others without
Java programming expertise. Apache Pig applies the fundamental of familiar scripting languages to the Hadoop
cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.

Trainer's Profile 

Sunil Yadav

Sunil has multiple  years of experience in developing Java & JEE based applications. He has rich experience  in Enterprise search and Enterprise Content Management domain.

He  has worked for various clients in different domains like Healthcare, Publication, Insurance and Government. Since he has worked with the ECM and ES products and know how customer stores there data and what value they want to extract from structured and unstructured data, he looked for the alternatives like Hadoop and NoSql Databases for the different use cases client have. Sunil had mastered technologies like Hadoop , Pig, Hive, Sqoop, Oozie and used them in various projects .He has explored some the newest technologies in big data world like Apache Spark and Kafka..


Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:

The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
Joining multiple data sets and analyzing disparate data with Pig;
Organizing data into tables, performing transformations, and simplifying complex queries with Hive;
Performing real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala;
How to pick the best analysis tool for a given task in Hadoop,


This course is best suited to data analysts, business analysts, developers, and administrators who have experience
with SQL and basic UNIX or Linux commands. Prior knowledge of Apache Hadoop is not required.


  • Introduction
  • Hadoop Fundamentals
  • Introduction to Pig
  • Basic Data Analysis with Pig
  • Processing Complex Data with Pig
  • Multi-Dataset Operations with Pig
  • Extending Pig
  • Pig Troubleshooting and Optimization
  • Introduction to Hive
  • Data Analysis with Hive
  • Hive Data Management
  • Text Processing with Hive
  • Hive Optimization
  • Extending Hive
  • Introduction to Impala
  • Analyzing Data with Impala
  • Choosing the Best Tool for the Job
  • Conclusion


Venue Map