Cloudera Data Analyst Training - Gurgaon; 13-16 Feb 2014

Cloudera Data Analyst Training - Gurgaon; 13-16 Feb 2014

 

  • Early bird price

    Sale Date Ended

    INR 55800
    Sold Out
  • Regular price

    Regular price

    Sale Date Ended

    INR 62000
    Sold Out

Invite friends

Contact Us

Page Views : 428

About The Event

 

Programme and Course Overview

This four-day hands-on data analyst training course, focusing on Apache Pig and Hive and Cloudera Implala,
will teach you to apply traditional data analytics and business intelligence skills to Big Data.
Xebia, official Cloudera trainings partner, presents the tools participants need to access,
manipulate and analyse complex data sets using SQL and familiar scripting languages.

Apache Hive makes multi-structured data accessible to analysts, database administrators and others without
Java programming expertise. Apache Pig applies the fundamental of familiar scripting languages to the Hadoop
cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.

Trainer's Profile 

Sunil Yadav

Sunil has multiple  years of experience in developing Java & JEE based applications. He has rich experience  in Enterprise search and Enterprise Content Management domain.

He  has worked for various clients in different domains like Healthcare, Publication, Insurance and Government. Since he has worked with the ECM and ES products and know how customer stores there data and what value they want to extract from structured and unstructured data, he looked for the alternatives like Hadoop and NoSql Databases for the different use cases client have. Sunil had mastered technologies like Hadoop , Pig, Hive, Sqoop, Oozie and used them in various projects .He has explored some the newest technologies in big data world like Apache Spark and Kafka..

 

Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:

The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
Joining multiple data sets and analyzing disparate data with Pig;
Organizing data into tables, performing transformations, and simplifying complex queries with Hive;
Performing real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala;
How to pick the best analysis tool for a given task in Hadoop,


Prerequisites:

This course is best suited to data analysts, business analysts, developers, and administrators who have experience
with SQL and basic UNIX or Linux commands. Prior knowledge of Apache Hadoop is not required.

Outline:

  • Introduction
  • Hadoop Fundamentals
  • Introduction to Pig
  • Basic Data Analysis with Pig
  • Processing Complex Data with Pig
  • Multi-Dataset Operations with Pig
  • Extending Pig
  • Pig Troubleshooting and Optimization
  • Introduction to Hive
  • Data Analysis with Hive
  • Hive Data Management
  • Text Processing with Hive
  • Hive Optimization
  • Extending Hive
  • Introduction to Impala
  • Analyzing Data with Impala
  • Choosing the Best Tool for the Job
  • Conclusion

 

 Contact - Xebia : +91 98712 37360