Cloudera Data Analyst Training-Pune( 7-9 August 2014)
Sale Date Ended
EARLY BIRD DISCOUNT CODE - CDA7
Programme and Course Overview
This three-day hands-on data analyst training course, focusing on Apache Pig and Hive and Cloudera Implala,
will teach you to apply traditional data analytics and business intelligence skills to Big Data.
Xebia, official Cloudera trainings partner, presents the tools participants need to access,
manipulate and analyse complex data sets using SQL and familiar scripting languages.
Apache Hive makes multi-structured data accessible to analysts, database administrators and others without
Java programming expertise. Apache Pig applies the fundamental of familiar scripting languages to the Hadoop
cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.
Sunil has multiple years of experience in developing Java & JEE based applications. He has rich experience in Enterprise search and Enterprise Content Management domain.
He has worked for various clients in different domains like Healthcare, Publication, Insurance and Government. Since he has worked with the ECM and ES products and know how customer stores there data and what value they want to extract from structured and unstructured data, he looked for the alternatives like Hadoop and NoSql Databases for the different use cases client have. Sunil had mastered technologies like Hadoop , Pig, Hive, Sqoop, Oozie and used them in various projects .He has explored some the newest technologies in big data world like Apache Spark and Kafka..
Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:
The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
Joining multiple data sets and analyzing disparate data with Pig;
Organizing data into tables, performing transformations, and simplifying complex queries with Hive;
Performing real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala;
How to pick the best analysis tool for a given task in Hadoop,
This course is best suited to data analysts, business analysts, developers, and administrators who have experience
with SQL and basic UNIX or Linux commands. Prior knowledge of Apache Hadoop is not required.