Sale Date Ended
Programme and Course Overview
This three-day hands-on data analyst training course, focusing on Apache Pig and Hive and Cloudera Implala,
will teach you to apply traditional data analytics and business intelligence skills to Big Data.
Xebia, official Cloudera trainings partner, presents the tools participants need to access,
manipulate and analyse complex data sets using SQL and familiar scripting languages.
Apache Hive makes multi-structured data accessible to analysts, database administrators and others without
Java programming expertise. Apache Pig applies the fundamental of familiar scripting languages to the Hadoop
cluster. Cloudera Impala enables real-time interactive analysis of the data stored in Hadoop via a native SQL environment.
Kris Geusebroek is a developer with a passion for combining technologies to create new possibilities for people around him. Coming from a Java and Geographical Information Systems background and being a fan op open source software, he started working with distributed systems and graph databases in the last couple of years. He’s currently working on visualizing Big Data with the help of Hadoop and Neo4j.
Kris has a broad interest in programming practices and languages. He likes to work and communicate closely with the customer to achieve the best results.
Before joining Xebia, Kris worked for two other IT companies and started his working career in logistics at the Dutch Railway. He's fluent in Dutch and English and speaks passable German. He lives in Wilnis, The Netherlands, with his wife Gita and their kids Lieke and Bas.
Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:
The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop tools
Joining multiple data sets and analyzing disparate data with Pig;
Organizing data into tables, performing transformations, and simplifying complex queries with Hive;
Performing real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala;
How to pick the best analysis tool for a given task in Hadoop,
This course is best suited to data analysts, business analysts, developers, and administrators who have experience
with SQL and basic UNIX or Linux commands. Prior knowledge of Apache Hadoop is not required.