Data Analysis with Apache Pig Training Course and Workshop in Bangalore, Mysore, Chennai, Hyderabad, Pune, Mumbai, Delhi, Noida, Gurgaon, Kolkata

This Data Analysis with Apache Pig training course covers how to use Pig as part of an ETL process in a Hadoop cluster. The course begins with manipulating semi-structured raw data files in Pig, and using the grunt shell and the Pig Latin programming language. Once the raw data has been manipulated into structured tables, they are exported from Pig and imported into Hive.

By attending Data Analysis with Apache Pig workshop, delegates will learn to:

Define Apache Pig
Describe how Apache Pig fits in the data pipeline
Understand data types in Apache Pig
Load data into Pig relations
Examine data and debug scripts
Use FOREACH ... GENERATE on data
Store data for use with other applications
Subset data with DISTINCT, FILTER, and SAMPLE
Combine data with JOIN, UNION, and GROUP
Manipulate data with ORDER, FLATTEN, and UDFs

Basic Hadoop knowledge
Basic to intermediate Linux skills including familiarity with command line options such as ls, cd, cp, and su
Familiarity with a functional high-level programming language such as Python or SQL

The Data Analysis with Apache Pig class is ideal for:

Data Analysts, Data Scientists and Developers

Welcome to Class

Course introduction

Apache Pig in the Hadoop Ecosystem

Define Apache Pig
Describe how Apache Pig fits in the data pipeline
Understand data types in Apache Pig

Extract, Transform, and Load Data with Apache Pig

Load data into Pig relations
Examine data and debug scripts
Use FOREACH ... GENERATE on data
Store data for use with other applications

Manipulate Data with Apache Pig

Subset data with DISTINCT, FILTER, and SAMPLE
Combine data with JOIN, UNION, and GROUP
Manipulate data with ORDER, FLATTEN, and UDFs

Encarta Labs Advantage

One Stop Corporate Training Solution Providers for over 6,000 various courses on a variety of subjects
All courses are delivered by Industry Veterans
Get jumpstarted from newbie to production ready in a matter of few days

Trained more than 50,000 Corporate executives across the Globe
All our trainings are conducted in workshop mode with more focus on hands-on sessions

Data Analysis with Apache Pig

COURSE AGENDA

Welcome to Class

Apache Pig in the Hadoop Ecosystem

Extract, Transform, and Load Data with Apache Pig

Manipulate Data with Apache Pig

Encarta Labs Advantage