Call : (+91) 968636 4243
Mail : info@EncartaLabs.com
EncartaLabs

Data Analysis with Apache Pig

( Duration: 1 Day )

This Data Analysis with Apache Pig training course covers how to use Pig as part of an ETL process in a Hadoop cluster. The course begins with manipulating semi-structured raw data files in Pig, and using the grunt shell and the Pig Latin programming language. Once the raw data has been manipulated into structured tables, they are exported from Pig and imported into Hive.

By attending Data Analysis with Apache Pig workshop, delegates will learn to:

  • Define Apache Pig
  • Describe how Apache Pig fits in the data pipeline
  • Understand data types in Apache Pig
  • Load data into Pig relations
  • Examine data and debug scripts
  • Use FOREACH ... GENERATE on data
  • Store data for use with other applications
  • Subset data with DISTINCT, FILTER, and SAMPLE
  • Combine data with JOIN, UNION, and GROUP
  • Manipulate data with ORDER, FLATTEN, and UDFs

  • Basic Hadoop knowledge
  • Basic to intermediate Linux skills including familiarity with command line options such as ls, cd, cp, and su
  • Familiarity with a functional high-level programming language such as Python or SQL

The Data Analysis with Apache Pig class is ideal for:

  • Data Analysts, Data Scientists and Developers

COURSE AGENDA

1

Welcome to Class

  • Course introduction
2

Apache Pig in the Hadoop Ecosystem

  • Define Apache Pig
  • Describe how Apache Pig fits in the data pipeline
  • Understand data types in Apache Pig
3

Extract, Transform, and Load Data with Apache Pig

  • Load data into Pig relations
  • Examine data and debug scripts
  • Use FOREACH ... GENERATE on data
  • Store data for use with other applications
4

Manipulate Data with Apache Pig

  • Subset data with DISTINCT, FILTER, and SAMPLE
  • Combine data with JOIN, UNION, and GROUP
  • Manipulate data with ORDER, FLATTEN, and UDFs

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 6,000 various courses on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting https://www.encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top
Notice
X