This Data Analysis with Apache Hive training course covers how to use Apache Hive to query structured data without writing MapReduce code. You learn how Apache Hive fits in the Hadoop ecosystem, how to create and load tables in Hive, and how to query data using the Hive Query Language. This course is best for data analysts and developers interested in the data pipeline, and those familiar with SQL who want to use data on an HDFS.
By attending Data Analysis with Apache Hive workshop, delegates will learn to:
- Define Apache Hive
- Explain Apache Hive use cases
- Describe how Apache Hive fits in the data pipeline
- Understand data types in Apache Hive
- Create databases and tables
- Partition and bucket data
- Load tables with data
- Alter and drop tables
- Query tables
- Manipulate tables
- Combine and store tables
- Basic Hadoop knowledge
- Beginner to intermediate Linux skills including familiarity with command line options such as ls, cd, cp, and su
- Beginner to intermediate proficiency with SQL
The Data Analysis with Apache Hive class is ideal for:
- Data Analysts, Data Scientists and Developers