Call : (+91) 968636 4243
Mail : info@EncartaLabs.com
EncartaLabs

Data Warehousing on AWS

( Duration: 3 Days )

This Data Warehousing on AWS training course introduces to concepts, strategies, and best practices for designing a cloudbased data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS. This course demonstrates how to collect, store, and prepare data for the data warehouse by using AWS services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis, and Amazon S3. Additionally, this course demonstrates how to use Amazon QuickSight to perform analysis on your data.

By attending Data Warehousing on AWS workshop, delegates will learn to:

  • Discuss the core concepts of data warehousing, and the intersection between data warehousing and big data solutions
  • Launch an Amazon Redshift cluster and use the components, features, and functionality to implement a data warehouse in the cloud
  • Use other AWS data and analytic services, such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis, and Amazon S3, to contribute to the data warehousing solution
  • Architect the data warehouse
  • Identify performance issues, optimize queries, and tune the database for better performance
  • Use Amazon Redshift Spectrum to analyze data directly from an Amazon S3 bucket
  • Use Amazon QuickSight to perform data analysis and visualization tasks against the data warehouse

  • Attend a training on AWS Technical Essentials or equivalent practical experience
  • Familiarity with relational databases and database design concepts

The Data Warehousing on AWS class is ideal for:

  • Database Architects
  • Database Administrators
  • Database Developers
  • Data Analysts
  • Data Scientists

COURSE AGENDA

1

Introduction to Data Warehousing

  • Relational databases
  • Data warehousing concepts
  • The intersection of data warehousing and big data
  • Overview of data management in AWS
2

Introduction to Amazon Redshift

  • Conceptual overview
  • Real-world use cases
3

Launching clusters

  • Building the cluster
  • Connecting to the cluster
  • Controlling access
  • Database security
  • Load data
4

Designing the database schema

  • Schemas and data types
  • Columnar compression
  • Data distribution styles
  • Data sorting methods
5

Identifying data sources

  • Data sources overview
  • Amazon S3
  • Amazon DynamoDB
  • Amazon EMR
  • Amazon Kinesis Data Firehose
  • AWS Lambda Database Loader for Amazon Redshift
6

Loading data

  • Preparing Data
  • Loading data using COPY
  • Maintaining tables
  • Concurrent write operations
  • Troubleshooting load issues
7

Writing queries and tuning for performance

  • Amazon Redshift SQL
  • User-Defined Functions (UDFs)
  • Factors that affect query performance
  • The EXPLAIN command and query plans
  • Workload Management (WLM)
8

Amazon Redshift Spectrum

  • Amazon Redshift Spectrum
  • Configuring data for Amazon Redshift Spectrum
  • Amazon Redshift Spectrum Queries
9

Maintaining clusters

  • Audit logging
  • Performance monitoring
  • Events and notifications
  • Resizing clusters
  • Backing up and restoring clusters
  • Resource tagging and limits and constraints
10

Analyzing and visualizing data

  • Power of visualizations
  • Building dashboards
  • Amazon QuickSight editions and features

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 6,000 various courses on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting https://www.encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top
Notice
X