Call : (+91) 968636 4243
Mail : info@EncartaLabs.com
EncartaLabs

Apache HBase

( Duration: 3 Days )

Apache HBase is a NoSQL database that runs on Hadoop’s HDFS and is fully integrated with Hadoop. It is designed to handle large sets of data, which contain billions of columns and rows. It is an ideal choice for storing sparse and semi-structured data because it provides fault tolerance through replication, automatic failover, sharding, and load balancing. It also provides fast real-time lookups, in-memory caching, and server-side processing. Apache HBase can be accessed using various client APIs, including Java, Thrift, and REST.

This Apache HBase training course covers the HBase architecture and data models, install HBase, use the shell and client APIs to access the data.

By attending Apache HBase workshop, delegates will learn to:

  • Understand HBase Architecture and use HBase shell to directly manipulate HBase Tables
  • Design optimal HBase schemas for efficient data storage and recovery
  • Connect to HBase using the Java API
  • Understand how to bulk load data into HBase using MapReduce
  • Administer HBase cluster and Identify, resolve performance bottlenecks

  • Core Java knowledge

The Apache HBase class is ideal for:

  • Administrators and developers who need experience using Hbase.

COURSE AGENDA

1

Installation

  • Overview of HBase
  • HBase Requirements
  • HBase Software Requirements
  • HBase Filesystems
  • HBase Installation Modes
  • Installing HBase in Local Mode
  • Installing HBase in Fully Distributed Mode
  • Accessing the Web-Based Management Console
  • Using the HBase Shell
2

Architecture

  • HBase Components
  • HFiles and Regions
  • The Write-Ahead Log and MemStore
  • Compaction and Splits
  • Data Replication
  • Access HBase
  • Securing HBase
  • Hadoop’s MapReduce Integration with HBase
3

Data Modeling

  • HBase Schema Overview
  • Designing Tables
  • Designing Rowkeys for Tables
  • Versions, DataTypes, and Joins
  • Time to Live and Deleted Cells
4

Table Creation in the Shell

  • Creating Tables Using the Shell
  • Disabling, Enabling, and Dropping a Table
  • Altering a Table’s Properties
5

Data Management in the Shell

  • Adding Data to a Table
  • Using the scan and get Commands
  • Deleting Data from a Table
  • Using Counters
6

Insert Data Using Java Client API

  • Establishing a Connection
  • Creating Tables using the Client Java API
  • Creating a Put Class Instance
  • Adding Data using the add() Option
  • Using Timestamp with Put for Versioning
  • Using the get() and has() Method
7

Get Data Using Java Client API

  • Using the Get Class
  • Retrieving Columns using the Get Class
  • Retrieving Versions of Columns using the Get Class
  • Retrieving Specific Values from a Cell
  • Using List with the Get Class
8

Scan Data using Java Client API

  • Using Scan() to read an Entire Table
  • Scanning Rows Starting at a Specific Row or a Range
  • Using Constructors to Narrow Search Results
  • Using getScanner() Method
  • Using the ResultScanner Class
9

Delete and Update Data using Java Client

  • Updating Data
  • Deleting Data
10

Filters

  • Implementing Utility Filters
  • Implementing Comparison Filters
  • Implementing Custom Filters
11

Cluster Administration

  • Checking the Status of the HBase Instance
  • Listing the User Space Tables
  • Deleting Tables
  • Completing a Major Compaction Manually
  • Merging Adjoining Regions
  • Stopping and Decommissioning a RegionServer
  • Performing a Rolling Restart
  • Adding a New Node
  • Monitoring HBase
12

Snapshots and Backups

  • Taking a Snapshot
  • Using a Snapshot to Clone a Table
  • Exporting and Restoring Snapshots
  • Performing a Full Shutdown Backup
  • Performing a Backup on a Live Cluster
  • Performing a Restore
13

MapReduce

  • Using HBase as a Data Sink for MapReduce Jobs
  • Using HBase as a Data Source for MapReduce Jobs
  • Bulk Loading Data
  • Splitting Map Tasks When Sourcing an HBase Table
  • Accessing Other HBase Tables within a MapReduce Job

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 6,000 various courses on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting https://www.encartalabs.com/course-catalogue-all.php

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.

Top
Notice
X