Call : (+91) 968636 4243
Mail :

Apache Cassandra

Apache Cassandra moves away from the master-slave model and instead uses a peer-to-peer model. This means there is no single master but all the nodes are potentially masters. This makes the writes and reads extremely scalable and even allows nodes to function in cases of partition tolerance.

This Apache Cassandra - Administrator training course provides expertise about the core concepts of Cassandra, highly scalable data models and Cassandra architecture for proper administration of Big Data applications. The Apache Cassandra - Developer training course provides skills about data model queries, indepth CQL, data modelling, complex queries and about whiteboard in NoSQL.

In Apache Cassandra - Administrator workshop, delegates will learn to:
  • Benchmark read and write operations
  • Recognize different types of failure
  • Fix a failed or partially failed cluster
  • Identify potential performance bottlenecks
  • Perform live schema updates
  • Perform move operations
  • Plan and perform cluster-wide operations
  • Monitor compaction, repair, and hinted handoff
In Apache Cassandra - Developer workshop, delegates will learn to:
  • Architect and engineer Cassandra databases for competitive advantage
  • Model data in Cassandra based on query patterns
  • Access Cassandra databases using CQL and Java
  • Create a balance between read/write speed and data consistency
  • Integrate Cassandra with Hadoop, Pig and Hive
  • Implement commonly used Cassandra design patterns

Apache Cassandra - Administrator class is designed for Administrators with basic knowledge of databases.

Database Administrators, Data Analytics professionals, Data architects, Managers


Apache Cassandra - Administrator
(Duration : 2 Days)


The Write Path

  • Log Structured Storage
  • Memtables
  • Flushing

The Read Path

  • SSTables
  • Row Merging
  • Cache (Key, Row)
  • Compaction
  • Distributed Deletes
  • Memory Mapped Files
  • Evolving Applications

Introduction to CAP


Partition Tolerance

  • Data partitioners
  • Replication strategies
  • Snitches
  • Hinted Handoff


  • How Cassandra handles failure of one or more nodes
  • What to do in the face of failure

Aspects of consistency

  • Consistency
  • Coordinators
  • Read Repair
  • Phi Accrual
  • Hinted Handoff
  • Anti-entropy Service

Cassandra and the JVM


Monitoring Cassandra


How Cassandra works with the physical hardware

  • CPU
  • Disk
  • Network
  • Goals for Sizing

The different storage strategies (disk configurations) including

  • Specific concerns for cloud hosting
  • Logical and Physical disk configuration
  • Local disks vs. network mounted/shared drives


  • Cassandra
  • System
  • GC

Backup and Recovery

  • Backup
  • Recovery


  • Authentication
  • Authorization
  • Physical Security

Apache Cassandra - Developer
(Duration : 3 Days)


NoSQL Overview

  • Justifying non-relational data stores
  • Listing the categories of NoSQL Data Stores

Exploring Cassandra

  • Defining column family data stores
  • Surveying Cassandra
  • Dissecting the basic Cassandra architecture

Querying Cassandra

  • Defining Cassandra Query Language, CQL
  • Enumerating CQL data types
  • Manipulating data from the cqlsh interface

Leveraging Cassandra structures and types

  • Drawing comparisons with the relational model
  • Organizing data with keyspaces, tables and columns
  • Creating collections and counters

Modeling data based on queries

  • Designing tables around access patterns
  • Clustering with compound primary keys
  • Improving data distribution with composite partition Keys

Detailing tunable consistency

  • Identifying consistency levels
  • Selecting appropriate read and write consistency levels
  • Distinguishing consistency repair features

Balancing consistency and performance

  • Relating replication factor and consistency
  • Trading consistency for availability
  • Trading consistency for availability

Working with Cassandra collection types

  • Grouping elements in sets
  • Ordering elements in lists
  • Expressing relationships with maps
  • Nesting collections

Storing data for easy retrieval

  • Mapping data to tuples and user defined types
  • Investigating the frozen keyword
  • Applying the Valueless Columns Pattern
  • Strategic implementation of clustering columns

Controlling data life span

  • Expiring temporal data with time-to-live
  • Reviewing how tombstones achieve distributed deletes
  • Executing DELETEs and UPDATEs in the future

Constructing materialized views and time series

  • Modeling time series data
  • Enhancing queries with materialized views
  • Materialized views maintained in the application
  • Driving analytics from materialized views

Managing triggers

  • Creating triggers by implementing ITrigger
  • Attaching triggers to tables
  • Supporting materialized views with triggers

Querying Cassandra data with the Datastax Java Driver

  • Connecting to a Cassandra cluster
  • Running CQL through the Java Driver
  • Batching prepared statements
  • Paginating large queries

Persisting Java Objects with Kundera

  • Defining the Java Persistence Architecture, JPA
  • Configuring Kundera to work with Cassandra
  • Generating schemas automatically
  • Managing JPA transactions in Kundera

Leveraging built-in Cassandra connectors

  • Loading data into Hadoop MapReduce with the Cassandra InputFormat
  • Utilizing the Cassandra Loader to create Pig relations
  • Converting a Cassandra table to a Hive table with the Casssandra serializer/deserializer (SerDe)

Encarta Labs Advantage

  • One Stop Corporate Training Solution Providers for over 6,000 various courses on a variety of subjects
  • All courses are delivered by Industry Veterans
  • Get jumpstarted from newbie to production ready in a matter of few days
  • Trained more than 50,000 Corporate executives across the Globe
  • All our trainings are conducted in workshop mode with more focus on hands-on sessions

View our other course offerings by visiting

Contact us for delivering this course as a public/open-house workshop/online training for a group of 10+ candidates.