Cassandra Developer

Big Data & Analytics

Course Description

Cassandra Developer

In today’s times, businesses need to process a very large volume and variety of data. This generates a requirement for a highly available, low latency database. Cassandra solves this challenge by allowing high-speed reads and writes across a large and distributed system.

32 Hours

The Cassandra Developer Training courses educates the learner in

  • Architect and engineer NoSQL Cassandra databases
  • Model data in Cassandra
  • Accessing Cassandra using CQL and Java
  • Ensuring high read/write speeds while maintaining data consistency
  • Integrating Cassandra with Hadoop, Pig and Hive
  • Implementing commonly used Cassandra design patterns

Basic knowledge of Linux

Cassandra Fundamentals and Features

  • Distributed and Decentralized
  • Elastic Scalability
  • High Availability and Fault Tolerance
  • Tune-able Consistency
  • Row-Oriented
  • Schema-Free
  • High Performance

The Cassandra Data Model

  • The Relational Data Model
  • A Simple Introduction
  • Clusters
  • Keyspaces
  • Hands-on Session

Installation and Setup of Cassandra

  • Single Node Setup
  • Multi-Node Cluster Setup
  • Key Configurations for Cassandra
  • CLI and Hands-On with Cassandra

Cassandra Modeling

  • Cassandra (Column Family NoSQL DB)
  • Key Concepts
  • Key Space
  • Column Family
  • Column Family Options
  • Wide Rows
  • Skinny Row
  • Column Sorting
  • Super Columns
  • Counter Column Family
  • Composite Keys and Columns
  • Time To Live
  • Secondary Indexes in Cassandra
  • A comparison between Custom Indexes and Secondary Indexes
  • A comparison between Relational Modeling and Cassandra Modeling
  • Key Points while modeling a Cassandra Database
  • Patterns and Anti-Patterns in Cassandra Modeling

Cassandra Architecture & Intro to CQL

  • Anatomy of Reading operation in Cassandra
  • Anatomy of the Write operation in Cassandra
  • How is Deletes handled in Cassandra
  • System Keyspace
  • Peer to Peer Model Logical Data Model: Keyspace, Column Family/Table, Rows, Columns
  • Traditional Ring design vs. VNodes
  • Partitioners: Murmer3, Random (md5) and ByteOrdered
  • Gossip and Failure Detection
  • Anti-Entropy and Read Repair
  • Memtables, SSTables and Commit Log
  • Compaction fundamentals to reduce SSTable data files
  • Hinted Handoff
  • Compaction
  • Bloom Filters, Tombstones
  • Managers and Services
  • VNodes
  • Indexes and Caches
  • Coordinator node
  • Seed nodes
  • Write/Read consistency levels: Any, One, Two, Three, Quorum
  • Snitches: Dynamic snitching, Simple Snitch, Rack Inferring Snitch, Property File Snitch, Gossiping Property File Snitch
  • Routing Client requests
  • Nodetool commands: gossipinfo, cfstats, describing
  • YAML file fundamentals
  • Operations management web GUI
  • Stress testing Cassandra
  • CQL command fundamentals

Cassandra API

  • Key concepts for Reading and Write in Cassandra
  • Tunable Consistency
  • Simple Get, Multi-get Slice
  • Range and Slice
  • Slice Predicate
  • Delete
  • Hands-on CLI commands

Cassandra CQSHL

  • SQL over Cassandra
  • Composite Keys
  • Hands-on examples on CQL 3.0

Cassandra Clients

  • How to establish Client Connections
  • Thrift Client
  • Connection Pooling
  • Auto-discovery and Failover in Hector
  • Client with CQL

Cassandra Monitoring and Administration

  • Backup and Recovery methods
  • Tuning Cassandra
  • Balancing
  • Bootstrapping
  • Node Tools Commands
  • Upgrades
  • Monitoring critical metrics
  • Bulk Loading Data to Cassandra
  • Bulk Export of Data from Cassandra
  • Hands-on Examples for each of them

Cassandra Analytics Cluster

  • Cassandra Hadoop Integration

Cassandra Search Cluster

  • Integration of Solr with Cassandra
  • Search Query on Cassandra