Blog
Library

Learning Apache Cassandra

At this course, you will be able to build an efficient, scalable, fault-tolerant, and highly-available data layer into your applications by managing large amount of data using Apache Cassandra.

Features Includes:
  • Self-paced with Life Time Access
  • Certificate on Completion
  • Access on Android and iOS App

Course Preview Video

Description

Build an efficient, scalable, fault-tolerant, and highly-available data layer into your applications by managing large amount of data using Apache Cassandra.

Cassandra is a NoSQL database with decentralized, fault-tolerant, scalable, and low-cost features, making it a core component of cloud computing systems. The more recent versions have greatly improved the security features, making it suitable for use in enterprise systems.

In this tutorial, you’ll see how Cassandra overcomes the challenges that relational databases face during high scalability demand. You will become familiar with the Cassandra terminologies, components, and their roles. Then you will learn how to create a multi-node Cassandra structure, understand the roles and responsibilities of Cassandra components, and see the data flow during database operations that demand speed, accuracy, and durability.

You will then see how Cassandra stores data onto files on the disk, how to optimize those files to improve performance, and how to monitor the Cassandra database performance using logs and metrics.

We’ll demonstrate the factors that could affect the performance SLAs of the Cassandra database. Next, you will learn how to optimize the data model to provide performance guarantees and consistent performance SLA over time. You’ll also learn how to build the data model on Cassandra and integrate the database with your application.

In the later sections, you’ll connect with Cassandra from Spark to read and write data. You’ll integrate Cassandra with Spark and learn how to process live streaming data with Spark and persist the data in Cassandra for consumption through the downstream system.

By the end of the course, you’ll be able to build powerful, scalable Cassandra database layers for your applications. You’ll design rich schemes to capture the relationships between different data types and master the advanced features available in Cassandra.

About the Author

  • Tomasz Lelek is a Software Engineer and Co-Founder of InitLearn. He mostly does programming in Java and Scala. He dedicates his time and effort to get better at everything. He is currently diving into Big Data technologies. Tomasz is very passionate about everything associated with software development. He has been a speaker at a few conferences in Poland-Confitura and JDD, and at the Krakow Scala User Group. He has also conducted a live coding session at Geecon Conference. He was also a speaker for an international event in Dhaka. He is very enthusiastic and loves to share his knowledge.

Basic knowledge
  • This course is for anyone who wants to learn more about Apache Cassandra from the ground up and get a solid understanding of its workings

What will you learn
  • Scale up the relational databases and how no SQL databases like Cassandra overcome them
  • Understand the architecture of Apache Cassandra and how the data are stored
  • Use the different components of Cassandra; the read path, write path, fault tolerance, replication, consistency model, anti-entropy model as per what your application needs
  • Start a Cassandra multi-node cluster and understand the role of each critical piece of the distributed system and their interplay
  • Learn the principles and methodologies for data modelling in Cassandra
  • Integrate the database with your application
  • Migrate existing data from relational databases
  • Learn how to process live streaming data with Spark and persist the data on to Cassandra for consumption through a downstream system
Course Curriculum
Number of Lectures: 47 Total Duration: 03:12:55
Reviews

No Review Yet