Performance benchmark of DataStax Enterprise v6 (Cassandra)

Performance benchmark of DataStax Enterprise v6 (Cassandra)

This report is to measure the performance of DataStax Enterprise v6 (Cassandra) for two workloads in terms of latency and throughput.

Introduction

The goal of this report is to measure the performance of DataStax Enterprise v6 (Cassandra) in terms of latency and throughput. Cassandra is deployed by three types of cluster configuration. There are 4, 10, and 20 nodes. The evaluation was processed by using two workload types.

The first workload is the “_update heavy_” workload. This workload evaluates 50% of reading requests and 50% of writing requests by using Zipfian as a request distribution.

The second workload is the “_short ranges_”. The idea of this workload is to invoke 95% of scans and 5% of updates. The request distribution is the same — Zipfian.

As a tool for processing benchmark performance was Yahoo! Cloud Serving Benchmarking (YCSB). YCSB is a framework for evaluating the benchmark performance of the database under different workloads.

The data size is 1 KB records (10 fields, 100 bytes each, plus key). The number of records was chosen according to the size of the cluster. 50 million on a 4-node cluster, 100 million — 10-node cluster, 250 million records — 20-node cluster.


Environment

The following type of EC2 instances on AWS was chosen for deploying Cassandra cluster:

Image for post

The YCSB client was deployed on the compute-optimized instances by AWS:

Image for post

Cluster configuration

DataStax Enterprise (Cassandra) is a wide-column store NoSQL database management system, designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

The table below shows changes which were applied on Cassandra configuration: Image for post

performance cassandra big-data nosql data analysis

Bootstrap 5 Complete Course with Examples

Bootstrap 5 Tutorial - Bootstrap 5 Crash Course for Beginners

Nest.JS Tutorial for Beginners

Hello Vue 3: A First Look at Vue 3 and the Composition API

Building a simple Applications with Vue 3

Deno Crash Course: Explore Deno and Create a full REST API with Deno

How to Build a Real-time Chat App with Deno and WebSockets

Convert HTML to Markdown Online

HTML entity encoder decoder Online

Top Microsoft big data solutions Companies | Best Microsoft big data Developers

An extensively researched list of top microsoft big data analytics and solution with ratings & reviews to help find the best Microsoft big data solutions development companies around the world.

Role of Big Data in Healthcare - DZone Big Data

In this article, see the role of big data in healthcare and look at the new healthcare dynamics. Big Data is creating a revolution in healthcare, providing better outcomes while eliminating fraud and abuse, which contributes to a large percentage of healthcare costs.

Silly mistakes that can cost ‘Big’ in Big Data Analytics

‘Data is the new science. Big Data holds the key answers’ - Pat Gelsinger The biggest advantage that the enhancement of modern technology has brought

Big Data can be The ‘Big’ boon for The Modern Age Businesses

We need no rocket science in understanding that every business, irrespective of their size in the modern-day business world, needs data insights for its expansion. Big data analytics is essential when it comes to understanding the needs and wants of a significant section of the audience.

Data Lakes Are Not Just For Big Data - DZone Big Data

A data expert discusses the three different types of data lakes and how data lakes can be used with data sets not considered 'big data.'