Benchmarking Cassandra
Keywords:
NoSQL, Cassandra, YCSB (yahoo cloud service benchmark), workloadsAbstract
With the increasing need for the storage of unstructured data, the demand for NoSQL databases has grown. The most widely used NoSQL database is the column-based Cassandra. While the usage of Cassandra has expanded, evaluating its performance has become crucial for applications utilizing Cassandra at scale for storage. Furthermore, Cassandra is being applied to a diverse range of applications that significantly differ from traditional serving workloads. The rise of emerging cloud serving systems and the wide range of proposed applications, coupled with a lack of performance comparisons, makes it difficult to understand the trade-offs between systems and the workloads for which they are best suited. This work aims to benchmark Cassandra, with the goal of facilitating performance comparisons between different versions of Cassandra, while using YCSB to generate various workloads. We define a core set of benchmarks and report the results for Cassandra, evaluating it against various performance parameters.
References
B. Wang, S. Peng, X. Zhang, M. Bownes, R. Paton, and F. Golkarihagh, "Cassandra as Used by Facebook," December 15, 2010.
D. Featherston, "Cassandra: Principles and Application."
N. Ahamed Akheel, "Cassandra."
P. Bagade, A. Chandra, and A. Dhende, "Performance Monitoring Tool for NoSQL Column Oriented Distributed Database (Cassandra)."
P. Malik and A. Lakshman, "Cassandra - A Decentralized Structured Storage System," The 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS 09), October 2009.
Downloads
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.