What is Cassandra? Apache Cassandra is a free and open-source distributed wide column store NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. Wikipedia Apache Cassandra is a high performance, extremely scalable, fault tolerant (i.e. no single point of failure), distributed post-relational database solution. Cassandra combines all the benefits of Google Bigtable and Amazon Dynamo to handle the types of database management needs that traditional RDBMS vendors cannot support.
- PART 5 In the last blogs we examined a variety of different database types and considered some of their appropriate domains and use cases. In this blog we look at actual NoSQL Wide Column Store offerings from different vendors, in an attempt to highlight some of the key differences between otherwise similar technologies. We also consider the differences in performance and what costs are associated to running each database on premises or in the cloud, so that it may become clearer and easier to recognize the database that best suits your needs.
- In this blog, we show how we created a Datastax Cassandra cluster on Oracle Cloud Infrastructure (OCI) using Terraform and benchmark Oracle Cloud baremetal machines running Cassandra stress. 1. Steps to Create the Cluster Go here to download the Terraform project Follow the guide to setup your environment Edit the file env-vars and fill in all the relevant info for your OCI account edit variables.tf and select the shape of your OCI instance, we used DenseIO1.
- Setting up AWS EC2 Instance Type: m4.xlarge, 4 node cluster, 2 in each AZ Storage: Two EBS volumes, data volume 400GB, 150GB log volume, root volume 150GB (General Purpose SSD) OS:Amazon Linux AMI 2016.09.0 (HVM), SSD Volume Type - ami-b953f2da The Amazon Linux AMI is an EBS-backed, AWS-supported image. The default image includes AWS command line tools, Python, Ruby, Perl, and Java. The repositories include Docker, PHP, MySQL, PostgreSQL, and other packages.
- Choice of Tool New Relic http://newrelic.com/plugins/3legs/113 Monitor Cassandra statistics using the 3legs plugin. Metrics include Read and Write latency (global and per host), Cache statistics, Pending compactions, flushes and more. Datastax - OpsCenter http://www.datastax.com/documentation/opscenter/4.1/pdf/opscuserguide41.pdf DataStax OpsCenter is a visual management and monitoring solution for Apache Cassandra and DataStax Enterprise. The DataStax agents are installed on the Real-time (Cassandra), Analytics (Hadoop), and Search (Solr) nodes. They use Java Management Extensions (JMX) to monitor and manage each node.