Apache Cassandra is an open source distributed database management system. It is a highly scalable, eventually consistent, distributed, structured key-value store. Cassandra brings together the distributed systems technologies from Dynamo and the data model from Google’s BigTable.
Cassandra is essentially a hybrid between a key-value and a row-oriented (or tabular) database.
Cassandra is in use at Netflix, eBay, Twitter, Urban Airship, Constant Contact, Reddit, Cisco, OpenX, Digg, CloudKick, Ooyala, and more companies.
- Provides a structured key-value store with tunable consistency.
- Uses the BigTable way of modeling.
- Eventual consistency.
- Automatically replication of data to multiple nodes for fault-tolerance. Replication across multiple data centers is supported. Failed nodes can be replaced with no downtime.
- Gossip protocol.
- Master-master way of serving read and write requests inspired by Amazon’s Dynamo.
- Synchronous or asynchronous replication for each update.
- Supports CQL (Cassandra Query Language), an SQL-alike alternative to the traditional RPC interface.
- Hadoop integration, with MapReduce support. There is support also for Apache Pig and Apache Hive.
|Read our complete collection of recommended free and open source software. The collection covers all categories of software.|