Apache Accumulo is a sorted, distributed key/value store. It is a robust, highly scalable, high performance data storage and retrieval system.
Apache Accumulo is based on Google’s BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.
Accumulo operates over the Hadoop Distributed File System (HDFS), which is part of the popular Apache Hadoop project. Accumulo features automatic load-balancing and partitioning, data compression and fine-grained security labels. It has cell-level access labels and a server-side programming mechanisms.
Key Features
- Table Design and Configuration:
- Iterators.
- Cell Labels.
- Constraints.
- Sharding.
- Large Rows.
- Integrity/Availability:
- Master Fail Over.
- Write Ahead Log.
- Logical Time.
- Logical Time for Bulk Import.
- Fate.
- Scalable Master.
- Isolation.
- Performance:
- Relative Encoding.
- Native In-Memory Map.
- Scan Pipeline.
- Caching.
- Multi-Level Rfile Index.
- Testing:
- Mock.
- Functional Test.
- Scale Test.
- Random Walk Test.
- Client API:
- Batch Scanner.
- Batch Writer.
- Bulk Import.
- Map Reduce.
- Extensible Behaviors:
- Pluggable Balancer.
- Pluggable Memory Manager.
- Pluggable Logger Assignment Strategy.
- General Administration:
- Monitor Page.
- Tracing.
- Online Reconfiguration.
- Table Renaming.
- Internal Data Management:
- Locality Groups.
- Smart Compaction Algorithm.
- Merging Minor Compaction.
- On-demand Data Management:
- Compactions.
- Split Points.
- Tablet Merging.
- Table Cloning.
- Compact Range.
- Delete Range.
Website: accumulo.apache.org
Support: GitHub
Developer: The Apache Software Foundation
License: Apache License 2.0
Apache Accumulo is written in Java. Learn Java with our recommended free books and free tutorials.
Related Software
| Key Value Stores | |
|---|---|
| RocksDB | Persistent key-value store for flash and RAM Storage |
| Valkey | High-performance data structure server |
| Apache Cassandra | Distributed database management system |
| ScyllaDB | Real-time big data database |
| Apache Accumulo | Based on Google's BigTable design |
| ArangoDB | Native multi-model database |
| Aerospike CE | Real-time NoSQL database and key-value store |
| Berkeley DB | Family of open source, embeddable databases |
| LevelDB | Fast and lightweight key/value database library by Google |
| Garnet | Remote cache-store |
| KeyDB | High performance fork of Redis |
| Redict | Distributed key/value store |
| Project Voldemort | Distributed data store |
| Scalaris | Distributed transactional key-value store |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Know a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

