Apache Accumulo
Apache Accumulo is a sorted, distributed key/value store. It
is a
robust, highly scalable, high performance data storage and retrieval
system.
Apache Accumulo is based on Google's BigTable design and is
built on top of Apache Hadoop, Zookeeper, and Thrift. Apache Accumulo
features a few novel improvements on the BigTable design in the form of
cell-based access control and a server-side programming mechanism that
can modify key/value pairs at various points in the data management
process.
Accumulo is written in Java and operates over the Hadoop
Distributed File System (HDFS), which is part of the popular Apache
Hadoop project. Accumulo features automatic load-balancing and
partitioning, data compression and fine-grained security labels.
It has cell-level access labels and a server-side programming
mechanisms.
Features include:
- Table Design and Configuration:
- Iterators
- Cell Labels
- Constraints
- Sharding
- Large Rows
- Integrity/Availability:
- Master Fail Over
- Write Ahead Log
- Logical Time
- Logical Time for Bulk Import
- Fate
- Scalable Master
- Isolation
- Performance:
- Relative Encoding
- Native In-Memory Map
- Scan Pipeline
- Caching
- Multi-Level Rfile Index
- Testing:
- Mock
- Functional Test
- Scale Test
- Random Walk Test
- Client API:
- Batch Scanner
- Batch Writer
- Bulk Import
- Map Reduce
- Extensible Behaviors:
- Pluggable Balancer
- Pluggable Memory Manager
- Pluggable Logger Assignment Strategy
- General Administration:
- Monitor Page
- Tracing
- Online Reconfiguration
- Table Renaming
- Internal Data Management:
- Locality Groups
- Smart Compaction Algorithm
- Merging Minor Compaction
- On-demand Data Management:
- Compactions
- Split Points
- Tablet Merging
- Table Cloning
- Compact Range
- Delete Range
Return
to Key Value Stores Home Page
Last Updated Sunday, April 14 2013 @ 08:24 AM EDT |