Druid is a column-oriented distributed data store

Druid is a high performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.

Although Druid incorporates architecture ideas from data warehouses such as column-oriented storage, Druid also incorporates designs from search systems and timeseries databases. Druid’s architecture is designed to handle many use cases that traditional data warehouses cannot.

This is free and open source software.

Key Features

Interactive Query Engine – uses scatter/gather for high speed queries with data preloaded into memory or local storage to avoid data movement and network latency.
Tiering & QoS – configurable tiering with quality of service enables the ideal price-performance for mixed workloads, guarantees priority, and avoids resource contention.
Optimized Data Format – ingested data is automatically columnarized, time-indexed, dictionary-encoded, bitmap-indexed, and type-aware compressed
Elastic Architecture – loosely coupled components for ingestion, queries, and orchestration combined with a deep storage layer enable easy & quick scale-up & scale-out.
True Stream Ingestion – a connector-free integration with streaming platforms enables query-on-arrival, high scalability, low latency, and guaranteed consistency.
Non-stop Reliability – automatic data services including continuous backup, automated recovery, and multi-node replication ensure high availability and durability.
Schema Auto-Discovery – automatically detect, define, and update column names and data types upon ingestion, providing the ease of schemaless and the performance of strongly typed schemas.
Flexible Joins Support – supports join operations during data ingestion and at query-time execution, with the fastest query performance when tables are pre-joined during ingestion.
SQL Support – use the familiar SQL API for end-to-end data operations across ingestion, transformation, and querying.

Website: druid.apache.org
Support: GitHub Code Repository
Developer: Apache Software Foundation
License: Apache License 2.0

Druid is written in Java. Learn Java with our recommended free books and free tutorials.

Related Software

Column-Oriented Databases
MariaDB ColumnStore	Uses a massively parallel distributed data architecture
DuckDB	In-process SQL OLAP database management system
Druid	High performance, real-time analytics database
Databend	Cloud data warehouse
ClickHouse	Real-time analytics database management system
InfluxDB Core	Scalable datastore for metrics, events, and real-time analytics
Doris	Modern data warehouse for real-time analytics
VictoriaMetrics	Scalable solution for monitoring and managing time series data
StarRocks	High-performance analytical database
MonetDB	High performance relational database system for analytics
Kudu	Distributed data storage engine
QuestDB	High-performance time-series database
Pinot	Real-time analytics platform
IoTDB	High-performance time-series database
GreptimeDB	Cloud-native database
CrateDB	Distributed SQL database management

Read our verdict in the software roundup.

Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.

Documents	Internet	Education
Audio	Video	Graphics
Admin	Desktop	Productivity
Science	Games	Security
Utilities	Coding	Finance
Web Apps	Other	Books

Google	Microsoft	Apple
Adobe	IBM	Autodesk
Oracle	Atlassian	Corel
Cisco	Intuit	SAS
Progress	Salesforce	Citrix