StarRocks is a query engine for sub-second, ad-hoc analytics both on and off the data lakehouse. It eliminates the need for denormalization and adapts to your use cases, without having to move your data or rewrite SQL.
This is free and open source software.
Key Features
- Native vectorized SQL engine: It adopts vectorization technology to make full use of the parallel computing power of CPU, achieving sub-second query returns in multi-dimensional analyses, which is 5 to 10 times faster than previous systems.
- Standard SQL: StarRocks supports ANSI SQL syntax (fully supported TPC-H and TPC-DS). It is also compatible with the MySQL protocol. Various clients and BI software can be used.
- Smart query optimization: The software can optimize complex queries through CBO (Cost Based Optimizer). With a better execution plan, the data analysis efficiency will be greatly improved.
- Real-time update: The updated model of the database can perform upsert/delete operations according to the primary key, and achieve efficient query while concurrent updates.
- Intelligent materialized view: The materialized view of StarRocks can be automatically updated during the data import and automatically selected when the query is executed.
- Querying data in data lakes directly: It allows direct access to data from Apache Hive, Apache Iceberg, Delta Lake and Apache Hudi without importing.
- Resource management: This feature allows the program to limit resource consumption for queries and implement isolation and efficient use of resources among tenants in the same cluster.
- Easy to maintain: Simple architecture makes it easy to deploy, maintain and scale out. StarRocks tunes its query plan agilely, balances the resources when the cluster is scaled in or out, and recovers the data replica under node failure automatically.
Website: github.com/StarRocks/StarRocks
Support:
Developer: LF Projects, LLC
License: Apache License 2.0
StarRocks is written in Java and C++. Learn Java with our recommended free books and free tutorials. Learn C++ with our recommended free books and free tutorials.
Related Software
| Column-Oriented Databases | |
|---|---|
| MariaDB ColumnStore | Uses a massively parallel distributed data architecture |
| DuckDB | In-process SQL OLAP database management system |
| Druid | High performance, real-time analytics database |
| Databend | Cloud data warehouse |
| ClickHouse | Real-time analytics database management system |
| InfluxDB Core | Scalable datastore for metrics, events, and real-time analytics |
| Doris | Modern data warehouse for real-time analytics |
| VictoriaMetrics | Scalable solution for monitoring and managing time series data |
| StarRocks | High-performance analytical database |
| MonetDB | High performance relational database system for analytics |
| Kudu | Distributed data storage engine |
| QuestDB | High-performance time-series database |
| Pinot | Real-time analytics platform |
| IoTDB | High-performance time-series database |
| GreptimeDB | Cloud-native database |
| CrateDB | Distributed SQL database management |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

