The Pentaho BI Project is open source application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities for business intelligence needs.
The BI Suite Community Edition is intended for: Business intelligence aficionados, open source software programmers, early adopters, and college students.
This software can be used as a full suite or as individual components that are accessible via web services.
Pentaho BI Suite users will typically start with Pentaho Data Integration to prepare a data source, then use Metadata Editor to create a metadata layer for that data source, then potentially Schema Workbench to create a ROLAP schema. At that point, you’re ready to create reports and analysis views.
Key Features
- Reporting – an advanced report creation tool.
- Design Studio – an Eclipse-based tool that enables users to hand-edit a report or analysis view xaction file.
- Aggregation Designer – a graphical tool that helps improve Mondrian cube efficiency.
- Metadata Editor – add a custom metadata layer to an existing data source.
- Pentaho Data Integration – Kettle extract, transform, and load (ETL) tool, to access and prepare data sources for analysis, data mining, or reporting.
- Schema Workbench – a graphical tool to create ROLAP schemas for analysis.
- Analysis.
- Dashboards.
- Business Intelligence Platform.
- Data Mining.
- Guided analysis.
Website: www.pentaho.com
Support: Wiki, SourceForge Project Page
Developer: Hitachi Vantara
License: Pentaho Community Edition (CE): Apache license version 2.0; Pentaho Enterprise Edition (EE): Hitachi Commercial License

Pentaho is written in Java. Learn Java with our recommended free books and free tutorials.
Related Software
| Business Intelligence Software | |
|---|---|
| Metabase | Business intelligence and analytics software |
| Grafana | Platform for monitoring and observability |
| Superset | Data visualization and data exploration platform |
| Pentaho | Enterprise reporting, analysis, dashboard, data mining, workflow |
| Redash | Explore, query, visualize, and share data |
| Knowage | (formerly SpagoBI) Flexible business intelligence suite |
| JasperReports | A widely used reporting engine |
| KNIME | Konstanz Information Miner |
| ReportServer | Modern and versatile business intelligence platform |
| Rill | Operational BI tool |
| BIRT Project | Eclipse-based reporting system |
| Gephi | Visualization and exploration software for all kinds of graphs and networks |
| RapidMiner | Data analysis, knowledge discovery, data mining, predictive analytics |
Read our verdict in the software roundup.
| Data Analysis Tools | |
|---|---|
| Hadoop | Distributed processing of large data sets across clusters of computers |
| Storm | Distributed and fault-tolerant realtime computation |
| Drill | Distributed system for interactive analysis of large-scale datasets |
| Flink | Framework and distributed processing engine |
| Spark | Unified analytics engine for large-scale data processing |
| Pentaho | Enterprise reporting, analysis, dashboard, data mining, workflow and more |
| HPCC Systems | Designed for the enterprise to resolve Big Data challenges |
| Rapid Miner | Knowledge discovery in databases, machine learning, and data mining |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

