The Pentaho BI Project is application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities.
Read more
The Pentaho BI Project is application software for enterprise reporting, analysis, dashboard, data mining, workflow and ETL capabilities.
Read more
Talend Open Studio is an Open Source project for data integration based on Eclipse RCP. It is an ETL (Extract, Transform, and Load) tool.
Read more
Knowage (formerly SpagoBI) is an open source flexible business intelligence suite. It meets the criteria of modern business intelligence.
Read more
ReportServer is a modern and versatile business intelligence platform. It integrates Jasper, Birt, Mondrian and Excel-based reporting.
Read more
KNIME is a coherent and comprehensive open source visual platform for data integration, processing, analysis, reporting and exploration.
Read more
RapidMiner (formerly known as YALE) is a flexible Java environment for knowledge discovery in databases, machine learning, and data mining.
Read more
The Business Intelligence and Reporting Tools Project is software that provides reporting and business intelligence capabilities.
Read more
NormCap is an OCR powered screen-capture tool to capture information instead of images. Free and open source software.
Read more
Frog is an intuitive text extraction tool for the GNOME desktop. Frog is free and open source software written in Python.
Read more
TextShot offers the ability to take a screenshot and copy to the clipboard the text content of the screenshot. It’s free and open source.
Read more
TextSnatcher is a simple front-end that lets you copy text from images. It uses the Tesseract OCR 4.x for the character recognition.
Read more
Tesseract runs from the command line. It can only process an image of a single column and create text from it.
Read more
OCRFeeder is a free open source software desktop OCR suite for the GNOME desktop environment. It features a GTK+ graphical user interface.
Read more
ocropy (referred to as OCRopus) is an OCR system written in Python, NumPy, and SciPy focusing on the use of large scale machine learning.
Read more
gscan2pdf is a graphical user interface to produce PDFs or DjVus from scanned documents. gscan2pdf is free and open source software.
Read more
gImageReader is a simple Gtk/Qt front-end to Tesseract, a popular optical character recognition engine. Free and open source.
Read more
linux-intelligent-ocr-solution (Lios) is a free and open source software for converting print into text using either a scanner or a camera.
Read more
hocr-tools is a set of tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results.
Read more
Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method. Free and open source software.
Read more
GOCR is an optical character recognition program. It reads images in many formats and outputs a text file.
Read more