OpenOCR is an open source toolkit for general OCR research and applications.
It focuses on text detection and recognition, formula and table recognition, and document parsing and understanding. The toolkit brings together a unified training and evaluation benchmark, practical OCR and document parsing systems, and implementations of methods from academic papers.
This is free and open source software.
Key Features
- General OCR toolkit for research and applications.
- Text detection and recognition.
- Formula and table recognition.
- Document parsing and understanding.
- Unified training and evaluation benchmark.
- OpenDoc-0.1B lightweight document parsing system.
- UniRec-0.1B unified text and formula recognition.
- Supports Chinese and English text and formula recognition.
- Server and mobile OCR models.
- Fine-tuning on custom datasets.
- ONNX model export for wider compatibility.
- Includes local, Hugging Face, and ModelScope demo options.
Website: github.com/Topdu/OpenOCR
Support:
Developer: OCR team from FVL Lab, Fudan University
License: Apache License 2.0
OpenOCR is written in Python. Learn Python with our recommended free books and free tutorials.
Related Software
| OCR Systems | |
|---|---|
| Tesseract | High quality neural net (LSTM) based OCR engine focused on line recognition |
| EasyOCR | OCR that reads natural scene text and dense text in documents |
| ocrs | Modern OCR engine |
| Surya | Multilingual document OCR toolkit with text recognition |
| ocropy | Open source document analysis and OCR system |
| Ocrad | OCR engine based on a feature extraction method |
| Cuneiform | OCR Engine to convert OCR documents into editable form |
| GOCR | Reads images in many formats |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

