OpenOCR – toolkit for general OCR research and applications

OpenOCR is an open source toolkit for general OCR research and applications.

It focuses on text detection and recognition, formula and table recognition, and document parsing and understanding. The toolkit brings together a unified training and evaluation benchmark, practical OCR and document parsing systems, and implementations of methods from academic papers.

This is free and open source software.

Key Features

  • General OCR toolkit for research and applications.
  • Text detection and recognition.
  • Formula and table recognition.
  • Document parsing and understanding.
  • Unified training and evaluation benchmark.
  • OpenDoc-0.1B lightweight document parsing system.
  • UniRec-0.1B unified text and formula recognition.
  • Supports Chinese and English text and formula recognition.
  • Server and mobile OCR models.
  • Fine-tuning on custom datasets.
  • ONNX model export for wider compatibility.
  • Includes local, Hugging Face, and ModelScope demo options.

Website: github.com/Topdu/OpenOCR
Support:
Developer: OCR team from FVL Lab, Fudan University
License: Apache License 2.0

OpenOCR is written in Python. Learn Python with our recommended free books and free tutorials.


Related Software

OCR Systems
TesseractHigh quality neural net (LSTM) based OCR engine focused on line recognition
EasyOCROCR that reads natural scene text and dense text in documents
ocrsModern OCR engine
SuryaMultilingual document OCR toolkit with text recognition
ocropyOpen source document analysis and OCR system
OcradOCR engine based on a feature extraction method
CuneiformOCR Engine to convert OCR documents into editable form
GOCRReads images in many formats

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted