OCRFeeder – document layout analysis and optical character recognition system

OCRFeeder is a free open source software desktop OCR suite for the GNOME desktop environment. It converts paper documents to digital document files or makes them accessible to visually impaired users. OCRFeeder was created to allow users to easily convert document images (for example, a PNG image with text) into editable documents (for example, an ODT version with that text).

OCRFeeder will automatically outline its contents, distinguish between what is graphics and text and perform OCR over the latter. It generates multiple formats.

It features a complete GTK+ graphical user interface that allows the users to correct any unrecognized characters, defined or correct bounding boxes, set paragraph styles, clean the input images, import PDFs, save and load the project, export everything to multiple formats, etc.

Key Features

  • Simple graphical user interface.
  • Configurable.
  • Views: Zoom in / out, Normal Size, and Best Fit.
  • Import data from PDF or graphic files.
  • Grabs images direct from the scanner.
  • Unpaper image processor: Black filter, noise filter intensity, grey filter size.
  • Image Deskewer – deskewing an image makes it easier for the software to recognise the image. This option can be performed automatically each time an image is added.
  • Choose the language for the OCR engine.
  • Spellchecker.
  • Automatic recognition performs some complex operations.
  • Generates three document formats: ODT, HTML and Plain Text.
  • Supports OCR Engines: Cuneiform, Tesseract, GOCR, and Ocrad.
  • Supports: English, Czech, Danish, German, Spanish, French, Galician, Italian, Norwegian, Portuguese, Romanian, Slovenian, Swedish and Chinese.

Website: gitlab.gnome.org/GNOME/ocrfeeder
Support:
Developer: Igalia, SL
License: GNU General Public License v3.0

OCRFeeder

OCRFeeder is written in Python. Learn Python with our recommended free books and free tutorials.


Related Software

OCR Tools
OCRmyPDFAdds an OCR text layer to scanned PDFs using the unpaper utility
PaperworkSimplify the management of your paperwork
OCRFeederDesktop OCR suite featuring a complete GTK graphical user interface
gImageReaderSimple Gtk/Qt front-end to Tesseract
gscan2pdfGUI to produce PDFs or DjVus from scanned documents
lioslinux-intelligent-ocr-solution for converting print into text
hocr-toolsManipulate and evaluate hOCR format
SkanpageSimple scanning application optimized for multi-page document scanning
GOCRReads images in many formats
QuickSnipOCR and Google Lens search
ocropyOpen source document analysis and OCR system

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments