MITIE: MIT Information Extraction

MITIE: MIT Information Extraction offers state-of-the-art information extraction tools.

There are tools for performing named entity extraction and binary relation detection as well as tools for training custom extractors and relation detectors.

The core MITIE software is written in C++, but bindings for several other software languages including Python, R, Java, C, and MATLAB allow a user to quickly integrate MITIE into his/her own applications.

MITIE is built on top of dlib, a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. MITIE’s primary API is a C API.

Features include:

  • Uses several state-of-the-art techniques including the use of distributional word embeddings and Structural Support Vector Machines.
  • Several pre-trained models providing varying levels of support for both English, Spanish, and German trained using a variety of linguistic resources.
  • Comes with a basic streaming Named Entity Recognition (NER) tool. Its NER implementation is designed for bulk data processing at high speeds.
  • Compile MITIE as a shared library.
  • Compile MITIE using OpenBLAS.
  • Use MITIE from a Python 2.7 program, from R, from a C program, from a C++ program, and from a Java program.

Support: mitie-trainer – an interactive, browser-based model training tool for MITIE
Developer: Davis E. King and contributors
License: Boost Software License

Learn C++ with our recommended free books and free tutorials.

Return to Natural Language Processing Home Page | Return to C++ Natural Language Tools Page

Read our complete collection of recommended free and open source software. The collection covers all categories of software.

The software collection forms part of our series of informative articles for Linux enthusiasts. There's tons of in-depth reviews, alternatives to Google, fun things to try, hardware, free programming books and tutorials, and much more.
Share this article