Natural Language Processing

MITIE: MIT Information Extraction

MITIE: MIT Information Extraction offers state-of-the-art information extraction tools.

There are tools for performing named entity extraction and binary relation detection as well as tools for training custom extractors and relation detectors.

The core MITIE software is written in C++, but bindings for several other software languages including Python, R, Java, C, and MATLAB allow a user to quickly integrate MITIE into his/her own applications.

MITIE is built on top of dlib, a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. MITIE’s primary API is a C API.

Key Features

  • Uses several state-of-the-art techniques including the use of distributional word embeddings and Structural Support Vector Machines.
  • Several pre-trained models providing varying levels of support for both English, Spanish, and German trained using a variety of linguistic resources.
  • Comes with a basic streaming Named Entity Recognition (NER) tool. Its NER implementation is designed for bulk data processing at high speeds.
  • Compile MITIE as a shared library.
  • Compile MITIE using OpenBLAS.
  • Use MITIE from a Python 2.7 program, from R, from a C program, from a C++ program, and from a Java program.

Website: github.com/mit-nlp/MITIE
Support: mitie-trainer – an interactive, browser-based model training tool for MITIE
Developer: Davis E. King and contributors
License: Boost Software License

Learn C++ with our recommended free books and free tutorials.


Related Software

C++ Natural Language Processing Tools
text2vecFramework with API for text analysis and natural language processing
MosesStatistical machine translation system
TiMBLTilburg Memory-Based Learner
MITIEMIT Information Extraction
MeTAModern C++ data sciences toolkit
Colibri CoreEfficient n-gram & skipgram modelling on text corpora
CRF++Yet Another CRF toolkit
BLLIP ParserStatistical natural language parser

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted