Apache clinical Text Analysis and Knowledge Extraction System (cTAKES) is an open-source natural language processing system for information extraction from electronic medical record clinical free-text.
Curated link list for practical natural language processing in Ruby.
topic modelling platform.
BLLIP Natural Language Parser (also known as the Charniak-Johnson parser).
Python bindings for the BLLIP Natural Language Parser (also known as the Charniak-Johnson parser). [Deprecated]
is a numerical processing library for Scala.
is a natural language processing library. [Deprecated]
provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA. [Deprecated]
Natural Language Processing in Clojure (opennlp).
The Classical Language Toolkit.
This project collects a number of core libraries for Natural Language Processing (NLP) developed in the University of Illinois' Cognitive Computation Group, for example illinois-core-utilities which provides a set of NLP-friendly data structures and a number of NLP-related utilities that support writing NLP applications, running experiments, etc, illinois-edison a library for feature extraction from illinois-core-utilities data structures and many other packages.
C++ library, command line tools, and Python binding for extracting and working with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.
Add structure to unstructured text using a GUI.
Stanford provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words.
A julia package providing a variety of loaders for various NLP corpora.
Retina: an API performing complex NLP operations (disambiguation, classification, streaming text filtering, etc...) as quickly and intuitively as the brain.
CRF++ - Open source implementation of Conditional Random Fields (CRFs) for segmenting/labeling sequential data & other Natural Language Processing tasks. [Deprecated]
is an implementation of Conditional Random Fields (CRFs) for labeling sequential data. [Deprecated]
A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
conversational AI library with many pretrained Russian NLP models.
Levenshtein and Hamming computation. [Deprecated]
Reading Wikipedia to answer open-domain questions.
Incremental update tool for UITableView and UICollectionView
fast implementation of edit distance.
Functions and data dependencies for loading various word
is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
Memory-based NLP suite developed for Dutch: PoS tagger, lemmatiser, dependency parser, NER, shallow parser, morphological analyzer.
Fuzzy String Matching in Python.
A Chinese segment base on Conditional Random Field.
In-memory n-gram index with compression. [Deprecated]
A native Go clean room implementation of the Porter Stemming algorithm. [Deprecated]
Rails-like inflection library for Clojure and ClojureScript.
Cortical.io's FREE NLP, Retina API Analysis Tool (written in JavaFX!) - See the Tutorial Video.
a python library for doing approximate and phonetic matching of strings.
Chinese Words Segmentation Utilities.
A PHP port of Python's jieba. Chinese text segmentation for natural language processing.
A Natural Language Processor in JS.
A Python package for Korean natural language processing.
Julia package for working with various human
C++ library for the FoLiA format
A tool kit for processing text using computational linguistics.
Another Chinese segmentation library. [Deprecated]
A Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
A modern C++ data sciences toolkit. [MIT] website
C, C++, and Python tools for named entity recognition and relation extraction
is a semantic parsing library for Scala with an easy-to-use DSL.
General language facilities for node.
Named-entity recognition using neural networks providing state-of-the-art-results
An library for Matlab.
Natural Language processing in the browser.
An NLP library built in node over Natural, with entity extraction, sentiment analysis, automatic language identify, and so more
The project provides software and resources for natural language processing. The project started at the Center for Computational Language and EducAtion Research, and is currently developed by the Center for Language and Information Research at Emory University. [Deprecated]
A leading platform for building Python programs to work with human language data.
Natural language Understanding Toolkit. [Deprecated]
a machine learning based toolkit for the processing of natural language text.
Golang implementation of the Paice/Husk Stemming Algorithm. [Deprecated]
A web mining module for the Python programming language. It has tools for natural language processing, machine learning, among others.
A better version of Jieba, developed by Peking University.
Multilingual text (NLP) processing toolkit.
Python Natural Language Processing Library. General purpose NLP library for Python. Also contains some specific modules for parsing common NLP formats, most notably for FoLiA, but also ARPA language models, Moses phrasetables, GIZA++ alignments.
Python package that implements a novel white-box machine learning model for text classification, called SS3. Since SS3 has the ability to visually explain its rationale, this package also comes with easy-to-use interactive visualizations tools (online demos).
Python interface for converting Penn Treebank trees to Stanford Dependencies.
Python binding to Frog, an NLP suite for Dutch. (pos tagging, lemmatisation, dependency parsing, NER)
Python binding to ucto (a unicode-aware rule-based tokenizer for various languages).
Python bindings for ZPar, a statistical part-of-speech-tagger, constiuency parser, and dependency parser for English.
A python framework to transform natural language questions to queries in a database query language.
turn natural language into structured data.
raspell is an interface binding for ruby. [Deprecated]
Extensible system for analyzing and manipulating natural language.
Text processing tools and wrappers (e.g. Vowpal Wabbit)
is a suite of machine learning and numerical computing libraries.