Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

108   410   410  

nlp-notebook

NLP 领域常见任务的实现,包括新词发现、以及基于pytorch的词向量、中文文...

93   410   410  

awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, pre-trained...

14   408   408  

ThoughtSource

A central, open resource for data and tools related to chain-of-though...

28   407   407  

anlp19

Course repo for Applied Natural Language Processing (Spring 2019)

105   405   405  

ResourceBank_CV_NLP_MLOPS_2022

This repository offers a goldmine of materials for students of compute...

89   404   404  

nlpnet

A neural network architecture for NLP tasks, using cython for fast per...

105   403   403  

cookiecutter-spacy-fastapi

Cookiecutter API for creating Custom Skills for Azure Search using Pyt...

54   401   401  

tf-seq2seq

Sequence to sequence learning using TensorFlow.

111   392   392  

medaCy

:hospital: Medical Text Mining and Information Extraction with spaCy

88   390   390  

clause

:horse_racing: 聊天机器人,自然语言理解,语义理解

117   389   389  

pyswip

PySwip is a Python - SWI-Prolog bridge enabling to query SWI-Prolog in...

90   387   387  

beginner_nlp

A curated list of beginner resources in Natural Language Processing

83   383   383  

ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中...

33   383   383  

trade-dst

Source code for transferable dialogue state generator (TRADE, Wu et al...

114   382   382  

nlp

[UNMANTEINED] Extract values from strings and fill your structs with n...

35   381   381  

NLP_bahasa_resources

A Curated List of Dataset and Usable Library Resources for NLP in Baha...

114   378   378  

Awesome-Distributed-Deep-Learning

A curated list of awesome Distributed Deep Learning resources.

82   376   376  

pykakasi

Lightweight converter from Japanese Kana-kanji sentences into Kana-Rom...

49   371   371  

awesome-financial-nlp

Researches for Natural Language Processing for Financial Domain

55   367   367  

gcn-over-pruned-trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extra...

71   366   366  

NLP101

NLP 101: a resource repository for Deep Learning and Natural Language...

58   366   366  

link-grammar

The CMU Link Grammar natural language parser

117   366   366  

Deep-Generative-Models-for-Natural-Language-Processing

DGMs for NLP. A roadmap.

32   364   364  

Matterport3DSimulator

AI Research Platform for Reinforcement Learning from Real Panoramic Im...

119   363   363  

DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learn...

35   358   358  

SimBiber

MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex wit...

28   358   358  

cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (...

80   357   357  

Machine-Learning-Notebooks

Machine Learning notebooks for refreshing concepts.

195   357   357  

machine-learning-resources

A curated list of awesome machine learning frameworks, libraries, cour...

122   355   355  

adam_qas

ADAM - A Question Answering System. Inspired from IBM Watson

108   354   354  

Artificial-Intelligence-And-Data-Science-Pro

Regularly Updated | Collection of the best Data Science and AI Materi...

147   354   354  

malaya

Natural Language Toolkit for bahasa Malaysia, https://malaya.readthed...

118   352   352  

FakeNewsCorpus

A dataset of millions of news articles scraped from a curated list of...

95   352   352  

tensorlayer-tricks

How to use TensorLayer

63   347   347  

NNDIAL

NNDial is an open source toolkit for building end-to-end trainable tas...

106   346   346  

displacy

:boom: displaCy.js: An open-source NLP visualiser for the modern web

82   344   344  

tacred-relation

PyTorch implementation of the position-aware attention model for relat...

97   344   344  

nagisa

A Japanese tokenizer based on recurrent neural networks

19   339   339  

contextualSpellCheck

✔️Contextual word checker for better suggestions

48   339   339  

MedCAT

Medical Concept Annotation Tool

90   338   338  

Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, includ...

3   337   337  

Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

38   336   336  

airy

💬 Open Source App Framework to build streaming apps with real-time da...

46   335   335  

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and to...

51   334   334  

Dynamic-memory-networks-in-Theano

Implementation of Dynamic memory networks by Kumar et al. http://arxiv...

111   333   333  

AwesomeFakeNews

This repository contains recent research on fake news.

79   332   332  

low-resource-languages

Resources for conservation, development, and documentation of low reso...

58   332   332  

efaqa-corpus-zh

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

54   332   332  

chakin

Simple downloader for pre-trained word vectors

48   331   331