Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

UnifiedSKG

[EMNLP 2022] A Unified Framework and Analysis for Structured Knowledge...

50   417   417  

dialogflow-javascript-client

JavaScript Web SDK for Dialogflow

174   414   414  

DL_Topics

List of DL topics and resources essential for cracking interviews

54   412   412  

USC-DS-RelationExtraction

Distantly Supervised Relation Extraction

112   412   412  

tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT와 같...

204   412   412  

bricks

Open-source natural language enrichments at your fingertips.

17   412   412  

adaptnlp

An easy to use Natural Language Processing library and framework for p...

39   411   411  

nlp

Selected Machine Learning algorithms for natural language processing a...

46   411   411  

nlp-notebook

NLP 领域常见任务的实现,包括新词发现、以及基于pytorch的词向量、中文文...

93   410   410  

nlpnet

A neural network architecture for NLP tasks, using cython for fast per...

104   409   409  

ThoughtSource

A central, open resource for data and tools related to chain-of-though...

28   407   407  

anlp19

Course repo for Applied Natural Language Processing (Spring 2019)

105   405   405  

ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中...

39   405   405  

ResourceBank_CV_NLP_MLOPS_2022

This repository offers a goldmine of materials for students of compute...

89   404   404  

clause

:horse_racing: 聊天机器人,自然语言理解,语义理解

119   403   403  

cookiecutter-spacy-fastapi

Cookiecutter API for creating Custom Skills for Azure Search using Pyt...

54   401   401  

tf-seq2seq

Sequence to sequence learning using TensorFlow.

109   390   390  

medaCy

:hospital: Medical Text Mining and Information Extraction with spaCy

88   390   390  

FakeNewsCorpus

A dataset of millions of news articles scraped from a curated list of...

97   389   389  

beginner_nlp

A curated list of beginner resources in Natural Language Processing

83   383   383  

trade-dst

Source code for transferable dialogue state generator (TRADE, Wu et al...

114   382   382  

nlp

[UNMANTEINED] Extract values from strings and fill your structs with n...

35   381   381  

NLP_bahasa_resources

A Curated List of Dataset and Usable Library Resources for NLP in Baha...

114   378   378  

Awesome-Distributed-Deep-Learning

A curated list of awesome Distributed Deep Learning resources.

82   376   376  

airy

💬 Open Source App Framework to build streaming apps with real-time d...

44   376   376  

pykakasi

Lightweight converter from Japanese Kana-kanji sentences into Kana-Rom...

49   371   371  

awesome-financial-nlp

Researches for Natural Language Processing for Financial Domain

55   367   367  

gcn-over-pruned-trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extra...

71   366   366  

NLP101

NLP 101: a resource repository for Deep Learning and Natural Language...

58   366   366  

link-grammar

The CMU Link Grammar natural language parser

117   366   366  

Deep-Generative-Models-for-Natural-Language-Processing

DGMs for NLP. A roadmap.

32   364   364  

Matterport3DSimulator

AI Research Platform for Reinforcement Learning from Real Panoramic Im...

119   363   363  

DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learn...

35   358   358  

SimBiber

MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex wit...

28   358   358  

cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (...

80   357   357  

Machine-Learning-Notebooks

Machine Learning notebooks for refreshing concepts.

195   357   357  

machine-learning-resources

A curated list of awesome machine learning frameworks, libraries, cour...

122   355   355  

adam_qas

ADAM - A Question Answering System. Inspired from IBM Watson

108   354   354  

Artificial-Intelligence-And-Data-Science-Pro

Regularly Updated | Collection of the best Data Science and AI Materi...

147   354   354  

malaya

Natural Language Toolkit for bahasa Malaysia, https://malaya.readthed...

118   352   352  

tensorlayer-tricks

How to use TensorLayer

62   346   346  

NNDIAL

NNDial is an open source toolkit for building end-to-end trainable tas...

106   346   346  

displacy

:boom: displaCy.js: An open-source NLP visualiser for the modern web

82   344   344  

tacred-relation

PyTorch implementation of the position-aware attention model for relat...

97   344   344  

pyss3

A Python package implementing a new interpretable machine learning mod...

44   340   340  

nagisa

A Japanese tokenizer based on recurrent neural networks

19   339   339  

contextualSpellCheck

✔️Contextual word checker for better suggestions

48   339   339  

MedCAT

Medical Concept Annotation Tool

90   338   338  

Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, includ...

3   337   337  

coursera-natural-language-processing-specialization

Programming assignments from all courses in the Coursera Natural Langu...

330   336   336