Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

AI-Job-Notes

AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)

652   5441   5441  

Deep-Learning-Interview-Book

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理...

1133   5328   5328  

datascience

This repository is a compilation of free resources for learning Data S...

524   5102   5102  

nlpaug

Data augmentation for NLP

466   4516   4516  

practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated a...

1115   4462   4462  

ltp

Language Technology Platform

1019   4449   4449  

Data-science

Collection of useful data science topics along with articles, videos,...

1029   4076   4076  

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

715   4053   4053  

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment...

1117   3962   3962  

MatchZoo

Facilitating the design, comparison and sharing of deep text matching...

898   3853   3853  

arXivTimes

repository to research & share the machine learning articles

209   3787   3787  

libpostal

A C library for parsing/normalizing street addresses around the world....

397   3733   3733  

olivia

💁‍♀️Your new best friend powered by an artificial neural network

355   3686   3686  

textract

extract text from any document. no muss. no fuss.

533   3546   3546  

OpenPrompt

An Open-Source Framework for Prompt-Learning.

390   3530   3530  

zhihu

This repo contains the source code in my personal column (https://zhua...

2161   3440   3440  

spark-nlp

State of the Art Natural Language Processing

661   3321   3321  

vale

:pencil: A syntax-aware linter for prose built with speed and extensib...

121   3265   3265  

catalyst

Accelerated deep learning R&D

399   3150   3150  

marqo

Vector search for humans.

124   3143   3143  

lit

The Learning Interpretability Tool: Interactively analyze ML models to...

327   3085   3085  

nlp-roadmap

ROADMAP(Mind Map) and KEYWORD for students those who have interest in...

507   3076   3076  

prose

:book: A Golang library for text processing, including tokenization, p...

159   3000   3000  

nlp_tasks

Natural Language Processing Tasks and References

565   2998   2998  

DeepNLP-models-Pytorch

Pytorch implementations of various Deep NLP models in cs-224n(Stanford...

659   2954   2954  

MITIE

MITIE: library and tools for information extraction

539   2934   2934  

pyhanlp

中文分词

789   2905   2905  

fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still i...

448   2866   2866  

thinc

🔮 A refreshing functional take on deep learning, compatible with your...

277   2833   2833  

machine-learning

머신러닝 입문자 혹은 스터디를 준비하시는 분들에게 도움이 되고자 만든 r...

864   2690   2690  

knockknock

🚪✊Knock Knock: Get notified when your training ends with only two ad...

225   2625   2625  

MTBook

《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundati...

828   2589   2589  

gluon-nlp

NLP made easy

540   2505   2505  

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Mode...

443   2475   2475  

TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data aug...

324   2409   2409  

texar

Toolkit for Machine Learning, Natural Language Processing, and Text Ge...

382   2353   2353  

FLAML

A fast library for AutoML and tuning. Join our Discord: https://discor...

356   2341   2341  

decaNLP

The Natural Language Decathlon: A Multitask Challenge for NLP

429   2301   2301  

DeepInterests

深度有趣

420   2290   2290  

Top-AI-Conferences-Paper-with-Code

MLNLP: This repository is a collection of AI top conferences papers (e...

593   2290   2290  

scattertext

Beautiful visualizations of how language differs among document types.

292   2285   2285  

argilla

✨Argilla: the open-source data curation platform for LLMs

213   2280   2280  

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocess...

299   2244   2244  

PyTorch-NLP

Basic Utilities for PyTorch Natural Language Processing (NLP)

257   2217   2217  

lazynlp

Library to scrape and clean web pages to create massive datasets.

311   2179   2179  

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

361   2177   2177  

uda

Unsupervised Data Augmentation (UDA)

315   2130   2130  

turkce-yapay-zeka-kaynaklari

Türkiye'de yapılan derin öğrenme (deep learning) ve makine öğrenmesi (...

423   2124   2124  

practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real...

1637   2111   2111  

ML

A high-level machine learning and deep learning library for the PHP la...

187   2095   2095