Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Kashgari

Kashgari is a production-level NLP Transfer learning framework built o...

437   2392   2392  

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现...

249   2384   2384  

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

377   2380   2380  

electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Gene...

349   2358   2358  

CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

404   2353   2353  

rasa_core

Rasa Core is now part of the Rasa repo: An open source machine learnin...

1010   2340   2340  

RL4LMs

A modular RL library to fine-tune language models to human preferences

200   2336   2336  

news-please

news-please - an integrated web crawler and information extractor for...

443   2310   2310  

scattertext

Beautiful visualizations of how language differs among document types.

289   2308   2308  

Medical_NLP

Medical NLP Competition, dataset, large models, paper

424   2301   2301  

awesome-chatgpt

🧠 A curated list of awesome ChatGPT resources, including libraries, S...

170   2283   2283  

mt-dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

415   2253   2253  

awesome-sentence-embedding

A curated list of pretrained sentence and word embedding models

262   2248   2248  

Introduction-NLP

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯...

546   2240   2240  

Linly

Chinese-LLaMA 、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文Ope...

180   2233   2233  

textacy

NLP, before and after spaCy

248   2230   2230  

PyTorch-NLP

Basic Utilities for PyTorch Natural Language Processing (NLP)

254   2222   2222  

bootcamp

Dealing with all unstructured data, such as reverse image search, audi...

639   2199   2199  

uda

Unsupervised Data Augmentation (UDA)

311   2196   2196  

lazynlp

Library to scrape and clean web pages to create massive datasets.

311   2193   2193  

pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase...

333   2188   2188  

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

257   2158   2158  

sparseml

Libraries for applying sparsification recipes to neural networks with...

157   2147   2147  

Information-Extraction-Chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation E...

820   2112   2112  

sru

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

305   2101   2101  

ABSA-PyTorch

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的...

529   2084   2084  

kcws

Deep Learning Chinese Word Segment

643   2080   2080  

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeN...

297   2048   2048  

ecco

Explain, analyze, and visualize NLP language models. Ecco creates inte...

174   2044   2044  

PyTorchNLPBook

Code and data accompanying Natural Language Processing with PyTorch pu...

813   2043   2043  

The-NLP-Pandect

A comprehensive reference for all topics related to Natural Language P...

283   2021   2021  

chatgpt.js

🤖 A powerful, open source client-side JavaScript library for ChatGPT

159   1984   1984  

HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键...

314   1978   1978  

beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, eval...

218   1953   1953  

DeepLearningForNLPInPytorch

An IPython Notebook tutorial on deep learning for natural language pro...

461   1941   1941  

sling

SLING - A natural language frame semantics parser

266   1931   1931  

docspell

Assist in organizing your piles of documents, resulting from scanners,...

150   1903   1903  

named_entity_recognition

中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现...

512   1853   1853  

DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python...

350   1828   1828  

spago

Self-contained Machine Learning and Natural Language Processing librar...

88   1819   1819  

alpaca_eval

An automatic evaluator for instruction-following language models. Huma...

283   1817   1817  

awesome-bert

bert nlp papers, applications and github resources, including the new...

350   1814   1814  

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

102   1801   1801  

NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP p...

723   1787   1787  

BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

398   1785   1785  

NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

457   1785   1785  

Awesome-pytorch-list-CNVersion

Awesome-pytorch-list 翻译工作进行中......

403   1780   1780  

spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library

306   1775   1775  

kaggle-CrowdFlower

1st Place Solution for CrowdFlower Product Search Results Relevance Co...

657   1770   1770  

transfer-learning-conv-ai

🦄 State-of-the-Art Conversational AI with Transfer Learning

433   1754   1754