Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

text_classification

all kinds of text classification models and more with deep learning

2567   7930   7930  

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

835   7594   7594  

stanza

Stanford NLP Python library for tokenization, sentence segmentation, N...

909   7562   7562  

Awesome-Chinese-NLP

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

1697   7338   7338  

NeMo

NeMo: a toolkit for conversational AI

1664   7206   7206  

WantWords

An open-source online reverse dictionary.

624   7091   7091  

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

1725   7030   7030  

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2884   6930   6930  

DeepPavlov

An open source library for deep learning end-to-end dialog systems and...

1165   6917   6917  

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

803   6633   6633  

mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

1301   6601   6601  

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人...

2317   6510   6510  

nlp.js

An NLP library for building bots, with entity extraction, sentiment an...

632   6486   6486  

nlp-recipes

Natural Language Processing Best Practices & Examples

915   6424   6424  

TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN,...

2229   6384   6384  

tensorflow_cookbook

Code for Tensorflow Machine Learning Cookbook

2405   6263   6263  

smile

Statistical Machine Intelligence & Learning Engine

1144   6228   6228  

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understandi...

1168   6182   6182  

courses

This repository is a curated collection of links to various courses an...

562   6182   6182  

TagUI

Free RPA tool by AI Singapore

619   5978   5978  

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

312   5970   5970  

ERNIE

Official implementations for various pre-training models of ERNIE-fami...

1252   5898   5898  

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

349   5722   5722  

BERT-pytorch

Google AI 2018 BERT pytorch implementation

1231   5578   5578  

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval a...

506   5430   5430  

flashtext

Extract Keywords from sentence or Replace keywords in sentences.

608   5417   5417  

Bard-API

The unofficial python package that returns response of Google Bard thr...

516   5251   5251  

ltp

Language Technology Platform

1057   5163   5163  

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, base...

523   4909   4909  

machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks o...

822   4887   4887  

sensitive-word

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基...

653   4793   4793  

Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

894   4768   4768  

OpenPrompt

An Open-Source Framework for Prompt-Learning.

476   4699   4699  

argilla

Argilla is a collaboration tool for AI engineers and domain experts to...

445   4625   4625  

nlpaug

Data augmentation for NLP

470   4600   4600  

practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated a...

1092   4550   4550  

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment...

1183   4546   4546  

libpostal

A C library for parsing/normalizing street addresses around the world....

450   4532   4532  

argos-translate

Open-source offline translation library written in Python

317   4332   4332  

ml-road

Machine Learning Resources, Practice and Research

1598   4325   4325  

d2l-pytorch

This project reproduces the book Dive Into Deep Learning (https://d2l....

1238   4319   4319  

llm-foundry

LLM training code for Databricks foundation models

573   4297   4297  

franc

Natural language detection

181   4289   4289  

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language model...

384   4178   4178  

Awesome-ChatGPT

ChatGPT资料汇总学习,持续更新......

389   4163   4163  

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Cra...

288   4109   4109  

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Constr...

726   4062   4062  

spark-nlp

State of the Art Natural Language Processing

729   4023   4023  

donut

Official Implementation of OCR-free Document Understanding Transformer...

302   3986   3986  

snips-nlu

Snips Python library to extract meaning from text

512   3917   3917