Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

808   7289   7289  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Product...

610   7207   7207  

NeMo

NeMo: a toolkit for conversational AI

1664   7206   7206  

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

1725   7030   7030  

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2894   6923   6923  

DeepPavlov

An open source library for deep learning end-to-end dialog systems and...

1160   6846   6846  

WantWords

An open-source online reverse dictionary.

587   6654   6654  

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

803   6633   6633  

mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

1287   6574   6574  

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人...

2317   6510   6510  

nlp-recipes

Natural Language Processing Best Practices & Examples

917   6405   6405  

TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN,...

2231   6380   6380  

nlp.js

An NLP library for building bots, with entity extraction, sentiment an...

620   6363   6363  

tensorflow_cookbook

Code for Tensorflow Machine Learning Cookbook

2408   6249   6249  

smile

Statistical Machine Intelligence & Learning Engine

1138   6157   6157  

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understandi...

1181   6059   6059  

courses

This repository is a curated collection of links to various courses an...

554   5949   5949  

ERNIE

Official implementations for various pre-training models of ERNIE-fami...

1252   5898   5898  

BERT-pytorch

Google AI 2018 BERT pytorch implementation

1231   5578   5578  

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

339   5477   5477  

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Exampl...

374   5461   5461  

flashtext

Extract Keywords from sentence or Replace keywords in sentences.

608   5417   5417  

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

285   5332   5332  

ltp

Language Technology Platform

1048   5075   5075  

TagUI

Free RPA tool by AI Singapore

554   4980   4980  

machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks o...

795   4802   4802  

sensitive-word

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基...

653   4793   4793  

Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

894   4768   4768  

nlpaug

Data augmentation for NLP

466   4537   4537  

practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated a...

1115   4462   4462  

argilla

Argilla is a collaboration tool for AI engineers and domain experts to...

420   4422   4422  

argos-translate

Open-source offline translation library written in Python

317   4332   4332  

libpostal

A C library for parsing/normalizing street addresses around the world....

432   4214   4214  

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language model...

384   4178   4178  

Awesome-ChatGPT

ChatGPT资料汇总学习,持续更新......

385   4129   4129  

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Cra...

288   4109   4109  

d2l-pytorch

This project reproduces the book Dive Into Deep Learning (https://d2l....

1225   4058   4058  

ml-road

Machine Learning Resources, Practice and Research

1509   4021   4021  

donut

Official Implementation of OCR-free Document Understanding Transformer...

302   3986   3986  

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment...

1117   3962   3962  

snips-nlu

Snips Python library to extract meaning from text

512   3917   3917  

franc

Natural language detection

196   3873   3873  

Dive-into-DL-TensorFlow2.0

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改...

826   3638   3638  

text

Models, data loaders and abstractions for language processing, powered...

812   3532   3532  

OpenPrompt

An Open-Source Framework for Prompt-Learning.

390   3530   3530  

Bard-API

The unofficial python package that returns response of Google Bard thr...

453   3523   3523  

ml-workspace

🛠 All-in-one web-based IDE specialized for machine learning and data s...

451   3496   3496  

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based...

268   3461   3461  

course-nlp

A Code-First Introduction to NLP course

1480   3442   3442  

picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

433   3337   3337