Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

829   7531   7531  

Awesome-Chinese-NLP

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

1697   7338   7338  

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Product...

610   7207   7207  

NeMo

NeMo: a toolkit for conversational AI

1664   7206   7206  

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

1725   7030   7030  

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2885   6929   6929  

DeepPavlov

An open source library for deep learning end-to-end dialog systems and...

1165   6917   6917  

WantWords

An open-source online reverse dictionary.

587   6654   6654  

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

803   6633   6633  

mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

1287   6574   6574  

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人...

2317   6510   6510  

nlp-recipes

Natural Language Processing Best Practices & Examples

915   6424   6424  

TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN,...

2229   6384   6384  

nlp.js

An NLP library for building bots, with entity extraction, sentiment an...

620   6363   6363  

tensorflow_cookbook

Code for Tensorflow Machine Learning Cookbook

2405   6263   6263  

smile

Statistical Machine Intelligence & Learning Engine

1144   6228   6228  

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understandi...

1168   6182   6182  

courses

This repository is a curated collection of links to various courses an...

557   6133   6133  

TagUI

Free RPA tool by AI Singapore

619   5978   5978  

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

312   5970   5970  

ERNIE

Official implementations for various pre-training models of ERNIE-fami...

1252   5898   5898  

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

349   5722   5722  

BERT-pytorch

Google AI 2018 BERT pytorch implementation

1231   5578   5578  

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Exampl...

374   5461   5461  

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval a...

506   5430   5430  

flashtext

Extract Keywords from sentence or Replace keywords in sentences.

608   5417   5417  

ltp

Language Technology Platform

1048   5075   5075  

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, base...

523   4909   4909  

machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks o...

822   4887   4887  

sensitive-word

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基...

653   4793   4793  

Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

894   4768   4768  

OpenPrompt

An Open-Source Framework for Prompt-Learning.

475   4692   4692  

nlpaug

Data augmentation for NLP

466   4537   4537  

practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated a...

1115   4462   4462  

argilla

Argilla is a collaboration tool for AI engineers and domain experts to...

420   4422   4422  

argos-translate

Open-source offline translation library written in Python

317   4332   4332  

ml-road

Machine Learning Resources, Practice and Research

1598   4325   4325  

d2l-pytorch

This project reproduces the book Dive Into Deep Learning (https://d2l....

1238   4319   4319  

llm-foundry

LLM training code for Databricks foundation models

573   4297   4297  

libpostal

A C library for parsing/normalizing street addresses around the world....

432   4214   4214  

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language model...

384   4178   4178  

Awesome-ChatGPT

ChatGPT资料汇总学习,持续更新......

389   4163   4163  

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Cra...

288   4109   4109  

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Constr...

726   4062   4062  

spark-nlp

State of the Art Natural Language Processing

729   4019   4019  

donut

Official Implementation of OCR-free Document Understanding Transformer...

302   3986   3986  

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment...

1117   3962   3962  

snips-nlu

Snips Python library to extract meaning from text

512   3917   3917  

franc

Natural language detection

196   3873   3873  

Dive-into-DL-TensorFlow2.0

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改...

820   3816   3816