Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

models

Officially maintained, supported by PaddlePaddle, including CV, NLP, S...

2900   6922   6922  

DeepPavlov

An open source library for deep learning end-to-end dialog systems and...

1157   6808   6808  

FinGPT

Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥...

1640   6770   6770  

WantWords

An open-source online reverse dictionary.

587   6654   6654  

mycroft-core

Mycroft Core, the Mycroft Artificial Intelligence platform.

1286   6561   6561  

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人...

2317   6510   6510  

TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN,...

2229   6375   6375  

nlp.js

An NLP library for building bots, with entity extraction, sentiment an...

620   6363   6363  

tensorflow_cookbook

Code for Tensorflow Machine Learning Cookbook

2410   6245   6245  

nlp-recipes

Natural Language Processing Best Practices & Examples

898   6159   6159  

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understandi...

1181   6059   6059  

ERNIE

Official implementations for various pre-training models of ERNIE-fami...

1252   5898   5898  

smile

Statistical Machine Intelligence & Learning Engine

1118   5764   5764  

BERT-pytorch

Google AI 2018 BERT pytorch implementation

1231   5578   5578  

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Exampl...

374   5461   5461  

flashtext

Extract Keywords from sentence or Replace keywords in sentences.

608   5417   5417  

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

285   5332   5332  

TagUI

Free RPA tool by AI Singapore

554   4980   4980  

sensitive-word

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基...

653   4793   4793  

Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

894   4768   4768  

nlpaug

Data augmentation for NLP

466   4516   4516  

practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated a...

1115   4462   4462  

ltp

Language Technology Platform

1019   4449   4449  

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

557   4446   4446  

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

149   4361   4361  

machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks o...

667   4194   4194  

Awesome-ChatGPT

ChatGPT资料汇总学习,持续更新......

385   4129   4129  

d2l-pytorch

This project reproduces the book Dive Into Deep Learning (https://d2l....

1225   4058   4058  

donut

Official Implementation of OCR-free Document Understanding Transformer...

302   3986   3986  

pytorch-sentiment-analysis

Tutorials on getting started with PyTorch and TorchText for sentiment...

1117   3962   3962  

aim

Aim 💫 — An easy-to-use & supercharged open-source AI metadata tracker...

243   3925   3925  

snips-nlu

Snips Python library to extract meaning from text

511   3908   3908  

franc

Natural language detection

196   3873   3873  

libpostal

A C library for parsing/normalizing street addresses around the world....

397   3733   3733  

Dive-into-DL-TensorFlow2.0

本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改...

826   3638   3638  

OpenPrompt

An Open-Source Framework for Prompt-Learning.

390   3530   3530  

Bard-API

The unofficial python package that returns response of Google Bard thr...

453   3523   3523  

ml-workspace

🛠 All-in-one web-based IDE specialized for machine learning and data s...

457   3476   3476  

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language model...

314   3398   3398  

text

Models, data loaders and abstractions for language processing, powered...

811   3330   3330  

course-nlp

A Code-First Introduction to NLP course

1471   3328   3328  

spark-nlp

State of the Art Natural Language Processing

661   3321   3321  

picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

430   3320   3320  

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

548   3294   3294  

vale

:pencil: A syntax-aware linter for prose built with speed and extensib...

121   3265   3265  

Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Llama-7B tuned with C...

299   3256   3256  

sumy

Module for automatic summarization of text documents and HTML pages.

512   3196   3196  

courses

This repository is a curated collection of links to various courses an...

262   3181   3181  

nlp-roadmap

ROADMAP(Mind Map) and KEYWORD for students those who have interest in...

507   3076   3076  

prose

:book: A Golang library for text processing, including tokenization, p...

159   3000   3000