Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Basic4AI

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

77   476   476  

Legal-Text-Analytics

A list of selected resources, methods, and tools dedicated to Legal Te...

97   476   476  

awesome-bangla

A collection of tools, datasets and resources on Bangla computing

187   474   474  

chinese_dictionary

同义词表,反义词表,否定词表

197   473   473  

pubmed_parser

:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDL...

147   472   472  

Text-Classification-Models-Pytorch

Implementation of State-of-the-art Text Classification Models in Pytor...

134   471   471  

PFL-Non-IID

Personalized federated learning simulation platform with non-IID and u...

124   471   471  

LMaaS-Papers

Awesome papers on Language-Model-as-a-Service (LMaaS)

30   470   470  

transformers-bloom-inference

Fast Inference Solutions for BLOOM

91   470   470  

kaggle-HomeDepot

3rd Place Solution for HomeDepot Product Search Results Relevance Comp...

210   467   467  

sacremoses

Python port of Moses tokenizer, truecaser and normalizer

55   467   467  

PaperRobot

Code for PaperRobot: Incremental Draft Generation of Scientific Ideas

136   466   466  

Giveme5W1H

Extraction of the journalistic five W and one H questions (5W1H) from...

86   466   466  

oie-resources

A curated list of Open Information Extraction (OIE) resources: papers,...

55   465   465  

pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Lan...

64   464   464  

Jiayan

甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言...

66   464   464  

caiss

一款简单好用的 跨平台/多语言的 相似向量/相似词/相似句 高性能检索引擎。...

61   463   463  

small-text

Active Learning for Text Classification in Python

48   463   463  

cogcomp-nlp

CogComp's Natural Language Processing Libraries and Demos: Modules inc...

147   461   461  

CS224n-winter-together

an Open Course Platform for Stanford CS224n (2020 Winter)

156   461   461  

searchGPT

Grounded search engine (i.e. with source reference) based on LLM / Cha...

48   461   461  

beto

BETO - Spanish version of the BERT model

62   460   460  

whatlies

Toolkit to help understand "what lies" in word embeddings. Also benchm...

51   460   460  

TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feed...

56   459   459  

tianchi_nl2sql

追一科技首届中文NL2SQL挑战赛决赛第3名方案+代码

143   458   458  

examples

Jina examples and demos to help you get started

152   453   453  

ConvoKit

ConvoKit is a toolkit for extracting conversational features and analy...

107   452   452  

hierarchical-attention-networks

Document classification with Hierarchical Attention Networks in Tensor...

148   451   451  

bert-embedding

🔡 Token level embeddings from BERT model on mxnet and gluonnlp

68   450   450  

cope

A modern IDE for writing classical Chinese poetry 格律诗编辑程序

48   449   449  

node-question-answering

Fast and production-ready question answering in Node.js

48   448   448  

Styleformer

A Neural Language Style Transfer framework to transfer natural languag...

63   448   448  

PromptKG

PromptKG Family: a Gallery of Prompt Learning & KG-related research wo...

52   448   448  

codequestion

🔎 Semantic search for developers

47   446   446  

Prompt4ReasoningPapers

Repository for the ACL2023 paper "Reasoning with Language Model Prompt...

37   440   440  

awesome-arabic

A curated list of awesome projects and dev/design resources for suppor...

93   437   437  

pytorch-bert-crf-ner

KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity R...

104   437   437  

Transformers.jl

Julia Implementation of Transformer models

54   437   437  

happy-transformer

A package built on top of Hugging Face's transformers library that mak...

56   436   436  

deep_learning_NLP

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectur...

105   433   433  

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

66   432   432  

nlquery

Natural Language Engine on WikiData

76   429   429  

DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transf...

124   427   427  

nlp-papers-with-arxiv

Statistics and accepted paper list of NLP conferences with arXiv link

53   426   426  

autoprompt

AutoPrompt: Automatic Prompt Construction for Masked Language Models.

63   426   426  

indonlu

The first-ever vast natural language processing benchmark for Indonesi...

174   423   423  

FinBERT

A Pretrained BERT Model for Financial Communications. https://arxiv.or...

108   422   422  

epidemic-sentence-pair

天池 疫情相似句对判定大赛 线上第一名方案

73   421   421  

wego

Word Embeddings (e.g. Word2Vec) in Go!

37   418   418  

keytotext

Keywords to Sentences

59   417   417