Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B

144   531   531  

headlines

Automatically generate headlines to short articles

152   527   527  

text_summurization_abstractive_methods

Multiple implementations for abstractive text summurization , using go...

219   526   526  

poplar

A web-based annotation tool for natural language processing (NLP)

140   524   524  

awesome-tensorflow-2

👉 Tensorflow 2.x resources such as tutorial, blog, code and videos

102   523   523  

awesome-arabic

A curated list of awesome projects and dev/design resources for suppor...

97   521   521  

Deep-Semantic-Similarity-Model

My Keras implementation of the Deep Semantic Similarity Model (DSSM)/C...

186   520   520  

EventExtractionPapers

A list of NLP resources focused on event extraction task

90   519   519  

bigbird

Transformers for Longer Sequences

96   516   516  

Goopt

🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.

37   516   516  

awesome-sentiment-analysis

Repository with all what is necessary for sentiment analysis and relat...

103   514   514  

OmniNet

Official Pytorch implementation of "OmniNet: A unified architecture fo...

58   512   512  

prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

51   511   511  

LightAutoML

Fast and customizable framework for automatic ML model creation (AutoM...

31   510   510  

datasets-server

Lightweight web API for visualizing and exploring all types of dataset...

37   509   509  

python-stanford-corenlp

Python interface to CoreNLP using a bidirectional server-client interf...

110   509   509  

machine-learning-articles

Monthly Series - Top 10 Machine Learning Articles

40   508   508  

llm_training_handbook

An open collection of methodologies to help with successful training o...

43   508   508  

CPM-Live

Live Training for Open-source Big Models

39   506   506  

MedCAT

Medical Concept Annotation Tool

112   505   505  

awesome-data-annotation

A list of tools for annotating data, managing annotations, etc.

60   504   504  

BertSimilarity

Computing similarity of two sentences with google's BERT algorithm。利...

69   504   504  

oie-resources

A curated list of Open Information Extraction (OIE) resources: papers,...

59   499   499  

pytorch-bert-crf-ner

KoBERT와 CRF로 만든 한국어 개체명인식기 (BERT+CRF based Named Entity R...

109   499   499  

attn2d

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequenc...

75   497   497  

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

115   496   496  

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and to...

66   492   492  

Event-Extraction

基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体...

129   491   491  

matchbox

Write PyTorch code at the level of individual examples, then run it ef...

25   488   488  

code_search

Code For Medium Article: "How To Create Natural Language Semantic Sear...

137   488   488  

allennlp-models

Officially supported AllenNLP models

158   488   488  

tomotopy

Python package of Tomoto, the Topic Modeling Tool

57   487   487  

chinese_text_cnn

TextCNN Pytorch实现 中文文本分类 情感分析

109   487   487  

subreddit-analyzer

A comprehensive Data and Text Mining workflow for submissions and comm...

43   485   485  

mexican-government-report

Text Mining on the 2019 Mexican Government Report, covering from extra...

85   481   481  

detecting-fake-text

Giant Language Model Test Room

112   480   480  

Deta_Parser

快速中文分词分析word segmentation

97   479   479  

large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for tr...

21   479   479  

php-text-analysis

PHP Text Analysis is a library for performing Information Retrieval (I...

83   477   477  

fastT5

⚡ boost inference speed of T5 models by 5x & reduce the model size by...

63   477   477  

pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Lan...

68   476   476  

cogcomp-nlp

CogComp's Natural Language Processing Libraries and Demos: Modules inc...

144   476   476  

Basic4AI

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

77   476   476  

Legal-Text-Analytics

A list of selected resources, methods, and tools dedicated to Legal Te...

97   476   476  

awesome-bangla

A collection of tools, datasets and resources on Bangla computing

187   474   474  

clifs

Contrastive Language-Image Forensic Search allows free text searching...

52   474   474  

cope

A modern IDE for writing classical Chinese poetry 格律诗编辑程序

48   473   473  

chinese_dictionary

同义词表,反义词表,否定词表

197   473   473  

pubmed_parser

:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDL...

147   472   472  

Text-Classification-Models-Pytorch

Implementation of State-of-the-art Text Classification Models in Pytor...

134   471   471