Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

zamia-ai

Free and open source A.I. system based on Python, TensorFlow and Prolo...

26   167   167  

PersonaPaper

This is a repository for sharing papers in the field of persona-based...

11   167   167  

transformer-abstractive-summarization

Abstractive Text Summarization using Transformer

48   166   166  

small-doge

Doge Family of Small Language Models

13   166   166  

LaMP

Codes for papers on Large Language Models Personalization (LaMP)

9   166   166  

nlp-startups

국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록

17   166   166  

sling

SLING - A natural language frame semantics parser

11   166   166  

DiscoBERT

Code for paper "Discourse-Aware Neural Extractive Text Summarization"...

30   165   165  

Pre-modern_Chinese_corpus_dataset

近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人...

18   165   165  

prenlp

Preprocessing Library for Natural Language Processing

12   164   164  

spacyfishing

A spaCy wrapper of Entity-Fishing (component) for named entity disambi...

6   164   164  

TwitterScraper

Scrape a User's Twitter data! Bypass the 3,200 tweet API limit for a U...

17   163   163  

AI-Conference-Info

Extensive acceptance rates and information of main AI conferences

6   163   163  

words_counted

A Ruby natural language processor.

28   163   163  

MT-DNN

Multi-Task Deep Neural Networks for Natural Language Understanding

28   163   163  

KoSentenceBERT-ETRI

Sentence Embeddings using Siamese ETRI KoBERT-Networks

24   163   163  

pythonrouge

Python wrapper for evaluating summarization quality by ROUGE package

34   162   162  

spacy-udpipe

spaCy + UDPipe

10   162   162  

TreebankPreprocessing

Python scripts preprocessing Penn Treebank and Chinese Treebank

42   162   162  

RBERT

Implementation of BERT in R

19   161   161  

easse

Easier Automatic Sentence Simplification Evaluation

39   161   161  

Dual-Contrastive-Learning

Code for our paper "Dual Contrastive Learning: Text Classification via...

30   161   161  

byt5-geotagging

Confidence and Byt5 - based geotagging model predicting coordinates fr...

21   161   161  

diffusion-nlp-paper-arxiv

Auto get diffusion nlp papers in Axriv. More papers Information can be...

9   161   161  

pythorch-text-classification

对豆瓣影评进行文本分类情感分析,利用爬虫豆瓣爬取评论,进行数据清洗,分...

9   161   161  

postagga

A Library to parse natural language in pure Clojure and ClojureScript

16   160   160  

DeepLearning_NLP

基于深度学习的自然语言处理库

40   159   159  

gitagpt

Gita GPT A personal productivity assistant (RAG), a platform of AI cha...

40   159   159  

parsinlu

A comprehensive suite of high-level NLP tasks for Persian language

23   158   158  

awesome-ai-services

An overview of the AI-as-a-service landscape

22   158   158  

minicons

Utility for behavioral and representational analyses of Language Model...

38   157   157  

mtdata

A tool that locates, downloads, and extracts machine translation corpo...

25   156   156  

nl4dv

A python toolkit to create Visualizations (Vis) using natural language...

26   156   156  

augmenty

Augmenty is an augmentation library based on spaCy for augmenting text...

11   156   156  

lingo

package lingo provides the data structures and algorithms required for...

15   156   156  

fake-news

Building a fake news detector from initial ideation to model deploymen...

62   155   155  

awesome-AI-tutorials-surveys

A professional list of Tutorials and Surveys on DL, ML, DM, CV, NLP, S...

18   155   155  

sluice-networks

Code for Sluice networks: Learning what to share between loosely relat...

35   154   154  

Deep-Lyrics

Lyrics Generator aka Character-level Language Modeling with Multi-laye...

27   153   153  

Awesome-Mixup

[Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv....

11   153   153  

AttrPrompt

[NeurIPS 2023] This is the code for the paper `Large Language Model as...

13   153   153  

KnowledgeCircuits

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

10   153   153  

Subspace-Tuning

A generalized framework for subspace tuning methods in parameter effic...

5   153   153  

Ask2Transformers

A Framework for Textual Entailment based Zero Shot text classification

13   152   152  

anchoring-ai

An open-source no-code tool for teams to collaborate on building, eval...

31   152   152  

paper-survey

📚 Survey of previous research and related works on machine learning (...

12   152   152  

negapoji

Japanese negative positive classification.日本語文書のネガポジを判定。

74   151   151  

NeuSum

Code for the ACL 2018 paper "Neural Document Summarization by Jointly...

33   150   150  

gpt-paper-title-generator

Generating paper titles (and more!) with GPT trained on data scraped f...

32   150   150  

clicr

Machine reading comprehension on clinical case reports

40   149   149