Most popular natural-language-processing repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

odin-slides

This is an advanced Python tool that empowers you to effortlessly draf...

20   136   136  

awesome-machine-learning

:book: List of some awesome university courses for Machine Learning!...

47   136   136  

NLP

Natural Language Processing For Everyone

100   136   136  

nested-ner-tacl2020-transformers

Implementation of Nested Named Entity Recognition using BERT

24   136   136  

awesome-papers

机器学习,深度学习,自然语言处理,计算机视觉方面的顶级期刊会议论文集

35   134   134  

sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., se...

12   134   134  

BLINK_Benchmark

This repo contains evaluation code for the paper "BLINK: Multimodal La...

7   134   134  

awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refresh...

11   134   134  

nlp_workshop_odsc_europe20

Extensive tutorials for the Advanced NLP Workshop in Open Data Science...

63   133   133  

spring

SPRING is a seq2seq model for Text-to-AMR and AMR-to-Text (AAAI2021).

28   133   133  

nlp-with-pytorch

<파이토치로 배우는 자연어 처리>(한빛미디어, 2021)의 소스 코드를 위한...

64   132   132  

natural-language-preprocessings

Some recipes of natural language pre-processing

26   132   132  

Bible_Text_GCN

Pytorch implementation of "Graph Convolutional Networks for Text Class...

34   132   132  

bert_nli

A Natural Language Inference (NLI) model based on Transformers (BERT a...

27   132   132  

spf

Cornell Semantic Parsing Framework

13   131   131  

xlnet_extension_tf

XLNet Extension in TensorFlow

26   131   131  

Distill-BERT-Textgen

Research code for ACL 2020 paper: "Distilling Knowledge Learned in BER...

21   131   131  

awesome-bert-japanese

📝 A list of pre-trained BERT models for Japanese with word/subword to...

7   131   131  

wink-nlp-utils

NLP Functions for amplifying negations, managing elisions, creating ng...

11   131   131  

ml-classify-text-js

Machine learning based text classification in JavaScript using n-grams...

11   131   131  

HumanPrompt

A framework for human-readable prompt-based method with large language...

9   131   131  

Knowledge-Conflicts-Survey

[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge...

6   131   131  

awesome-llm-os

A curated list of awesome resources, tools, research papers, and proje...

8   130   130  

FinBERT-QA

Financial Domain Question Answering with pre-trained BERT Language Mod...

28   129   129  

crossnorm-selfnorm

CrossNorm and SelfNorm for Generalization under Distribution Shifts, I...

7   129   129  

Spider2-V

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automatin...

7   129   129  

awesome-ai4lam

A list of awesome AI in libraries, archives, and museum collections fr...

11   129   129  

LSR

Pytorch Implementation of our ACL 2020 Paper "Reasoning with Latent S...

21   129   129  

klue-transformers-tutorial

KLUE 데이터를 활용한 HuggingFace Transformers 튜토리얼

16   129   129  

keita

My personal toolkit for PyTorch development.

11   128   128  

NLPCC-WordSeg-Weibo

NLPCC 2016 微博分词评测项目

45   128   128  

cotk

Conversational Toolkit. An Open-Source Toolkit for Fast Development an...

26   128   128  

phrase-at-scale

Detect common phrases in large amounts of text using a data-driven app...

45   128   128  

dialogue-understanding

This repository contains PyTorch implementation for the baseline model...

21   128   128  

lingfeat

[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction...

16   128   128  

mlmm-evaluation

Multilingual Large Language Models Evaluation Benchmark

18   128   128  

chat-to-your-database

Chat to your database with AI. An experimental app to test the abiliti...

25   128   128  

awesome-artificial-intelligence-research

A curated list of Artificial Intelligence (AI) Research, tracks the cu...

14   127   127  

FlexPrefill

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse At...

7   127   127  

LLM-GenAI-Transformers-Notebooks

An repository containing all the LLM notebooks with tutorial and proje...

26   127   127  

awesome-active-learning-New

Active Learning Awesome Paper

2   127   127  

cmrc2019

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CM...

33   127   127  

spacy-dev-resources

💫 Scripts, tools and resources for developing spaCy

55   126   126  

papernotes

My personal notes and surveys on DL, CV and NLP papers.

6   126   126  

Chinese_NLU_by_using_RASA_NLU

使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build...

32   126   126  

chatbot-samples

🤖 聊天机器人示例,定制聊天机器人,聊天机器人语料导入导出

43   126   126  

optimum-transformers

Accelerated NLP pipelines for fast inference on CPU and GPU. Built wit...

8   126   126  

contrastive-active-learning

Code for the EMNLP 2021 Paper "Active Learning by Acquiring Contrastiv...

13   126   126  

gptsh

GPT.sh is a CLI tool built with NodeJs and powered by Open AI's GPT-3....

13   126   126  

DeepAligned-Clustering

Discovering New Intents with Deep Aligned Clustering (AAAI 2021)

18   125   125