Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

portuguese-bert
portuguese-bert neuralmind-ai Python

Portuguese pre-trained BERT models

868
transformers-tutorials
transformers-tutorials abhimishra91 Jupyter Notebook

Github repo with tutorials to fine tune transformers for diff NLP tasks

864
Natural-Language-Processing-Specialization
Natural-Language-Processing-Specialization amanjeetsahu Jupyter Notebook

This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera

857
DL-NLP-Readings
DL-NLP-Readings 26hzhang TeX

My Reading Lists of Deep Learning and Natural Language Processing

856
language-detection
language-detection patrickschur PHP

A language detection library for PHP. Detects the language from a given text string.

853
spacy-streamlit
spacy-streamlit explosion Python

👑 spaCy building blocks and visualizers for Streamlit apps

853
PIXIU
PIXIU The-FinAI Jupyter Notebook

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and eva...

851
catalyst
catalyst curiosity-ai C#

🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box supp...

843
alpaca_farm
alpaca_farm tatsu-lab Python

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

842
hate-speech-and-offensive-language
hate-speech-and-offensive-language t-davidson Jupyter Notebook

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

839
DataAug4NLP
DataAug4NLP styfeng

Collection of papers and resources for data augmentation for NLP.

833
causal-text-papers
causal-text-papers causaltext

Curated research at the intersection of causal inference and natural language processing.

815
lingua
lingua pemistahl Kotlin

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

806
OCTIS
OCTIS MIND-Lab Python

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

802
Opus-MT
Opus-MT Helsinki-NLP Python

Open neural machine translation models and web services

802
trankit
trankit nlp-uoregon Python

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

795
papermage
papermage allenai Python

library supporting NLP and CV research on scientific papers

794
AI-Notes
AI-Notes wx-chevalier Jupyter Notebook

:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learning, including Mathematics Fundamentals, Python Practices, NLP Application, etc....

775
OpenAttack
OpenAttack thunlp Python

An Open-Source Package for Textual Adversarial Attack.

773
awesome-persian-nlp-ir
awesome-persian-nlp-ir mhbashari

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

770
AI-Job-Recommend
AI-Job-Recommend amusi

国内公司人工智能方向(含机器学习、深度学习、计算机视觉和自然语言处理)岗位的招聘信息(含全职、实习和校招)

768
Instruction-Tuning-Papers
Instruction-Tuning-Papers SinclairCoder

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

767
LightMem
LightMem zjunlp Python

[ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation

762
Failed-ML
Failed-ML kennethleungty

Compilation of high-profile real-world examples of failed machine learning projects

750
DNABERT
DNABERT jerryji1993 Python

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

750
keras-attention
keras-attention datalogue Python

Visualizing RNNs using the attention mechanism

749
texar-pytorch
texar-pytorch asyml Python

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: ht...

747
FewRel
FewRel thunlp Python

A Large-Scale Few-Shot Relation Extraction Dataset

747
spacy-stanza
spacy-stanza explosion Python

💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy

747
efaqa-corpus-zh
efaqa-corpus-zh chatopera Python

❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库

747
primeqa
primeqa primeqa Python

The prime repository for state-of-the-art Multilingual Question Answering research and development.

739
Awesome-Papers-Autonomous-Agent
Awesome-Papers-Autonomous-Agent lafmdp

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

739
COMET
COMET Unbabel Python

A Neural Framework for MT Evaluation

739
PromptKG
PromptKG zjunlp Python

PromptKG Family: a Gallery of Prompt Learning & KG-related research works, toolkits, and paper-list.

734
talisman
talisman Yomguithereal JavaScript

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

726
SqueezeLLM
SqueezeLLM SqueezeAILab Python

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

718
deeplearning-guide
deeplearning-guide sannykim

An evolving guide to learning Deep Learning effectively.

716
acl-anthology
acl-anthology acl-org Python

Data and software for building the ACL Anthology.

709
Courses-
Courses- salimt Jupyter Notebook

Answers for Quizzes & Assignments that I have taken

706
AI-Job-Resume
AI-Job-Resume amusi

AI 算法岗简历模板

704
chat
chat Decalogue Python

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

701
Books
Books JiashuWu

My book list

701
advanced-machine-learning-engineer-roadmap-2024
advanced-machine-learning-engineer-roadmap-2024 farukalamai

A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine lear...

698
SkyChat-Chinese-Chatbot-GPT3
SkyChat-Chinese-Chatbot-GPT3 SkyWorkAIGC C#

SkyChat是一款基于中文GPT-3 api的聊天机器人项目。它可以像chatGPT一样,实现人机聊天、问答、中英文互译、对对联、写古诗等任务。| SkyChat is a Chatbot proj...

694
azooKey
azooKey azooKey Swift

azooKey is an open-source Japanese keyboard for iPhone and iPad, written in Swift and powered by its own kana-kanji conversion engine. It provides liv...

692
Me_Bot
Me_Bot Spandan-Madan Jupyter Notebook

Build a bot that speaks like you!

687
SkillNet
SkillNet zjunlp Python

Create, Evaluate, and Connect AI Skills

687
Matterport3DSimulator
Matterport3DSimulator peteanderson80 C++

AI Research Platform for Reinforcement Learning from Real Panoramic Images.

686
CS224n
CS224n hankcs Python

CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017

684
SpeedTorch
SpeedTorch Santosh-Gupta Python

Library for faster pinned CPU <-> GPU transfer in Pytorch

682