Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1402)

relik
relik SapienzaNLP Python

Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)

446
NeuroNLP2
NeuroNLP2 XuezheMax Python

Deep neural models for core NLP tasks (Pytorch version)

441
medaCy
medaCy NLPatVCU Python

:hospital: Medical Text Mining and Information Extraction with spaCy

439
Deep-Learning-NLP
Deep-Learning-NLP astorfi Python

:satellite: Organized Resources for Deep Learning in Natural Language Processing

436
cmrc2018
cmrc2018 ymcui Python

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

436
pykakasi
pykakasi miurahr Python

Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.

434
machine-learning-resources
machine-learning-resources datascienceid

A curated list of awesome machine learning frameworks, libraries, courses, books and many more.

432
inseq
inseq inseq-team Python

Interpretability for sequence generation models 🐛 🔍

432
nlp-papers-with-arxiv
nlp-papers-with-arxiv roomylee Jupyter Notebook

Statistics and accepted paper list of NLP conferences with arXiv link

431
Awesome-Distributed-Deep-Learning
Awesome-Distributed-Deep-Learning bharathgs

A curated list of awesome Distributed Deep Learning resources.

431
awesome-financial-nlp
awesome-financial-nlp icoxfog417

Researches for Natural Language Processing for Financial Domain

426
textaugment
textaugment dsfsi Python

TextAugment: Text Augmentation Library

425
ChineseBLUE
ChineseBLUE alibaba-research Python

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

423
low-resource-languages
low-resource-languages RichardLitt TeX

Resources for conservation, development, and documentation of low resource (human) languages.

422
ResourceBank_CV_NLP_MLOPS_2022
ResourceBank_CV_NLP_MLOPS_2022 ashishpatel26 Jupyter Notebook

This repository offers a goldmine of materials for students of computer vision, natural language processing, and machine learning operations.

421
USC-DS-RelationExtraction
USC-DS-RelationExtraction INK-USC C++

Distantly Supervised Relation Extraction

420
whichlang
whichlang quickwit-oss Rust

A blazingly fast and lightweight language detection library for Rust

417
contextualSpellCheck
contextualSpellCheck R1j1t Python

✔️Contextual word checker for better suggestions (not actively maintained)

417
NLP-Natural-Language-Processing
NLP-Natural-Language-Processing ElizaLo Jupyter Notebook

Projects and useful articles / links

416
ArticutAPI
ArticutAPI Droidtown Python

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,...

413
MedQuAD
MedQuAD abachaa

Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites

412
dialogflow-javascript-client
dialogflow-javascript-client dialogflow TypeScript

JavaScript Web SDK for Dialogflow

411
adaptnlp
adaptnlp Novetta Jupyter Notebook

An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.

411
edgar-crawler
edgar-crawler lefterisloukas Python

The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean stru...

411
anlp19
anlp19 dbamman Jupyter Notebook

Course repo for Applied Natural Language Processing (Spring 2019)

408
nlpnet
nlpnet erickrf Python

A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.

408
clause
clause chatopera C++

:horse_racing: 聊天机器人,自然语言理解,语义理解

407
nagisa
nagisa taishi-i Python

A Japanese tokenizer based on recurrent neural networks

402
link-grammar
link-grammar opencog C

The CMU Link Grammar natural language parser

399
FakeNewsCorpus
FakeNewsCorpus several27

A dataset of millions of news articles scraped from a curated list of data sources.

398
awesome-python
awesome-python dylanhogg

🐍 Hand-picked awesome Python libraries and frameworks, organised by category

398
airy
airy airyhq Java

💬 Open Source App Framework to build streaming apps with real-time data - 💎 Build real-time data pipelines and make real-time data universally acc...

395
customizable-gpt-chatbot
customizable-gpt-chatbot shamspias Python

A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Levera...

394
NLP101
NLP101 Huffon

NLP 101: a resource repository for Deep Learning and Natural Language Processing

392
trade-dst
trade-dst jasonwu0731 Python

Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743

392
Deep-Generative-Models-for-Natural-Language-Processing
Deep-Generative-Models-for-Natural-Language-Processing FranxYao

DGMs for NLP. A roadmap.

391
tf-seq2seq
tf-seq2seq jayparks Python

Sequence to sequence learning using TensorFlow.

389
nlp
nlp shixzie Go

[UNMANTEINED] Extract values from strings and fill your structs with nlp.

389
HugNLP
HugNLP HugAILab Python

CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊

389
dynalang
dynalang jlin816 Python

Code for "Learning to Model the World with Language." ICML 2024 Oral.

388
pycantonese
pycantonese jacksonllee Python

Cantonese Linguistics and NLP

388
OmniEvent
OmniEvent THU-KEG Python

A comprehensive, unified and modular event extraction toolkit.

387
korean-hate-speech
korean-hate-speech kocohub

Korean HateSpeech Dataset

386
awesome-bioie
awesome-bioie caufieldjh

🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)

385
beginner_nlp
beginner_nlp gutfeeling

A curated list of beginner resources in Natural Language Processing

384
scriptum
scriptum robotroutine JavaScript

No-Frills Functional Programming Lib Augmenting Javascript/Node.js

383
zshot
zshot IBM Python

Zero and Few shot named entity & relationships recognition

383
DeCLUTR
DeCLUTR JohnGiorgi Python

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue...

380
FinNLP-Progress
FinNLP-Progress YangLinyi

NLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datas...

374
gcn-over-pruned-trees
gcn-over-pruned-trees qipeng Python

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)

373