Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Deep neural models for core NLP tasks (Pytorch version)
:hospital: Medical Text Mining and Information Extraction with spaCy
:satellite: Organized Resources for Deep Learning in Natural Language Processing
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Interpretability for sequence generation models 🐛 🔍
Statistics and accepted paper list of NLP conferences with arXiv link
A curated list of awesome Distributed Deep Learning resources.
Researches for Natural Language Processing for Financial Domain
TextAugment: Text Augmentation Library
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Resources for conservation, development, and documentation of low resource (human) languages.
This repository offers a goldmine of materials for students of computer vision, natural language processing, and machine learning operations.
Distantly Supervised Relation Extraction
A blazingly fast and lightweight language detection library for Rust
✔️Contextual word checker for better suggestions (not actively maintained)
Projects and useful articles / links
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,...
Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites
JavaScript Web SDK for Dialogflow
An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean stru...
Course repo for Applied Natural Language Processing (Spring 2019)
A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.
:horse_racing: 聊天机器人,自然语言理解,语义理解
A Japanese tokenizer based on recurrent neural networks
The CMU Link Grammar natural language parser
A dataset of millions of news articles scraped from a curated list of data sources.
🐍 Hand-picked awesome Python libraries and frameworks, organised by category
💬 Open Source App Framework to build streaming apps with real-time data - 💎 Build real-time data pipelines and make real-time data universally acc...
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Levera...
NLP 101: a resource repository for Deep Learning and Natural Language Processing
Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743
DGMs for NLP. A roadmap.
Sequence to sequence learning using TensorFlow.
[UNMANTEINED] Extract values from strings and fill your structs with nlp.
CIKM2023 Best Demo Paper Award. HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊
Code for "Learning to Model the World with Language." ICML 2024 Oral.
Cantonese Linguistics and NLP
A comprehensive, unified and modular event extraction toolkit.
Korean HateSpeech Dataset
🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)
A curated list of beginner resources in Natural Language Processing
No-Frills Functional Programming Lib Augmenting Javascript/Node.js
Zero and Few shot named entity & relationships recognition
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue...
NLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datas...
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)