Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

PURE
PURE princeton-nlp Python

NAACL'2021: A Frustratingly Easy Approach for Entity and Relation Extraction https://arxiv.org/abs/2010.12812

712
mordecai
mordecai openeventdata Python

Full text geoparsing as a Python library

709
sequence-labeling-BiLSTM-CRF
sequence-labeling-BiLSTM-CRF scofield7419 JavaScript

The BiLSTM-CRF model implementation in Tensorflow, for sequence labeling tasks.

706
chat
chat Decalogue Python

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

700
THUOCL
THUOCL thunlp

THUOCL(THU Open Chinese Lexicon)中文词库

694
bookcorpus
bookcorpus soskek Python

Crawl BookCorpus

694
Python-ai-assistant
Python-ai-assistant ggeop Python

Python AI assistant 🧠

694
albert_pytorch
albert_pytorch lonePatient Python

A Lite Bert For Self-Supervised Learning Language Representations

690
WeCron
WeCron polyrabbit JavaScript

:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat

689
PatrickStar
PatrickStar Tencent Python

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.

686
SpeedTorch
SpeedTorch Santosh-Gupta Python

Library for faster pinned CPU <-> GPU transfer in Pytorch

685
pypostal
pypostal openvenues C

Python bindings to libpostal for fast international address parsing/normalization

685
magpie
magpie inspirehep Python

Deep neural network framework for multi-label text classification

683
DNABERT
DNABERT jerryji1993 Python

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

682
MacBERT
MacBERT ymcui

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

678
nboost
nboost koursaros-ai Python

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms...

675
Annotated-Semantic-Relationships-Datasets
Annotated-Semantic-Relationships-Datasets davidsbatista

A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)

671
meta
meta meta-toolkit C++

A Modern C++ Data Sciences Toolkit

670
Octopii
Octopii redhuntlabs Python

An AI-powered Personal Identifiable Information (PII) scanner.

668
whatlanggo
whatlanggo abadojack Go

Natural language detection library for Go

664
Awesome-Korean-NLP
Awesome-Korean-NLP datanada

A curated list of resources for NLP (Natural Language Processing) for Korean

661
detoxify
detoxify unitaryai Python

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For ac...

659
tensor_parallel
tensor_parallel BlackSamorez Python

Automatically split your PyTorch models on multiple GPUs for training & inference

658
Deeplearning.ai-Natural-Language-Processing-Specialization
Deeplearning.ai-Natural-Language-Processing-Specialization ibrahimjelliti Jupyter Notebook

This repository contains my full work and notes on Coursera's NLP Specialization (Natural Language Processing) taught by the instructor Younes Bensoud...

655
Med-ChatGLM
Med-ChatGLM SCIR-HI Python

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

655
obsidian-ava
obsidian-ava different-ai TypeScript

Quickly format your notes with ChatGPT in Obsidian

654
griptape
griptape griptape-ai Python

Python framework for AI workflows and pipelines with chain of thought reasoning, external tools, and memory.

649
seqGAN
seqGAN suragnair Python

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

647
mynlp
mynlp mayabot Java

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)

646
COMET
COMET Unbabel Python

A Neural Framework for MT Evaluation

643
nlprule
nlprule bminixhofer Rust

A fast, low-resource Natural Language Processing and Text Correction library written in Rust.

641
VnCoreNLP
VnCoreNLP vncorenlp Java

A Vietnamese natural language processing toolkit (NAACL 2018)

635
word_forms
word_forms gutfeeling Python

Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.

634
ekphrasis
ekphrasis cbaziotis Python

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word norm...

634
homer
homer wyounas Python

Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.

634
nlpia
nlpia totalgood HTML

Examples and libraries for "Natural Language Processing in Action" book

631
RocketQA
RocketQA PaddlePaddle Python

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

625
lexpredict-lexnlp
lexpredict-lexnlp LexPredict Jupyter Notebook

LexNLP by LexPredict

621
small-text
small-text webis-de Python

Active Learning for Text Classification in Python

621
BotLibre
BotLibre BotLibre Java

An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.

620
KoELECTRA
KoELECTRA monologg Python

Pretrained ELECTRA Model for Korean

620
cdQA
cdQA cdqa-suite Python

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.

616
babyai
babyai mila-iqia Python

BabyAI platform. A testbed for training agents to understand and execute language commands.

614
graphbrain
graphbrain graphbrain Python

Language, Knowledge, Cognition

614
Chinese_models_for_SpaCy
Chinese_models_for_SpaCy howl-anderson Jupyter Notebook

SpaCy 中文模型 | Models for SpaCy that support Chinese

612
SmoothNLP
SmoothNLP smoothnlp Java

专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference

612
indonlu
indonlu IndoNLP Jupyter Notebook

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models,...

612
Blackstone
Blackstone ICLRandD Python

:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.

611
poetry
poetry sheepzh Python

汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...

610
botonic
botonic hubtype TypeScript

Build chatbots and conversational experiences using React

602