Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

NER-pytorch
NER-pytorch ZhixiuYe Python

LSTM+CRF NER

295
ark-nlp
ark-nlp xiangking Python

A private nlp coding package, which quickly implements the SOTA solutions.

295
deepsegment
deepsegment notAI-tech Python

A sentence segmenter that actually works!

294
komputation
komputation sekwiatkowski Kotlin

Komputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.

293
yargy
yargy natasha Python

Rule-based facts extraction for Russian language

292
rc-cnn-dailymail
rc-cnn-dailymail danqi Python

CNN/Daily Mail Reading Comprehension Task

291
transfer-nlp
transfer-nlp feedly Python

NLP library designed for reproducible experimentation management

291
nlp-data-augmentation
nlp-data-augmentation quincyliang

Data Augmentation for NLP. NLP数据增强

291
cargo-spellcheck
cargo-spellcheck drahnr Rust

Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar

291
question_generator
question_generator AMontgomerie Python

An NLP system for generating reading comprehension questions

291
BOND
BOND cliang1453 Python

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision

291
hanlp-lucene-plugin
hanlp-lucene-plugin hankcs Java

HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统

289
text-classification
text-classification javedsha Jupyter Notebook

Machine Learning and NLP: Text Classification using python, scikit-learn and NLTK

289
kss
kss hyunwoongko Python

Kss: A Toolkit for Korean sentence segmentation

289
NSC
NSC thunlp Python

Neural Sentiment Classification

288
Kevinpro-NLP-demo
Kevinpro-NLP-demo Ricardokevins Python

All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)

288
SAPConversationalAI
SAPConversationalAI SAP-archive

✨ 🤖 🤖 Build your own conversational bot on our Collaborative Bot Platform! 🤖🤖 ✨

287
Text-Classification
Text-Classification renjunxiang Python

自然语言处理项目,目标是对文本进行分类。

287
Kiwi
Kiwi bab2min C++

Kiwi(지능형 한국어 형태소 분석기)

286
languagecrunch
languagecrunch artpar Python

LanguageCrunch NLP server docker image

285
RNNSharp
RNNSharp zhongkaifu C#

RNNSharp is a toolkit of deep recurrent neural network which is widely used for many different kinds of tasks, such as sequence labeling, sequence-to-...

285
textnets
textnets jboynyc Python

Text analysis with networks.

285
similarities
similarities shibing624 Python

Similarities: a toolkit for similarity calculation and semantic search. 语义相似度计算、匹配搜索工具包,支持文本和图像,开箱即用。

284
behemoth
behemoth DigitalPebble Java

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

283
pyate
pyate kevinlu1248 HTML

PYthon Automated Term Extraction

283
discopy
discopy discopy Python

The Python toolkit for computing with string diagrams.

283
multifit
multifit n-waves Jupyter Notebook

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761

282
Customer-Chatbot
Customer-Chatbot WenRichard Python

中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professiona...

282
pixel
pixel xplip Python

Research code for pixel-based encoders of language (PIXEL)

282
Multi-Type-TD-TSR
Multi-Type-TD-TSR Psarpei Jupyter Notebook

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition

280
snow-owl
snow-owl b2ihealthcare Java

:owl: Snow Owl Terminology Server - a production-ready, scalable, FHIR Terminology Service compliant server that supports SNOMED CT International and...

280
xk-time
xk-time xkzhangsan Java

xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达式和时间NLP等的工具,使用Java8(JSR-310),线程安全,简单易用,多达70几种常用日...

279
NLP-Vietnamese-progress
NLP-Vietnamese-progress undertheseanlp

Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most commo...

279
shared_colab_notebooks
shared_colab_notebooks mrm8488 Jupyter Notebook

A Repo to store the Google Colaboratory Notebooks that I have created and shared

279
Data-Science-EBooks
Data-Science-EBooks data-science-projects-and-resources

Data Science E-books, Interview Resources and Cheat-sheets

278
Web-Database-Analytics
Web-Database-Analytics tirthajyoti Jupyter Notebook

Web scrapping and related analytics using Python tools

277
fancy-nlp
fancy-nlp boat-group Python

NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.

276
Text-Summarization-Repo
Text-Summarization-Repo uoneway

텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.

276
nlp-tutorial
nlp-tutorial bonzanini Jupyter Notebook

Tutorial: Natural Language Processing in Python

275
PyTorch-Batch-Attention-Seq2seq
PyTorch-Batch-Attention-Seq2seq AuCson Python

PyTorch implementation of batched bi-RNN encoder and attention-decoder.

275
Taisite-Platform
Taisite-Platform amazingTest Vue

最强接口测试平台

275
THUTag
THUTag thunlp Java

A Package of Keyphrase Extraction and Social Tag Suggestion

273
gobbli
gobbli RTIInternational Python

Deep learning with text doesn't have to be scary.

272
AHANLP
AHANLP jsksxs360 Java

啊哈自然语言处理包,提供包括分词、依存句法分析、语义角色标注、自动摘要、语义相似度计算、LDA 主题预测、词云等服务。

272
open-semantic-etl
open-semantic-etl opensemanticsearch Python

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity R...

271
browser-ml-inference
browser-ml-inference jobergum Jupyter Notebook

Edge Inference in Browser with Transformer NLP model

271
pytorch-question-answering
pytorch-question-answering kushalj001 Jupyter Notebook

Important paper implementations for Question Answering using PyTorch

270
extreme-bert
extreme-bert extreme-bert Python

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A...

269
awesome-semantic-search
awesome-semantic-search Agrover112

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

269
BRIO
BRIO yixinL7 Python

ACL 2022: BRIO: Bringing Order to Abstractive Summarization

268