Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

tokenizers
tokenizers ropensci R

Fast, Consistent Tokenization of Natural Language Text

185
sandbox-topically
sandbox-topically cohere-ai Jupyter Notebook

Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.

185
uniem
uniem wangyuxinwhy Python

unified embedding model

184
Recurrent-Convolutional-Neural-Network-Text-Classifier
Recurrent-Convolutional-Neural-Network-Text-Classifier airalcorn2 Python

My (slightly modified) Keras implementation of the Recurrent Convolutional Neural Network (RCNN) described here: http://www.aaai.org/ocs/index.php/AAA...

183
Guyu
Guyu lipiji Python

Chinese GPT2: pre-training and fine-tuning framework for text generation

183
coreference-resolution
coreference-resolution shayneobrien Perl

Efficient and clean PyTorch reimplementation of "End-to-end Neural Coreference Resolution" (Lee et al., EMNLP 2017).

183
dna2vec
dna2vec pnpnpn Python

dna2vec: Consistent vector representations of variable-length k-mers

183
spacy-universal-sentence-encoder
spacy-universal-sentence-encoder MartinoMensio Python

Google USE (Universal Sentence Encoder) for spaCy

183
hntitlenator
hntitlenator victorqribeiro JavaScript

Test your HN title against a neural network

182
gpt4-playground
gpt4-playground Nashex TypeScript

Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.

182
multihead-siamese-nets
multihead-siamese-nets tlatkowski Jupyter Notebook

Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.

182
R-NET-in-Keras
R-NET-in-Keras YerevaNN Python

Open R-NET (hy` առնետ 🐁) implementation and detailed analysis: https://git.io/vd8dx

181
spacymoji
spacymoji explosion Python

💙 Emoji handling and meta data for spaCy with custom extension attributes

181
FinBERT
FinBERT psnonis C++

BERT for Finance : UC Berkeley MIDS w266 Final Project

181
2017-CCF-BDCI-AIJudge
2017-CCF-BDCI-AIJudge ShawnyXiao Jupyter Notebook

2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)

180
sentence-splitter
sentence-splitter mediacloud Python

Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.

180
ParseLawDocuments
ParseLawDocuments FanhuaandLuomu Python

对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、案件聚类、法律条文推荐等(试验目前基于婚姻类案件,可扩展至其它领域)。

179
MultiModalStory-demo
MultiModalStory-demo EdenBD Python

FairyTailor: Multimodal Generative Framework for Storytelling

179
python_autocomplete
python_autocomplete labmlai Jupyter Notebook

Use Transformers and LSTMs to learn Python source code

178
KR-BERT
KR-BERT snunlp Python

KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch

178
Python_Natural_Language_Processing
Python_Natural_Language_Processing milaan9 Jupyter Notebook

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP...

178
Magento-Chatbot
Magento-Chatbot blopa PHP

Magento Chatbot Integration with Telegram, Messenger, Whatsapp, WeChat, Skype and wit.ai.

177
xatkit
xatkit xatkit-bot-platform

The simplest way to build all types of smart chatbots and digital assistants

177
A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling
A-Hierarchical-Latent-Structure-for-Variational-Conversation-Modeling ctr4si Python

PyTorch Implementation of "A Hierarchical Latent Structure for Variational Conversation Modeling" (NAACL 2018 Oral)

176
nlp_research
nlp_research zhufz Python

NLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务

176
AI-Competition-Collections
AI-Competition-Collections SWHL Python

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)

176
ML
ML sherlcok314159 Jupyter Notebook

此仓库将介绍Deep Learning 所需要的基础知识以及NLP方面的模型原理到项目实操 : )

176
text-detector
text-detector s3nh Python

Tool which allow you to detect and translate text.

175
APE
APE Attempto Prolog

Parser for Attempto Controlled English (ACE)

175
ner-slot_filling
ner-slot_filling GaoQ1 Python

中文自然语言的实体抽取和意图识别(Natural Language Understanding),可选Bi-LSTM + CRF 或者 IDCNN + CRF

175
awesome-generative-information-retrieval
awesome-generative-information-retrieval gabriben
175
AI-NLP-Paper-Readings
AI-NLP-Paper-Readings zhongpeixiang

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

175
easy-bert
easy-bert robrua Java

A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)

175
LiveActionMap
LiveActionMap kinshukdua Python

An attempt to map the areas with active conflict in Ukraine using twitter data and NLP.

175
ruijin_round2
ruijin_round2 beader Jupyter Notebook

瑞金医院MMC人工智能辅助构建知识图谱大赛复赛

174
awesome-data-science-viz
awesome-data-science-viz quantmind

:boom: :chart_with_upwards_trend: A curated list of data science, analysis and visualization tools

174
pymetamap
pymetamap AnthonyMRios Python

Python wraper for MetaMap

174
pytorch-acnn-model
pytorch-acnn-model dgai91 Python

code of Relation Classification via Multi-Level Attention CNNs

172
keras-xlnet
keras-xlnet CyberZHG Python

Implementation of XLNet that can load pretrained checkpoints

171
Hash-Embeddings
Hash-Embeddings YannDubs Python

PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.

171
ML-DL-scripts
ML-DL-scripts Diyago Jupyter Notebook

The repository provides usefull python scripts for ML and data analysis

171
NLP-pretrained-model
NLP-pretrained-model balavenkatesh3322

A collection of Natural language processing pre-trained models.

170
RobBERT
RobBERT iPieter Jupyter Notebook

A Dutch RoBERTa-based language model

170
SolrTextTagger
SolrTextTagger OpenSextant Java

A text tagger based on Lucene / Solr, using FST technology

169
VDCNN
VDCNN cjiang2 Python

Implementation of Very Deep Convolutional Neural Network for Text Classification

169
All4NLP
All4NLP hscspring Jupyter Notebook

All For NLP, especially Chinese.

169
monkeylearn-python
monkeylearn-python monkeylearn Python

Official Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.

169
turkish-morphology
turkish-morphology google-research Python

A two-level morphological analyzer for Turkish.

169
wefe
wefe dccuchile Python

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings...

169
genie-toolkit
genie-toolkit stanford-oval TypeScript

The Genie open source kit for voice assistant (formerly known as Almond)

169