Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Fast, Consistent Tokenization of Natural Language Text
Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.
unified embedding model
My (slightly modified) Keras implementation of the Recurrent Convolutional Neural Network (RCNN) described here: http://www.aaai.org/ocs/index.php/AAA...
Chinese GPT2: pre-training and fine-tuning framework for text generation
Efficient and clean PyTorch reimplementation of "End-to-end Neural Coreference Resolution" (Lee et al., EMNLP 2017).
dna2vec: Consistent vector representations of variable-length k-mers
Google USE (Universal Sentence Encoder) for spaCy
Test your HN title against a neural network
Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
Open R-NET (hy` առնետ 🐁) implementation and detailed analysis: https://git.io/vd8dx
💙 Emoji handling and meta data for spaCy with custom extension attributes
BERT for Finance : UC Berkeley MIDS w266 Final Project
2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、案件聚类、法律条文推荐等(试验目前基于婚姻类案件,可扩展至其它领域)。
FairyTailor: Multimodal Generative Framework for Storytelling
Use Transformers and LSTMs to learn Python source code
KoRean based BERT pre-trained models (KR-BERT) for Tensorflow and PyTorch
This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP...
Magento Chatbot Integration with Telegram, Messenger, Whatsapp, WeChat, Skype and wit.ai.
The simplest way to build all types of smart chatbots and digital assistants
PyTorch Implementation of "A Hierarchical Latent Structure for Variational Conversation Modeling" (NAACL 2018 Oral)
NLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
此仓库将介绍Deep Learning 所需要的基础知识以及NLP方面的模型原理到项目实操 : )
Tool which allow you to detect and translate text.
Parser for Attempto Controlled English (ACE)
中文自然语言的实体抽取和意图识别(Natural Language Understanding),可选Bi-LSTM + CRF 或者 IDCNN + CRF
This is my reading list for my PhD in AI, NLP, Deep Learning and more.
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
An attempt to map the areas with active conflict in Ukraine using twitter data and NLP.
瑞金医院MMC人工智能辅助构建知识图谱大赛复赛
:boom: :chart_with_upwards_trend: A curated list of data science, analysis and visualization tools
Python wraper for MetaMap
code of Relation Classification via Multi-Level Attention CNNs
Implementation of XLNet that can load pretrained checkpoints
PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
The repository provides usefull python scripts for ML and data analysis
A collection of Natural language processing pre-trained models.
A Dutch RoBERTa-based language model
A text tagger based on Lucene / Solr, using FST technology
Implementation of Very Deep Convolutional Neural Network for Text Classification
All For NLP, especially Chinese.
Official Python client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Python apps.
A two-level morphological analyzer for Turkish.
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings...
The Genie open source kit for voice assistant (formerly known as Almond)