Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text
Tool for interactive embeddings visualization
Cybertron: the home planet of the Transformers in Go
PyContinual (An Easy and Extendible Framework for Continual Learning)
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitle...
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
RASA chatbot use case boilerplate
⚡️ Reality OS for Creators
Sentiment analysis library for russian language
Meandering In Networks of Entities to Reach Verisimilar Answers
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
Repository for Project Insight: NLP as a Service
Fast and Portable Character String Processing in R (with the Unicode ICU)
ByteNet for character-level language modelling
NaturalCC: An Open-Source Toolkit for Code Intelligence
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/boo...
On-device LLM Inference Powered by X-Bit Quantization
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks...
Mishkal is an arabic text vocalization software
Default English stopword lists from many different sources
This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.
Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps
A list of contrastive Learning papers
Text2Text Language Modeling Toolkit
code samples for the goodreads datasets
Research and Materials on Hardware implementation of Transformer Model
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
a sklearn wrapper for Google's BERT model
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhanc...
A personal semantic search engine capable of surfacing relevant bookmarks, journal entries, notes, blogs, contacts, and more, built on an efficient do...
AI system design guide for engineers building production AI systems and evals.
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
An NLP system for generating reading comprehension questions
Recent Deep Learning papers in NLU and RL
All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)
Course notes for Data Science related topics, prepared in LaTeX
Pre-Trained Models for ToD-BERT
LDA topic modeling for node.js
AI ChatBot using Python Tensorflow and Natural Language Processing (NLP) along side TFLearn
[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
An Extensible Continual Learning Framework Focused on Language Models (LMs)
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
ACL 2019: Incorporating Syntactic and Semantic Information in Word Embeddings using Graph Convolutional Networks
A list of Indonesian NLP resources.
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"
Interpretable data visualizations for understanding how texts differ at the word level
Web scrapping and related analytics using Python tools