Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1261)

TextBrewer
TextBrewer airaria Python

A PyTorch-based knowledge distillation toolkit for natural language processing

1.4k
Chinese-ELECTRA
Chinese-ELECTRA ymcui Python

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

1.4k
QA-Survey-CN
QA-Survey-CN BDBC-KG-NLP

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于...

1.4k
projects
projects explosion Python

🪐 End-to-end NLP workflows from prototype to production

1.4k
spacy-transformers
spacy-transformers explosion Python

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

1.4k
vvedenie-mashinnoe-obuchenie
vvedenie-mashinnoe-obuchenie demidovakatya

:memo: Подборка ресурсов по машинному обучению

1.4k
Dragonfire
Dragonfire DragonComputer Python

the open-source virtual assistant for Ubuntu based Linux distributions

1.4k
nlg-eval
nlg-eval Maluuba Python

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

1.4k
nlp-tutorial
nlp-tutorial lyeoni Jupyter Notebook

A list of NLP(Natural Language Processing) tutorials

1.4k
paperai
paperai neuml Python

📄 🤖 Semantic search and workflows for medical/scientific papers

1.4k
transformers-interpret
transformers-interpret cdpierse Jupyter Notebook

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

1.4k
jieba-php
jieba-php fukuball PHP

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese...

1.4k
MNBVC
MNBVC esbatmop

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火...

1.3k
search-index
search-index fergiemcdowall JavaScript

A persistent, network resilient, full text search library for the browser and Node.js

1.3k
TurboTransformers
TurboTransformers Tencent C++

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

1.3k
konlpy
konlpy konlpy Python

Python package for Korean natural language processing.

1.3k
rpaframework
rpaframework robocorp Python

Collection of open-source libraries and tools for Robotic Process Automation (RPA), designed to be used with both Robot Framework and Python

1.3k
duckling_old
duckling_old facebookarchive Clojure

Deprecated in favor of https://github.com/facebook/duckling

1.3k
tribuo
tribuo oracle Java

Tribuo - A Java machine learning library

1.3k
scibert
scibert allenai Python

A BERT model for scientific text.

1.3k
tika-python
tika-python chrismattmann Python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

1.3k
obsei
obsei obsei Python

Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysi...

1.3k
spacy-llm
spacy-llm explosion Python

🦙 Integrating LLMs into structured NLP pipelines

1.3k
basaran
basaran hyperonym Python

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-bas...

1.3k
nlp_overview
nlp_overview omarsar CSS

Overview of Modern Deep Learning Techniques Applied to Natural Language Processing

1.3k
wink-nlp
wink-nlp winkjs JavaScript

Developer friendly Natural Language Processing ✨

1.3k
lingua-go
lingua-go pemistahl Go

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

1.3k
gnes
gnes gnes-ai Python

GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.

1.3k
textrank
textrank summanlp Python

TextRank implementation for Python 3.

1.3k
natasha
natasha natasha Python

Solves basic Russian NLP tasks, API for lower level Natasha projects

1.3k
ktrain
ktrain amaiya Jupyter Notebook

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

1.3k
text_gcn
text_gcn yao8839836 Python

Graph Convolutional Networks for Text Classification. AAAI 2019

1.3k
opennlp
opennlp apache Java

Apache OpenNLP

1.3k
detext
detext linkedin Python

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

1.2k
zemberek-nlp
zemberek-nlp ahmetaa Java

NLP tools for Turkish.

1.2k
one-pixel-attack-keras
one-pixel-attack-keras Hyperparticle Jupyter Notebook

Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet

1.2k
bpemb
bpemb bheinzerling Python

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

1.2k
awesome-relation-extraction
awesome-relation-extraction roomylee

📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (NLP).

1.2k
fastText_multilingual
fastText_multilingual babylonhealth Jupyter Notebook

Multilingual word vectors in 78 languages

1.2k
hmtl
hmtl huggingface Python

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP

1.2k
Repo-2017
Repo-2017 RubensZimbres Python

My first Python repo with codes in Machine Learning, NLP and Deep Learning with Keras and Theano

1.2k
transformers_tasks
transformers_tasks HarderThenHarder Jupyter Notebook

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

1.2k
nlp-in-practice
nlp-in-practice kavgan Jupyter Notebook

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word...

1.2k
PPLM
PPLM uber-research Python

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

1.2k
Deep-Learning-Experiments
Deep-Learning-Experiments roatienza Jupyter Notebook

Videos, notes and experiments to understand deep learning

1.1k
KoBERT
KoBERT SKTBrain Jupyter Notebook

Korean BERT pre-trained cased (KoBERT)

1.1k
question_generation
question_generation patil-suraj Jupyter Notebook

Neural question generation using transformers

1.1k
awesome-transformer-nlp
awesome-transformer-nlp cedrickchee

A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

1.1k
datumbox-framework
datumbox-framework datumbox Java

Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applicati...

1.1k
nlp-library
nlp-library mihail911

curated collection of papers for the nlp practitioner 📖👩‍🔬

1.1k