Topic

natural-language-processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1431)

stanza
stanza stanfordnlp Python

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

7.8k
WantWords
WantWords thunlp JavaScript

An open-source online reverse dictionary.

7.1k
models
models PaddlePaddle Python

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

6.9k
awesome-multimodal-ml
awesome-multimodal-ml pliang279

Reading list for research topics in multimodal machine learning

6.9k
mycroft-core
mycroft-core MycroftAI Python

Mycroft Core, the Mycroft Artificial Intelligence platform.

6.6k
nlp.js
nlp.js axa-group JavaScript

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

6.6k
big-AGI
big-AGI enricoros TypeScript

AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-t...

6.5k
ML-Course-Notes
ML-Course-Notes dair-ai

🎓 Sharing machine learning course / lecture notes.

6.4k
nlp-recipes
nlp-recipes microsoft Python

Natural Language Processing Best Practices & Examples

6.4k
courses
courses SkalskiP Python

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

6.4k
awesome-self-supervised-learning
awesome-self-supervised-learning jason718

A curated list of awesome self-supervised methods

6.4k
AI-Job-Notes
AI-Job-Notes amusi

AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)

6.1k
ai-deadlines
ai-deadlines paperswithcode JavaScript

:alarm_clock: AI conference deadline countdowns

6k
ERNIE
ERNIE PaddlePaddle Python

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understan...

5.9k
trafilatura
trafilatura adbar Python

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

5.7k
Baichuan-7B
Baichuan-7B baichuan-inc Python

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

5.7k
vale
vale vale-cli Go

:pencil: A markup-aware linter for prose built with speed and extensibility in mind.

5.4k
ltp
ltp HIT-SCIR Python

Language Technology Platform

5.2k
datascience
datascience sreeharierk

This repository is a compilation of free resources for learning Data Science.

5.2k
marqo
marqo marqo-ai Python

Ecommerce Search and Discovery - marqo.ai

5k
argilla
argilla argilla-io Python

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4.9k
flash-linear-attention
flash-linear-attention fla-org Python

🚀 Efficient implementations for emerging model architectures

4.9k
OpenPrompt
OpenPrompt thunlp Python

An Open-Source Framework for Prompt-Learning.

4.9k
libpostal
libpostal openvenues C

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

4.8k
nlpaug
nlpaug makcedward Jupyter Notebook

Data augmentation for NLP

4.7k
pytorch-sentiment-analysis
pytorch-sentiment-analysis bentrevett Jupyter Notebook

Tutorials on getting started with PyTorch and TorchText for sentiment analysis.

4.6k
autotrain-advanced
autotrain-advanced huggingface Python

🤗 AutoTrain Advanced

4.6k
practical-pytorch
practical-pytorch spro Jupyter Notebook

Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained

4.5k
textract
textract deanmalmgren HTML

extract text from any document. no muss. no fuss.

4.5k
LLMBook-zh.github.io
LLMBook-zh.github.io LLMBook-zh Python

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

4.4k
oasis
oasis camel-ai Python

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

4.3k
FLAML
FLAML microsoft Jupyter Notebook

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

4.3k
cs230-code-examples
cs230-code-examples cs230-stanford Python

Code examples in pyTorch and Tensorflow for CS230

4.2k
Data-science
Data-science CodeCutTech Jupyter Notebook

Collection of useful data science topics along with articles, videos, and code

4.2k
spark-nlp
spark-nlp JohnSnowLabs Scala

State of the Art Natural Language Processing

4.1k
Baichuan2
Baichuan2 baichuan-inc Python

A series of large language models developed by Baichuan Intelligent Technology

4.1k
LLM-RL-Visualized
LLM-RL-Visualized changyeyu Python

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

4.1k
arXivTimes
arXivTimes arXivTimes

repository to research & share the machine learning articles

3.9k
MatchZoo
MatchZoo NTMC-Community Python

Facilitating the design, comparison and sharing of deep text matching models.

3.9k
JioNLP
JioNLP dongrixinyu Python

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

3.8k
olivia
olivia olivia-ai Go

💁‍♀️Your new best friend powered by an artificial neural network

3.7k
AI-Engineer-Headquarters
AI-Engineer-Headquarters hemansnation Jupyter Notebook

A collection of scientific methods, processes, algorithms, and systems to build stories & models.

3.7k
lit
lit PAIR-code TypeScript

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

3.7k
AIGC-Interview-Book
AIGC-Interview-Book WeThinkIn

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、LLM大模型、AI Agent、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、大数...

3.5k
huggingface_hub
huggingface_hub huggingface Python

The official Python client for the Hugging Face Hub.

3.5k
zhihu
zhihu NELSONZHAO Jupyter Notebook

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Langua...

3.5k
ml-course
ml-course girafe-ai Jupyter Notebook

Open Machine Learning course

3.5k
TextAttack
TextAttack QData Python

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master...

3.4k
catalyst
catalyst catalyst-team Python

Accelerated deep learning R&D

3.4k
mlops-course
mlops-course GokuMohandas Jupyter Notebook

Learn how to design, develop, deploy and iterate on production-grade ML applications.

3.3k