Topic

nlp

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Repositories (1462)

relik
relik SapienzaNLP Python

Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)

497
large_language_model_training_playbook
large_language_model_training_playbook huggingface Python

An open collection of implementation tips, tricks and resources for training large language models

496
CS224n-winter-together
CS224n-winter-together xixiaoyao JavaScript

an Open Course Platform for Stanford CS224n (2020 Winter)

495
sacremoses
sacremoses hplt-project Python

Python port of Moses tokenizer, truecaser and normalizer

494
Question-Generation
Question-Generation KristiyanVachev Jupyter Notebook

Generating multiple choice questions from text using Machine Learning.

493
Styleformer
Styleformer PrithivirajDamodaran Python

A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/...

493
autocorrect
autocorrect filyp Python

Spelling corrector in python

493
Machine-Learning-Roadmap
Machine-Learning-Roadmap shanmukh05

A roadmap for getting started with Machine Learning

493
KcBERT
KcBERT Beomi

🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋

492
code_search
code_search hamelsmu Jupyter Notebook

Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"

491
Text-Classification-Models-Pytorch
Text-Classification-Models-Pytorch AnubhavGupta3377 Python

Implementation of State-of-the-art Text Classification Models in Pytorch

491
ai-agents
ai-agents huangjia2019 Jupyter Notebook

Introductory examples for building LLM-based AI agents. 异步图书:《大模型应用开发 动手做AI Agent》 - 这是一些非常简单的入门示例,重在引导新手入门,目...

490
llm-analysis
llm-analysis cli99 Python

Latency and Memory Analysis of Transformer Models for Training and Inference

486
matchbox
matchbox salesforce Python

Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.

485
nlp-tutorial
nlp-tutorial shibing624 Jupyter Notebook

自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。

484
clifs
clifs johanmodin JavaScript

Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP

483
PaperRobot
PaperRobot EagleW Python

Code for PaperRobot: Incremental Draft Generation of Scientific Ideas

482
infomate.club
infomate.club vas3k Python

RSS feed aggregator with collections and NLP article summarization

482
cogcomp-nlp
cogcomp-nlp CogComp Java

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extract...

480
step_into_llm
step_into_llm mindspore-lab Jupyter Notebook

MindSpore online courses: Step into LLM

480
Deta_Parser
Deta_Parser yaoguangluo Java

快速中文分词分析word segmentation

478
mexican-government-report
mexican-government-report PhantomInsights Python

Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.

476
pynlpl
pynlpl proycon Python

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common...

476
SimCTG
SimCTG yxuansu Python

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

476
cope
cope LingDong- JavaScript

A modern IDE for writing classical Chinese poetry 格律诗编辑程序

475
whatlies
whatlies koaning Python

Toolkit to help understand "what lies" in word embeddings. Also benchmarking!

473
relora
relora Guitaricet Jupyter Notebook

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

472
nlp
nlp james-bowman Go

Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang

471
IE-Survey
IE-Survey BDBC-KG-NLP

北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行...

470
kss
kss hyunwoongko Python

KSS: Korean String processing Suite

470
clipsai
clipsai ClipsAI Python

Clips AI is an open-source Python library that automatically converts long videos into clips.

470
edenai-apis
edenai-apis edenai Python

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

469
SpanMarkerNER
SpanMarkerNER tomaarsen Jupyter Notebook

SpanMarker for Named Entity Recognition

469
Advanced_RAG
Advanced_RAG NisaarAgharia Jupyter Notebook

Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.

469
hierarchical-attention-networks
hierarchical-attention-networks ematvey Python

Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is currently unmaintained, issues will probably not be ad...

468
kaggle-HomeDepot
kaggle-HomeDepot ChenglongChen Python

3rd Place Solution for HomeDepot Product Search Results Relevance Competition on Kaggle.

467
node-question-answering
node-question-answering huggingface TypeScript

Fast and production-ready question answering in Node.js

466
deepseek-php-client
deepseek-php-client deepseek-php PHP

⚡️ A robust and developer-friendly, and community-driven PHP Client that provides a clean, extensible interface for integrating with the DeepSeek AI...

466
Deep-Learning-Specialization-Coursera
Deep-Learning-Specialization-Coursera abdur75648 Jupyter Notebook

This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes...

463
rulm
rulm IlyaGusev Jupyter Notebook

Language modeling and instruction tuning for Russian

463
examples
examples jina-ai Python

Jina examples and demos to help you get started

462
Visual-Chinese-LLaMA-Alpaca
Visual-Chinese-LLaMA-Alpaca airaria Python

多模态中文LLaMA&Alpaca大语言模型(VisualCLA)

460
flash-tokenizer
flash-tokenizer NLPOptimize C++

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

459
coref
coref mandarjoshi90 Python

BERT for Coreference Resolution

455
awesome-python
awesome-python dylanhogg

🐍 Hand-picked awesome Python libraries and frameworks, organised by category

454
AdaSeq
AdaSeq modelscope Python

AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models

453
machine-learning-exams
machine-learning-exams fatosmorina

This repository contains links to machine learning exams, homework assignments, and exercises that can help you test your understanding.

452
bert-embedding
bert-embedding imgarylai Python

🔡 Token level embeddings from BERT model on mxnet and gluonnlp

451
keytotext
keytotext gagan3012 Jupyter Notebook

Keywords to Sentences

451
cntext
cntext hiDaDeng Python

cntext 是一个专为社会科学实证研究设计的中文文本分析 Python 库。它不仅提供传统的词频统计和情感分析,还支持词嵌入训练、语义投影计算等高级功能,帮助研究...

450