Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
An open collection of implementation tips, tricks and resources for training large language models
an Open Course Platform for Stanford CS224n (2020 Winter)
Python port of Moses tokenizer, truecaser and normalizer
Generating multiple choice questions from text using Machine Learning.
A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/...
Spelling corrector in python
A roadmap for getting started with Machine Learning
🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋
Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
Implementation of State-of-the-art Text Classification Models in Pytorch
Introductory examples for building LLM-based AI agents. 异步图书:《大模型应用开发 动手做AI Agent》 - 这是一些非常简单的入门示例,重在引导新手入门,目...
Latency and Memory Analysis of Transformer Models for Training and Inference
Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.
自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。
Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP
Code for PaperRobot: Incremental Draft Generation of Scientific Ideas
RSS feed aggregator with collections and NLP article summarization
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extract...
MindSpore online courses: Step into LLM
快速中文分词分析word segmentation
Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common...
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
A modern IDE for writing classical Chinese poetry 格律诗编辑程序
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行...
KSS: Korean String processing Suite
Clips AI is an open-source Python library that automatically converts long videos into clips.
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
SpanMarker for Named Entity Recognition
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.
Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is currently unmaintained, issues will probably not be ad...
3rd Place Solution for HomeDepot Product Search Results Relevance Competition on Kaggle.
Fast and production-ready question answering in Node.js
⚡️ A robust and developer-friendly, and community-driven PHP Client that provides a clean, extensible interface for integrating with the DeepSeek AI...
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes...
Language modeling and instruction tuning for Russian
Jina examples and demos to help you get started
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING
BERT for Coreference Resolution
🐍 Hand-picked awesome Python libraries and frameworks, organised by category
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
This repository contains links to machine learning exams, homework assignments, and exercises that can help you test your understanding.
🔡 Token level embeddings from BERT model on mxnet and gluonnlp
Keywords to Sentences
cntext 是一个专为社会科学实证研究设计的中文文本分析 Python 库。它不仅提供传统的词频统计和情感分析,还支持词嵌入训练、语义投影计算等高级功能,帮助研究...