Most popular nlp repositories and open source projects

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

dkpro-core

Collection of software components for natural language processing (NLP...

65   199   199  

node-postal

NodeJS bindings to libpostal for fast international address parsing/no...

33   199   199  

displacy-ent

:boom: displaCy-ent.js: An open-source named entity visualiser for the...

40   198   198  

eudex

A blazingly fast phonetic reduction/hashing algorithm.

12   198   198  

cadmium

Natural Language Processing (NLP) library for Crystal

16   198   198  

SyferText

A privacy preserving NLP framework

49   198   198  

exercises_thushv_dot_com

Here lies all the exercises I implement and share in my website

158   198   198  

KcELECTRA

🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델

22   198   198  

bi-lstm-crf

A PyTorch implementation of the BI-LSTM-CRF model.

49   197   197  

wrench

WRENCH: Weak supeRvision bENCHmark

27   197   197  

datastories-semeval2017-task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4:...

63   197   197  

tensorflow-ml-nlp

텐서플로우와 머신러닝으로 시작하는 자연어처리(로지스틱회귀부터 트랜스...

110   197   197  

THUCTC

An Efficient Chinese Text Classifier

64   196   196  

twitterDataMining

Twitter数据挖掘及其可视化

68   196   196  

spacy-js

🎀 JavaScript API for spaCy with Python REST API

23   196   196  

akaza

Yet another Japanese IME for IBus/Linux

6   196   196  

UABSA-SyMux

Codes for the IJCAI2022 paper: Inheriting the Wisdom of Predecessors:...

2   196   196  

open-semantic-entity-search-api

Open Source REST API for named entity extraction, named entity linking...

34   195   195  

cedille-ai

✒️ Cedille is a large French language model (6B), released under an op...

10   195   195  

arXivNotes

IssuesにNLP(自然言語処理)に関連するの論文を読んだまとめを書いていま...

8   195   195  

textvec

Text vectorization tool to outperform TFIDF for classification tasks

26   194   194  

ROUGE-2.0

ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N...

39   194   194  

denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Inde...

29   194   194  

paper-reading

比做算法的懂工程落地,比做工程的懂算法模型。

34   194   194  

financial-news-dataset

Reuters and Bloomberg

92   193   193  

Getting-Started-with-Google-BERT

Build and train state-of-the-art natural language processing models us...

74   193   193  

TagEditor

🏖TagEditor - Annotation tool for spaCy

12   193   193  

KoBigBird

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

19   193   193  

pdfanno

Linguistic Annotation and Visualization Tool for PDF Documents

58   192   192  

tmtoolkit

Text Mining and Topic Modeling Toolkit for Python with parallel proces...

28   191   191  

files2rouge

Calculating ROUGE score between two files (line-by-line)

52   191   191  

NLPre

Python library for Natural Language Preprocessing (NLPre)

35   191   191  

Java-Deep-Learning-Cookbook

Code for Java Deep Learning Cookbook

43   190   190  

nlp-de-cero-a-cien

Curso práctico: NLP de cero a cien 🤗

90   190   190  

jupyterlab-prodigy

🧬 A JupyterLab extension for annotating data with Prodigy

23   189   189  

mongolian-nlp

Useful resources for Mongolian NLP

47   189   189  

awesome-community-curated-nlp

Community Curated NLP List

35   189   189  

practical-torchtext

A set of tutorials for torchtext

57   188   188  

guri-vr

https://gurivr.com

42   188   188  

abydos

Abydos NLP/IR library for Python

39   188   188  

embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from vari...

29   187   187  

Turkish-Word2Vec

Pre-trained Word2Vec Model for Turkish

29   187   187  

slovnet

Deep Learning based NLP modeling for Russian language

19   187   187  

compling_nlp_hse_course

Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ

76   187   187  

machine-learning-exams

This repository contains links to machine learning exams, homework ass...

45   187   187  

syntok

Text tokenization and sentence segmentation (segtok v2)

34   186   186  

hawking

A Natural Language Date Time Parser that Extract date and time from te...

13   186   186  

varbook

适合中文程序员的变量命名助手,NLP+翻译,规范变量命名,定制化变量命名规...

38   186   186  

acl-papers

paper summary of Association for Computational Linguistics

10   186   186  

tokenizers

Fast, Consistent Tokenization of Natural Language Text

25   185   185