OpenCorpus

OpenCorpus

madhav1k

A multilingual compilation of open-source textual corpora across major & minor world languages - curated for accessibility and linguistic research. Includes links and metadata for publicly available, CC-licensed, and machine-readable datasets.

1 Stars
0 Forks
1 Watchers
cc0-1.0 License
Cost to Build
$2.1K
Market Value
$800

Growth over time

1 data points  ·  2025-07-25 → 2025-07-25
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about OpenCorpus

Question copied to clipboard

What is the madhav1k/OpenCorpus GitHub project? Description: "A multilingual compilation of open-source textual corpora across major & minor world languages - curated for accessibility and linguistic research. Includes links and metadata for publicly available, CC-licensed, and machine-readable datasets.". Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone OpenCorpus

Clone via HTTPS

git clone https://github.com/madhav1k/OpenCorpus.git

Clone via SSH

[email protected]:madhav1k/OpenCorpus.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the OpenCorpus issue tracker:

Open GitHub Issues