kreuzberg

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

async python ruby php csharp elixir java node golang rust

View on GitHub Website

7.9k Stars

431 Forks

7.9k Watchers

Rust Language

other License

100 SrcLog Score

Cost to Build

$38.58M

Market Value

$249.08M

How is this calculated?

Growth over time

14 data points · 2025-08-30 → 2026-04-26

Stars Forks Watchers

💬

How do you feel about this project?

Ask AI about kreuzberg

Question copied to clipboard

What is the kreuzberg-dev/kreuzberg GitHub project? Description: "A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.". Written in Rust. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone kreuzberg

Clone via HTTPS

git clone https://github.com/kreuzberg-dev/kreuzberg.git

Clone via SSH

[email protected]:kreuzberg-dev/kreuzberg.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the kreuzberg issue tracker:

Open GitHub Issues

Similar to kreuzberg

tensorflow awesome-python system-design-primer flask thefuck free-programming-books-zh_CN cli django requests keras ansible scikit-learn scrapy TensorFlow-Examples certbot pytorch python-patterns tornado face_recognition core pandas CNTK python-guide reddit wechat_jump_game interactive-coding-challenges compose data-science-ipython-notebooks ipython pipenv