Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.