officeParser

officeParser

harshankur

A robust, strictly-typed Node.js and Browser library for parsing office files (docx, pptx, xlsx, odt, odp, ods, pdf, rtf). It produces a clean, hierarchical Abstract Syntax Tree (AST) with rich metadata, text formatting, and full attachment support.

393 Stars
45 Forks
393 Watchers
Rich Text Format Language
mit License
100 SrcLog Score
Cost to Build
$1.24M
Market Value
$5.33M

Growth over time

8 data points  ·  2026-04-07 → 2026-04-23
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about officeParser

Question copied to clipboard

What is the harshankur/officeParser GitHub project? Description: "A robust, strictly-typed Node.js and Browser library for parsing office files (docx, pptx, xlsx, odt, odp, ods, pdf, rtf). It produces a clean, hierarchical Abstract Syntax Tree (AST) with rich metadata, text formatting, and full attachment support.". Written in Rich Text Format. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone officeParser

Clone via HTTPS

git clone https://github.com/harshankur/officeParser.git

Clone via SSH

[email protected]:harshankur/officeParser.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the officeParser issue tracker:

Open GitHub Issues