Convert (almost) every document to Markdown

Microsoft has released its own document parser for LLM use!
.
.
Introducing MarkItDown, a 100% open-source, one-stop solution for effortlessly converting any file to Markdown—perfect for text analysis, indexing, and more!

Here’s what makes it special:

↳ Converts PDF, Word, Excel, PPT, images, audio to markdown
↳ Extracts EXIF, OCR, and transcripts automatically
↳ Available via CLI, Python API, or Docker
↳ Offers LLM-based image descriptions
↳ Supports batch conversions

https://github.com/microsoft/markitdown

“Technology is best when it brings people together.” – Matt Mullenweg

Comments

Sign In or Register to comment.