Convert To Markdown — AI-Ready Document Conversion
Transform Excel (.xlsx, .xls, .xlsm), Word (.docx, .dotx, .dotm), PowerPoint (.pptx), and PDF files into clean Markdown, JSON, or HTML — with enterprise-grade security through zero-storage architecture
Why?
The Problem
|
Our Solution
|
Conversion Capabilities
From | To | Features | Use Case |
---|---|---|---|
Excel.xlsx .xls .xlsm (Yes, we support macro-enabled XLSM!) |
JSON | • Sheet filtering • Empty row removal • Token estimation • Formula resolution |
Data pipelines, API integration |
Excel (.xlsx, .xls, .xlsm) Including XLSM files |
Markdown | • Table formatting • Multi-line cells • YAML metadata • Preserves formulas |
Documentation, Knowledge bases |
Markdown | • Table detection • Heading recognition • Label formatting |
Content migration, RAG systems |
|
Word.docx, .dotx, .dotm |
HTML | • Clean semantic HTML • Style preservation • No bloat |
Web publishing, CMS import |
Word.docx, .dotx, .dotm |
Markdown | • Full formatting • Lists & tables • Blockquotes |
Technical docs, GitHub wikis |
PowerPoint.pptx, .ppsx, .potx |
Markdown | • Slide content extraction • Speaker notes • Structured output |
Presentations, Training materials |
Try It - Demo
Experience the conversion quality yourself with these interactive examples:
Note: The demo allows you to upload and convert real files using our production API. File size limit is 5MB for demo usage.
Excel with Formulas → Clean JSON
Upload Your Excel File
Drop your .xlsx, .xls, or .xlsm file here
Max 5MB • Your file is never stored
Sample Input:
Product | Q1 | Q2 | Total
-----------|-------|-------|-------
Widget A | 1250 | 1580 | =B2+C2
Widget B | 980 | 1200 | =B3+C3
Instant Output
{ "content": { "Sheet1": [ { "Product": "Widget A", "Q1": 1250, "Q2": 1580, "Total": 2830 // ✅ Formula resolved! }, { "Product": "Widget B", "Q1": 980, "Q2": 1200, "Total": 2180 // ✅ Formula resolved! } ] }, "statistics": { "fileSize": { "bytes": 8421, "KB": 8.2 }, "sheets": 1, "cellCount": 12, "estimatedTokens": 145, "processingTime": "782ms" } }
PDF with Tables → Structured Markdown
Upload Your PDF
Drop your PDF file here
Text-based PDFs work best
Sample Input (PDF):
QUARTERLY SALES REPORT
Region Q1 Sales Q2 Sales
North $125,000 $158,000
South $98,000 $120,000
East $145,000 $167,000
Clean Markdown Output
--- title: quarterly-sales-report date: 2025-07-23 type: document source: PDF pages: 1 ---QUARTERLY SALES REPORT
| Region | Q1 Sales | Q2 Sales |
|--------|----------|----------|
| North | $125,000 | $158,000 |
| South | $98,000 | $120,000 |
| East | $145,000 | $167,000 |
Metadata:
- Estimated tokens: 287
- Tables detected: 1
- Processing time: 523ms
Word (.docx, .dotx, .dotm) Document → Semantic HTML**
Upload Your DOCX
Drop your .docx file here
Preserves all formatting
Sample Input (Word .docx/.dotx/.dotm):
Product Launch Plan
Executive Summary
Our new product will revolutionize...
Key Features:
• Advanced AI integration
• Real-time analytics
• Enterprise security
Clean HTML Output
{ "html": "<h1>Product Launch Plan</h1>\n <h2>Executive Summary</h2>\n <p>Our new product will revolutionize...</p>\n <h3>Key Features:</h3>\n <ul>\n <li>Advanced AI integration</li>\n <li>Real-time analytics</li>\n <li>Enterprise security</li>\n </ul>", "statistics": { "headings": 3, "paragraphs": 1, "lists": 1, "estimatedTokens": 178, "processingTime": "341ms" } }
PowerPoint Presentation → Structured Markdown
Upload Your PowerPoint
Drop your .pptx file here
Preserves slides & speaker notes
Sample Input (PowerPoint .pptx):
Slide 1: Company Overview
- Founded in 2020
- 500+ employees
- Global presence
[Speaker Notes: Emphasize growth trajectory]
Slide 2: Financial Results
Revenue: $50M → $125M
Growth: 150% YoY
Structured Markdown Output
--- title: Company Overview date: 2025-07-25 type: markdown source: pptx slideCount: 2 totalImages: 0 hasSpeakerNotes: true ---Company Overview
Slide 1: Company Overview
- Founded in 2020
- 500+ employees
- Global presence
Speaker Notes: Emphasize growth trajectory
Slide 2: Financial Results
Revenue: $50M → $125M
Growth: 150% YoY
Metadata:
- Slides: 2
- Speaker notes: Yes
- Processing time: 892ms
Security Note: These demos show the exact output quality you'll receive. In production, files are processed in memory and instantly deleted — never stored.
Ready to Convert Your Documents?
Join hundreds of developers using our API for reliable document conversion
Get Started for $10/month →Test API key included • No credit card required to try
Real-World Use Cases
Documentation Migration
Transform your entire Word-based documentation into a modern Markdown knowledge base
Before: Scattered Word docs
📁 documentation/
├── 📄 API_Guide_v1.docx
├── 📄 API_Guide_v2_FINAL.docx
├── 📄 Setup_Instructions.doc
└── 📄 UserManual_OLD.docx
After: Organized Markdown wiki
📁 docs/
├── 📝 api-reference.md
├── 📝 getting-started.md
├── 📝 user-guide.md
└── 📝 changelog.md
- Clean Markdown with preserved formatting
- Automatic table of contents generation
- GitHub/GitLab ready documentation
- Search-friendly structured content
Claude Code Integration
Turn your business documents into AI-ready knowledge that Claude can analyze and reason about ([perfect for our $10/month Pro plan](./pricing.md))
// Upload your business spreadsheet POST /v1/convert/excel-to-json // File: Q4_Sales_Analysis.xlsx// Get AI-ready structured data
{
"sales_analysis": {
"summary": {
"total_revenue": 1250000,
"growth_rate": 0.23,
"top_products": [...]
},
"estimatedTokens": 3200
}
}// Now Claude can:
✓ Analyze trends and patterns
✓ Generate insights and reports
✓ Answer questions about your data
✓ Create visualizations
Excel → JSON for data analysis
Word → Markdown for editing
PDF → Structured content
Programmatic workflows
No-Code Platform Integration
Perfect for Bubble.io, Zapier, Make.com - simple API, predictable pricing, zero complexity ([see detailed Bubble.io guide](./api.md#for-bubbleio-users))
Bubble.io API Connector Setup
That's it! You're ready to convert documents
Why it's perfect for no-code:
- $10/month flat - No surprises
- 5-minute setup - Copy & paste
- Test API keys provided - Start testing immediately
- Simple responses - Easy to parse
Common use cases:
- Client document portals
- Excel report processors
- PDF search systems
- Data import workflows
Security First: Zero-Storage Architecture
Your documents are never stored. Period.
Unlike other services that keep your files for "processing" or "analytics", we built Convert To Markdown with a zero-storage architecture:
Upload | Process | Delete |
---|---|---|
File streams to memory | Converted in RAM | 🗑️ Instantly purged |
No disk writes | No temporary files | No data retention |
Why This Matters
- Healthcare: Patient records stay confidential
- Finance: Sensitive financial data remains secure
- Legal: Confidential documents aren't retained
- Research: Proprietary data stays protected
Enterprise-Ready
Performance | Security | Scalability | Reliability |
---|---|---|---|
< 3s processing | Zero-storage design | Auto-scaling | 99.9% uptime |
Streaming uploads | Memory-only processing | Concurrent requests | Error recovery |
5MB file limit | No data retention | Cloud Functions | Graceful degradation |
Enterprise On-Premise Option
Need complete control? Deploy Convert To Markdown in your own Google Cloud account.
Your infrastructure - Runs entirely in your GCP project
Your data - Never leaves your cloud environment
Your compliance - Meet strict regulatory requirements
Your customization - Modify and extend as needed
Contact us for Enterprise On-Premise deployment → | View enterprise pricing →
API Reference
Looking for detailed documentation? Visit our comprehensive API guide for in-depth explanations, code examples, and troubleshooting tips. Need help with pricing? Check our pricing page.
Endpoints
Endpoint | Method | Input | Output |
---|---|---|---|
/v1/convert/excel-to-json |
POST | Excel file (.xlsx, .xls, .xlsm) | JSON |
/v1/convert/excel-to-markdown |
POST | Excel file (.xlsx, .xls, .xlsm) | Markdown |
/v1/convert/pdf-to-markdown |
POST | PDF file | Markdown |
/v1/convert/word-to-html |
POST | Word (.docx, .dotx, .dotm) file | HTML |
/v1/convert/word-to-markdown |
POST | Word (.docx, .dotx, .dotm) file | Markdown |
/v1/convert/ppt-to-markdown |
POST | PowerPoint (.pptx) file | Markdown |
Common Parameters
file
(required): The document to convertsheetPrefix
(optional): Filter Excel (.xlsx, .xls, .xlsm) sheets by prefix
Response Format
{
"content": "...", // Converted content
"statistics": {
"fileSize": { /* bytes, KB, MB */ },
"processingTime": 1250, // milliseconds
"estimatedTokens": 2500, // For LLM usage
// Format-specific stats...
}
}
What Makes Us Different
AI-OptimizedToken counting and metadata specifically designed for LLM consumption |
Lightning FastStream processing and cloud scaling handle files in seconds |
Zero ConfigNo complex setup — just POST your file and get results |
Smart ProcessingAutomatically cleans data, detects tables, and preserves structure |
Getting Help & Contributing
Found a Bug? Have a Feature Request?
We welcome your feedback and contributions!
Report a Bug | Request a Feature | View All Issues
Questions? Contact us at lindsay@knowcode.tech
Get Started Today
Ready to transform your document workflow?
View Pricing → API Documentation → Security → Technical Details → Get Started →
No setup fees • Cancel anytime • Generous limits
Convert To Markdown — Built with for developers who need clean, structured data from messy documents