Last updated: Jul 25, 2025, 10:08 AM UTC

Convert To Markdown — AI-Ready Document Conversion

Transform Excel (.xlsx, .xls, .xlsm), Word (.docx, .dotx, .dotm), PowerPoint (.pptx), and PDF files into clean Markdown, JSON, or HTML — with enterprise-grade security through zero-storage architecture

npm version
npm downloads
Deploy Status
API
Cloud Functions
License: MIT
Knowcode

View API Docs →   |   Pricing →   |   Get Started →

Sign Up - $10/month

Why?

The Problem

  • Security risks — Services store your sensitive documents indefinitely
  • Complex, unpredictable pricing — Per-document fees, hidden costs, usage spikes (see our simple pricing)
  • Excel (.xlsx, .xls, .xlsm) formulas stay as text=SUM(A1:A10) instead of the actual value
  • Tables get mangled — PDF tables become unreadable text blocks
  • No AI optimization — Missing metadata, token counts, and proper chunking
  • Complex integrations — Heavy SDKs and multi-step processes

Our Solution

  • Zero-storage architecture — Files processed in memory, never saved (see security →)
  • $10/month flat ratePredictable pricing, no surprises, generous limits
  • Smart data processing — Resolves formula values and cleans empty rows
  • Intelligent formatting — Auto-detects tables and preserves structure
  • LLM-ready output — YAML metadata, token estimates, clean Markdown
  • Simple API — One endpoint, zero configuration needed (API docs)

Conversion Capabilities

From To Features Use Case
Excel
.xlsx .xls .xlsm
(Yes, we support macro-enabled XLSM!)
JSON • Sheet filtering
• Empty row removal
• Token estimation
• Formula resolution
Data pipelines,
API integration
Excel (.xlsx, .xls, .xlsm)
Including XLSM files
Markdown • Table formatting
• Multi-line cells
• YAML metadata
• Preserves formulas
Documentation,
Knowledge bases
PDF Markdown • Table detection
• Heading recognition
• Label formatting
Content migration,
RAG systems
Word
.docx, .dotx, .dotm
HTML • Clean semantic HTML
• Style preservation
• No bloat
Web publishing,
CMS import
Word
.docx, .dotx, .dotm
Markdown • Full formatting
• Lists & tables
• Blockquotes
Technical docs,
GitHub wikis
PowerPoint
.pptx, .ppsx, .potx
Markdown • Slide content extraction
• Speaker notes
• Structured output
Presentations,
Training materials

Try It - Demo

Experience the conversion quality yourself with these interactive examples:

Note: The demo allows you to upload and convert real files using our production API. File size limit is 5MB for demo usage.

Excel with Formulas → Clean JSON

Upload Your Excel File

Drop your .xlsx, .xls, or .xlsm file here

Max 5MB • Your file is never stored

Sample Input:

Product    | Q1    | Q2    | Total
-----------|-------|-------|-------
Widget A   | 1250  | 1580  | =B2+C2
Widget B   | 980   | 1200  | =B3+C3

Instant Output

{
  "content": {
    "Sheet1": [
      {
        "Product": "Widget A",
        "Q1": 1250,
        "Q2": 1580,
        "Total": 2830 // ✅ Formula resolved!
      },
      {
        "Product": "Widget B",
        "Q1": 980,
        "Q2": 1200,
        "Total": 2180 // ✅ Formula resolved!
      }
    ]
  },
  "statistics": {
    "fileSize": { "bytes": 8421, "KB": 8.2 },
    "sheets": 1,
    "cellCount": 12,
    "estimatedTokens": 145,
    "processingTime": "782ms"
  }
}

PDF with Tables → Structured Markdown

Upload Your PDF

Drop your PDF file here

Text-based PDFs work best

Sample Input (PDF):

QUARTERLY SALES REPORT

Region     Q1 Sales    Q2 Sales
North      $125,000    $158,000
South      $98,000     $120,000
East       $145,000    $167,000

Clean Markdown Output

---
title: quarterly-sales-report
date: 2025-07-23
type: document
source: PDF
pages: 1
---

QUARTERLY SALES REPORT

| Region | Q1 Sales | Q2 Sales |
|--------|----------|----------|
| North | $125,000 | $158,000 |
| South | $98,000 | $120,000 |
| East | $145,000 | $167,000 |


Metadata:

  • Estimated tokens: 287
  • Tables detected: 1
  • Processing time: 523ms

Word (.docx, .dotx, .dotm) Document → Semantic HTML**

Upload Your DOCX

Drop your .docx file here

Preserves all formatting

Sample Input (Word .docx/.dotx/.dotm):

Product Launch Plan

Executive Summary
Our new product will revolutionize...

Key Features:
• Advanced AI integration
• Real-time analytics
• Enterprise security

Clean HTML Output

{
  "html": "<h1>Product Launch Plan</h1>\n
<h2>Executive Summary</h2>\n
<p>Our new product will revolutionize...</p>\n
<h3>Key Features:</h3>\n
<ul>\n
  <li>Advanced AI integration</li>\n
  <li>Real-time analytics</li>\n
  <li>Enterprise security</li>\n
</ul>",
  "statistics": {
    "headings": 3,
    "paragraphs": 1,
    "lists": 1,
    "estimatedTokens": 178,
    "processingTime": "341ms"
  }
}

PowerPoint Presentation → Structured Markdown

Upload Your PowerPoint

Drop your .pptx file here

Preserves slides & speaker notes

Sample Input (PowerPoint .pptx):

Slide 1: Company Overview
- Founded in 2020
- 500+ employees
- Global presence

[Speaker Notes: Emphasize growth trajectory]

Slide 2: Financial Results
Revenue: $50M → $125M
Growth: 150% YoY

Structured Markdown Output

---
title: Company Overview
date: 2025-07-25
type: markdown
source: pptx
slideCount: 2
totalImages: 0
hasSpeakerNotes: true
---

Company Overview

Slide 1: Company Overview

  • Founded in 2020
  • 500+ employees
  • Global presence

Speaker Notes: Emphasize growth trajectory


Slide 2: Financial Results

Revenue: $50M → $125M
Growth: 150% YoY


Metadata:

  • Slides: 2
  • Speaker notes: Yes
  • Processing time: 892ms

Security Note: These demos show the exact output quality you'll receive. In production, files are processed in memory and instantly deleted — never stored.

Ready to Convert Your Documents?

Join hundreds of developers using our API for reliable document conversion

Get Started for $10/month →

Test API key included • No credit card required to try


Real-World Use Cases

Documentation Migration

Transform your entire Word-based documentation into a modern Markdown knowledge base

Before: Scattered Word docs

📁 documentation/
├── 📄 API_Guide_v1.docx
├── 📄 API_Guide_v2_FINAL.docx
├── 📄 Setup_Instructions.doc
└── 📄 UserManual_OLD.docx

After: Organized Markdown wiki

📁 docs/
├── 📝 api-reference.md
├── 📝 getting-started.md
├── 📝 user-guide.md
└── 📝 changelog.md
What you get:
  • Clean Markdown with preserved formatting
  • Automatic table of contents generation
  • GitHub/GitLab ready documentation
  • Search-friendly structured content

Claude Code Integration

Turn your business documents into AI-ready knowledge that Claude can analyze and reason about ([perfect for our $10/month Pro plan](./pricing.md))

// Upload your business spreadsheet
POST /v1/convert/excel-to-json
// File: Q4_Sales_Analysis.xlsx

// Get AI-ready structured data
{
"sales_analysis": {
"summary": {
"total_revenue": 1250000,
"growth_rate": 0.23,
"top_products": [...]
},
"estimatedTokens": 3200
}
}

// Now Claude can:
Analyze trends and patterns
Generate insights and reports
Answer questions about your data
Create visualizations

Business Intelligence
Excel → JSON for data analysis
Document Processing
Word → Markdown for editing
Knowledge Base
PDF → Structured content
Automation
Programmatic workflows

No-Code Platform Integration

Perfect for Bubble.io, Zapier, Make.com - simple API, predictable pricing, zero complexity ([see detailed Bubble.io guide](./api.md#for-bubbleio-users))

Bubble.io API Connector Setup

API Name: Convert to Markdown
Authentication: X-API-Key header
Endpoint: https://convert-api-qpg64cvnga-uk.a.run.app/v1/convert/excel-to-json
Method: POST

That's it! You're ready to convert documents

Why it's perfect for no-code:
  • $10/month flat - No surprises
  • 5-minute setup - Copy & paste
  • Test API keys provided - Start testing immediately
  • Simple responses - Easy to parse
Common use cases:
  • Client document portals
  • Excel report processors
  • PDF search systems
  • Data import workflows

Security First: Zero-Storage Architecture

Your documents are never stored. Period.

Unlike other services that keep your files for "processing" or "analytics", we built Convert To Markdown with a zero-storage architecture:

Upload Process Delete
File streams to memory Converted in RAM 🗑️ Instantly purged
No disk writes No temporary files No data retention

Read our full security documentation →

Why This Matters

  • Healthcare: Patient records stay confidential
  • Finance: Sensitive financial data remains secure
  • Legal: Confidential documents aren't retained
  • Research: Proprietary data stays protected

Enterprise-Ready

Performance Security Scalability Reliability
< 3s processing Zero-storage design Auto-scaling 99.9% uptime
Streaming uploads Memory-only processing Concurrent requests Error recovery
5MB file limit No data retention Cloud Functions Graceful degradation

Enterprise On-Premise Option

Need complete control? Deploy Convert To Markdown in your own Google Cloud account.

Your infrastructure - Runs entirely in your GCP project
Your data - Never leaves your cloud environment
Your compliance - Meet strict regulatory requirements
Your customization - Modify and extend as needed

Contact us for Enterprise On-Premise deployment → | View enterprise pricing →


API Reference

Looking for detailed documentation? Visit our comprehensive API guide for in-depth explanations, code examples, and troubleshooting tips. Need help with pricing? Check our pricing page.

Endpoints

Endpoint Method Input Output
/v1/convert/excel-to-json POST Excel file (.xlsx, .xls, .xlsm) JSON
/v1/convert/excel-to-markdown POST Excel file (.xlsx, .xls, .xlsm) Markdown
/v1/convert/pdf-to-markdown POST PDF file Markdown
/v1/convert/word-to-html POST Word (.docx, .dotx, .dotm) file HTML
/v1/convert/word-to-markdown POST Word (.docx, .dotx, .dotm) file Markdown
/v1/convert/ppt-to-markdown POST PowerPoint (.pptx) file Markdown

Common Parameters

  • file (required): The document to convert
  • sheetPrefix (optional): Filter Excel (.xlsx, .xls, .xlsm) sheets by prefix

Response Format

{
  "content": "...",           // Converted content
  "statistics": {
    "fileSize": { /* bytes, KB, MB */ },
    "processingTime": 1250,   // milliseconds
    "estimatedTokens": 2500,  // For LLM usage
    // Format-specific stats...
  }
}

What Makes Us Different

AI-Optimized

Token counting and metadata specifically designed for LLM consumption

Lightning Fast

Stream processing and cloud scaling handle files in seconds

Zero Config

No complex setup — just POST your file and get results

Smart Processing

Automatically cleans data, detects tables, and preserves structure


Getting Help & Contributing

Found a Bug? Have a Feature Request?

We welcome your feedback and contributions!

Report a Bug | Request a Feature | View All Issues

Questions? Contact us at lindsay@knowcode.tech


Get Started Today

Ready to transform your document workflow?


View Pricing →     API Documentation →     Security →     Technical Details →     Get Started →


Start Converting Documents - Only $10/month

No setup fees • Cancel anytime • Generous limits



Convert To Markdown — Built with for developers who need clean, structured data from messy documents