Supported file formats
ComPDF AI document parsing and extraction support the following file formats:
Office documents
| Format | Extension |
|---|---|
| Word | .doc, .docx |
| Excel | .xls, .xlsx |
| PowerPoint | .ppt, .pptx |
Images
| Format | Extension |
|---|---|
| PNG | .png |
| JPEG | .jpg, .jpeg |
| GIF | .gif |
| BMP | .bmp |
| TIFF | .tiff, .tif |
| WebP | .webp |
Text & markup
| Format | Extension |
|---|---|
| CSV | .csv |
| Plain text | .txt |
| RTF | .rtf |
| HTML | .html, .htm |
| MHTML | .mhtml, .mht |
Recommendations
- For scanned documents or images, enable OCR-related options.
- For long structured documents, prefer
mode=layout. - For handwriting or free-form scanned pages, try
mode=vision. - For large files, use async or presigned request modes.
- If the API returns
06001, the page quota is exhausted — purchase a page bundle.