Skip to content
ComPDF

Document extraction guide

This guide documents the extraction API with both supported modes: vision and layout.

Guide structure

  1. Modes
  2. Extract schema
  3. Response structure

Modes at a glance

ModeBest forNotes
visionHandwritten forms, free-layout scans, image-heavy pagesVision-language extraction mode
layoutStable business documents such as invoices, orders, and contractsStructured extraction mode with optional grounding
  • Start with Modes to understand the differences and selection advice for the two modes
  • Then read Extract schema to learn how to write extract schema
  • Finally read Response structure to understand how the returned results carry field values and grounding information