Skip to content
ComPDF
Guides

Intelligent Document Extraction

Extract key fields and table information from documents. Supported file formats: PDF, JPG, JPEG, PNG. Supported downloadable result types: JSON, TXT, Excel, CSV.


Home

From the homepage, you can directly access the Intelligent Document Extraction feature. The homepage is the central hub of ComPDF AI's navigation, enabling quick access to commonly used modules.

extract home

Single Document Extraction

Perform intelligent document extraction on a single document.
Select the file you want to extract. After successfully uploading, select your desired extraction mode, either Specified Field Extraction or Extract All Key Information in the Document.

extract single

Click Extract Now and wait for a moment to see the Smart Document Extraction results.

extract result

Batch Document Extraction

Batch extract up to 30 documents.
Click the Bulk button above to switch to batch document extraction. Upload the file you want to extract, set the fixed fields to be extracted, click Extract Now and wait for a moment to see the Smart Document Extraction results.

  • Text Field: Name of the key information field to be extracted
  • Table Field: Name of the table header to be extracted. Support to customize the table header.

extract bulk

Introduction to Field Extraction Templates Features

This feature allows you to save your pre-defined field extraction settings for easy reuse next time. Two templates are provided by default (invoice & order).

After making changes, clicking the Save as Template button below will generate a new field extraction template.

alt textalt text

Note: You can save a maximum of 5 new templates, and the names cannot be duplicated.

Introduction to Adding Prompt to Extracted Fields

Adding extra prompt to extracted fields increases extraction accuracy.

Click the field where you want to add prompt; a hint input box will pop up. Enter the prompt you want to add and click the OK button.

If you save a template with prompt for a field, the prompt for that field will also be saved in the template.

alt text

Introduction to Extracting JSON Fields

JSON Content Explanation

Parameter NameData TypeDescription
codeStringError code, "200" means success
messageStringError message
dataObjectReturn result
+detailsObjectKey information extraction result
++Page-indexObjectCorresponding page number extraction result
+++keyStringKey information field extraction result, key:value
+++tablesArrayKey information table extraction results, tables:[ [table1], [table2] ] ]

JSON structure example:

json
{
    "code": "200",
    "msg": "success",
    "data": {
        "details": {
            "Page-1": {
                "Order Date": "xxx",
                "Order #": "xxx",
                "Quote#": "xxx",
                "Your estimated delivery date is": "xxx",
                "tables": null
            }
        }
    }
}