Windows
ComPDFKit Conversion SDK
Guides

Recognize Content with OCR (Support architecture x64 only)

 

ComPDFKit Conversion SDK provides OCR functionality when converting PDF documents to Word, Excel, PPT, TXT, HTML, and RTF. Follow the steps below to use OCR in your project.

 

         - Add and reference the "ComDocumentAIKit.dll" in the Lib folder to the project reference.

 

Recognize Content with OCR 1

 

         - Include the files ComDocumentAINative.dll, DocumentAI.dll, onnxruntime.dll, and paddle2onnx.dll from the x64 folder in the project, and set the Copy to Output Directory property of these dynamic libraries to Copy if newer.

 

Recognize Content with OCR 2

 

         - Set the option parameter options.IsAllowOCR to true.

 

ComPDFKit Conversion SDK OCR supported language details:

Script / Notes Language Language (English Name)
Latn; American English English
Latn; Canadian Français canadien French
Hans/Hant 中文简体 Chinese (Simplified)
Hans/Hant 中文繁体 Chinese (Traditional)
Jpan 日本語 Japanese
Kore 한국어 Korean
Latn Deutsch German
Latn Српски (латиница) Serbian(latin)
Latn Occitan, lenga d'òc, provençal Occitan
Latn Dansk Danish
Latn Italiano Italian
Latn; European Español Spanish
Latn; European Português (Portugal) Portuguese
Latn Te reo Māori Maori
Latn Bahasa Melayu Malay
Latn Malti Maltese
Latn Nederlands Dutch
Latn; Bokmål Norsk Norwegian
Latn Polski Polish
Latn Română Romanian
Latn Slovenčina Slovak
Latn Slovenščina Slovenian
Latn shqip Albanian
Latn Svenska Swedish
Latn Swahili Swahili
Latn Wikang Tagalog Tagalog
Latn Türkçe Turkish
Latn oʻzbekcha Uzbek
Latn Tiếng Việt Vietnamese
Latn Afrikaans Afrikaans
Latn Azərbaycan Azerbaijani
Latn Bosanski Bosnian
Latn Čeština Czech
Latn Cymraeg Welsh
Latn Eesti keel Estonian
Latn Gaeilge Irish
Latn Hrvatski Croatian
Latn Magyar Hungarian
Latn Bahasa Indonesia Indonesian
Latn Íslenska Icelandic
Latn Kurdî Kurdish
Latn Lietuvių Lithuanian
Latn Latviešu Latvian