Skip to content
ComPDF

OCR Language Codes

The ocrRecognitionLang parameter specifies the OCR recognition language using standard language codes. Defaults to auto (automatic detection).

Common Languages

CodeLanguage
autoAuto Detect
zh-HansSimplified Chinese
zh-HantTraditional Chinese
enEnglish
koKorean
jaJapanese

Latin Script

You can pass a specific language code or use latin for general Latin script recognition.

CodeLanguage
latinLatin (General)
frFrench
deGerman
esSpanish
ptPortuguese
itItalian
nlDutch
svSwedish
plPolish
csCzech
roRomanian
huHungarian
fiFinnish
daDanish
noNorwegian
trTurkish
viVietnamese
idIndonesian
msMalay
hrCroatian
skSlovak
slSlovenian
afAfrikaans
sqAlbanian
caCatalan
etEstonian
lvLatvian
ltLithuanian
isIcelandic
filFilipino

Devanagari

CodeLanguage
devanagariDevanagari (General)
hiHindi
mrMarathi
neNepali
saSanskrit

Cyrillic

CodeLanguage
cyrillicCyrillic (General)
kkKazakh
kyKyrgyz
mnMongolian

East Slavic

CodeLanguage
eslavEast Slavic (General)
ruRussian
ukUkrainian
beBelarusian
bgBulgarian

Arabic Script

CodeLanguage
arabicArabic (General)
arArabic
faPersian
urUrdu

Other Languages

CodeLanguage
tamil / taTamil
telugu / teTelugu
kannada / knKannada
thai / thThai
greek / elGreek

Legacy Format Compatibility

The following legacy format codes are still supported but we recommend migrating to standard codes:

LegacyStandard Code
AUTOauto
CHINESEzh-Hans
CHINESE_TRADzh-Hant
ENGLISHen
KOREANko
JAPANESEja
LATINlatin
DEVANAGARIdevanagari
CYRILLICcyrillic
ARABICarabic
TAMILtamil
TELUGUtelugu
KANNADAkannada
THAIthai
GREEKgreek
ESLAVeslav