ComPDFKit Conversion SDK 1.8.0: Table Recognition

By ComPDFKit | 2023 Jul 27
Release Data Extraction Conversion SDK

We are pleased to announce that ComPDFKit Conversion SDK 1.8.0 for Windows, iOS, Android, and Server is now released! In this version, the OCR function supports table recognition and optimizes the OCR text recognition rate. PDF to HTML optimizes the HTML file structure, so that the size of the converted HTML file is greatly reduced.



Table Recognition in OCR



CPDFConvertWordOptions wordOptions = new CPDFConvertWordOptions();
wordOptions.IsAllowOCR = false;



CPDFConvertWordOptions *options = [[CPDFConvertWordOptions alloc] init]
[options setIsAllowOCR:YES];

If you want to know other programming platforms and more about table recognition in OCR, please check out our OCR guides.






string resPath = "***";
string inputFilePath = "***";
string outputFolderPath = "***";
string outputFileName = "***";

CPDFConverterHTML converter = CPDFConvertFactroy.CreateConverter(CPDFConvertType.CPDFConvertTypeHtml, inputFilePath) as CPDFConverterHTML;

CPDFConvertHTMLOptions htmlOptions = new CPDFConvertHTMLOptions();
htmlOptions.PageAndNavigationPaneOpts = PageAndNavigationPaneOptions.SinglePageNavigationByBookmarks;
htmlOptions.IsAllowOCR = false;
htmlOptions.IsContainAnnotations = true;
htmlOptions.IsContainImages = true;

int pageCount = converter.GetPagesCount();
int[] pageArray = new int[pageCount];
for (int i = 0; i < pageArray.Length; i++)
    pageArray[i] = i + 1;

ConvertError error = ConvertError.ERR_UNKNOWN;
converter.Convert(outputFolderPath, ref outputFileName, htmlOptions, pageArray, ref error, getPorgress);

If you want to know other programming platforms and more about PDF to HTML, please check out our PDF to HTML guides.





Overall, we believe this update will take your experience with ComPDFKit to a whole new level. And we will continue to optimize our functions to provide a better user experience for every user. You are welcome to contact us to try ComPDFKit and give us feedback.