
When converting PDFs to or from other formats, or even when printing, text loss happens frequently. For example, half the text may disappear after converting to Word, or fonts may turn into empty boxes when you email the file to a colleague. Don't worry—once you understand the causes, most issues have simple solutions.
This article will walk you through the common reasons for text loss, one by one, along with their fixes. And if you need a tool that prevents the vast majority of text loss issues, try ComPDF—it's designed to preserve both text and formatting, saving you from the hassle of manual adjustments.
PDF Missing Text When Converting PDF to Other Format (Causes & Solutions)
Cause 1: Fonts and Encoding Problems
- Missing ToUnicode table – The conversion tool knows what shape the character is (a glyph) but has no idea which letter or number it maps to. Result: gibberish or blank spaces.
- Type 3 fonts – These use custom encoding that standard converters can’t read. Workaround: treat the page as a scanned image and run OCR.
- Font not embedded + no encoding table – If the original PDF creator didn’t embed the font and left out the encoding info, you can’t extract the text reliably. Honestly, there’s no fix from your end—you need to go back to the source file.
Cause 2: Object Structure and Layers
- Form XObjects – Sometimes text lives inside a mini embedded PDF within the main PDF. Basic converters ignore these completely. Solution: use a professional tool like PDF Reader Pro or Acrobat Pro.
- Form field content – Fillable form fields sit in a separate annotation layer. When you export to Word, that layer often gets dropped. Flatten the PDF first (when printing to PDF, choose “Document and Markups” or use a flattening tool).
- Image‑only PDFs (scanned docs) – There’s no text data at all. You must run OCR. For mixed English/Chinese documents, use an OCR engine—ComPDF, ABBYY FineReader, or Acrobat Pro.
Cause 3: Permissions and Tool Limitations
- DRM or encryption – Some PDFs block text extraction. If you have the password, remove the restriction. If not, you’re looking at manual retyping or screenshots.
- Free online converters – They cut corners to stay fast and free. Complex layouts often lose text. For important documents, skip the web tools. A solid alternative is the converter Demo of ComPDF – it handles complex fonts and layered content better than most free tools.
PDF Missing Text When Converting Office Docs to PDF (Causes & Solutions)
Cause 1: Fonts (Public Enemy)
- Fonts not embedded – Your computer has the fancy font; your coworker’s doesn’t. They see squares. Fix: when you save as PDF in Word, click “Options” and check “Embed all fonts”.
- Only subset embedded – Office sometimes embeds only the characters you actually used. Then you edit the PDF and type a new letter that wasn’t in the original document—that new character disappears on save. Embed the full font, or make sure your original text already includes all possible characters you might need later.
- Font licensing blocks embedding – Some commercial fonts don’t allow embedding. Switch to a freely embeddable font like Noto Sans CJK or Microsoft YaHei before generating the PDF.
- PowerPoint special note – PowerPoint does not embed fonts by default. Go to File → Options → Save, then check “Embed fonts in the file” and choose “Embed all characters”.
Cause 2: Hidden, Covered, or Invisible Text
- Hidden text attribute – In Word, text marked as “Hidden” won’t make it to the PDF. Select the text, press Ctrl+D, and uncheck “Hidden”.
- White text / zero opacity – White text on a white background, or text with 100% transparency. It’s technically there but invisible. Check font color and transparency settings.
- Layer overlap – A shape with a solid fill sitting on top of your text. Right‑click the shape → Send to Back, or move the text to the front.
- Text box overflow – Text that doesn’t fit inside a fixed‑size text box gets cut off. Either resize the box or convert the content to regular paragraph text.
Cause 3: Text Wrapping and Layout Engine Quirks
- Tight or through wrapping – These wrapping styles around images or shapes can cause following paragraphs to vanish during conversion. Change wrapping to “Top and Bottom” or turn the content into normal paragraphs.
- In‑line with text images – Sometimes the paragraph behind an in‑line image disappears. Or if an image and text share the same paragraph, the whole thing drops out. Change the image wrapping style to something other than “In Line with Text” (e.g., Square).
- Corrupted styles – Styles imported from an old template can break. When you export to PDF, entire paragraphs may evaporate. Create a new document, paste special as unformatted text, and reapply clean styles.
Cause 4: PowerPoint‑Specific Merging Problems
- Text inside combined shapes – If you group several shapes and one of them contains text, that text sometimes doesn’t export. Ungroup before exporting (right‑click → Group → Ungroup).
- Missing master elements – Headers and footers placed on the slide master often get lost. Make a copy of the slide and paste the master text directly onto each slide.
- Deeply nested groups – PowerPoint can handle two or three levels of grouping, but beyond that, text may randomly fail to appear in the PDF. Avoid deep nesting, and ungroup before export.
Cause 5: Excel‑Specific Merging Problems
- Print area set too small – Anything outside the defined print area won’t be included in the PDF. Clear the print area, then reset it to cover all your data.
- Scaling and margins – Wrong scaling or oversized margins can cut off rows and columns. Use “Custom Scaling” and check Print Preview before exporting.
- Row height too low – Fixed row height cuts off multi‑line text. Set row height to “AutoFit” so everything shows.
Cause 6: Version / Tool Bugs and Simple Mistakes
- Word + Acrobat version mismatch – Random paragraphs disappear for no obvious reason. Change your method: try “Save As PDF”, “Microsoft Print to PDF”, or “Create PDF from File” inside Acrobat.
- Legacy form fields – Old‑style form fields live in a separate layer. If the flattening step fails, the content vanishes. Use content controls instead, or generate the PDF through Acrobat’s own PDF maker.
- User errors – Selecting the wrong page range, saving as the ancient
.docformat instead of.docx, or using advanced layout without testing on another machine. Slow down. Double‑check. Convert to.docxfirst. Always test before sending.
Other Cases and Solutions Where Text Disappears
Text Missing When Printing a PDF
- Printer driver font issues – Printers don’t directly use your system fonts. If the driver can’t handle a particular encoding (common with Chinese text), those characters won’t print. Workaround: in the print dialog, check “Print as Image”.
- Transparency flattening gone wrong – The printer’s RIP (raster image processor) processes transparent objects in the wrong order and accidentally erases text on top. Before printing, flatten transparency manually in Acrobat (Advanced → Print Production → Flattener Preview).
- Unflattened form fields – Fillable form content often doesn’t print. Flatten the form before printing, or in the print dialog choose “Document and Markups”.
- Low‑resolution image compression – If your text is actually an image (e.g., from a scanned document), some printers use a fast, low‑quality mode that makes text unreadable. Switch to “High Quality” or “Best” print mode.
Text Missing When Opening or Viewing a PDF
- Missing fonts or incomplete glyph sets – This is the most common cause. The creator didn’t embed the font, and your system doesn’t have it installed. Blocks appear where text should be. Prevention: always embed fonts when creating PDFs. For rare or ancient characters, use a complete font like Noto Sans CJK (65,000+ glyphs).
- PDF viewer rendering bugs – Some PDF viewers have occasional issues with minority languages, while professional PDF readers work fine. Download the PDF and open it in professional readers like Adobe Reader, PDF Reader Pro, and more.
- Math or specialty fonts rendered white – The text is actually there, but the viewer paints it white on a white background. Switch to a different viewer to confirm.
- Layered content (Optional Content Groups) – Someone put text on a layer that’s hidden by default. Open the Layers panel (usually in the left sidebar) and turn on any hidden layers.
- Encoding mismatch – Very old PDFs may use a non‑Unicode encoding. If your system’s “Language for non‑Unicode programs” doesn’t match, you’ll see garbage characters. Change that setting to Chinese, but a better fix is to regenerate the PDF from the source using Unicode.
- Private Use Area (PUA) characters – Some niche fonts map characters to custom code points that only make sense with that specific font. Anyone without the exact same font sees blanks. Avoid PUA fonts for documents you’ll share. If you must use them, make sure recipients install the same font.
Text Missing When Editing or Annotating a PDF
- Annotation truncation – Adding comments in Outlook’s or Edge’s built-in PDF viewer sometimes cuts off long Chinese sentences. Keep annotations short, or use a dedicated tool like Acrobat or Foxit.
- Editing introduces characters outside the font subset – The PDF only embedded a subset of characters (say, the 200 Chinese characters used in the original document). You open it in Acrobat, type a 201st character, and save. That new character disappears. Use Acrobat Pro’s “Add Font” feature to expand the subset, or go back to the original Office document and regenerate the PDF.
- Windows 11 save bug – Editing a PDF and then using “Microsoft Print to PDF” to save your changes can drop text. Avoid the Microsoft PDF generator in these situations. Use Acrobat’s own save function instead.
Summary — A Quick Checklist
That’s a lot of scenarios, I know. Here’s a simple checklist that prevents about 80% of missing‑text problems, especially with Chinese content. Before creating a PDF (Office → PDF):
-
Embed all fonts (or at least the full font, not just a subset)
-
Check for hidden text, text box overflow, and shapes covering your text
-
In PowerPoint: ungroup combined shapes before export
-
In Excel: clear print area and check scaling
-
Prefer “Save As PDF” first. If that fails, try “Print → Microsoft Print to PDF”
Before converting a PDF to another format:
-
Know whether you have a text‑based PDF or a scanned image (needs OCR)
-
Flatten forms and annotations
-
Use Acrobat Pro or ComPDF for complex layouts
Before sharing a PDF with others:
-
Test on a different computer that doesn’t have your fonts
-
Verify that text shows correctly in both a browser and other PDF Reader
-
For critical documents, include screenshots as a backup
When you run into missing text:
-
First rule out white text, transparency, and hidden layers
-
Try a different viewer or a different conversion method
-
Go back to the source document whenever possible
-
For OCR: use a high‑resolution scan and an engine that supports your language (Chinese, Japanese, Korean, etc.)