Search in PDF

aras · Post by **aras** » Mon Oct 24, 2011 1:12 pm

Hi all,
My Client requirement is to do a PDF search (non-english) in the Search module of his e-learning website. When i try to extract the contents of PDF for indexing, some of the characters are neglected during extraction (empty spaces in that area,when i view the indexed contents in Luke). I am getting these problem for languages like Tamil/Hindi.
The Client is very adamant that he wants PDF search.
What is the solution for this...Please give me a ray of light or guidelines.
Thanks and Regards,

aras

Claudiu (Softland) · Post by **Claudiu (Softland)** » Mon Oct 24, 2011 5:55 pm

Hello,
Unfortunately the PDF format supports by default only latin characters. The other characters are added in the PDF as embedded CID font subsets, with Unicode CMaps. You have to use a search text module capable to read this type of text from the PDF files to be able to extract all the characters correctly.
Thank you for understanding.

Shopping Cart

doPDF Forum

Search in PDF

More Support

Frequent QuestionsVisit the Frequently Asked Questions section to find answers for the most popular questions we receive.

Get more featuresnovaPDF is a premium PDF creator, with more features than doPDF. Click the button below to see how it compares to doPDF.

NewsletterBe notified on new releases by joining our newsletter

SupportStill have questions on how to use doPDF? Send us an email and reply promptly.

	doPDF Free	doPDF Premium	novaPDF Pro
Print from any application
Hide ads
Password Protection
Sign PDF files
Active PDF links
PDF watermarks
Compare all features

Spring sale: 10% Discount for novaPDF Pro to unlock new features (Save US$ 5.00)

Search in PDF

More Support

Frequent QuestionsVisit the Frequently Asked Questions section to find answers for the most popular questions we receive.

Get more featuresnovaPDF is a premium PDF creator, with more features than doPDF. Click the button below to see how it compares to doPDF.

NewsletterBe notified on new releases by joining our newsletter

SupportStill have questions on how to use doPDF? Send us an email and reply promptly.