Hello,
In the target of PDF outputs, there is a check box 'Generate tagged PDF' in the 'PDF Options' tab.
When uploading a PDF file with this check box NOT selected to a MS SharePoint Online (SPO) site, it seems that SPO does not correctly crawl and index the text inside the PDF. The result of this is that when you search the SPO site on text strings that appear inside the PDF, the PDF is not returned in the search results.
However, when the ‘Generate tagged PDF’ check box IS selected in the PDF target, the search behaviour is again back to normal: After uploading the PDF to SPO, the PDF is returned in the search results when expected!
According to Flare’s help, PDF tagging “gives the file a structure similar to that of the source XHTML documents. This structure is necessary for certain accessibility applications, including screen readers”.
Some googling around shows that tagged PDFs are supposed to help with screen readers for users with disabilities. But did not find any information that the check box may impact the searchability of the PDF text in SPO..
I am wondering whether anyone has similar experiences with this check box?
Thanks & best regards,
Stijn Vanderwee
'Generate tagged PDF' for PDFs uploaded SharePoint Online
-
- Jr. Propeller Head
- Posts: 3
- Joined: Mon Dec 10, 2018 4:36 am
-
- Propeller Head
- Posts: 12
- Joined: Wed Mar 30, 2016 1:59 am
Re: 'Generate tagged PDF' for PDFs uploaded SharePoint Onlin
Hello,
I was curious and tried that, and I can confirm that.
Without checking the mentioned box, full text search in SPO does not work for the PDF created with Flare.
regards
Christian Wagner
I was curious and tried that, and I can confirm that.
Without checking the mentioned box, full text search in SPO does not work for the PDF created with Flare.
regards
Christian Wagner