Tuesday, May 22, 2012

Indexing Pdf files Search results for pdf files in Sharepoint 2010

As Adobe is not a Microsoft product. There are some limitations on Indexing and getting the search results for pdf files. 
 If you add PDF as a file type for SharePoint Search, you will get the results however the icon will not be displayed. You can see that only the file attributes are indexed. 
In order to index the content of pdf files with SharePoint 2010 you need to install an ifilter
 Adobe provides a free pdf ifilter that works for most pdf files and there are a couple other vendors offering ifilter that you might want to try if Adobe’s doesn’t work. Adobe has an ifilter for free that supports most pdf documents. You may have the adobe ifilter installed on your home or work pc as it is also delivered with Adobe Acrobat Reader.  For those not supported, a vended ifilter from Foxit is usually the best choice. Links provided.
 1.  Download the pdf ifilter from Adobe here
Note :  You need to install the 64 bit version of Ifilter 9 listed above. Other versions will not work
 2.  Install the ifilter on the App/Index Server
Download the Adobe PDF icon (select the smaller icon, at time of latest update, it’s called 16 x 16) currently available from http://www.adobe.com/misc/linking.html#pdficon
             Give the icon a name or keep the default name: pdficonsmall.gif
             Save the icon (or copy to) C:\Program Files\Common Files\Microsoft Shared\Web Server Extensions\14\TEMPLATE\IMAGES
3. Edit the DOCICON.XML file to include the PDF icon
o             In Windows Explorer, navigate to C:\Program Files\Common Files\Microsoft Shared\Web Server Extensions\14\TEMPLATE\XML
o             Edit the DOCICON.XML file (open with Notepad )
o             Ignore the section <ByProgID> and scroll down to the <ByExtension> section of the file
o             Within the <ByExtension> section, insert a <Mapping Key=”pdf” Value=”pdficonsmall.gif” /> attribute. The easiest way is to copy an existing one and then making the required changes  
5) Open the Search Service Application in Central Administration and choose the File Types menu on the left. Add pdf as an extension
 6) Restart the Search Service Application
Start => run => services.msc => find “SharePoint Server Search 14” right click Restart
Or You can run the command given below from the command prompt
net stop osearch14
net start osearch14

7) Do a full crawl
Note: You must do a full crawl for changes to take effect and the documents in the index to be updated.
You might find in some blogs mentioning the Registry change however it is not required .
Link to Download the Foxit Ifilter: http://www.foxitsoftware.com/products/ifilter
In some cases. I have seen that we might have to reinstall Foxit ifilter after an Upgrade of patches.

No comments:

Post a Comment

Users cannot see the checked out files in the folder/ library

I Came across a Issue today wherein the user opened a ticket for the below issue Issue : Users cannot see the checked out files in the fo...