Welcome to MSDN Blogs Sign in | Join | Help

Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

The version 8 of the adobe reader has some significant architectural changes (for the better of course) including an inbuilt IFilter to index PDF documents. Previously the adobe IFilter was available as a seperate download. This new change in architecture compromised the ability to search pdf documents from within MOSS 2007. However, the pdf filter works fine with WDS 3.0 . While many consultants recommend that if we're to index pdf documents through MOSS 2007, we use the the v.6 of adobe IFilter and if we want to index pdf documents through WDS 3.0 or higher, we use the v.8 of adobe reader. But what if we wanted to index pdf documents using both WDS and MOSS 2007?!!! Here's how you can use MOSS 2007 with adobe reader v.8, the version currently patronized by WDS:)

1. Download Adobe Reader v.8 .

2. Add the filter-extension to the File types crawled:

Start -> Program -> Microsoft Office Server -> SharePoint 3.0 Central Administration  -> <Name of SharedService Provider> -> Search Settings -> File Types -> New File Type (Add extension  pdf here)

3. Modify the following Registry keys by changing their "Default" value to the new CLSID of the Adobe IFilter:  {E8978DA6-047F-4E3D-9C78-CDBE46041603}

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office                  

server\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf

Default --> {E8978DA6-047F-4E3D-9C78-CDBE46041603}

 

                HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server                          

Extensions\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf

Default --> {E8978DA6-047F-4E3D-9C78-CDBE46041603}

4.  Add the Installation directory of the Adobe Reader v.8 to the System Path. For example, if the Reader is installed on "D:\Program  Files\Adobe", then add "D:\Program Files\Adobe\Reader 8.0\Reader" to the system path by:

           --> Right Click on My Computer -> Properties -> Advanced -> Environment Variables -> Path (Under System Variables) -> Edit -> (Add "D:\Program Files\Adobe\Reader 8.0\Reader").

 

This effectively tells the adobe IFilter where to pick up the dependent DLLs.

 

5. Recycle the search service: > net stop osearch

                                                            > net start osearch

 

6. Walla! Now we can crawl and search PDF documents with v.8 Reader.

Published Thursday, March 29, 2007 5:12 AM by Deb Haldar
Filed under:

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, March 29, 2007 6:17 AM by dmurillo

Do you know if Adobe has an IFilter implementation for 64-bit OS's (XP, 2K3, Vista)?

# Adobe PDF I-Filter per SharePoint 2007

Thursday, March 29, 2007 2:13 PM by Igor Macori

Grazie alla provvidenziale segnalazione di Claudio , nonostante il titolo del post potrebbe entusiasmare

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, March 29, 2007 2:24 PM by Deb Haldar

Adobe is working on getting a 64 bit version of IFilter out. We don't have any ETA at this point though.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, March 29, 2007 2:38 PM by Marco van Schagen

Would there be info avaiable on how to develop 64 bit versions? I'd like to adjust our (3rd party) DWG IFilter when possible.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, March 29, 2007 3:15 PM by Deb Haldar

The procedure is exactly the same if you're using Visual Studio. However, your document parsing libraries might have different requirements for 32 and 64 bit. For example, if using the interop to call unmanaged code, the layout(offset) of propspec structures might need to be modified.In the end, the requirements vary for different implementations.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, March 30, 2007 11:47 AM by David Gilbert

I followed all the steps, but this registry entry does not exist:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Search\1.0\Filters\AcroExch.Document

The other two were there.

Ideas?

Thanks

Dave

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Saturday, March 31, 2007 7:21 AM by Mike Walsh

What's "WDS 3.0" ?

and more importantly will this method also work with WSS 3.0 or only with MOSS 2007 ?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Saturday, March 31, 2007 2:23 PM by Deb Haldar

Dave, the key should exist. I tried it on the following config on a couple of machines.

OS: Win2k3 SP2

MOSS 2007 standalone config.

However, I'll double check and update the article if necessary.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Saturday, March 31, 2007 2:49 PM by Deb Haldar

Mike, WDS refers to Windows Desktop Search.

I haven't personally tried it on WSS but the same strategy should work. After installing WSS , just replace the old CLSID of pdf ifilter with the new one.Then try to index the pdf files. Also, it's very likely that you'd need to add adobe installation directory to system path.

# MOSS and Indexing PDF Documents

Thursday, April 05, 2007 5:20 PM by Steve Caravajal's Ramblings

Definitely worth a read... Indexing pdf documents with Adobe Reader v.8 and MOSS 2007 The version 8 of

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, April 05, 2007 8:18 PM by Robert Bradford

Any contact at Adobe to get on a mailing list or something for the availability of the 64-bit iFilter?  We talked to 6 folks at Adobe today all of whom had (a)never heard of iFilters and (b) never heard of MOSS including their management.  The perils of being a semi-early adopter of a new technology, I suppose.  Thanks in advance for any guidance.

Cheers - RHB

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, April 05, 2007 8:54 PM by Deb Haldar

Robert, our management is talking to Adobe to expedite the release. Unfortunately, staff members (such as myself) do not have contact info for Adobe folks in charge of ifilters.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Monday, April 09, 2007 6:12 PM by Susan C

This registry entry does not exist on my server either:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Search\1.0\Filters\AcroExch.Document

The other two were there.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Monday, April 09, 2007 9:12 PM by Deb Haldar

Susan, David, thank you for pointing this out. The machine I used for debugging had some weird installations on it.

I've corrected the article accordingly.

# Sample Data for Capacity Planning

Friday, April 20, 2007 8:50 PM by Joel Oleson's SharePoint Land

I've been through many conversations that ended with... Ok, if we only knew how large the files were

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Sunday, April 29, 2007 11:49 PM by Danny

Is the new iFilter in Adobe Acrobat Professional 8.0 as well? Meaning, must I download the Reader even though I have Professional installed to get the new 8.0 iFilter in order to search with WDS?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Monday, April 30, 2007 1:50 PM by Deb Haldar

Danny, the ifilter ships with both the reader and professional. However, note that the filter in professional(and reader) 8.0 only works with WDS 3.0 and up.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, May 09, 2007 3:26 PM by Tom

What is WDS??

When is a 64-bit Adobe PDF IFilter coming out??

Thank you, Tom

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, May 09, 2007 10:54 PM by Deb Haldar

WDS refers to Windows Desktop Search.

Tom, you can find a 64 bit PDF IFilter from FOXIT software. See the blog entry under:

http://blogs.msdn.com/ifilter/archive/2007/05/10/long-awaited-64-bit-pdf-ifilter-finally-available.aspx

Thanks,

Deb.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, May 23, 2007 10:42 AM by Peter De Meester

I tried the approach on a WSS v3, and it works. Thanks for the CLSID-tip!!

Peter

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, May 29, 2007 9:51 AM by Jens

Hi, have installed Adobe Acrobat 8 on the Server and the following tasks 2..5, but the pdf icon is not visible and the pdf content is still not searchable. Do I have to start a full crawl before I can search for? Or do I have a chance to check the installation?

Thanks in advance,

Jens

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, May 30, 2007 6:52 PM by Deb Haldar

Jens, you'll need a full crawl for the PDF contents to be indexed.

regards,

Deb.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, June 08, 2007 2:53 PM by Michael

I have followed the instructions above and then crawled our Corporate Intranet hr page (not migrated to MOSS 2007 yet).  The results of all the PDF docs fail as follows:

The filtering process could not load the item. This is possibly caused by an unrecognized item format or item corruption.

I have checked the PATH, the reg entries etc.   I have even rebooted the server.  Deleted teh index and recrawled etc.  Thoughts on how to troubleshoot?

I have a single MOSS server but the DB is on another box (SQL 2000 for now).

Thanks.

Michael

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Saturday, June 09, 2007 1:22 PM by Deb Haldar

Michael,

From the failure message it seems like the search process could not pick up the filter.

Usually in cases like this,the easiest thing to do is use ifilttst/filtdump and try to filter a PDF file from command line.

If that does not work, try the following:

->Recycle the search service

->Launch regmon and set a filter on *pdf*

->Launch filemon and set a filter on *pdf*

-> Start the crawl and try to see which registry keys/Files are being accessed. This should give you a clear idea of the COM registry calls made as well as if dependent DLLs are failing to load with V.8 reader.

regards,

Deb.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Saturday, June 09, 2007 1:25 PM by Deb Haldar

Also, folks, we recently ran some perf tests on the FOXIT pdf ifilter vs the adobe pdf filter and the former provided much better results. So my recommendation at this point is to switch to FOXIT ifilter instead of using the complex workarounds with adobe ifilter.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Monday, June 11, 2007 3:07 PM by Michael

Deb -

Thanks for the tip.  For some reason, using the ifilter explorer from Foxit's site, I saw for some reason, my adobe 8.0 wasn't registering properly.  At your suggestion, I tried Foxit's pdf filter, made the registry change and voila my filter worked.

Thanks a bunch!

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, June 22, 2007 7:49 AM by evald

Hi !

i have SP(sharepoint 2007) installed on a windows server 2003 64 bit ed.

The database(SQL Server 2005) is on another machine windows server 2003 32bit ed.

both SO and sql server are updated.

the full text is working but not on pdf document. the are not being indexed.

i have added the file type on SP and installed the IFilter 6.0 from adobe, on both server but it is not working.

any idea

thanks.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, June 22, 2007 1:58 PM by Deb Haldar

If your MOSS 2007 is 64 bit, you cannot use the adobe filter as it is a 32 bit binary and our 64 bit filtering process cannot load a 32 bit binary.

In that case, please use the 64 bit FOXIT ifilter. (See blog post on FOXIT).

Thanks,

Deb.

# Post Sharepoint 2007 / WSS 3.0 ToDos: Enable PDF in search and display

Tuesday, July 10, 2007 5:05 PM by FG

Fix PDF in search of WSSS 3.0 and MOSS 2007 - Install Acrobat Reader 8 on the Sharepoint Webserver -

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, July 13, 2007 10:59 AM by Carl

Great article. However, if you start a new sharepoint service one step is missing to allow for the icon to show up.

This step is refered in this article:

http://support.microsoft.com/kb/832809

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, August 01, 2007 5:01 PM by Gus

I followed the steps and I can search PDFs that have already been OCR'd. When I run paper through my copier and create a PDF file, all I get is an image which is not searchable. Should this filter perform the OCR on the image when it is indexed and make the file searchable?

postmaster@momentumcc.com

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Monday, August 13, 2007 10:22 AM by pbruss

Hello,

I have W2K3 EE, SQL2005 Std, MOSS 2007, Adobe Reader 8.1. Tried these steps (numerous times) and the Crawl logs always tell me:

'crawled (the filtering process could not process this item. This might be because you do not have the latest file filter for this type of item. install the corresponding filter and retry your crawl.'

Before i rip out Adobe 8.1 and revert to an earlier version is there any other areas you can think of that would point towards the problem. Thanks much.

pat

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, August 31, 2007 11:41 AM by Victor S

As much as MOSS is being deployed all over the world right now, I am suprised at how little information is available on this subject.

There needs to be an official Knowledge Base article from Microsoft that documents EXACT steps on getting the ifliter installed.

I am still not clear what it the best route to go with 32 bit MOSS deployment ifilter6.0, or Reader 8?

This was a pain in 2003 and still is the pain in 2007, common people...

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, September 05, 2007 10:49 PM by Victor S

I spoke with Lawrence Liu during SPUG in Midtown Manhattan and he was not aware that this is an option. Let's get a KB out there shall we?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Friday, September 07, 2007 4:06 AM by Deb Haldar

Victor, please make a request to MS Product Management (Lawrence's team) and once we receive the request from them, we can start the process.

The other alternative is to contact MS product support and file the request with them.

Thanks,

Deb.

# Corrections to 3rd step

Wednesday, September 12, 2007 3:56 PM by Alex Kuzmin

Hi, Deb!

In 3rd step changing default value for key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\.pdf not required for MOSS, it's only needed for WSS.

WBR, Alex.

# MOSS Search : Tool, Ifilter, provider,..., quelques bonnes ressources à connaitre (part 2)

Friday, September 14, 2007 4:50 AM by The Mit's Blog

Maintenant que le search fonctionne bien, il faut savoir s'outiller pour bien l'exploiter, n'est il pas

# trina myspace background html codes

Tuesday, September 18, 2007 12:23 PM by trina myspace background html codes

trina myspace background html codes

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, October 31, 2007 11:39 AM by Doug Kenrick

Does anyone know if the issue of PDF's not showing up in search after using the method above is still an open issue? I have MOSS on W2K3 R2 Standard, SQL 2K5 backend, followed the above instructions to index PDF files in MOSS, docs show up in the crawl log but not in search, any suggestions?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, November 07, 2007 11:16 AM by John Young

Tried using the PDF iFilter (ala V8.1) through a simple test program and the "text" portion of the PDF document is not returned, although attributes are.  We are using the GetChunk method of the iFilter interace.  This works fine with iFilter 6.0 (but is problematic as we all know). perating on XP Prof (withour WDS).

Any ideas?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, November 07, 2007 11:22 AM by Marco van Schagen

John,

You need to use the GetText also. In general this will contain the bulk of the text. You may need to create a loop to create a series of GetText calls to receive all of the contents.

I am talking general IFilter specs as I have built the DWG IFilter, not PDF.

Good luck.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, November 21, 2007 7:41 AM by jerome

Hi all

Do you have any solution to install on a WSS v3 farm on w2k3 64 bit the new PDF ifilter 8 ?

I have try the solution describe on adobe lab site but it doesn't work

if someone has successfuly install it, please let me know

regards

jerome

# Indizierung von TIFs mit OCR

Friday, November 30, 2007 7:42 AM by mosslive.de

Manche Dinge brauchen ja so ihre Zeit zum Reifen, deshalb kurz zur Vorgeschichte (zusammengefasste Anleitung

# Indexing TIF files using OCR

Friday, November 30, 2007 7:49 AM by mosslive.de

A couple of months ago I migrated a friend's SPS 2003 to MOSS 2007. On that day I realized that MS skipped

# Indexing pdf documents with Adobe Reader v.8 and MOSS 2007 on 64 bit

Friday, January 18, 2008 3:50 PM by wes@pronet-inc.com

I was able to get the pdf's to index on a 64 bit server following this article: http://labs.adobe.com/wiki/index.php/PDF_iFilter_8_-_64-bit_Support

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, January 22, 2008 6:20 AM by Pwa

One thing is still not clear for me : on an installation with MOSS and SQL on 2 separate servers, what should be installed on each machine ?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, January 22, 2008 5:11 PM by Kumu

Is there any document that tells me how to install adobe 8 filter on SQL server 2005? Thanks

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, January 22, 2008 11:51 PM by Deb Haldar

Pwa, you should install the Adobe Reader (which contains the iFilter) on the indexer (MOSS) box.

Thanks,

Deb.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, January 29, 2008 5:41 PM by Matt

I have tried all of the above steps but I am only getting 1 pdf showing up when I do a search that should be showing at least 5.  This is on WSS 3.0.  Has anyone seen this happen?

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, February 12, 2008 6:29 PM by f00

thought I would post a link to adobe's instructions for 64 bit ifilter:

http://labs.adobe.com/wiki/index.php/PDF_iFilter_8_-_64-bit_Support

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, February 20, 2008 8:04 AM by Tom Vervoort

Thank you, this was exactly the information I was looking for. I tried some different 'solutions' I found on the net, but none of them worked. After reading this article, PDF indexing was setup in less than 2 minutes. :-)

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, March 13, 2008 2:14 AM by Prashanth

We have installed Adobe iFilter v6.0 on our MOSS 2007 server. However, we are not able to index and search on PDF metadata such as Title, Author, Subject & Keywords. When we tried the Foxit iFilter, it was able to search on Title & Author, but not on Subject & Keyword. When we contacted Foxit Support, they informed us that they supported only Title & Author in the current release, and that Subject & Keyword would be supported only in the next release, due in the first half of 2008. Does the Adobe iFilter support metadata search? Any pointers would be useful.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Wednesday, March 26, 2008 4:22 PM by Eric

Thanks foo! That worked perfectly.

The only thing not in Adobe's article is how to add the pdf icon to SharePoint...still looking for that.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Monday, March 31, 2008 5:07 PM by Thant

Hi

Can anybody help me on this?

I am testing Microsoft SharePoint Server 2007 particularly on Document Management area.

My setup is I installed everything (SharePoint Server and SQL Server) on one PC.

I've installed the Adobe IFilter ver 6.0 and configured the Registry and SharePoint Search setting as Deb Haldar suggested.

See the link below.

http://blogs.msdn.com/ifilter/archive/2007/03/29/indexing-pdf-documents-with-adobe-reader-v-8-and-moss-2007.aspx

Unfortunately, my PDF file search doesn't work.

All the other file type such as Doc, Excel and Text files work fine except PDF file.

Although, I can see those PDF files in the Crawl log, their content don't return in my search result page.

I am stuck over two days on this problem and will be greatly appreciate for any input.

Thank you so much in advance.

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Thursday, April 03, 2008 6:52 AM by MaBerY

I was having the same problems usinf the Acrobat 8.1 supplied Ifilter on one of our 2 farms.

I got the first one working and upgraded to SP1 afterwards.

The second one was a vanilla install of MOS 2007 SP1 and I tried to install the acrobat 8.1 reader supplied IFilter.

It dit not work. I tried running the IFilter

on the commandline ifilttst.exe and that worked OK. The filter on a PDF file left an output log file with keywords.

In the crawl log I found entries on the pdf files finding them but failing to start the PDF Ifilter

I was about to give up and move back to the 6.0 IFilter, but I did some final checks in the registry.

For some reasons there seems to exist a registry entry for the 6.0 Adobe IFilter GUID

in SP1.

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Setup\Filters\.pdf

It listed a default of {4C904448-74A9-11D0-AF6E-00C04FD8DC02}, which is a 6.0 IFilter value.

I changed that into {E8978DA6-047F-4E3D-9C78-CDBE46041603} and resterted the search service and did a full crawl. Then it started working

After googling on the 6.0 guid I found a MS KB article describing the same symptoms for the Foxit Ifilter, but it also applies to the Acrobat 8.1 filter

http://support.microsoft.com/kb/944447

I would recommend to check the citeknet Ifilter explorer at http://www.citeknet.com/Products/IFilters/IFilterExplorer/tabid/62/Default.aspx

# Not able to search inside pdf

Tuesday, April 15, 2008 8:55 AM by Anil

i tried your steps.

But still i m not able to search inside pdf

# re: Indexing pdf documents with Adobe Reader v.8 and MOSS 2007

Tuesday, May 06, 2008 1:23 AM by Javed

I am facing same problem as John is. I have Adobe 8 Ifilter named AcroRdIF.dll.

I am able to load the filter using LoadIFilter and  then I called Init() which succeeded. But When I use GetChunK() it is not initializing the STAT_CHUNK structure and returning FILTER_E_END_OF_CHUNKS.

ANy Idea please

Leave a Comment

(required) 
required 
(required) 
 
Page view tracker