Welcome to MSDN Blogs Sign in | Join | Help

Crawling Sites Protected by Forms or Cookie-based Authentication with Enterprise Search in SharePoint Server 2007


Learn how to use Enterprise Search to crawl sites that are protected by forms or cookie-based authentication, and walk through creating a sample custom security trimmer for trimming the returned search results… Read the full MSDN article here

Related resources.

Read the KB article here

Download the required Hotfix here

Download a required tool here

Published Thursday, October 04, 2007 12:45 PM by enterprisesearch
Filed under: , ,

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

# Techy News Blog » Crawling Sites Protected by Forms Authentication with Enterprise Search in SharePoint Server 2007

# re: Crawling Sites Protected by Forms or Cookie-based Authentication with Enterprise Search in SharePoint Server 2007

Sunday, October 07, 2007 11:41 PM by Sushrut

Hi,

I have some queries on MOSS Search, specifically related to crawling and indexing. Would be great to have your inputs on these:

-Is there a hard limit of having only one IFilter per file extension? For e.g. in our MOSS Search server, I can see that the ‘.txt’ IFilter seems to be implemented in ‘tquery.dll’. Can I have an additional IFilter say, ‘MyTxtFilter.dll’ that chains along with this default filter? (So that both would be called by the filter daemon when indexing).

-If the above is not possible then is there a way for me to have my custom code execute with the default IFilters? If so, how? I am specifically looking for ways to access chunks at crawl time and assign text or value inside those chunks to my custom created crawl properties.

-If we have to necessarily have only one IFilter per file extension, can I replace the default shipped IFilter with my custom created IFilter (i.e. replace tquery.dll with MyTxtFilter.dll)? An assumption I am making here is that since we are not messing with the protocol handlers, my custom IFilters will get chunks with text or values inside for all the content that the indexer crawls (and has protocol handlers registered for). What is the behavior that I will see after each crawl ends (apart from seeing only those crawl properties that MyTxtFilter.dll creates)?

-Say on a MOSS Server, I have two search SSP’s configured to crawl the exact same content. Let these search services be SSP1 and SSP2. Can I map crawl properties from SSP2 to managed properties in SSP1? (SSP1 would therefore have managed properties that are mapped to crawled properties from both SSP1 and SSP2). I am not able to explicitly see crawled properties of other SSP's in the mapping page. Can it be done programmatically? Are there any caveats that I need to be careful about?

Rgds,

Sushrut.

Leave a Comment

(required) 
required 
(required) 

  
Enter Code Here: Required
 
Page view tracker