Welcome to MSDN Blogs Sign in | Join | Help

Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 1 of 2

This first of two guest blog entries, written by Neil Hodgkinson, a Microsoft Premier Field Engineer based in the UK, will cover the "why we did it" aspect of the open source faceted search solution for MOSS 2007 and MSS 2008 that has been released on CodePlex at http://www.codeplex.com/facetedsearch. The second guest blog entry, scheduled to be posted within a couple of weeks, will be written by Leonid Lyublinski, a Microsoft Consultant based in Ohio, USA, and will cover the "how we did it" aspect of the solution.

<Lawrence />

 

Background

Metadata is information that has been gathered in addition to the resources made available to a user to locate. Classically, it can be defined as information about information, but more precisely, it's structured information about resources. For companies that have large data libraries or repositories for their corporate information, this metadata is oftentimes much more than a simple hierarchical set of subject labels. Typically, the metadata has several facets -- that is, multiple attributes assigned to the resource being indexed.

Examples of faceted metadata include:

  • Music catalog: songs have attributes such as artist, title, length, genre, date.
  • Company white pages: directory of people with names, department, role, region.
  • Recipes: cuisine, main ingredients, cooking style, holiday.
  • Travel site: articles have authors, dates, places, prices.
  • Regulatory documents: product and part codes, machine types, expiration dates.
  • Image collection: artist, date, style, type of image, major colors, theme.

In all of these cases, there is no single way to provide navigation for everyone because users have disparate needs. One person might want to look through all the albums created by one band; others might be more interested in particular musical genres or instruments.

With traditional parametric searching techniques, users are expected to provide from one to several parameters in order to describe the object being searched for. The drawback with this approach is that by requiring the user to choose parameters, valid results may be excluded because the search criteria have been too confining.

An alternative to parametric searching is doing full text searches, which while valid in their own right, there is a certain loss of refinement when using this approach. To a full text search engine, the fact that a recipe contains a particular ingredient is irrelevant as the context of the use of the ingredient has not been preserved.

Faceted Metadata Search Solution

A good solution to these problems involves exposing the facets in dynamic taxonomies so that the user can see all of the refinement options at any time. The user can easily switch between a search based approach vs. metadata browsing, using a familiar terminology while recognizing the organization and vocabulary of the data.

Key features for metadata search include:

  • Displaying aspects of the current results set in multiple categorization schemes.
  • Showing only categories that have a result set, no dead-ends (links leading to empty lists).
  • Displaying a count of the contents of each category; lets the user know what size of result set to expect if they choose that facet.
  • Generating groupings on the fly, such as size, price or date.
  • Drill down by facet, so a record enthusiast could choose genre, artist, title, year.
  • Adding special facets within categories -- e.g. a Yellow Pages site would want to show cuisine and location for restaurant listings but not plumbers.

Implementing Faceted Search in MOSS 2007 and MSS 2008

The solution started in June 2007 as a field research project for one of Microsoft's customers. Leonid Lyublinski, a Microsoft Consultant, delivered the architectural design and development of a Faceted Search solution as an add-on to MOSS 2007 and MSS 2008. The initial version was released with an open source license at http://www.codeplex.com/facetedsearch and has been very well received. A second major version was released just last week and includes the following features:

  • Support of all content sources, BDC, file shares, web sites, and SharePoint lists.
  • Asynchronous processing based on flexible number of facets.
  • Support of choice, lookup, and lookup with multiple selection fields.
  • Sorting of facet by name, hits, and max.
  • Configurable display name, icon per facet.
  • Adjustable facet exclusion based on wildcard match.
  • Client-side collapse/expand option.
  • Crop with tooltip for cropped values and quick info for the Facet.
  • Customizable styles consistent with SharePoint.

Here are screenshots of a couple of example implementations:
image 
image 

Another major version of Faceted Search is scheduled for release within the next few week, and it will encapsulate foundational changes in the design and code that will provide a balance between search accuracy and performance. Key enhancements will include:

  • Multi-thread processing. 1st thread runs for up to 500 facets synchronously, while the 2nd thread is running asynchronously against up to ~30,000 facets.
  • Client side refresh (not AJAX) that updates only facets web part without page refresh.
  • Web part connections to pass Facet settings to the bread crumbs.
  • Extended facet schema now supports:
    • Facet icons -- default icon per facet name complimented by an icon per facet value .
    • Friendly names for facet values.
    • Exclusions -- allow exclude facet when values match pattern.
    • Built-in wildcard match, especially useful for exclusions.
    • Improved search syntax, added supports for sentences and quoted phrases.

This new version will also include numerous bug fixes and be complemented by updated documentation for installation, configuration, and styling. It will be first demonstrated by Leonid and me at the Office Developer Conference 2008 in San Jose, California on February 10-13 and then released on CodePlex shortly thereafter.

 

Neil Hodgkinson, Microsoft PFE

Published Wednesday, January 30, 2008 3:57 AM by sptblog

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

# MSDN Blog Postings &raquo; 2008 &raquo; January &raquo; 30

Wednesday, January 30, 2008 8:00 AM by MSDN Blog Postings » 2008 » January » 30

# Faceted Search for MOSS and Search Server 2008

Faceted Search for MOSS and Search Server 2008

Wednesday, January 30, 2008 9:15 AM by harbar.net

# re: Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 1 of 2

Why were WSS 3.0 and MSSX discriminated against by not making this app available to users of WSS 3.0 and MSSX??

Wednesday, January 30, 2008 2:12 PM by Tom

# re: Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 1 of 2

Response to - Why were WSS 3.0 and MSSX discriminated against by not making this app available to users of WSS 3.0 and MSSX??

You will be able to use the faceted search capability with MSS Express and in fact the presentation at ODC will show MSS Express configured for facets.

The key element of th search index that makes facets so powerful is the gathered METADATA. Since WSS search alone doesnt provide a mechanisn for creating and editing managed properties the concept of using WSS and facets has not been considered here.

Wednesday, January 30, 2008 7:19 PM by Neil Hodgkinson

# Does MS alreay use this person search service in company?

Hello, Neil Hodgkinson.

I'm interested in this person searching service and I need your help. because our compay looking for the solution about social network.

I'd like to know that Does MS alreay use this person search service in company.

If I can have that information or best practice of example, like pdf files or ppt files, about a use case of person search service in MS company,

please send it to me. (nepsdev@hanmail.net)

Or if you don't have data or if you can't send it to me, plesse describe the reason in this blog comment why you couldn't send a data.

Include whether MS uses this person searching service or not.

I'm waiting your reply and I need your help.

Thursday, January 31, 2008 7:59 PM by Justine

# re: Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 1 of 2

Online demo of the SharePoint Faceted Web Parts can be seen here...

http://www.wssdemo.com/search/Pages/Results.aspx?k=web%20part

Friday, February 01, 2008 10:33 PM by Ian Morrish

# re: Does MS alreay use this person search service in company?

YES. I am one of the users. Have a look at the Whitepaper "Creating Effective Enterprise Portals by Using SharePoint Server 2007"

http://technet.microsoft.com/en-us/library/cc184928.aspx

It covers how SharePoint has been deployed at Microsoft and includes this section...

Improved Ability to Find Employees and Subject Matter Contacts

Office SharePoint Server 2007 includes the ability to search for employees by using the People tab in Search Center. In an organization the size of Microsoft, collaboration between teams is very important. For example, at Microsoft, someone on the Windows Vista® team who is working on authentication may need to locate someone on the Windows Server® team who is working on the Active Directory® directory service.

Employees are located based on any of the typical employee attributes (such as name, office, department, or job title) in addition to expertise (such as developer, technical writer, or government sales). The ability to locate other employees quickly and based on meaningful criteria makes collaboration on a project more efficient. Employee information is imported from other sources into the profiles in Office SharePoint Server 2007. Then, the search services index the information and make it available.

Friday, February 01, 2008 11:54 PM by Ian Morrish

# Participation, Diversity, and Change

“Our goal here is to evolve and to hopefully provide information that makes it easier for people using OSS and Microsoft software in the real world.” Bill Hilf wrote this in an April 2006 blog entitled “Who Would Have Guessed?” just one week after Port25

Tuesday, February 05, 2008 11:37 PM by Port 25

# re: Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 1 of 2

Nice article Neil. Good to see a Brit giving it large and proud.

Friday, February 08, 2008 12:51 PM by Graham Chastney

# Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 2 of 2

Background This is part 2 of the blog series that was started about a month ago with Neil Hodgkinson's

# SharePoint Kaffeetasse #50

Holla - schon die 50.te Ausgabe der Kaffeetasse .... Anpassungs-Projekt Building a news workbench on

Monday, March 31, 2008 3:55 AM by SharePoint, SharePoint and stuff

# The Ultimate List of SharePoint AddOns and Tools for Web 2.0, Enterprise 2.0 And Social Networking Features

How to bring Web 2.0/Enterprise 2.0 and Social Networking into the SharePoint-Platform? This is one of

Thursday, May 22, 2008 4:45 PM by SharePoint, SharePoint and stuff

# The Ultimate List of SharePoint AddOns and Tools for Web 2.0, Enterprise 2.0 And Social Networking Features

How to get Web 2.0/Enterprise 2.0 and Social Networking into the SharePoint-Platform? This is one of

Thursday, May 22, 2008 4:50 PM by SharePoint, SharePoint and stuff

# The Ultimate List of SharePoint AddOns and Tools for Web 2.0, Enterprise 2.0 And Social Networking Features

How to get Web 2.0/Enterprise 2.0 and Social Networking into the SharePoint-Platform? This is one of

Thursday, May 22, 2008 5:07 PM by Mirrored Blogs

# SharePoint and Web 2.0, Enterprise

Web2.0/Enteprise2.0/Social Software A. Free Tools, AddOns and Solutions 1. Community Kit: Enhanced Blog

Monday, June 09, 2008 7:56 AM by SharePoint, SharePoint and stuff

# The Ultimate List of SharePoint AddOns and Tools for Web 2.0, Enterprise 2.0 And Social Networking Features

How to get Web 2.0/Enterprise 2.0 and Social Networking into the SharePoint-Platform? This is one of

Thursday, September 04, 2008 2:43 PM by Michael Greth MVP SharePoint Blog

# Faceted Search for MOSS and Search Server 2008

Faceted Search for MOSS and Search Server 2008

Tuesday, December 30, 2008 1:23 PM by harbar.net

# The Best Free SharePoint Downloads

This post will continue to be maintained to contain the best free utilities and downloads available for

Friday, January 23, 2009 12:33 PM by Eli Robillard's World of Blog.

# re: Open Source Faceted Search for MOSS 2007 and Microsoft Search Server 2008 - Part 1 of 2

hi,

If i have acustomize web part how can i use the faceted webpart? from where the web part get his data?

Thanks,

Sunday, February 22, 2009 4:20 AM by aviran24

# SharePoint and Web 2.0 Enterprise

Web2.0/Enteprise2.0/Social Software A. Free Tools, AddOns and Solutions 1. Community Kit: Enhanced Blog

Thursday, June 18, 2009 6:47 AM by Michael Greth [SharePoint MVP]

Leave a Comment

(required) 
required 
(required) 
 
Page view tracker