Welcome to MSDN Blogs Sign in | Join | Help

Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Relevance is about how closely the search results match what the user wanted to find.

 

To improve the search results that MOSS Search returns, we need to understand how search results are ranked:

 

SharePoint performs two types of ranking, dynamic ranking and static ranking. Dynamic ranking, is something that happens on the Query Servers and depends on query and term matching whereas static ranking occurs at index time. Static ranking is query independent and is computed at index time. Lets dive deeper into each of these:

 

Dynamic Ranking:

This looks at the content or property values for a content item such as:

 

 Anchor Text

This evaluates the text that describes a target. E.g. <A href=http://portal/site> Company Name Enterprise Gateway Portal</A>

  •  Search harvests anchor text from HTML anchor elements, WSS Link Lists, SPS 2003 listings, Word/Excel/PowerPoint 2007 (files using Open Office XML File Formats)
  • Any other File Types handled by installed 3rd Party iFilter  components

Property Weighting

Property weighting infers that matches on a specific property value can be more relevant than other property values or in document’s body.

  • MOSS 2007 automatically enhances / extraction of metadata
  •  MOSS 2007 automatic tuning
  •  Index time implementation (occurs on index server)
    • Weight is part of property definition
    • Managed properties considered in ranking (weights can only be changed through object model); New Relevance Object Model in Microsoft.Office.Server.Search.Administration Namespace
      • Configure Managed Property (managedProperty.Weight = newWeight;) or set ranking parameter on predefined documents (RankingParameter.Value)

string strURL = "http://<SiteName>";

SearchContext srchContext;

using (SPSite site = new SPSite("http://yourSiteName"))

{

     srchContext = SearchContext.GetContext(site);

}

 

Ranking ranking = new ranking(srchContext));

 

foreach (RankingParameter param in ranking.RankingParameters)

{

    RankingParameter lookedup = ranking.RankingParameters[param.Name];

    Console.WriteLine(lookedup.Name + ": " + lookedup.Value);

}

      • Unmanaged properties NOT considered in ranking.

Title Extraction

Title is a very important property of ranking and are often wrong (e.g. “Slide 1”, or “Word Template Name”) MOSS 2007 has an intelligent way of overcoming this problem. What is does, is use a text extraction algorithm that generates a shadow title. How does it find a shadow title if one does not exist? It uses the headings inside your document. These are normally displayed using text formatting such as Heading 1 or Heading 2. 

 

Please note that this only works for Office file types, another words, the Office IFilter that MOSS 2007 search uses to pick up this information. 

 

URL Matching

Name of a website is normally a common type of query. MOSS Search matches site name to URL equivalent. 

 

Static Ranking

This describes the ranking that is not impacted by the content or property values for a content item.

 

File Type Biasing

In most search scenarios, certain file types are more relevant than others. This effects the MOSS Search relevance calculation ranks.

 

  • Order of relevancy: HTML Web pages, PowerPoint presentations, Word documents, XML Files, Excel Spreadsheets, Plain Text files, List Items
  • See Object Model : RankingParameter.Value
  •  IMPORTANT: You cannot add and/or remove File Types

Automatic Language Detection

Foreign language results are less relevant than results in user’s language

  • Index time: documents are tagged with their likely language.
  • Query time: MOSS Search determines users language via browsers headers (Accept-Language).
    • Advanced Search: User can override this default behaviour using different language.
    • Exception: ENGLISH is always considered as relevant as user’s language.

Click Distance from authoritative pages

NOTE: the difference between Click Distance and URL Depth. Click distance is not based on URL depth but rather on the path the user takes through pages to get to information.

 

Authoritative Pages (Configured in SharePoint Central Administration):

 

  • Sites linked to authoritative pages have higher relevant score.
  • Click distance can be improved by configuring authoritative pages in search admin. This effectively “bumps up” the a “X number of clicks site” to a one click site.
  • 3 levels of importance and is maintained by an administrator.
  • Pages linked to authoritative pages are MORE relevant than pages that and is adjusted until rank of all pages is influenced by its “click distance” to authoritative pages.
  • Administrators CAN demote relevance of sites. 

 URL Depth

Items with shorter urls are more relevant than items placed in longer URLs; E.g.  http://msw/ vs http://portal/divisionalsite/ProjectSite1/MeetingSite/ .Short URLS are like prime real estate and organisations tend to allocate them to the most important content.

 

Relevance Metrics

 

·         Precision@N: Avg. No. Of relevant documents in top 5, 10,etc.

·         Mean Average Precision: Avg. Precision from N-1 to R

·         Reciprical Rank: 1/rank of the top relevant document

·         Normalized Discounted Cumulative Gain (NDCG) : Represents ratio of current ranking to ideal

 

User’s Perceived Relevance

 

·         Summarization and Highlighting : Query-dependant summarization and highlighting of hits within summary.

·         Duplicate removal: Near duplicates documents are detected across index and removed at query time; can be disabled by admin

·         Best Bets: Best Bets promotion IS NO LONGER PART OF ranking algorithm

·         Did you mean? : Index informed spell checker; Only available for English, Spanish, French, (not sure of last language).

 

 

Optimization

 

·         First crawl your content J

·         Manage authoritative pages and demoted sites carefully

·         Mine query logs to identify keywords

·         Review list of descriptions, keywords, and best bets periodically as content prioritization can change over time

·         Use admin object model CAREFULLY to change weight given to properties

·         Features in ranking formula can also be added using object model to personalize ranking criterias:

o   http://msdn2.microsoft.com/en-us/library/microsoft.office.server.search.administration.ranking.rankingparameters.aspx

Published Monday, February 26, 2007 8:06 PM by Brian Wilson
Filed under: ,

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

# WSS FAQ additions and changes LII - 23rd - 29th April 2007

Sunday, April 29, 2007 1:56 AM by Mike Walsh's WSS and more

# Mike Walsh's WSS and more - WSS FAQ additions and changes LII - 23rd - 29th April 2007

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Sunday, June 17, 2007 9:49 PM by Carl

Thanks Brian. Am looking at implementing MOSS 2007 as replacement for a legacy search engine and this article was really helpful.

Cheers

Carl

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Friday, November 23, 2007 8:22 AM by Michael

How SharePoint select text for HightlightSummary ?

When page conains word in other form, for example ran (search query: run) it returns some unrelated text, i think some text from beggining of page, because i get navigation very ofthen in that case very often.

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Wednesday, January 16, 2008 11:43 AM by Gaurav

This seems to be a very informative and rare article...kudos to the author for such a gud article.....

I have a ques though..........suppose Instead of ranking by Relevance and modified date as provided by MOSS 2007 OOB.....I need to rank by say author name,title etc....how to do that.......

Thanks

Gaurav

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Thursday, February 07, 2008 8:16 AM by Mike

I have noticed that File Biasing does not include PDFs.  What does it mean for PDF relevancy?  Will a PDF be less relevant than a List Item?  How does that work?

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Tuesday, February 12, 2008 3:16 PM by Dan Bauhaus

At Coveo, we encourage evaluators to actually measure relevance with their own documents and content.  It's pretty easy and very revealing as to whether the underlying search software is doing relevance well.  In a nutshell, gather 50-100 queries, decide in advance which document is the best result, set up the search software (Coveo takes about 30 minutes even integrated with SharePoint), run your searches, and assign the position of each result as the score (use 100 if the result is past the first 100).  Add up all the scores, and divide by the number of queries and you have your overall score.  A score of 1 means the best result on average was first on the first page.  A score of 50 means on average the 50th result, and so on.  Small note, take into account that the search engine may have found a better result...you never know, so look at the results before your pre-chosen "best".  Coveo has won many many bakeoffs this way.

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Friday, May 02, 2008 1:02 PM by Robin

open search of doc libs shows a "view duplicates" option where multiple files satisfy the search criterion (say found in different doc libraries). If I then go to Advanced search, and use a property value such as "name" contains "text of interest" and I get a single return rather than the say 8 items that open search revealed. This behavior seems just wrong headed. In my thinking if I wanted less precision I would choose the open search, and going to the trouble of Advanced property searching I obviously want *more* precision of search results. Less bulk, more detail... or is it just me?

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Wednesday, June 04, 2008 8:52 AM by Andy

Hi

Is there any way to stop the search from indexing columns, eg not to index the author, createdby columns.  I've tried checking the columns in Searchable columns at the top level site but this doesn't seem to be being identified.

Thanks

Andy

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Friday, June 12, 2009 12:37 PM by 出会い

ヒマだょ…誰かかまってぉ…会って遊んだりできる人募集!とりあえずメール下さい☆ uau-love@docomo.ne.jp

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Sunday, June 14, 2009 10:43 AM by 家出掲示板

カワイイ子ほど家出してみたくなるようです。家出掲示板でそのような子と出会ってみませんか?彼女たちは夕食をおごってあげるだけでお礼にHなご奉仕をしてくれちゃったりします

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Monday, June 15, 2009 11:30 AM by 右脳左脳

あなたは右脳派?もしくは左脳派?隠されたあなたの性格分析が3分で出来ちゃう診断サイトの決定版!合コンや話のネタにも使える右脳左脳チェッカーを試してみよう

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Wednesday, June 17, 2009 11:33 AM by 逆援助

セレブ達は一般の人達とは接する機会もなく、その出会う唯一の場所が「逆援助倶楽部」です。 男性はお金、女性はSEXを要求する場合が多いようです。これは女性に圧倒的な財力があるから成り立つことの出来る関係ではないでしょうか?

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Thursday, June 18, 2009 10:51 AM by 救援部

貴方のオ○ニーライフのお手伝い、救援部でHな見せたがり女性からエロ写メ、ムービーをゲットしよう!近所の女の子なら実際に合ってHな事ができちゃうかも!?夏に向けて開放的になっている女の子と遊んじゃおう

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Sunday, June 21, 2009 9:28 AM by 家出

家出中でネットカフェやマンガ喫茶にいる女の子たちは、お金が無くなり家出掲示板で今晩泊めてくれる男性を探しています。ご飯を食べさせてあげたり泊めてあげることで彼女たちはHなお礼をしてくれる事が多いようです

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Monday, June 22, 2009 9:29 AM by 勝ち組負け組

当サイトは、みんなの「勝ち組負け組度」をチェックする性格診断のサイトです。ホントのあなたをズバリ分析しちゃいます!勝ち組負け組度には、期待以上の意外な結果があるかもしれません

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Tuesday, June 23, 2009 9:26 AM by 素人

男性が主役の素人ホストでは、男性のテクニック次第で女性会員様から高額な謝礼がもらえます。欲求不満な人妻や、男性と出会いが無い女性が当サイトで男性を求めていらっしゃいます。興味のある方はTOPページからどうぞ

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Wednesday, June 24, 2009 9:15 AM by エロ漫画

エロ漫画やエロゲーなどでかわいい女の子が淫らな肉欲に溺れる様子をみて「こんなの現実にあるわけない」そう思った事ありませんか?それが当サイトでは現実に実現できるのです!羨ましさを憶えた2次元の中での出来事。あなたと同じように望む女の子が当サイトに集まっているのです

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Thursday, June 25, 2009 9:04 AM by 高級チェリー

高級チェリーの夏は童貞卒業の夏です。セレブ達も童貞を卒業させたくてウズウズしながら貴方との出会いを待っています。そんなセレブ達に童貞を捧げ、貴方もハッピーライフを送ってみませんか

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Friday, June 26, 2009 9:56 AM by 助けて〜!

何回かメールして会える人一緒に楽しいことしょ?お給料もらったばかりだからご飯くらいならごちそうしちゃうょ♪ cha-a@docomo.ne.jp とりあえずメールくださぃ★

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Saturday, June 27, 2009 8:36 AM by セレブラブ

セレブラブではココロとカラダに癒しを求めるセレブ達と会って頂ける男性を募集しています。セレブ女性が集まる当サイトではリッチな彼女たちからの謝礼を保証、安心して男性はお金、女性は体の欲求を満たしていただけます。無料登録は当サイトトップページからどうぞ

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Sunday, June 28, 2009 9:10 AM by SOS少女

家出中でお金が無く、ネットカフェを泊り歩いているSOS少女たちは、家出掲示板で泊めてくれたり遊んでくれる男性を探しています。泊めてあげたりすると彼女たちはHなお礼をしてくれるかもしれません。家出少女と遊びたい方は当サイトはどうぞ

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Monday, June 29, 2009 9:40 AM by 精神年齢

あなたの精神年齢を占ってみよう!当サイトは、みんなの「精神年齢度」をチェックする性格診断のサイトです。精神年齢度には、期待以上の意外な結果があるかも??興味がある方はぜひどうぞ

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Tuesday, June 30, 2009 9:55 AM by 出張ホスト

マダムと甘い時間を過ごしてみませんか?性欲を持て余しているセレブたちは出張ホストサービスで男性を探し、セックスを求めているのです。ホスト希望の方なら容姿や年齢は一切不問!ご近所の女性を探して、多額の報酬をゲットしよう

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Wednesday, July 01, 2009 9:04 AM by スローセックス

楽しく、気持ちよく絶頂を味わえることで若い女性から熟女の女性まで幅広い世代で爆発的な人気がある、スローセックス。当サイトはプレイに興味がある、あるいは試してみたいけれど相手がいない…といった方の支援サイトです。当サイトでSEXパートナーを探してみませんか

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Friday, July 03, 2009 9:51 AM by メル友募集

恋することって怖くないですか?最近ちょっと臆病になってて…そういうの抜きでえっちなことしたくて… lovely-i0709@docomo.ne.jp優しい人がいたらメール待ってます☆

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Saturday, July 04, 2009 9:54 AM by 逆円助

さあ、今夏も新たな出会いを経験してみませんか?当サイトは円助交際の逆、つまり女性が男性を円助する『逆円助交際』を提供します。逆円交際を未経験の方でも気軽に遊べる大人のマッチングシステムです。年齢上限・容姿・経験一切問いません。男性の方は無料で登録して頂けます。貴方も新たな出会いを経験してみませんか

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Monday, July 06, 2009 10:00 AM by 精神年齢

みんなの精神年齢を測定できる、メンタル年齢チェッカーで秘められた年齢がズバリわかっちゃう!かわいいあの子も実は精神年齢オバサンということも…合コンや話のネタに一度チャレンジしてみよう

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Tuesday, July 07, 2009 10:15 AM by 童貞卒業

童貞卒業を考えているなら、迷わずココ!今まで童貞とヤッた事がない女性というのは意外と多いものです。そんな彼女たちは一度童貞とやってみたいと考えるのは自然な事と言えるでしょう。当サイトにはそんな好奇心旺盛な女性たちが登録されています

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Wednesday, July 08, 2009 8:37 AM by 素人

素人ホストでは日頃のストレスを発散したい、もう一度恋がしたい、そういた女性が癒しを求めて登録されています。当サイトは癒やされたい女性・寂しい女性を癒やす男性が集うカップリングサイトです

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Thursday, July 09, 2009 9:57 AM by 熟女

熟女だって性欲がある、貴方がもし人妻とSEXしてお金を稼ぎたいのなら、一度人妻ワイフをご利用ください。当サイトには全国各地からお金持ちのセレブたちが集まっています。女性から男性への報酬は、 最低15万円からと決めております。興味のある方は一度当サイト案内をご覧ください

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Friday, July 10, 2009 10:19 AM by メル友募集

恥ずかしいけどやらしいことしたくてしょうがありません…誰か一緒にしてくれませんか?とりあえず連絡待ってます☆ cute.y.0902@docomo.ne.jp

# re: Search: Fine Tuning search relevancy in Microsoft SharePoint Server 2007: Getting the search results your user expects

Saturday, July 11, 2009 11:16 AM by オナニー

女の子のオナニーを手伝って報酬をもらう仕事に興味はありませんか?新感覚SNSの当サイトで見るだけで3万円、お手伝いで5万円の高額アルバイトを始めてみたい方は当サイトへどうぞ。

Leave a Comment

(required) 
required 
(required) 
 
Page view tracker