<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/atom.xsl" media="screen"?><feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-US"><title type="html">Microsoft Enterprise Search Blog</title><subtitle type="html" /><id>http://blogs.msdn.com/enterprisesearch/atom.xml</id><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/default.aspx" /><link rel="self" type="application/atom+xml" href="http://blogs.msdn.com/enterprisesearch/atom.xml" /><generator uri="http://communityserver.org" version="2.1.61025.2">Community Server</generator><updated>2008-09-02T16:47:00Z</updated><entry><title>Observations from the Text Analytics Summit 2009</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2009/06/10/observations-from-the-text-analytics-summit-2009.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2009/06/10/observations-from-the-text-analytics-summit-2009.aspx</id><published>2009-06-10T01:56:00Z</published><updated>2009-06-10T01:56:00Z</updated><content type="html">&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;One of the hard parts about organizing a conference like the 5&lt;SUP&gt;th&lt;/SUP&gt; annual &lt;/FONT&gt;&lt;A href="http://www.textanalyticsnews.com/usa/"&gt;&lt;FONT size=3 face=Calibri&gt;Text Analytics Summit&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt;, held last week in Boston, must be selecting the industry case studies. Text analytics is a highly specialized, but broad reaching topic that has applications in life sciences, financial service, legal, retail, government, media, and entertainment, to name a few. Any one of these industries could have filled the conference with interesting examples. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;As it was, most of the case studies and vendor briefings at this conference were about Voice of the Customer or Market Intelligence. I suspect that some attendees might have preferred a little more variety in the cases presented. The absence of any government case studies, for example, was conspicuous, but understandable given the special nature of that domain. We’d all probably have needed security clearances to attend those sessions anyway. &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;Overall, I appreciated the more commercial/consumer focus and felt that the conference organizers did a great job of finding representative examples and balancing the practical (vendor briefings and case studies) with the theoretical.&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;As a first time attendee to the conference, I was interested in just getting the lay of the land in text analytics, but I was also interested to learn how people were answering the “what’s next” question. It came up several times over the 2 days during Q&amp;amp;A and panel sessions and there were different takes, but I paid close attention to three, in particular, that resonated with my own observations looking through the lens of enterprise search. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Trend 1:&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;ETL-like Tools&lt;?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Ok, this is not really a trend in text analytics, but it is one in enterprise search that is informed by text and data analytics. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Many of the vendors at the conference demonstrated graphical tools designed to simplify the process of building text analysis “pipelines”. These tools look very much like the Extract, Transform, and Load (&lt;/FONT&gt;&lt;A href="https://www.tdwi.org/research/display.asp?id=6716"&gt;&lt;FONT color=#0000ff size=3 face=Calibri&gt;ETL&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt;) tools that have been around for many years in the data integration world. The difference is that the text analysis versions of these tools focus on operations for handling unstructured text. For example, &lt;/FONT&gt;&lt;A href="http://en.wikipedia.org/wiki/Named_entity_recognition"&gt;&lt;FONT size=3 face=Calibri&gt;named entity recognizers&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt; are a common text analytics task for automatically recognizing and tagging things like person names, company names, and locations in text. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;This ETL “pattern” exists in enterprise search, as well, where information must be &lt;I style="mso-bidi-font-style: normal"&gt;&lt;U&gt;extracted&lt;/U&gt;&lt;/I&gt; from a source repository (e.g. an email archive), &lt;I style="mso-bidi-font-style: normal"&gt;&lt;U&gt;transformed&lt;/U&gt;&lt;/I&gt; into an enhanced, canonical representation (e.g. annotated XML), and &lt;I style="mso-bidi-font-style: normal"&gt;&lt;U&gt;loaded&lt;/U&gt;&lt;/I&gt; into a database or index for searching. &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;The demand for graphical tools to manage the ETL process for search has not been as high as for text or data analysis. I think this partly because, for search applications, it is usually a one-time set up process and not an iterative modeling exercise as it is with text analytics. It may also be because historically the operations performed on content before it’s indexed for search have not been as sophisticated as the operations performed for in-depth text analytics. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;This is changing.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;To start, extensible pipeline processing frameworks that incorporate advanced text analysis capabilities have become more common in enterprise search products. By now, most mainstream enterprise search platforms include entity extractors, for example. We are also seeing more ETL-like graphical consoles for managing content integration and analysis. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;The adoption of these tools and techniques for enterprise search is motivated, in part, by a desire to more easily harness text analytics features that increase search precision and create richer search experiences. It’s also the case that, while text analytics shares a heritage more with information retrieval (search) than with business intelligence (BI), it includes technologies relevant to both and sits smack in the middle of the &lt;/FONT&gt;&lt;A href="http://www.bing.com/search?q=%22search+and+bi%22+convergence&amp;amp;FORM=SOLTDF"&gt;&lt;FONT color=#0000ff size=3 face=Calibri&gt;convergence&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt; between these two spaces. Sue Feldman and Hadley Reynolds of &lt;/FONT&gt;&lt;A href="http://www.idc.com/"&gt;&lt;FONT color=#0000ff size=3 face=Calibri&gt;IDC&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt; reinforced this role of text analytics by describing it as a cornerstone of &lt;/FONT&gt;&lt;A href="http://www.idc.com/getdoc.jsp?containerId=IDC_P10654"&gt;&lt;FONT size=3 face=Calibri&gt;Unified Information Access&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt; during their Market Report at the conference. Given this, it shouldn’t be surprising to see that, as text analytic tools and concepts have found their way into BI applications, traditional BI tools and concepts, like ETL, are finding a place within enterprise search. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Trend 2:&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;Empowering the End User &lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Another topic that popped up at various times during the conference was the challenge of delivering the richness of text analysis tools to users other than specially trained analysts. As with traditional BI tools, many text analysis tools assume a trained user or “analyst” capable of designing sophisticated workflows or advanced analytical models. One question posed to a speaker after he finished describing his text mining process was “when do you think you’ll be out of your job?” &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;- meaning, when will the tools be so easy to use that your end users won’t need you to do their investigation for them? &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;I’m sure this exact question was asked at a conference of professional research librarians some 15-20 years ago - back when online search services and later Internet search engines were becoming easier and easier to use and obviating the need for “professional searchers”. The answer is likely the same, too. There will always be specialists and “power users”, but as the tools become easier to use, end users will become more empowered to do their own increasingly advanced analysis. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;In practice, we are seeing more applications that combine conventional search with advanced text analytics in ways that bring a more powerful search experience to relatively unsophisticated end users. &lt;/FONT&gt;&lt;A href="http://www.silobreaker.com/"&gt;&lt;FONT size=3 face=Calibri&gt;Silobreaker.com&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt; is a clever site that combines the richness of text analytics within what is fundamentally a news search application. Unlike other news search sites, Silobreaker offers options and tools that help to uncover and &lt;U&gt;discover&lt;/U&gt; interesting and potentially novel connections and patterns in the news. There are still some usability challenges with a consumer site like Silobreaker, but I like it as an example of ad hoc search converging with iterative knowledge discovery. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;The trend toward empowering users with more than just a search box and list of blue links also&amp;nbsp;reaches into less “analytical” consuemr applications. Two examples are &lt;/FONT&gt;&lt;A href="http://www.oodle.com/"&gt;&lt;FONT color=#0000ff size=3 face=Calibri&gt;www.oodle.com&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt; and &lt;/FONT&gt;&lt;A href="http://www.globrix.com/"&gt;&lt;FONT color=#0000ff size=3 face=Calibri&gt;www.globrix.com&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;. Both sites show the power of applying analytics to both structured and textual data (classifieds in the case of Oodle, real estate postings in the case of Globrix) in what are otherwise fundamentally search applications. &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Trend 3: &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;Taking Sentiment Analysis to the next level&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Sentiment analysis is the ability to recognize the mood, opinion, or intent of a writer by analyzing written text. It is sometimes called the “thumbs up, thumbs down” problem because the most common application is establishing whether a writer is positive or negative on a particular subject. In this form, it is often used to analyze written product reviews (see this example on Microsoft’s new &lt;/FONT&gt;&lt;A href="http://www.bing.com/shopping/search?q=52%22%20lcd&amp;amp;p1=%5bCommerceService+scenario%3d%22f%22+a%3d%22ra%22+p%3d%22df5c7f1ba4404c05b3f423c4e307eee6%22%5d&amp;amp;wf=Commerce&amp;amp;FORM=GZCA"&gt;&lt;FONT size=3 face=Calibri&gt;Bing Web search&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3 face=Calibri&gt;). &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Sentiment was a much mentioned topic at the conference. This is not surprising given the focus on Voice of the Customer and Market Intelligence – two areas where accurately establishing the sentiment of customers and consumers toward products, services, and brand is highly desirable. One of the presenters at the conference was Roddy Lindsay from Facebook. I missed that session, but it doesn’t take much imagination to appreciate the possible applications for text analytics and sentiment analysis, in particular, with the information available on Facebook and other social networking platforms. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Every vendor present had something to show or say on the subject of sentiment analysis, but all the panelists in the vendor-only panel acknowledged the difficulties of increasing the precision of sentiment classification. Currently, the number tossed around is 80%. That is, a sentiment classifier will get it right about 80% of the time compared to human judgments. This number is higher in some applications - for example, when analyzing short, strongly opinionated product reviews. It is lower when analyzing longer pieces of text where just fixing the subject can difficult – like this blog post. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Progress is being made, though. The first step has been a shift away from “document-level” sentiment to “topic-level” sentiment. This allows sentiment classification to be more accurate when confronting documents, like this post, that touch on and offer opinion on multiple topics. It also helps with more concrete problems like the ones represented in this sentence: &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="TEXT-INDENT: 0.5in; MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;“Acme’s new P40 digital camera has a good viewer, but its controls are awkward.”&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;While it’s relatively easy for a human, it takes some heavy linguistic lifting for a machine to recognize that the sentiment of this opinion is directed not just at Acme or at the P40 digital camera, but specifically at the &lt;I style="mso-bidi-font-style: normal"&gt;viewer&lt;/I&gt; (positive sentiment) and the &lt;I style="mso-bidi-font-style: normal"&gt;controls&lt;/I&gt; (negative sentiment). It’s ever trickier establishing what the word “its” refers to in the 2nd part of the sentence. Is it the Acme P40 itself, or just the viewer?&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Sentiment is admittedly a niche topic, even within text analytics, but getting it right matters a lot for enterprise search applications in eCommerce (think product reviews), Market Intelligence (reputation tracking and competitive intelligence), eDiscovery, and Government Intelligence. One presenter suggested that all the remaining hard problems in sentiment analysis will be solved, at least academically, in a couple years. It will be interesting to see how soon these improvements surface in real-life applications.&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN: 0in 0in 10pt" class=MsoNormal&gt;&lt;FONT size=3 face=Calibri&gt;Nate&lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9721226" width="1" height="1"&gt;</content><author><name>ntreloar</name><uri>http://blogs.msdn.com/members/ntreloar.aspx</uri></author><category term="search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/search/default.aspx" /><category term="enterprise search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/enterprise+search/default.aspx" /><category term="etl" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/etl/default.aspx" /><category term="sentiment analysis" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/sentiment+analysis/default.aspx" /><category term="'text analytics&amp;quot;" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/_2700_text+analytics_2600_quot_3B00_/default.aspx" /></entry><entry><title>Thinking Big – Search Scale and Performance on a Budget</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2009/05/22/thinking-big-search-scale-and-performance-on-a-budget.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2009/05/22/thinking-big-search-scale-and-performance-on-a-budget.aspx</id><published>2009-05-22T20:02:00Z</published><updated>2009-05-22T20:02:00Z</updated><content type="html">&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I recently came across Paul Nelson’s informative post on &lt;/FONT&gt;&lt;A href="http://searchtechnologies.com/searchchronicles.html"&gt;&lt;FONT face=Calibri size=3&gt;search scalability&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;. I don’t know how long it’s been up there, but reading it made me think of customers I’ve spoken with recently who are looking to scale up their search deployments, but, due to tight budgets, want to do so without simply buying more hardware. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Paul focuses on document count as the main consideration for architecting scalable search, saying:&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt; TEXT-INDENT: 0.5in"&gt;&lt;I style="mso-bidi-font-style: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;There is really only one dimension of size: The total count of documents in the system.&lt;?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/I&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;He goes on to describe several useful strategies for scaling search for “large” systems – those with document counts of &amp;gt;500 million. Importantly, imo, he also points out that even medium sized systems (10-100 million docs) will have special scaling needs depending on their performance requirements:&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt 0.5in"&gt;&lt;I style="mso-bidi-font-style: normal"&gt;&lt;SPAN style="mso-bidi-font-size: 9.0pt"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;If these systems have any kind of query or index performance requirements — for example, it is a public web site with 10-30 queries per second, or that new documents arrive at a rate of 10 documents per second — then you will likely need an array of machines to handle your needs. &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/SPAN&gt;&lt;/I&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I mostly want to reinforce and build on this second point. Effective scaling search means getting the most out of your search infrastructure (i.e. maximizing the number of documents per unit of hardware), but scale and performance are two sides of the same coin, and whether a system can squeeze ten thousand or ten billion documents on a machine, it must still satisfy the applications performance requirements. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;If you can’t just add hardware, what then? Well, there are still options for getting more capacity out of a search system that provides the right level of control for optimization and tuning. Understanding these options requires understanding how search system performance is measured and the associated trade-offs that exist. Paul alludes to some of these trade-offs, but it’s worth providing a few more details and examples to drive this point home. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Search System Performance Metrics&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Metrics for search system performance typically fall into two categories: &lt;B style="mso-bidi-font-weight: normal"&gt;query performance &lt;/B&gt;and &lt;B style="mso-bidi-font-weight: normal"&gt;indexing performance&lt;/B&gt;. In turn, these categories each have two measures associated with them:&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;U&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Query performance&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/U&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpFirst style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l1 level1 lfo1"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;Query latency&lt;/B&gt; (or response time) – the time it takes for a query to be processed and results to be returned.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpLast style="MARGIN: 0in 0in 10pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l1 level1 lfo1"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;Query rate&lt;/B&gt; – the rate at which the system can process queries. Usually measured in queries per second (or QPS). &lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;U&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Indexing performance*&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/U&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpFirst style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo2"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;Indexing latency&lt;/B&gt; – the time it takes for a document to be indexed and made available to search. &lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpLast style="MARGIN: 0in 0in 10pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo2"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;Indexing rate&lt;/B&gt; – the rate at which the system can process and index documents. Measured in documents per second. &lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;*&lt;I style="mso-bidi-font-style: normal"&gt;Indexing performance assumes systems that actually create an index or some other sort of database optimized for information retrieval. This rules out “federated search” engines, which rely on other systems to create and manage these indices.&lt;o:p&gt;&lt;/o:p&gt;&lt;/I&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;There are some variations on these measurements. For example, you can track average or peak values for each. &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;Document count per node (where a node = a Processing/Memory/Storage unit on a network) impacts all of these measures, but there’s a balance between query performance and index performance that also influences how many documents you can squeeze onto a single node.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;The perhaps obvious explanation is that the more system resources you allocate to serve query performance, the fewer resources you’ll have available for indexing, and vice versa. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Applications with rapidly changing content or with very time sensitive data place high demand on indexing performance. Other applications, like highly trafficked Web sites, place high demand on query performance. Different applications place different demands on scalability depending on the performance requirements across these dimensions. To take a specific example, consider an eDiscovery application that provides search across 100s of millions of archived emails. The query rate and indexing latency requirements for this type of application are typically lower than what a reasonably popular social networking site with an equivalent document count might see. As a result, eDiscovery search applications are able to squeeze more documents per node than highly trafficked Web sites – even if they serve the same total number of documents. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;For another comparison, large eCommerce sites can have extreme query performance requirements - in some cases handling several thousand queries per second during peak traffic times, while still delivering sub-second responses. Even with these extreme query requirements, these sites can have relatively modest indexing performance requirements when compared to, say, financial news applications where content “freshness” and, so, low index latency are a priority. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Impact of Features&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;An often neglected factor that impacts performance is feature set. Features like faceted searching, results clustering, automatic query completion, and advanced query operators can each add incremental overhead to indexing performance, query performance, or both, depending on the feature and the system. For example, queries used for eDiscovery are sometimes crafted by teams of lawyers. This can result in queries made up of dozens or even hundreds of carefully selected search terms combined in a maze of (also carefully selected) Boolean, proximity, and other types of search operators. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I remember one FAST partner describing how their legacy eDiscovery tool (built on relational database technology) took up to &lt;B style="mso-bidi-font-weight: normal"&gt;2 weeks&lt;/B&gt; to process a particularly long and complex query. Needless to say, they were delighted when we demonstrated the same query taking only a few seconds. It was not sub-second, but the point is that they would have been happy with this particular query if it came back in a few hours. In fact, our conversations on optimization included whether we could squeeze more capacity (docs per node) by relaxing the query response time requirements to 10-15 seconds for these queries in their application. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Different search systems are better (faster) than others, but parsing and evaluating very long and complex queries will generally take more cycles and resources than the usual 1 or 2 term ad hoc query. Relative to absolute document count, the individual impact on performance and scale of any one feature may be small, but taken as a whole and for certain applications, like the one in the example above, they can represent meaningful tuning options.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Know Your Options&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;The moral of the story is that getting enterprise search scale and performance right for large systems can be somewhat nuanced - especially if you’re on a tight budget. If you’re embarked on, or about to embark on a large scale enterprise search project, make sure you understand these performance considerations. Best of breed enterprise search platforms support many tuning strategies that factor in all the key dimensions of search performance and scale. Read your system’s deployment guide (if it comes with one) to understand these options.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Lastly, if you’re not sure if your project has what might be considered demanding scale or performance requirements, consider getting some expert advice. Below are some good online forums you can tap for expert advice and to get a sense for whether your system might be considered “demanding”. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;A href="http://tech.groups.yahoo.com/group/search_dev/"&gt;&lt;FONT face=Calibri size=3&gt;http://tech.groups.yahoo.com/group/search_dev/&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;(Search Engine Developers group on Yahoo)&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;A href="http://www.linkedin.com/groups?gid=161594"&gt;&lt;FONT face=Calibri size=3&gt;http://www.linkedin.com/groups?gid=161594&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;(Enterprise Search Engine Professionals on LinkedIn)&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Nate &lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9635856" width="1" height="1"&gt;</content><author><name>ntreloar</name><uri>http://blogs.msdn.com/members/ntreloar.aspx</uri></author></entry><entry><title>Actionable Search – From What to Why?</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2009/05/12/actionable-search-from-what-to-why.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2009/05/12/actionable-search-from-what-to-why.aspx</id><published>2009-05-12T21:45:00Z</published><updated>2009-05-12T21:45:00Z</updated><content type="html">&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Day 1 at the &lt;/FONT&gt;&lt;A href="http://www.enterprisesearchsummit.com/"&gt;&lt;FONT face=Calibri size=3&gt;Enterprise Search Summit&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; in NYC is wrapping up and I’ve just listened to &lt;/FONT&gt;&lt;A href="http://www.enterprisesearchsummit.com/2009/speaker.shtml?speaker=LisaDenissen"&gt;&lt;FONT face=Calibri size=3&gt;Lisa Denissen&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; from Shearman &amp;amp; Sterling talk about Actionable Search. Actionable search is a key tenet of Microsoft’s enterprise search strategy, so it was good to see promotion of the concept.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;For many organizations, just adding basic, no-frills search to an intranet can have a big impact on employee productivity, but to really create an optimal search experience it helps to understand the processes that drive users to search in the first place. Too often search is treated as an end unto itself, without consideration for the larger processes that it ultimately serves. Users care about finding relevant information, sure, but they care even more about using that information to complete tasks and achieve outcomes. These tasks and desired outcomes are what ultimately define success for an enterprise search application and, it may be argued, for any type of search app.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Understanding what motivates people to search means going beyond capturing requirements like “I need to be able to search all of Product Marketing’s PowerPoints” to addressing more precise needs like &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;“I need to quickly assemble targeted presentations for sales prospects based on existing marketing material”. This second statement doesn’t sound like a search problem, but it speaks clearly to a desired outcome (“targeted presentations”) and the word “quickly” suggests that search may offer some help here. Importantly, the statement also focuses on the question of &lt;U&gt;why&lt;/U&gt; the user is searching, not just &lt;U&gt;what&lt;/U&gt; they hope to find.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;The phrase “actionable search” refers to the idea that items in search results can be directly acted on in a way that moves the searcher toward completion of a specific task – an outcome. While general Web search engines have us accustomed to results sets that contain only links to relevant Web pages, the richness of applications and content types in the enterprise and on targeted Internet sites promise a bit more than just a blue link. eCommerce sites have supported actionable search for years by allowing users to directly add items from a search result to a shopping cart. &lt;/FONT&gt;&lt;A href="http://www.facebook.com/"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;Facebook&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; provides contextual actions directly from its general search results that let you Join Groups, Add Friends, Join Events, or Send Messages. To take the earlier example, once a relevant PowerPoint presentation is found, an actionable search experience would be to offer the user help with the next steps of finding the right individual slide and then quickly incorporating that slide into their work-in-progress presentation.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;One argument for enterprise search starts with the question “What good is an enterprise content management and collaboration if you can’t easily find the information you create, manage, and share? We might switch the question around and ask, “What good is enterprise search if you can’t easily act on the information you find?”&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;Actionable search promises to close this gap between information access and outcomes. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Nate&lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9608267" width="1" height="1"&gt;</content><author><name>ntreloar</name><uri>http://blogs.msdn.com/members/ntreloar.aspx</uri></author><category term="enterprise search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/enterprise+search/default.aspx" /><category term="actionable search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/actionable+search/default.aspx" /></entry><entry><title>Search and Natural User Interfaces - Part 2</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2009/05/03/search-and-natural-user-interfaces-part-2.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2009/05/03/search-and-natural-user-interfaces-part-2.aspx</id><published>2009-05-03T16:47:00Z</published><updated>2009-05-03T16:47:00Z</updated><content type="html">&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;FONT face=Calibri size=3&gt;In my &lt;/FONT&gt;&lt;A href="http://blogs.msdn.com/enterprisesearch/archive/2009/04/27/search-and-natural-user-interfaces-nuis-part-1.aspx" mce_href="http://blogs.msdn.com/enterprisesearch/archive/2009/04/27/search-and-natural-user-interfaces-nuis-part-1.aspx"&gt;&lt;FONT face=Calibri size=3&gt;first post&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; on this subject last week, I referred to a scene in the movie “Minority Report” as a visionary example of a natural user interfaces (NUIs) and, more to the theme of this blog, a visionary example of ad hoc search within a NUI.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;I realize that I didn’t offer a definition of NUIs in that post, so, before I go back to the search connection, here’s a quick primer.&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;NUIs Defined&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;?xml:namespace prefix = o /&gt;&lt;o:p&gt;&lt;FONT face=Calibri size=3&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/o:p&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;FONT face=Calibri size=3&gt;Natural user interfaces or NUIs rely on natural expressions like touches and gestures to directly and intuitively control the experience of a software application. The word “natural” means that the interaction is not controlled through an artificial device, like a mouse or keyboard. &lt;I style="mso-bidi-font-style: normal"&gt;(I take this to imply that a Nintendo Wii is &lt;U&gt;not&lt;/U&gt; an example of a NUI, since there are still artificial controllers involved. Other opinions and thoughts on this are welcomed).&lt;/I&gt; &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;FONT face=Calibri size=3&gt;NUIs have been described as the next evolutionary step in human-computer interaction – the successor to graphical user interfaces (GUIs), which succeeded command line interfaces (CLIs), which succeeded physical input devices like card readers. Touch screens on hand-held devices are the most common examples of NUIs, but there are number of other emerging NUI platforms and technologies. This &lt;/FONT&gt;&lt;A href="http://www.pcmag.com/article2/0,2817,2336839,00.asp" mce_href="http://www.pcmag.com/article2/0,2817,2336839,00.asp"&gt;&lt;FONT face=Calibri size=3&gt;article on touch computing from PC Magazine&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; offers a catalog of some of the systems currently available. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Microsoft Surface&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;o:p&gt;&lt;FONT face=Calibri size=3&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/o:p&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;FONT face=Calibri size=3&gt;One of the technologies mentioned in the PC Magazine story is &lt;/FONT&gt;&lt;A href="http://www.microsoft.com/surface" mce_href="http://www.microsoft.com/surface"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Microsoft &lt;SPAN style="mso-bidi-font-family: 'Times New Roman'; mso-fareast-font-family: 'Times New Roman'; mso-bidi-font-size: 11.0pt; mso-ascii-font-family: Calibri; mso-hansi-font-family: Calibri"&gt;Surface&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;Microsoft Surface&lt;SPAN style="mso-bidi-font-family: 'Times New Roman'; mso-fareast-font-family: 'Times New Roman'; mso-bidi-font-size: 11.0pt; mso-ascii-font-family: Calibri; mso-hansi-font-family: Calibri"&gt; is a Windows powered device in the form factor of a table - a coffee table, if you will - with a surface that supports touch and gesture interaction. There are other NUI platforms, but t&lt;/SPAN&gt;here are a couple things that make Microsoft Surface different and interesting. &lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;SPAN style="mso-bidi-font-family: 'Times New Roman'; mso-fareast-font-family: 'Times New Roman'; mso-bidi-font-size: 11.0pt; mso-ascii-font-family: Calibri; mso-hansi-font-family: Calibri"&gt;&lt;o:p&gt;&lt;FONT face=Calibri size=3&gt;&lt;/FONT&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;First, the Microsoft Surface form factor and interface are designed to allow multiple users to interact with the device at the same time. The interface can detect and track dozens of touch points simultaneously. It can even recognize the orientation of fingers prints and infer, in turn, the physical orientation of a user relative to the display. Because of these capabilities, many applications created for Microsoft Surface emphasize multi-user collaboration and interaction – for example, there are multi-user games and other collaborative consumer applications for things like music and picture sharing. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Second, Microsoft Surface devices have built-in cameras that can not only track touches and gestures, but can recognize digitally tagged objects and can initiate specific actions when these objects are placed on the table. For example, &lt;/FONT&gt;&lt;A href="http://www.infusiondev.com/" mce_href="http://www.infusiondev.com/"&gt;&lt;FONT face=Calibri size=3&gt;Infusion Development&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; has created an &lt;/FONT&gt;&lt;A href="http://www.youtube.com/watch?v=Qf0WxOo3O4g" mce_href="http://www.youtube.com/watch?v=Qf0WxOo3O4g"&gt;&lt;FONT face=Calibri size=3&gt;application&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;&amp;nbsp;designed to&amp;nbsp;enhance the doctor patient consultation experience. By placing a tagged card on Microsoft Surface, doctors can use and access interactive cardiac images, dynamic charts and clinical documents to help explain medical conditions and procedures to their patients. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;NUIs:&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;Where’s the Search?&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I was wowed by my first experience with Microsoft Surface - as many are when the first get a chance to play with one - but being a search guy, I looked for applications that included some sort of search function. So far, of the NUI applications I’ve seen to date, whether on Microsoft Surface or in other NUI technologies, I’ve seen very few that provide true ad hoc search. In one or two examples I’ve seen, a virtual keyboard is used to enter search terms and traditional GUI search metaphors are used to render search results. More often, though, finding information requires the user to navigate through some pre-defined structure. Even this &lt;/FONT&gt;&lt;A href="http://www.youtube.com/watch?v=KqKC5A9JWTg" mce_href="http://www.youtube.com/watch?v=KqKC5A9JWTg"&gt;&lt;FONT face=Calibri size=3&gt;TouchWall demo by Bill Gates&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; from last year’s CEO Summit focused on navigation. Where’s the search?&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I’ll grant that structural navigation metaphors in NUIs are really cool and work pretty well.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;For example, I’ve seen a medical app that allows you to visually navigate a representation of the human body to explore different anatomical concepts. You can tap on the virtual head to explore the brain and then drill down further to learn about neurons. It looks like a fun and an interesting way to explore human anatomy, but the problem with&amp;nbsp;this navigation-only approach is that if you don’t happen to know that neurons are in the brain, it will take you a while to find them. It is browsing, not ad hoc search and, as we learned from the Yahoo Directory experience back in the 90s, people tend to prefer searching over browsing.&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;B style="mso-bidi-font-weight: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;A Prototype and a Request&lt;/FONT&gt;&lt;/FONT&gt;&lt;/B&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;FONT face=Calibri size=3&gt;At our FASTforward’09 user conference in Las Vegas in February, we showed a prototype application,&amp;nbsp;built in collaboration with a very sharp team of developers at EMC Consulting, which brought together ad hoc search and the natural user interface experience of Microsoft Surface. You can see a short video of this demo &lt;/FONT&gt;&lt;A href="http://www.youtube.com/watch?v=Xb5N5uZjpVQ" mce_href="http://www.youtube.com/watch?v=Xb5N5uZjpVQ"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;here&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;, or the longer keynote presentation from the event &lt;/FONT&gt;&lt;A href="http://www.youtube.com/watch?v=mUQWVpqcuaE" mce_href="http://www.youtube.com/watch?v=mUQWVpqcuaE"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;here&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;&lt;/FONT&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;When Mark Stone, Global Enterprise Search Lead at EMC Consulting, and I first conceived this demo, we were inspired by three things:&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpFirst style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo1"&gt;&lt;SPAN style="mso-bidi-font-family: Calibri; mso-fareast-font-family: Calibri; mso-fareast-theme-font: minor-latin; mso-bidi-theme-font: minor-latin"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT face=Calibri size=3&gt;1)&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;The dramatic growth and potential of NUI technologies, particularly Microsoft Surface.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpMiddle style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo1"&gt;&lt;SPAN style="mso-bidi-font-family: Calibri; mso-fareast-font-family: Calibri; mso-fareast-theme-font: minor-latin; mso-bidi-theme-font: minor-latin"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT face=Calibri size=3&gt;2)&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;The dearth of search examples in all these NUI applications.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpLast style="MARGIN: 0in 0in 10pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo1"&gt;&lt;SPAN style="mso-bidi-font-family: Calibri; mso-fareast-font-family: Calibri; mso-fareast-theme-font: minor-latin; mso-bidi-theme-font: minor-latin"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT face=Calibri size=3&gt;3)&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;The potential for creating transformative user experiences that combine search and NUIs .&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;You can judge for yourself how successful the team was in combining ad hoc search with Microsoft Surface by looking at the demos, but one thing is for sure, we were in uncharted waters when building this app. The user interface patterns for search within a NUI are not well established. Even without considering search, building user interfaces in Microsoft Surface requires setting aside the old GUI models and learning brand new patterns and metaphors. As for search in a NUI, well, what are the equivalents to the search box, the search result list, navigation facets, document links, and all the interaction patterns around this “controls”?&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;How can we use a 3&lt;SUP&gt;rd&lt;/SUP&gt; dimension (“depth”) and what role does “zoom” play in search? Working within a NUI environment even challenges the basic containers of information. Should you first show documents, or just extracted facts and information summaries? All these questions and more came up during the development of this prototype. Some of the answers are now known, or at least we have a better feel for the right direction to go, but others require more research and experimentation. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;There is the opportunity here, and a challenge to be met by the search community. NUIs are here to stay and are demanding new patterns for true ad hoc search that satisfy the intuitive and natural interaction requirements of these environments. Reverting to browsing metaphors is not the answer; nor is simply recreating the GUI patterns of keyword search boxes and lists of blue links.&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="MARGIN-BOTTOM: 0pt"&gt;&lt;FONT face=Calibri size=3&gt;I’m very interested in this topic and am on a hunt for any good examples of true search within NUIs. If you know of an example, please send whatever pointer you can - links to demo videos, screen shots, academic papers, … anything. You can respond to this post or email me &lt;/FONT&gt;&lt;A href="mailto:natetrel@microsoft.com" mce_href="mailto:natetrel@microsoft.com"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;directly&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;&lt;/FONT&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I look forward to seeing your examples and will summarize what I find in a future post. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;In the mean time, I feel like we need a new name for search interfaces within NUIs. I like the phrase “Natural Search Interface” used by the &lt;/FONT&gt;&lt;A href="http://www.microsoft.com/germany/aktionen/partnerfinden/default.mspx?solutionid=314582e1-4079-4726-899b-f107cb2f8297" mce_href="http://www.microsoft.com/germany/aktionen/partnerfinden/default.mspx?solutionid=314582e1-4079-4726-899b-f107cb2f8297"&gt;&lt;FONT face=Calibri size=3&gt;Microsoft Germany Partner site in reference to the Microsoft/EMC Consulting prototype&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;. I’ll use that. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Nate&lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9584780" width="1" height="1"&gt;</content><author><name>ntreloar</name><uri>http://blogs.msdn.com/members/ntreloar.aspx</uri></author><category term="NUIs" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/NUIs/default.aspx" /><category term="natural user interfaces" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/natural+user+interfaces/default.aspx" /><category term="FASTForward'09" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/FASTForward_2700_09/default.aspx" /><category term="Infusion Development" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Infusion+Development/default.aspx" /><category term="Microsoft Surface" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Microsoft+Surface/default.aspx" /><category term="EMC Consulting" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/EMC+Consulting/default.aspx" /><category term="enterprise search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/enterprise+search/default.aspx" /></entry><entry><title>Search and Natural User Interfaces (NUIs) - Part 1</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2009/04/27/search-and-natural-user-interfaces-nuis-part-1.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2009/04/27/search-and-natural-user-interfaces-nuis-part-1.aspx</id><published>2009-04-27T16:08:00Z</published><updated>2009-04-27T16:08:00Z</updated><content type="html">&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;About five years ago or so, I participated in a conference panel where the question was asked: “What will search interfaces look like 20 years from now?”. I had just seen Steven Spielberg’s sci-fi film “Minority Report” starring Tom Cruise, so I referred to the &lt;/FONT&gt;&lt;A href="http://www.dailymotion.com/video/x5e06m_minority-report-trailer-3_shortfilms"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;scene&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; where Cruise’s character is interacting with a futuristic looking visual display and using appropriately dramatic gestures to grab, spin, shrink, expand, and otherwise manipulate the various news stories and images floating on the display. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I heard later that Spielberg, while developing the script for the film, had consulted a number of futurists to create as realistic picture of the year 2050 as possible (from the point of view of those futurists at least). Interestingly, over the past several years, that scene has become a conceptual benchmark for so-called natural user interfaces (NUIs), to the point where if you search for “minority report” in your favorite Web video search engine you’re as likely to find examples of prototype NUI products as you are trailers for the actual film. It’s not a stretch, imo, to say that the film has inspired and perhaps even accelerated advancements in NUI products and technology. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;There are now many good and &lt;I style="mso-bidi-font-style: normal"&gt;real &lt;/I&gt;examples of NUIs and even some actual products that come close to the vision in "Minority Report", but despite the impact the film appears to have had on the development of NUIs, there is a very strong connection to search that gets overlooked. Cruise’s character in that scene is searching. His various gestures and other contortions are queries, navigation, and refinements intended to help him find answers and collect information. Granted the depiction is not quite up to the vision of the smooth-voiced computer on Star Trek, but it’s a step beyond the keyboard and mouse and, if you look past the theatrics, I think it paints a realistic view of not just the future of natural user interfaces, but of the type of natural &lt;U&gt;search-driven&lt;/U&gt; &lt;/FONT&gt;&lt;FONT face=Calibri size=3&gt;user interfaces we will be seeing soon… in much less than 20 years time.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Nate&lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9571245" width="1" height="1"&gt;</content><author><name>ntreloar</name><uri>http://blogs.msdn.com/members/ntreloar.aspx</uri></author><category term="search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/search/default.aspx" /><category term="NUIs" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/NUIs/default.aspx" /><category term="natural user interfaces" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/natural+user+interfaces/default.aspx" /><category term="vision" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/vision/default.aspx" /></entry><entry><title>One Year with Microsoft – a FAST Perspective</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2009/04/17/one-year-with-microsoft-a-fast-perspective.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2009/04/17/one-year-with-microsoft-a-fast-perspective.aspx</id><published>2009-04-17T22:49:00Z</published><updated>2009-04-17T22:49:00Z</updated><content type="html">&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;After years of writing customer proposals, internal memoranda, and various stuffily formal documents, it feels like a luxury to be able to just write what I think about enterprise search.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;It’s actually part of my job these days and I’m looking forward to sharing a perspective from 13 years in the industry – the past 6 years with FAST and, most recently, with Microsoft. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;As a reminder, it’s been a more than a year since the original offer came down from Microsoft to acquire FAST. To be precise, the bid was announced on &lt;/FONT&gt;&lt;A href="http://www.microsoft.com/presspass/press/2008/jan08/01-08FastSearchPR.mspx"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;January 8&lt;SUP&gt;th&lt;/SUP&gt;, 2008&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; and the deal closed on &lt;/FONT&gt;&lt;A href="http://www.microsoft.com/presspass/press/2008/apr08/04-25LervikPR.mspx"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;April 25&lt;SUP&gt;th&lt;/SUP&gt;, 2008&lt;/FONT&gt;&lt;/A&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;. The FAST team now makes up a large part of the new Enterprise Search Group (ESG) within the Microsoft Business Division (MBD) – the division that makes SharePoint, the Office line of products, Exchange, etc… .&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;When I get asked about my reaction to the FAST acquisition by Microsoft, I tend to&amp;nbsp;point out that, while those of us in the business have always understood the value of search, nothing says “Ata boy!” like having the largest software company in the world take notice. Maybe we could ask why it took so long, but even if you didn’t happen to work at FAST, you can’t help but feel that Microsoft’s move is validation of our growing corner of the IT industry. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;I admit that the answer above, while maybe heartwarming, doesn’t get to the core of what people really want to know. Not surprisingly, folks are more interested in Microsoft’s vision for enterprise search and plans for the FAST people, products, partners, and customers than they are in my emotions.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;Now, with a year under the belt at Microsoft, I have a few more insights to offer than just the initial “nice validation” response. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;In his keynote presentation at &lt;/FONT&gt;&lt;A href="http://fastforward09.com/"&gt;&lt;FONT face=Calibri size=3&gt;FASTforward’09&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; in February, Kirk Koenigsbauer addressed three key topics related to Microsoft’s interest in enterprise search (a transcript of Kirk’s keynote can be found &lt;/FONT&gt;&lt;A href="http://www.microsoft.com/presspass/exec/koenigsbauer/02-11FASTkeynote.mspx"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;here&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;). These were:&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpFirst style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l1 level1 lfo1"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;Commitment (to enterprise search)&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpMiddle style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l1 level1 lfo1"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;Vision&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpLast style="MARGIN: 0in 0in 10pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l1 level1 lfo1"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;Product Plans &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;These topics provide a useful framework for sharing my own observations.&lt;U&gt;&lt;?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/U&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;U&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Commitment&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/U&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;There are a number of anecdotal facts that point to Microsoft’s commitment to being a leader in enterprise search. Kirk shared a few of these in his keynote – things like the percentage of Microsoft Research investment going to search (appx 15%), the size of the Enterprise Search Group R&amp;amp;D organization (several hundred engineers and growing), &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;and of course the investment itself to acquire FAST (US$1.2B). There are other supporting data points, like the &lt;/FONT&gt;&lt;A href="http://www.usatoday.com/tech/products/2008-09-30-3790007627_x.htm"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;announcement&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; of Oslo (FAST’s headquarters) as a key R&amp;amp;D center for business search. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Any one of these facts is a strong indication of Microsoft’s ambitions in this space, but my take is that the evidence of Microsoft’s commitment to search comes from more than these metrics or executive statements. It comes from a growing grass roots interest in search across all of Microsoft. &lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&lt;/SPAN&gt;For example, I often get a question like this from customers and partners: &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;I style="mso-bidi-font-style: normal"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;“Have you guys talked with the folks over in Microsoft’s &amp;lt;product name&amp;gt; team?”&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/I&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;…and then…&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;“&lt;I style="mso-bidi-font-style: normal"&gt; Man, you should because FAST technology added to what they’re doing would be powerful combination.”&lt;o:p&gt;&lt;/o:p&gt;&lt;/I&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;The usual answer is, yes, we’ve talked to the &amp;lt;product name&amp;gt; team and, yes, there are some very interesting ideas and even some specific activity that we mostly can’t talk about yet. In fact, what’s been most interesting and fun for us former FAST folks is the breadth of technologies that we can now&amp;nbsp;include in our conversations with customers and partners. SharePoint is the “hero SKU”, as we say here, and the combination of FAST search with the capabilities of SharePoint makes for an impressive offering for both intranet and Internet applications that are focused on helping people consume and use information.&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;It’s not a leap to recognize that Microsoft has something to offer at almost every level of an IT solution “stack” complementing the capabilities of both SharePoint and search – from the operating system to application development tools and even cloud-based services. To put it in perspective, ask yourself how many companies offer both a world class enterprise search platform and a world class relational database. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;To be honest, search is such a generally valued concept and the possibilities are so compelling when it’s combined with other Microsoft products and technology that it’s all we can do to stay focused on our main priorities. It’s a good problem.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;U&gt;&lt;FONT face=Calibri size=3&gt;Vision&lt;/FONT&gt;&lt;/U&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;At some point prior to the acquisition, the Microsoft enterprise search team came to a vision of search that matched&amp;nbsp;what we had developed at FAST. Specifically, that search is more than just a search box and a list of blue document links, but represents a set of capabilities that are enabling new ways to engage users by creating personalized, conversational experiences that cater to the way people prefer to consume and interact with information. This vision was behind the principle theme for the &lt;/FONT&gt;&lt;A href="http://www.fastforward09.com/"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;FASTforward’09 conference&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; this past February – “Engage Your Users”.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Whether the original Microsoft team came to this vision independently or after talking to FAST folks (ego would like to think the latter) is less important than the fact that it is now a shared vision throughout the Microsoft Enterprise Search Group and is shaping how we are investing in product development. It’s also a vision that is permeating into other areas within Microsoft. For example, I recently had a chance to apply this way of thinking about search to some other very interesting Microsoft technology, &lt;/FONT&gt;&lt;A href="http://surface.microsoft.com/"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;Microsoft Surface&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;, but that’s a topic for another post.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;U&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;Product Plans&lt;o:p&gt;&lt;/o:p&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/U&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT size=3&gt;&lt;FONT face=Calibri&gt;At FASTforward’09 we announced our plans to target enterprise search in two areas:&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpFirst style="MARGIN: 0in 0in 0pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo2"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;Business productivity – applications inside the firewall where, in particular, SharePoint provides the framework for content management and collaboration. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoListParagraphCxSpLast style="MARGIN: 0in 0in 10pt 0.5in; TEXT-INDENT: -0.25in; mso-list: l0 level1 lfo2"&gt;&lt;SPAN style="FONT-FAMILY: Symbol; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol"&gt;&lt;SPAN style="mso-list: Ignore"&gt;&lt;FONT size=3&gt;·&lt;/FONT&gt;&lt;SPAN style="FONT: 7pt 'Times New Roman'"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;FONT face=Calibri size=3&gt;Internet business – “outside the firewall” applications for attracting, retaining, and otherwise monetizing customers.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;The intentions are to have a common search platform supporting both of these general markets and to include application specific capabilities and templates that are unique to each. FAST had already started down this path. For example, FAST AdMomentum is an ad platform that interoperates with search and is relevant to monetization strategies in Internet Businesses, but not so obvious of a fit for inside the firewall apps.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;This relatively straightforward strategy and message was very important to get out to the FAST customers base, especially given that Internet Businesses have made up well more than half of FAST’s business to date. Also, most&amp;nbsp;industry pundits will tell you that the requirements for search inside the corporate firewall are simply different than search in consumer facing applications. Even so, what’s so promising to me about this strategy is that there are opportunities to “bleed” capabilities between these two application spaces. We saw this &lt;/FONT&gt;&lt;A href="http://en.wikipedia.org/wiki/Consumerization"&gt;&lt;FONT face=Calibri size=3&gt;“consumerization”&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt; of search features happen more than once at FAST. Features that we initially designed for consumer search found their way into intranet search deployments (one simple example is the “best bets” concept like the one found in &lt;/FONT&gt;&lt;A href="http://office.microsoft.com/en-us/sharepointserver/HA011605771033.aspx"&gt;&lt;FONT face=Calibri color=#0000ff size=3&gt;SharePoint&lt;/FONT&gt;&lt;/A&gt;&lt;FONT face=Calibri size=3&gt;). The opposite has also happened. Now, consider the capabilities in SharePoint, which is already powering many consumer facing Web sites, and you can see where this can lead. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;There you have it, my first post for the Microsoft Enterprise Search Blog. Look for more posts from me in this general category of enterprise search vision and strategy. I welcome all comments on this and future entries. &lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Next up – Search plus Natural User Interfaces.&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 10pt"&gt;&lt;FONT face=Calibri size=3&gt;Nate &lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9554713" width="1" height="1"&gt;</content><author><name>ntreloar</name><uri>http://blogs.msdn.com/members/ntreloar.aspx</uri></author><category term="SharePoint" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/SharePoint/default.aspx" /><category term="FAST" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/FAST/default.aspx" /><category term="Strategy" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Strategy/default.aspx" /></entry><entry><title>Microsoft Presents FAST forward 09: Engage Your User</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/12/18/microsoft-presents-fast-forward-09-engage-your-user.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/12/18/microsoft-presents-fast-forward-09-engage-your-user.aspx</id><published>2008-12-18T20:31:29Z</published><updated>2008-12-18T20:31:29Z</updated><content type="html">&lt;p&gt;The Mirage, Las Vegas, Feb 9-11&lt;/p&gt;  &lt;p&gt;Since its inaugural conference in 2006, FAST&lt;i&gt;forward&lt;/i&gt; has been a venue for though leadership and innovation in the field of search. This year, &lt;strong&gt;FAST&lt;/strong&gt;&lt;strong&gt;&lt;i&gt;forward’09&lt;/i&gt;&lt;/strong&gt; is the industry’s largest business and technology conference dedicated to search-driven innovation. Join the discussion! At &lt;strong&gt;FAST&lt;/strong&gt;&lt;strong&gt;&lt;i&gt;forward’09&lt;/i&gt;&lt;/strong&gt;, we explore how businesses are responding – and evolving – in the face of rapid technological change and the growing demands for user control. As The User Revolution continues, we examine search’s critical role in helping companies engage their users. This year’s conference will also highlight Microsoft’s vision for enterprise search technology. &lt;/p&gt;  &lt;p&gt;New this year, a SharePoint technology track covering Enterprise Search, Social Computing, Enterprise Content Management and more!&amp;#160; Other tracks include:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;Monetization via Search (customer-facing)&lt;/li&gt;    &lt;li&gt;Productivity via Search (internal enterprise)&lt;/li&gt;    &lt;li&gt;FAST technology&lt;/li&gt;    &lt;li&gt;Partner Solutions&lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;Top Ten Reasons Why You Should Attend FAST&lt;i&gt;forward’09&lt;/i&gt;:&lt;/p&gt;  &lt;blockquote&gt;   &lt;p&gt;&lt;strong&gt;1. Uncover new opportunities for using search &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;2. Hear what others have done with search technology &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;3. Learn industry best practices for search &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;4. Hear the Microsoft vision for search and FAST &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;5. Learn how SharePoint and FAST products are positioned &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;6. Gain insight on integration plans for SharePoint and FAST products &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;7. Understand how partners can help &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;8. Obtain access to Microsoft and FAST executives and industry luminaries &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;9. Network with colleagues &lt;/strong&gt;&lt;/p&gt;    &lt;p&gt;&lt;strong&gt;10. Attend convenient pre-conference technical training &lt;/strong&gt;&lt;/p&gt; &lt;/blockquote&gt;  &lt;p&gt;&lt;/p&gt;  &lt;p&gt;Come spend three days with us at the Mirage in Las Vegas learning from industry thought leaders, customers, partners, and our own Microsoft experts! &lt;/p&gt;  &lt;p&gt;Learn more at &lt;a href="http://www.fastforward09.com/"&gt;FASTforward ‘09&lt;/a&gt;. Register before January 9 and receive &lt;b&gt;$400 off&lt;/b&gt; of the full registration fee. See you there!&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9237486" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author></entry><entry><title>Microsoft positioned in the Leaders Quadrant of the 2008 Information Access Magic Quadrant</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/10/30/microsoft-a-leader-in-the-gartner-2008-magic-quadrant-for-information-access.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/10/30/microsoft-a-leader-in-the-gartner-2008-magic-quadrant-for-information-access.aspx</id><published>2008-10-30T21:33:00Z</published><updated>2008-10-30T21:33:00Z</updated><content type="html">&lt;P&gt;We’ve got great news to share! Last month, Gartner published the 2008 Magic Quadrant for Information Access Technology, and Microsoft was positioned in the Leaders Quadrant. Since the completion of the acquisition, we’ve worked incredibly hard to communicate and demonstrate a combined vision and strategy to our customers and partners. It’s good to know we’re heading in the right direction!&lt;/P&gt;
&lt;P&gt;When I talk with customers about search, it’s clear that organizations have very different needs. In fact, many people tell me that even within an organization the one-size-fits-all approach just doesn’t work. So over the last year, we’ve announced some bold moves designed to create a compelling portfolio of search applications. With the addition of Search Server Express and the acquisition of FAST, we now have a product line-up designed to meet a broad range of business needs:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Some departments or small organizations need search that is quick and easy to set up; we offer Microsoft Search Server Express as a free download so that you can get it up and running in about 30 minutes. We’re excited to see customers like &lt;A href="http://www.sjm.com/" mce_href="http://www.sjm.com"&gt;St. Jude Medical&lt;/A&gt; and Urbis having quick successes with Express. We’re also seeing partners, such as &lt;A href="http://www.startready.com/" mce_href="http://www.startready.com"&gt;StartReady&lt;/A&gt;, build solutions around Search Server Express to create a search appliance. &lt;/LI&gt;
&lt;LI&gt;Many organizations need search as an integral part of a business productivity infrastructure; Search in Microsoft Office SharePoint Server is integrated with other key SharePoint productivity workloads such as portals, collaboration, ECM, business processes and BI. Customers like McCann Worldgroup and Jones Lang LaSalle are all deriving productivity increases with better search in SharePoint. In particular, both companies are promoting collaboration and leveraging in-house experts with people search enhanced by user profiles in MySites. &lt;/LI&gt;
&lt;LI&gt;Some organizations face business problems that demand high-end search; FAST ESP offers best-in-class search with extreme scalability, query performance, and other advanced capabilities for sophisticated customer-facing or inside-the-firewall applications. For example, &lt;A href="http://www.aerotek.com/Jobs-Employment/Default.aspx" mce_href="http://www.aerotek.com/Jobs-Employment/Default.aspx"&gt;Aerotek&lt;/A&gt; and &lt;A href="http://www.teksystems.com/Careers/Default.aspx" mce_href="http://www.teksystems.com/Careers/Default.aspx"&gt;TEKsystems&lt;/A&gt;, two of the world’s largest staffing companies, deliver job searching to more than 1.3 million users. In more than 164 million queries, greater than 99.5% of query results came back in less than 2 seconds. For inside-the-firewall productivity, they index more than 10 million complex candidate records with low latency during high volume index updates. We’re also excited to see Pfizer pushing the envelope with an Enterprise Collaboration Framework driven by FAST ESP on top of SharePoint &lt;/LI&gt;&lt;/UL&gt;
&lt;P&gt;While our “Leaders Quadrant” position in the Magic Quadrant is an important milestone, we still think of this as the very beginning of our journey. We’re continuing to combine our deep technical expertise with our broad reach to deliver exciting innovations to the market – so you can and should expect great things to come. Stay tuned!&lt;/P&gt;
&lt;P&gt;Kirk Koenigsbauer &lt;BR&gt;General Manager, &lt;BR&gt;SharePoint Business Group &lt;/P&gt;
&lt;P&gt;&lt;A href="http://mediaproducts.gartner.com/reprints/microsoft/vol6/article4/article4.html" mce_href="http://mediaproducts.gartner.com/reprints/microsoft/vol6/article4/article4.html"&gt;Magic Quadrant for Information Access Technology&lt;/A&gt; (Gartner Research, Sept. 30, 2008) Microsoft is positioned in the Leaders Quadrant of Gartner, Inc.'s 2008 Magic Quadrant for Information Access Technology. This report assesses vendors with capabilities that go beyond enterprise search to encompass a range of technologies. Their capabilities include search; federated search, content classification, categorization and clustering; fact and entity extraction; taxonomy creation and management; information presentation (for example, visualization) to support analysis and understanding; and desktop search to address user-controlled repositories in order to locate and "invoke" documents, data, e-mail and intelligence.&lt;/P&gt;&lt;SPAN style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 'Arial','sans-serif'"&gt;&lt;SPAN style="FONT-SIZE: 8.5pt; FONT-FAMILY: 'Verdana','sans-serif'; mso-bidi-font-family: 'Times New Roman'"&gt;&lt;FONT color=#000000&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 12pt; LINE-HEIGHT: 15.6pt"&gt;&lt;SPAN style="FONT-SIZE: 8.5pt; FONT-FAMILY: 'Arial','sans-serif'"&gt;The Magic Quadrant is copyrighted 2008 by Gartner, Inc. and is reused with permission. The Magic Quadrant is a graphical representation of a marketplace at and for a specific time period. It depicts Gartner's analysis of how certain vendors measure against criteria for that marketplace, as defined by Gartner. Gartner does not endorse any vendor, product or service depicted in the Magic Quadrant, and does not advise technology users to select only those vendors placed in the "Leaders" quadrant. The Magic Quadrant is intended solely as a research tool, and is not meant to be a specific guide to action. Gartner disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. &lt;?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;/FONT&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9025325" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="Announcements" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Announcements/default.aspx" /><category term="Analyst" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Analyst/default.aspx" /><category term="Search Server" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Search+Server/default.aspx" /><category term="SharePoint" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/SharePoint/default.aspx" /><category term="FAST" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/FAST/default.aspx" /></entry><entry><title>Taking People Search on the Road….</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/10/16/taking-people-search-on-the-road.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/10/16/taking-people-search-on-the-road.aspx</id><published>2008-10-16T17:33:53Z</published><updated>2008-10-16T17:33:53Z</updated><content type="html">&lt;p&gt;In another great blog post Matt McDermott walks you through the steps of enabling SharePoint’s people search capability on a mobile device with the end results looking something like this;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/Media/WindowsLiveWriter/MobilePeopleSearch_1075E/image4.png"&gt;&lt;img border="0" alt="Search Results" src="http://blogs.catapultsystems.com/matthew/Media/WindowsLiveWriter/MobilePeopleSearch_1075E/image4_thumb.png" width="324" height="244" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;The post is here;&lt;/p&gt;  &lt;p&gt;&amp;#160;&lt;a title="http://blogs.catapultsystems.com/matthew/archive/2008/09/27/mobile-people-search.aspx" href="http://blogs.catapultsystems.com/matthew/archive/2008/09/27/mobile-people-search.aspx"&gt;http://blogs.catapultsystems.com/matthew/archive/2008/09/27/mobile-people-search.aspx&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;Richard Riley    &lt;br /&gt;Senior Technical Product Manager     &lt;br /&gt;Microsoft Corp.&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9002027" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="ITPro" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/ITPro/default.aspx" /><category term="Search Server" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Search+Server/default.aspx" /><category term="Samples" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Samples/default.aspx" /><category term="SharePoint" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/SharePoint/default.aspx" /><category term="People Search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/People+Search/default.aspx" /></entry><entry><title>Partner Post: One Stop Search from the Microsoft Office Research Task Pane</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/09/26/partner-post-one-stop-search-from-the-microsoft-office-research-task-pane.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/09/26/partner-post-one-stop-search-from-the-microsoft-office-research-task-pane.aspx</id><published>2008-09-26T20:04:59Z</published><updated>2008-09-26T20:04:59Z</updated><content type="html">&lt;p&gt;Since the release of Microsoft Office 2003, Microsoft desktop applications such as MS Word, PowerPoint, Excel, Outlook and Internet Explorer have contained an internal federated or meta-search capability known as the ‘Research Pane’. To see this in action in office 2003 (see &lt;a href="http://office.microsoft.com/en-us/word/HA102158281033.aspx#16"&gt;link&lt;/a&gt; for instructions for Office 2007), select (i.e. highlight) a word or phrase within MS Word or MS Outlook, and on PC’s right click on the highlighted word, pull down to the “Lookup Up” option and click. Another way to do this is to hold down the ‘Alt’ button while left-clicking on a highlighted word (in Macs use a command-click). The Research Pane should then open up in the application window and execute a search on the highlighted section. Out of the box, MS Office ships with several research sources such as the Microsoft Encarta Dictionary, Microsoft Live Search, MSN Money and some third party offerings from Factiva and Thomson Gale among others. Here is a screenshot of content returned from three enterprise search engines as well as from some public biomedical websites.&lt;/p&gt;  &lt;p&gt;&lt;a href="https://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostOneStopSearchfromtheMicrosoft_D9E5/clip_image002_2.jpg"&gt;&lt;img style="border-right-width: 0px; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="clip_image002" border="0" alt="clip_image002" src="https://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostOneStopSearchfromtheMicrosoft_D9E5/clip_image002_thumb.jpg" width="448" height="320" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;The list of sources that can be searched from the Research Pane is expandable by adding connections to Research Pane service providers. Armed with a URL to a Research Pane “registration service”, a user can install the source into their MS applications using the “Research options…” link. This potentially gives users access to a large set of data sources to choose from. Once a source is installed, the user can select the source from a dropdown list (which causes the search to be executed) or can select a set of sources based on certain pre-defined categories. &lt;/p&gt;  &lt;p&gt;Raritan Technologies specializes in &lt;a href="http://www.raritantechnologies.com/downloads.shtml"&gt;Federated Search solutions&lt;/a&gt; and has created an array of search connectors to a number of web sites, web services, search engines and databases and directory services (to name a few) using our Search Integration Framework Toolkit (&lt;a href="http://www.raritantechnologies.com/Sift.shtml"&gt;SIFT&lt;/a&gt;) and Federation Manager. We and our partner in this effort, &lt;a href="http://www.ideaeng.com/index.html"&gt;New Idea Engineering&lt;/a&gt;, have also provided a number of ways to deploy these federated search connectors to web applications and within web services such as SOAP and Open Search. We have recently added to this list by providing a MS Research Pane service ‘front-end’ to our federated connectors. This enables connections to search engines such as Autonomy IDOL, K2 or Ultraseek, Dieselpoint, Endeca, Exalead, Fast, Lucene, Mark Logic (and others) as well as Sharepoint (out of the box) SQL databases, LDAP directories, SOAP and OpenSearch web services, Z39.50 sources, Internet web sites that have search boxes (a very large list that includes general web search engines and specialized sites such as news or research sites) and Content Management Systems such as Alfresco, Documentum and eRoom, and Archival Systems like Symantec Enterprise Vault to be ‘plugged-in’ to any MS Office application. The modular design of the Raritan Search Integration Framework enables other connectors to be added to this list and as this happens, these new sources will automatically be available to users of the Research Pane once configured as a service.&lt;/p&gt;  &lt;p&gt;The ability to combine internal content sources from content management systems, enterprise search engines, databases and directory services with external content from subscription or public web sites and web services into MS Office applications provides a huge potential for search integration at the “tip of the sword” where thought and knowledge are combined to create new content.&lt;/p&gt;  &lt;p&gt;For more information on the Raritan Technologies “Research Pane Integration” or to arrange for a trial connector please visit &lt;a href="http://www.raritantechnologies.com/ResearchPane.shtml"&gt;http://www.raritantechnologies.com/ResearchPane.shtml&lt;/a&gt;.&lt;/p&gt;  &lt;p&gt;Barry Freindlich    &lt;br /&gt;President Raritan     &lt;br /&gt;Technologies, Inc.&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8966982" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="Partners" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Partners/default.aspx" /></entry><entry><title>How to: Customize the Thesaurus in SharePoint Search and Search Server</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/09/23/how-to-customize-the-thesaurus-in-sharepoint-search-and-search-server.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/09/23/how-to-customize-the-thesaurus-in-sharepoint-search-and-search-server.aspx</id><published>2008-09-23T21:49:04Z</published><updated>2008-09-23T21:49:04Z</updated><content type="html">&lt;p&gt;The thesaurus is an xml file that provides users with a means of automatically expanding or rewriting their queries to include synonyms, acronyms, etc. For example, in a chemical company, product ID 1234, oxygen, O2 and LOX could all refer to the same item.&lt;/p&gt;  &lt;p&gt;A SharePoint Search administrator can modify the thesaurus file to substitute all these words at search query time. This document explains how to set up a thesaurus and where to find the relevant files.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;Supported Thesaurus Syntax:&lt;/strong&gt;     &lt;br /&gt;To use the sample files provided by the product, you need to remove the comment beginning (&amp;lt;!--) and ending lines (--&amp;gt;) from the xml file.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;Explanation of terms:&lt;/strong&gt;&lt;/p&gt;  &lt;table border="0" cellspacing="0" cellpadding="0" width="446"&gt;&lt;tbody&gt;     &lt;tr&gt;       &lt;td valign="top" width="200"&gt;&lt;strong&gt;Term&lt;/strong&gt;&lt;/td&gt;        &lt;td valign="top" width="244"&gt;&lt;strong&gt;Meaning&lt;/strong&gt;&lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;thesaurus&lt;/td&gt;        &lt;td valign="top" width="244"&gt;marks beginning (and end) of thesaurus&lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;diacritics_sensitive&lt;/td&gt;        &lt;td valign="bottom" width="244"&gt;         &lt;p&gt;Diacritics are marks, such as accents that are added to letters that change their pronunciation. For example, the acute accent over and e gives you: é.            &lt;br /&gt;0 – ignore diacritics             &lt;br /&gt;1 – respect diacritics&lt;/p&gt;       &lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;expansion&lt;/td&gt;        &lt;td valign="top" width="244"&gt;A list of alternative forms each marked by &amp;lt;sub&amp;gt; by the sub keyword&lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;sub&lt;/td&gt;        &lt;td valign="top" width="244"&gt;One of several alternatives in an expansion&lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;replacement&lt;/td&gt;        &lt;td valign="top" width="244"&gt;Several patterns will be replaced with a substitution.&lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;pat&lt;/td&gt;        &lt;td valign="top" width="244"&gt;A pattern to be replaced&lt;/td&gt;     &lt;/tr&gt;      &lt;tr&gt;       &lt;td valign="top" width="200"&gt;sub&lt;/td&gt;        &lt;td valign="top" width="244"&gt;Item to be substituted&lt;/td&gt;     &lt;/tr&gt;   &lt;/tbody&gt;&lt;/table&gt;  &lt;p&gt;&lt;strong&gt;Example:&lt;/strong&gt;&lt;/p&gt;  &lt;pre class="csharpcode"&gt;&lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;XML&lt;/span&gt; &lt;span class="attr"&gt;ID&lt;/span&gt;&lt;span class="kwrd"&gt;=&amp;quot;Microsoft Search Thesaurus&amp;quot;&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
  &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;thesaurus&lt;/span&gt; &lt;span class="attr"&gt;xmlns&lt;/span&gt;&lt;span class="kwrd"&gt;=&amp;quot;x-schema:tsSchema.xml&amp;quot;&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;diacritics_sensitive&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;0&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;diacritics_sensitive&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
  &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;expansion&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;Internet Explorer&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;IE&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;IE5&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
  &lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;expansion&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
  &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;replacement&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;pat&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;NT5&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;pat&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;pat&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;W2K&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;pat&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
    &lt;span class="kwrd"&gt;&amp;lt;&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;Windows 2000&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;sub&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
  &lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;replacement&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;
&lt;span class="kwrd"&gt;&amp;lt;/&lt;/span&gt;&lt;span class="html"&gt;thesaurus&lt;/span&gt;&lt;span class="kwrd"&gt;&amp;gt;&lt;/span&gt;&lt;/pre&gt;

&lt;p&gt;&lt;strong&gt;The example means:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;We have elected to ignore accents, etc in the thesaurus &lt;/li&gt;

  &lt;li&gt;Queries containing IE, or any other one of the &amp;lt;sub&amp;gt; clauses will also contain “internet explorer” and “ie5”. &lt;/li&gt;

  &lt;li&gt;If a query contains terms “NT5” or “W2K”, they will be replaced by “Windows 2000”. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;How to Customize the Thesaurus:&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Find the appropriate thesaurus file in the config folder contained in the registry key: [HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Global\Gathering Manager]&amp;quot;DefaultApplicationsPath” &lt;/li&gt;

  &lt;li&gt;Update the thesaurus file(s) for each appropriate language for each desired &amp;lt;expansion&amp;gt; or &amp;lt;replacement&amp;gt;. &lt;/li&gt;

  &lt;li&gt;Replace the file(s) on each index, query and web frontend server for each search application path: 
    &lt;br /&gt;%programfiles%\Microsoft Office Servers\12.0\Data\Office Server\Applications\[GUID]\Config&amp;#160; &lt;br /&gt;Note index propagation does not sync these files on all the servers in the farm. &lt;/li&gt;

  &lt;li&gt;Stop and restart search service (this is needed to load the new thesaurus files). E.G., in a console window, run “net stop osearch &amp;amp; net start osearch” without quotes, or launch Programs\Administrations Tools\Services then right click Office SharePoint Search Service then choose restart. &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Notes:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;See “&lt;i&gt;Finding Important Files&lt;/i&gt;” below for a summary of where to find the key files to manage your thesaurus.&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;(optional) If you want to have the same thesaurus files apply to all newly created SSPs, put your thesaurus files under the main config folder 
    &lt;br /&gt;(e.g., %programfiles%\Microsoft Office Servers\12.0\Data\config). &lt;/li&gt;

  &lt;li&gt;If there is a syntax error in the thesaurus file, all expansions and replacements will be ignored. &lt;/li&gt;

  &lt;li&gt;If a word in the thesaurus file matches a stop word in the stop word file, it will be ignored.&amp;#160;&amp;#160; To avoid this, remove it from the appropriate stop word file. &lt;/li&gt;

  &lt;li&gt;Thesaurus terms are broken into words at query time.&amp;#160; Add words you do not want to be broken into the custom dictionary file customLANG.lex (see Finding Important Files for more details). &lt;/li&gt;

  &lt;li&gt;Search first applies the thesaurus, and then expands words into their alternate forms, when “stemming” functionality is turned on.&amp;#160;&amp;#160; Care should be taken to avoid expanding into too many unnecessary forms as this may harm search performance and accuracy. &lt;/li&gt;

  &lt;li&gt;The “All words” option on the Advanced Search page might no longer work when using multiple term substitution with the thesaurus. This is because an implicit “+” is used between every term.&amp;#160; For example, if we used our example thesaurus above and typed E.G., “browser ie” in the “All words” field, it would look for “+browser +ie” – it would no longer allow “Internet Explorer”.&lt;/li&gt;

  &lt;li&gt;Ambiguous replacements will stop the thesaurus working (this will be noted in the appropriate logs, but will not be obvious to the user). For example, if you replace a with b and a with c, this is an error. Some admins add large thesauri which are automatically populated with items such as “replace a b with c” and “replace a,b with c”. As it turns out, after wordbreaking &lt;i&gt;these two expressions look exactly the same&lt;/i&gt;.&amp;#160; Please check for this kind of problem in the logs if you are building a large thesaurus.&lt;/li&gt;

  &lt;li&gt;There is a 10,000 term limit per language in thesaurus. &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Finding Important Files: &lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The following are the most important files used to manage your thesaurus. &lt;/p&gt;

&lt;p&gt;There are 50 default stop word files and 48 thesaurus sample files for the languages we support.&lt;/p&gt;

&lt;p&gt;The search service install path can be located by examining registry key [HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Global\Gathering Manager]&amp;quot;DefaultApplicationsPath”&lt;/p&gt;

&lt;p&gt;The default location of the thesaurus files (for each index, query and web frontend server) is: 
  &lt;br /&gt;%programfiles%\ Microsoft Office Servers\12.0\Data\Office Server&amp;#160; &lt;br /&gt;When a search application is created, a copy of the thesaurus file will also be placed under: %programfiles%\Microsoft Office Servers\12.0\Data\Office Server\Applications\[GUID]\Config &lt;/p&gt;

&lt;p&gt;Stop word files for each language can be found as noiseLANG.txt, where LANG is the 3 letter acronym for that language. For example, US English is noiseENU.txt, and the language neutral list is noiseNEU.txt.&lt;/p&gt;

&lt;p&gt;To find the appropriate acronym for your language(s), please look them up under: &lt;a href="http://www.microsoft.com/globaldev/nlsweb/default.mspx"&gt;http://www.microsoft.com/globaldev/nlsweb/default.mspx&lt;/a&gt;. 

  &lt;br /&gt;&lt;/p&gt;

&lt;table border="0" cellspacing="0" cellpadding="0" width="400"&gt;&lt;tbody&gt;
    &lt;tr&gt;
      &lt;td valign="top" width="200"&gt;Ping Lin 
        &lt;br /&gt;Senior Test Lead 

        &lt;br /&gt;Microsoft Corp.&lt;/td&gt;

      &lt;td valign="top" width="200"&gt;Victor Poznanski 
        &lt;br /&gt;Senior Program Manager 

        &lt;br /&gt;Microsoft Corp.&lt;/td&gt;
    &lt;/tr&gt;
  &lt;/tbody&gt;&lt;/table&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8962775" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author></entry><entry><title>SharePoint Image Search</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/09/19/sharepoint-image-search.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/09/19/sharepoint-image-search.aspx</id><published>2008-09-19T22:03:43Z</published><updated>2008-09-19T22:03:43Z</updated><content type="html">&lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/default.aspx"&gt;Matthew McDermott&lt;/a&gt;, a &lt;a href="https://mvp.support.microsoft.com/profile=9D1EC624-38BA-4652-B746-F04ECE300D3C"&gt;SharePoint MVP&lt;/a&gt;, has written a great 4 part blog post on how to make SharePoint 2007 search (and Search Server) render image results in a way that looks very similar to &lt;a href="http://images.live.com"&gt;http://images.live.com&lt;/a&gt;. &lt;/p&gt;  &lt;p&gt;Not only does this make searching images much easier, it’s also a very thorough step-by-step tutorial on how to customize results using the built in Web Parts and XSL – it’s well worth a read. &lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/archive/2008/08/28/sharepoint-image-search-part-1.aspx"&gt;SharePoint Image Search (Part 1)&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/archive/2008/08/30/sharepoint-image-search-part-2.aspx"&gt;SharePoint Image Search (Part 2)&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/archive/2008/08/31/sharepoint-image-search-part-3.aspx"&gt;SharePoint Image Search (Part 3)&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/archive/2008/09/01/sharepoint-image-search-part-4.aspx"&gt;SharePoint Image Search (Part 4)&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;The end result makes SharePoint Image results look like the screencap below.&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.catapultsystems.com/matthew/default.aspx"&gt;&lt;img style="border-bottom: 0px; border-left: 0px; border-top: 0px; border-right: 0px" title="isearch" border="0" alt="isearch" src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SharePointImageSearch_D3CD/isearch_668bb173-6e80-4b8f-a5a6-384fb0f33b13.png" width="447" height="331" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;Richard Riley   &lt;br /&gt;Senior Technical Product Manager    &lt;br /&gt;Microsoft Corp.&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8959481" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="ITPro" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/ITPro/default.aspx" /><category term="Search Server" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Search+Server/default.aspx" /><category term="Samples" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Samples/default.aspx" /><category term="SharePoint" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/SharePoint/default.aspx" /><category term="Image Search" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Image+Search/default.aspx" /></entry><entry><title>SQL File groups and Search</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/09/16/sql-file-groups-and-search.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/09/16/sql-file-groups-and-search.aspx</id><published>2008-09-16T22:17:00Z</published><updated>2008-09-16T22:17:00Z</updated><content type="html">&lt;p mce_keep="true"&gt;This article has been a long time coming, but it is finally here.&amp;#160; In the post below I will cover how to configure the Search database to span multiple filegroups.&amp;#160; First I'll cover a little about the benefits of doing so:&lt;/p&gt;  &lt;p&gt;General references on what SQL file groups are:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;A basic description of &lt;a href="http://msdn.microsoft.com/en-us/library/ms179316(SQL.90).aspx" mce_href="http://msdn.microsoft.com/en-us/library/ms179316(SQL.90).aspx"&gt;Physical Database Files and Filegroups&lt;/a&gt; &lt;/li&gt;    &lt;li&gt;High level discussion on the benefits of &lt;a href="http://msdn.microsoft.com/en-us/library/ms187087.aspx" mce_href="http://msdn.microsoft.com/en-us/library/ms187087.aspx"&gt;Using Files and Filegroups&lt;/a&gt; &lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;The method that we have chosen to implement filegroups on the Search database is one of segregation.&amp;#160; We have identified all of the tables and indexes within the database that are solely used for crawling and not used at all to satisfy end-user queries.&amp;#160; The remaining tables and indexes are used for end-user queries.&amp;#160; However, the nature of the Search and indexing problem still dictates that the &amp;quot;query&amp;quot; tables are written to during a crawl.&amp;#160; The crawl only tables and indexes are isolated into their own filegroup.&amp;#160; With the crawl and query centric filegroups identified you can now ensure that the IO intensive process of crawling has a reduced impact on the IO subsystem that is hosting the query filegroup by ensuring that these filegroups are on separate spindles. &lt;/p&gt;  &lt;p&gt;The whole goal of using filegroups is to improve the performance of the system.&amp;#160; This is done by providing an additional file.&amp;#160; This file must be placed on a different set of spindles to see any kind of performance enhancement.&amp;#160; If your SQL machine is not IO bound for the Search database then implementing filegroups will not provide you with any benefits.&amp;#160; &lt;/p&gt;  &lt;p&gt;To make the migration process easier we did not actually create a query filegroup.&amp;#160; We simply created a new filegroup called &amp;quot;CrawlFileGroup&amp;quot; and moved the crawl tables out of the PRIMARY filegroup.&amp;#160; Such that PRIMARY effectively becomes the query filegroup.&amp;#160; This migration process is one that can be quite expensive to complete and could take hours to finish.&amp;#160; Keep this in mind when scheduling this on your production servers.&amp;#160; Because the move involves dropping and recreating numerous clustered indexes you should assume that the DB is offline during this move as many long running locks will be taken to recreate the index.&amp;#160;&amp;#160; &lt;/p&gt;  &lt;p&gt;&lt;b&gt;Issues and concerns with using filegroups:&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;&lt;b&gt;Back-up and Restore&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;One concern that you will need to be aware of in you planning for deploying filegroups on the Search database is that your restore process will be slightly impacted.&amp;#160; Out of the box Search restore is unaware of the filegroup that will exist within the backup image.&amp;#160; Because of this there is no way to indicate where this file should be restored to.&amp;#160; As a result the restore process is going to try and place the crawl filegroup file onto the same drive that it existed&amp;#160; on when you ran the back-up.&amp;#160; Once you enable filegroups you will be committed to making sure that all future machines that you restore your back-up to have a drive with the same drive letter that you initially created the filegroup on.&amp;#160;&amp;#160;&amp;#160; &lt;/p&gt;  &lt;p&gt;&lt;b&gt;Future upgrades, Service packs and Hot fixes&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;Each Hotfix, Service Pack and update that you apply to the server has the potential to modify the index that was moved into the CrawlFileGroup or add an new index to one of the tables moved to the filegroup.&amp;#160; When/if this happens the index will be moved back or created in the primary filegroup.&amp;#160; Updates will also clean out any non-product sproc.&amp;#160; Because of the risk of index modification with updates applied you will need to reinstall the stored proc and run the scripts again after each update applied. &lt;/p&gt;  &lt;p&gt;The risk of a new index being added or modified quite low at this time.&amp;#160; We have confirmed that this does not occur if upgrading from RTM to SP1.&amp;#160; But, it does happen when upgrading from SP1&amp;#160; to the Infrastructure Update.&amp;#160; Future Updates are less like to modify the set of indexes. &lt;/p&gt;  &lt;p&gt;However, the risk still exists and you will want to re-run the scripts below after each update that you apply to your system.&amp;#160; In the case when you apply an update and the index did &lt;b&gt;not&lt;/b&gt; change running the script is a no-op and nothing gets moved.&amp;#160; So it is very cheap to run the script on a system that already has the indexes moved.&amp;#160; &lt;/p&gt;  &lt;p&gt;&lt;b&gt;SQL 2005 and greater&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;The script that is moving the indexes is utilizing new features that were released in SQL 2005.&amp;#160; As such you &lt;b&gt;cannot &lt;/b&gt;perform this optimization with SQL 2000.&amp;#160; &lt;/p&gt;  &lt;p&gt;&lt;b&gt;Step- by-Step instructions for applying filegroups to your environment.&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;To deploy this you will need to manually create a file group on the Search database.&amp;#160; To do this execute the following steps:&lt;/p&gt;  &lt;p&gt;a. Go to the &lt;u&gt;Filegroups&lt;/u&gt; section of the Search database &lt;u&gt;properties&lt;/u&gt; within SQL Server Management Studio.&lt;/p&gt;  &lt;p&gt;b. From the &lt;u&gt;Filegroups&lt;/u&gt; section click add and fill in the name &amp;quot;CrawlFileGroup.&amp;quot; &lt;b&gt;The scripts are written assume the filegroup has this name, failure to use this name will result in early failures&amp;#160; in the script&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image001%5B1%5D.jpg" mce_href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image001%5B1%5D.jpg"&gt;&lt;img style="border-right-width: 0px; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="clip_image001[1]" border="0" alt="clip_image001[1]" src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image001%5B1%5D_thumb.jpg" width="461" height="175" mce_src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image001%5B1%5D_thumb.jpg" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;c. Once you have a new filegroup with the name CrawlFileGroup you need add a file into this group.&amp;#160; To do this select the &lt;u&gt;Files&lt;/u&gt; section of the database properties dialog and add a new file into the CrawlFileGroup.&amp;#160; Be sure that you place this file onto a separate drive with isolated spindles. &lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image002%5B1%5D.jpg" mce_href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image002%5B1%5D.jpg"&gt;&lt;img style="border-right-width: 0px; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="clip_image002[1]" border="0" alt="clip_image002[1]" src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image002%5B1%5D_thumb.jpg" width="464" height="192" mce_src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/SQLFilegroupsandSearch_A612/clip_image002%5B1%5D_thumb.jpg" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;d. Next you need to install the stored proc that will move the indexes and tables to the new filegroup.&amp;#160; Open the script named&amp;#160; &lt;a target="_blank" href="http://blogs.msdn.com/enterprisesearch/attachment/8954476.ashx" mce_href="http://blogs.msdn.com/enterprisesearch/attachment/8954476.ashx"&gt;MoveTableToFileGroup.sql&lt;/a&gt; within Management Studio and execute it; ensuring that you are working with the Search database&amp;#160; This will create a stored proc named proc_MoveTableToFileGroup.&amp;#160; Confirm that this sproc does indeed exist within the Search database.&lt;/p&gt;  &lt;p&gt;e. Open and execute the second script named&amp;#160;&amp;#160; &lt;a target="_blank" href="http://blogs.msdn.com/enterprisesearch/attachment/8954472.ashx" mce_href="http://blogs.msdn.com/enterprisesearch/attachment/8954472.ashx"&gt;MoveCrawlTablesToFileGroup.sql&lt;/a&gt;, this is the script that does all of the work by calling proc_MoceTableToFileGroup for each table that is dedicated for crawling.&amp;#160; &lt;/p&gt;  &lt;p&gt;That is all there is to it.&amp;#160; You have now moved you crawl tables on to a separate set of spindles.&amp;#160; &lt;/p&gt;  &lt;p&gt;Thank you for your time and as always I welcome any feedback or questions&lt;/p&gt;  &lt;p&gt;Dan Blood    &lt;br /&gt;Senior Test&amp;#160; Engineer     &lt;br /&gt;Microsoft Corp&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8954463" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="ITPro" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/ITPro/default.aspx" /><category term="Perf &amp; Scale" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Perf+_2600_+Scale/default.aspx" /></entry><entry><title>Partner Post: Announcing conceptClassifier for SharePoint – Automatic Classification within Office</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/09/02/partner-post-announcing-conceptclassifier-for-sharepoint-automatic-classification-within-office.aspx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/09/02/partner-post-announcing-conceptclassifier-for-sharepoint-automatic-classification-within-office.aspx</id><published>2008-09-03T00:10:00Z</published><updated>2008-09-03T00:10:00Z</updated><content type="html">&lt;p mce_keep="true"&gt;Enterprise customers are increasingly struggling with how to apply policy and governance at the desktop. End user adoption is cited as the single most critical barrier to success in ECM and Records Management initiatives. Using Concept Searching’s unique compound term processing concept&lt;b&gt;Classifier&lt;/b&gt; for SharePoint can now be used to automatically classify content from Microsoft Office Applications, upload the documents directly to SharePoint, store the metadata in SharePoint properties and write back the classifications to the custom properties of the document for use within knowledge and workflow applications or enterprise applications such as ECM, Document Management, Records Management, or eDiscovery.&lt;/p&gt;  &lt;p&gt;The classification can take place automatically without end user intervention. Optionally, Subject Matter Experts can be granted the authority to manually adjust the classification based on the taxonomy. A ribbon bar has been added to the familiar Office interface enabling automatic classification of content. When the end user classifies a document the system will retrieve existing concepts as an aid to the classification process as shown below. Subject Matter Experts also have the ability to add or delete classes in the taxonomy.&lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image002_2.jpg" mce_href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image002_2.jpg"&gt;&lt;img style="border-right-width: 0px; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="clip_image002" border="0" alt="clip_image002" src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image002_thumb.jpg" width="442" height="278" mce_src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image002_thumb.jpg" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;Documents are uploaded to SharePoint and the classification metadata is stored in the properties fields. The classification status automatically reflects the manual classification so as to not overwrite the classification classes the Subject Matter Expert entered. The systems administrator features currently enabled include the ability to edit the classifications, classify the document, a batch of documents or the full library. This metadata can now be used by Microsoft Enterprise Search to improve identification of relevant documents when searching. &lt;/p&gt;  &lt;p&gt;&lt;a href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image004_2.jpg" mce_href="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image004_2.jpg"&gt;&lt;img style="border-right-width: 0px; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="clip_image004" border="0" alt="clip_image004" src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image004_thumb.jpg" width="446" height="281" mce_src="http://blogs.msdn.com/blogfiles/enterprisesearch/WindowsLiveWriter/PartnerPostAnnouncingconceptClassifierfo_F193/clip_image004_thumb.jpg" /&gt;&lt;/a&gt;&lt;/p&gt;  &lt;p&gt;For more information visit &lt;a href="http://www.conceptsearching.com/" mce_href="http://www.conceptsearching.com"&gt;www.conceptsearching.com&lt;/a&gt; or &lt;a href="http://www.conceptsearching.com/web/UserFiles/video/Office%20Integration/Office%20Integration.htm" mce_href="http://www.conceptsearching.com/web/UserFiles/video/Office%20Integration/Office%20Integration.htm"&gt;click here&lt;/a&gt; to view a webcast demo of the integrated technology. &lt;/p&gt;  &lt;p&gt;Martin Garland&lt;strong&gt;&amp;#160;&amp;#160;&amp;#160;&amp;#160; &lt;br /&gt;&lt;/strong&gt;President&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160; Concept Searching, Inc &lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8921011" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="Announcements" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Announcements/default.aspx" /><category term="Partners" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Partners/default.aspx" /><category term="Classification" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Classification/default.aspx" /></entry><entry><title>SQL Index defrag and maintenance tasks for Search</title><link rel="alternate" type="text/html" href="http://blogs.msdn.com/enterprisesearch/archive/2008/09/02/sql-index-defrag-and-maintenance-tasks-for-search.aspx" /><link rel="enclosure" type="text/plain" length="3773" href="http://blogs.msdn.com/enterprisesearch/attachment/8920977.ashx" /><id>http://blogs.msdn.com/enterprisesearch/archive/2008/09/02/sql-index-defrag-and-maintenance-tasks-for-search.aspx</id><published>2008-09-02T23:47:00Z</published><updated>2008-09-02T23:47:00Z</updated><content type="html">&lt;p&gt;&lt;/p&gt;  &lt;p&gt;Hi all, this topic is an area that has caused me much pain and work.&amp;#160; My goal for this was to follow the recommended SQL guidelines while minimizing the impact that these maintenance jobs have on Crawling and Queries.&amp;#160; We know from the &lt;a href="http://blogs.msdn.com/enterprisesearch/archive/2008/05/19/sql-monitoring-and-i-o.aspx"&gt;SQL Monitoring an I/O&lt;/a&gt; post that Search is extremely I/O intensive .&amp;#160; As it turns out so is all of the regular maintenance that SQL recommends, so finding the right balance between the two is an interesting scheduling task. &lt;/p&gt;  &lt;p&gt;As a starting point much information about SQL maintenance and MOSS is covered in the following paper:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;&lt;a href="http://go.microsoft.com/fwlink/?LinkId=111531&amp;amp;amp;clcid=0x409"&gt;&lt;b&gt;Database Maintenance for Microsoft&lt;sup&gt;®&lt;/sup&gt; SharePoint&lt;sup&gt;®&lt;/sup&gt; Products and Technologies&lt;/b&gt;&lt;/a&gt;&lt;b&gt; &lt;/b&gt;&lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;There are some key areas from the above paper that I would like to augment here.&lt;/p&gt;  &lt;ol&gt;   &lt;li&gt;The stored procedure (proc_DefragIndexes) identified in this paper will work, but it is extremely expensive to run on the Search DB as it defrags &lt;u&gt;all&lt;/u&gt; of the indexes in the table. &lt;/li&gt;    &lt;li&gt;Maintenance plans generated with the Maintenance Plan Wizard in SQL Server 2005 can cause unexpected results (KB &lt;a href="http://support.microsoft.com/kb/932744"&gt;932744&lt;/a&gt;.)&amp;#160; While this was fixed in SQL 2005 SP2 these maintenance plans also do more work than is necessary to have a healthy functional system.&amp;#160;&amp;#160;&amp;#160; &lt;/li&gt;    &lt;li&gt;Shrinking&amp;#160; the Search DB&amp;#160; should not be a necessary task that you need to perform.&amp;#160; The process of Shrinking the database does not provide a performance benefit.&amp;#160; SQL best practices for &lt;a href="http://msdn.microsoft.com/en-us/library/ms189493.aspx"&gt;DBCC SHRINKFILE&lt;/a&gt; suggest that this operation is most effective after an operation that creates lots of unused space.&amp;#160; Search does not regularly perform these types of operations.&amp;#160; The only time that a SHRINKFILE may make sense is after you have cleaned out your index by removing a Content Source.&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160; &lt;/li&gt;    &lt;li&gt;Rebuilding an index can cause latency issues with SQL Mirroring if the SQL I/O subsystem is constrained.&amp;#160; If you are using SQL Mirroring, be sure you are following the SQL &lt;a href="http://www.microsoft.com/technet/prodtechnol/sql/2005/technologies/dbm_best_pract.mspx"&gt;best practices&lt;/a&gt; and the &lt;a href="http://technet.microsoft.com/en-us/library/cc262910.aspx"&gt;SharePoint mirroring white paper&lt;/a&gt;.&amp;#160; Because Search, SQL Mirroring, and defrag are all very I/O intensive you will want to be extra cautious with your deployment plan for this defrag script and make sure you &lt;b&gt;&lt;i&gt;test the script prior to going into production&lt;/i&gt;&lt;/b&gt;. &lt;/li&gt; &lt;/ol&gt;  &lt;p&gt;&lt;b&gt;DBCC CHECKDB&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;&lt;a href="http://msdn.microsoft.com/en-us/library/ms176064.aspx"&gt;DBCC CHECKDB&lt;/a&gt; is a command used to check the logical and physical integrity of all the objects in a database.&amp;#160; SQL Best practices recommend that you run DBCC CHECKDB periodically.&amp;#160; For a Search deployment we would recommend that you run DBCC CHECKDB WITH PHYSICAL_ONLY on a regular basis.&amp;#160; The PHYSICAL_ONLY option will reduce the overhead of the command.&amp;#160; However, due to the cost of running this you should schedule it during off-peak times.&amp;#160; The frequency of execution depends on your business needs, but a good place to start is once a week just prior to your back-up.&amp;#160; You still need to run DBCC CHECKDB, but less frequently also based on business needs.&amp;#160; Perhaps every other or every third back-up.&amp;#160;&amp;#160; &lt;/p&gt;  &lt;p&gt;When running these commands make sure that you have a monitoring process in-place.&amp;#160; DBCC only reports errors, it does not fix them unless explicitly specified by other options.&amp;#160; You either want to archive the output of the DBCC command for post processing or make sure you have event log monitoring set-up (for example MOM) to check for DBCC errors.&lt;/p&gt;  &lt;p&gt;In very large environments you can run DBCC on an off-line (sandbox) copy of the database.&amp;#160; This will be less intrusive to end-users and the crawl.&amp;#160; In this scenario you would restore your back-up to a separate sandbox and run DBCC CHECKDB in the restored&amp;#160; environment.&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160;&amp;#160; &lt;/p&gt;  &lt;p&gt;&lt;b&gt;Fragmentation and index statistics freshness&lt;/b&gt;&lt;/p&gt;  &lt;p&gt;We started with the proc_DefragIndexes script mentioned above.&amp;#160; After running it became obvious that the script was just too expensive to run on a regular basis.&amp;#160; To reduce the load placed on the I/O system we took a look at all of our indexes in the Search DB and defragged them one-by-one to measuring query performance along the way.&amp;#160; Doing this we were able to identify the indices that provided a performance benefit to the system when they were defragmented.&amp;#160; These indexes are listed below:&lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;IX_MSSDocProps &lt;/li&gt;    &lt;li&gt;IX_MSSDocSdids &lt;/li&gt;    &lt;li&gt;IX_AlertDocHistory &lt;/li&gt;    &lt;li&gt;IX_MSSDEFINITIONS_DOCID &lt;/li&gt;    &lt;li&gt;IX_MSSDEFINITIONS_TERM &lt;/li&gt;    &lt;li&gt;PK_Sdid &lt;/li&gt;    &lt;li&gt;IX_SDHash &lt;/li&gt;    &lt;li&gt;IX_DOCID &lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;Optionally there are two additional indexes that you may want to include in your defrag maintenance plan.&amp;#160; These indexes do not see much use in typical out of box situations and are commented out in the script.&amp;#160; But if your environment is built on a custom UI or makes extensive use of the Advanced Search UI you will see improvements in query latencies if you defrag them. &lt;/p&gt;  &lt;ul&gt;   &lt;li&gt;IX_int -- defrag this index if you have a lot of queries that using numeric properties in the property store.&amp;#160; The classic case is date rage queries. &lt;/li&gt;    &lt;li&gt;IX_Str -- defrag this index if you have a lot of queries that using string properties in the property store.&amp;#160; There is not a common case for this but if you have made changes to your managed properties and are driving your search UI off of exact matches for a string based property you will want to regularly defrag this index. &lt;/li&gt; &lt;/ul&gt;  &lt;p&gt;Once we knew which indexes to defrag we looked at the duration it took for the index to reach a 10% defragmentation rate.&amp;#160; From this we adjusted the &lt;a href="http://technet.microsoft.com/en-us/library/ms188388.aspx"&gt;FILLFACTOR&lt;/a&gt; so we could maintain a longer period of time between actually needing a defrag.&amp;#160; At this point we are seeing a duration somewhere around 2+ weeks between defrags.&amp;#160; Do note that by increasing the FILLFACTOR we did grow the size of the database slightly, the growth rate on SearchBeta was not that large.&lt;/p&gt;  &lt;p&gt;We then looked at the cost/benefit of doing a &lt;a href="http://technet.microsoft.com/en-us/library/ms189858.aspx"&gt;Reorganize versus a Rebuild&lt;/a&gt;.&amp;#160; This was a interesting discovery for us.&amp;#160; Initially we had a script in place similar to proc_DefragIndexes that would choose to Reorganize or Rebuild based on percent fragmentation with 30% being the decision point (IE greater than 30% would do a Rebuild).&amp;#160; What we found was a Reorganize was taking over 8 hours with a 10% fragmentation rate and during this time end-user queries suffered dramatically.&amp;#160; Out of curiosity and desperation we tried a Rebuild which is supposed to be the more expensive of the two operations.&amp;#160; The Rebuild operation is completing in approximately &lt;b&gt;1 hour&lt;/b&gt; while the Reorganize takes as long as &lt;b&gt;8 hours&lt;/b&gt;.&amp;#160; The Rebuild operation is more expensive in the sense that you will see some failed queries during the hour that it runs, where as the Reorganize doesn't have as drastic of an effect on the queries, but the overall cost is much higher since you have an 8 hour window where the query performance is degraded.&amp;#160; &lt;a href="http://msdn.microsoft.com/en-us/library/ms187348.aspx"&gt;UPDATE STATISTICS&lt;/a&gt;:&amp;#160; In the experiments we ran we found that simply doing the rebuild (which also updates statistics) that it was not necessary to regularly use this command. &lt;/p&gt;  &lt;p&gt;Finally we deployed the script into an environment that utilized SQL Mirroring.&amp;#160; Unfortunately this didn't work out very well.&amp;#160; The mirror got so far behind that we eventually had to disconnect the mirror and stop the defrag.&amp;#160; Going through an analysis of this it became clear that the root cause was that the environment was heavily I/O bound and the defrag script generated more I/O than the system could keep up with.&amp;#160;&amp;#160; While the mirror was behind end-user query latencies suffered dramatically.&amp;#160; To recover from this we ultimately had to improve the hardware by increasing the number of spindles.&amp;#160; &lt;/p&gt;  &lt;p&gt;To mitigate this we have added a parameter to the script that allows you to reduce the &lt;a href="http://msdn.microsoft.com/en-us/library/ms181007.aspx"&gt;MAXDOP&lt;/a&gt; used in the index rebuild.&amp;#160; Setting this parameter to 1 on a SQL box that is minimally I/O bound helps, but it may not be enough depending on how constrained the system is.&amp;#160; If you are in an environment&amp;#160; that is I/O bound (with or without SQL Mirroring) we strongly recommend that you go through a test of the defrag before you go live with the deployment.&amp;#160; The easiest thing to try is the following SQL statement:&lt;/p&gt;  &lt;pre class="csharpcode"&gt;&lt;span class="kwrd"&gt;ALTER&lt;/span&gt; &lt;span class="kwrd"&gt;INDEX&lt;/span&gt; IX_MSSDocProps &lt;span class="kwrd"&gt;ON&lt;/span&gt; [dbo].[MSSDocProps]

REBUILD &lt;span class="kwrd"&gt;WITH&lt;/span&gt; (MAXDOP = 1, &lt;span class="kwrd"&gt;FILLFACTOR&lt;/span&gt; = 80, ONLINE = &lt;span class="kwrd"&gt;OFF&lt;/span&gt;)&lt;/pre&gt;
&lt;style type="text/css"&gt;


.csharpcode, .csharpcode pre
{
	font-size: small;
	color: black;
	font-family: consolas, "Courier New", courier, monospace;
	background-color: #ffffff;
	/*white-space: pre;*/
}
.csharpcode pre { margin: 0em; }
.csharpcode .rem { color: #008000; }
.csharpcode .kwrd { color: #0000ff; }
.csharpcode .str { color: #006080; }
.csharpcode .op { color: #0000c0; }
.csharpcode .preproc { color: #cc6633; }
.csharpcode .asp { background-color: #ffff00; }
.csharpcode .html { color: #800000; }
.csharpcode .attr { color: #ff0000; }
.csharpcode .alt 
{
	background-color: #f4f4f4;
	width: 100%;
	margin: 0em;
}
.csharpcode .lnum { color: #606060; }&lt;/style&gt;

&lt;p&gt;The statement above rebuilds the largest index using the lowest possible MAXDOP, this index must be rebuilt OFFLINE so you will need to run this on a test system or during a maintenance window.&amp;#160;&amp;#160; While this command is running keep an eye on the state of your mirroring with:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;The duration of the command.&amp;#160; Will it complete within your service window?&amp;#160; For comparison purposes this command completes in under an hour on the &lt;a href="http://blogs.msdn.com/enterprisesearch/archive/2008/05/03/searchbeta-hardware-configuration.aspx"&gt;SearchBeta hardware&lt;/a&gt; &lt;/li&gt;

  &lt;li&gt;&lt;a href="http://blogs.msdn.com/enterprisesearch/archive/2008/05/19/sql-monitoring-and-i-o.aspx"&gt;SQL I/O latencies&lt;/a&gt; &lt;/li&gt;

  &lt;li&gt;If you have mirroring in place 
    &lt;ul&gt;
      &lt;li&gt;The &lt;a href="http://msdn.microsoft.com/en-us/library/ms365809.aspx"&gt;Database Mirroring Monitor&lt;/a&gt; &lt;/li&gt;

      &lt;li&gt;&lt;a href="http://www.microsoft.com/technet/prodtechnol/sql/2005/technologies/dbm_best_pract.mspx#EKMAE"&gt;Send and Redo Queues&lt;/a&gt;&amp;#160; within perfmon.&amp;#160; The monitor above will tell you if mirroring is too far out of sync, but these counters are useful for comparison if you start changing the MAXDOP parameter. &lt;/li&gt;
    &lt;/ul&gt;
  &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Bottom line we feel the rebuild is a much better operation to run and recommend that you:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Run the &lt;a target="_blank" href="http://blogs.msdn.com/enterprisesearch/attachment/8920977.ashx"&gt;script&lt;/a&gt; on a regular basis; once a night or on the weekends depending on your service windows. 

    &lt;ul&gt;
      &lt;li&gt;&lt;b&gt;Weekends or weekly&lt;/b&gt; - reduce the fragmentation rate (sproc parameter) to 5.0 or lower to prevent missing the defrag due to a fraction of a percent (IE - 9.5%) &lt;/li&gt;

      &lt;li&gt;&lt;b&gt;Nightly&lt;/b&gt; - use the defaults for fragmentation rate. The largest index (MSSDocProps) gets rebuilt approximately every 2 weeks on SearchBeta. Running the script nightly will ensure that your indexes are up to date more often, but gives you less control over the exact time that the index rebuild occurs. &lt;/li&gt;
    &lt;/ul&gt;
  &lt;/li&gt;

  &lt;li&gt;Before running the script the first time test out how your system will behave when rebuilding MSSDocProps. &lt;/li&gt;

  &lt;li&gt;&lt;b&gt;Reduce MAXDOP - &lt;/b&gt;If your environment shows poor I/O response time or unacceptable durations (cannot complete a defrag inside your service window) reducing the MAXDOP value &lt;b&gt;may&lt;/b&gt; reduce the duration of the script and put less pressure on the I/O system.&amp;#160; Reducing the MAXDOP will not help enough if the system is very I/O bound.&amp;#160; &lt;/li&gt;

  &lt;li&gt;&lt;b&gt;SQL Mirroring&lt;/b&gt; - SQL mirroring is sensitive to I/O latencies, adding the defrag may be too much I/O for the system handle. &lt;/li&gt;

  &lt;li&gt;&lt;b&gt;Poor I/O latency&lt;/b&gt; - You should focus on improving the I/O subsystem of your SQL environment before you begin running this script.&amp;#160;&amp;#160;&amp;#160;&amp;#160; &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Stored Procedure syntax:&lt;/p&gt;

&lt;pre class="csharpcode"&gt;&lt;span class="kwrd"&gt;exec&lt;/span&gt; proc_DefragSearchIndexes [MAXDOP &lt;span class="kwrd"&gt;value&lt;/span&gt;], &lt;br /&gt;[fragmentation &lt;span class="kwrd"&gt;percent&lt;/span&gt;]&lt;/pre&gt;
&lt;style type="text/css"&gt;


.csharpcode, .csharpcode pre
{
	font-size: small;
	color: black;
	font-family: consolas, "Courier New", courier, monospace;
	background-color: #ffffff;
	/*white-space: pre;*/
}
.csharpcode pre { margin: 0em; }
.csharpcode .rem { color: #008000; }
.csharpcode .kwrd { color: #0000ff; }
.csharpcode .str { color: #006080; }
.csharpcode .op { color: #0000c0; }
.csharpcode .preproc { color: #cc6633; }
.csharpcode .asp { background-color: #ffff00; }
.csharpcode .html { color: #800000; }
.csharpcode .attr { color: #ff0000; }
.csharpcode .alt 
{
	background-color: #f4f4f4;
	width: 100%;
	margin: 0em;
}
.csharpcode .lnum { color: #606060; }&lt;/style&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;b&gt;MAXDOP value&lt;/b&gt; - Integer value. Default is 0&amp;#160; which means that all available CPUs will be used. &lt;/li&gt;

  &lt;li&gt;&lt;b&gt;Fragmentation percent&lt;/b&gt; - decimal value. Default is 10.0.&amp;#160; This value was explicitly chosen because we able measure query latency improvements on SearchBeta when defragging at the 10% boundary.&amp;#160;&amp;#160; &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;-Thanks &lt;/p&gt;

&lt;p&gt;Dan Blood 
  &lt;br /&gt;Senior Test&amp;#160; Engineer 

  &lt;br /&gt;Microsoft Corp&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=8920977" width="1" height="1"&gt;</content><author><name>enterprisesearch</name><uri>http://blogs.msdn.com/members/enterprisesearch.aspx</uri></author><category term="ITPro" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/ITPro/default.aspx" /><category term="Perf &amp; Scale" scheme="http://blogs.msdn.com/enterprisesearch/archive/tags/Perf+_2600_+Scale/default.aspx" /></entry></feed>