<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Thinking Big – Search Scale and Performance on a Budget</title><link>http://blogs.msdn.com/b/enterprisesearch/archive/2009/05/22/thinking-big-search-scale-and-performance-on-a-budget.aspx</link><description>I recently came across Paul Nelson’s informative post on search scalability . I don’t know how long it’s been up there, but reading it made me think of customers I’ve spoken with recently who are looking to scale up their search deployments, but, due</description><dc:language>en-US</dc:language><generator>Telligent Evolution Platform Developer Build (Build: 5.6.50428.7875)</generator><item><title>SharePoint Kaffeetasse 116</title><link>http://blogs.msdn.com/b/enterprisesearch/archive/2009/05/22/thinking-big-search-scale-and-performance-on-a-budget.aspx#9640162</link><pubDate>Mon, 25 May 2009 11:52:28 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9640162</guid><dc:creator>Michael Greth MVP SharePoint Blog</dc:creator><description>&lt;p&gt;Webcast TechNet Webcast: SharePoint und PowerShell (Level 300) 4.6.09 15:30-15:50 Uhr F&amp;#252;r Office SharePoint&lt;/p&gt;
&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9640162" width="1" height="1"&gt;</description></item><item><title>re: Thinking Big – Search Scale and Performance on a Budget</title><link>http://blogs.msdn.com/b/enterprisesearch/archive/2009/05/22/thinking-big-search-scale-and-performance-on-a-budget.aspx#9639268</link><pubDate>Sun, 24 May 2009 22:06:47 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9639268</guid><dc:creator>Paul</dc:creator><description>&lt;P&gt;Hey Nate:&lt;/P&gt;
&lt;P&gt;Excellent points.&lt;/P&gt;
&lt;P&gt;Some additional comments for readers (I know you know all of this already):&lt;/P&gt;
&lt;P&gt;1. &amp;nbsp;Very large systems need to be more concerned about 2nd order effects overwhelming the system. These are things like hardware failures, configuration complexity, network traffic, etc.&lt;/P&gt;
&lt;P&gt;This is why I often like splitting large systems into multiple smaller, independent systems when things get big. For example, two 5-row FAST installations may be, overall, more reliable than one large 10-row system for query scalability.&lt;/P&gt;
&lt;P&gt;2. &amp;nbsp;Just to emphasize that indexing is very expensive, and a high-bandwidth indexing application (i.e. very frequent updates) will use up a lot of resources.&lt;/P&gt;
&lt;P&gt;So, if possible, make some servers "static" - i.e. no new documents (deletes are okay, they execute very quickly). This will free up large amounts of resource for query and will allow you to pack on more documents per node increasing document count scalability. Of course, doing this is tricky and requires some careful architectural analysis.&lt;/P&gt;
&lt;P&gt;Going along with this, indexing is more efficient the larger the number of documents are included in a batch. For example, if indexing 10 documents requires X, then indexing 1000 documents may only require X*2. The larger the batch, the more efficient. This further emphases the value of centralizing indexing onto a small number of nodes - if possible.&lt;/P&gt;
&lt;P&gt;Of course, a lot of these ideas are non-standard configurations. :-)&lt;/P&gt;
&lt;P&gt;Also, they have other issues - such as interfering with default relevancy ranking TF/IDF formula and increasing configuration complexity...&lt;/P&gt;
&lt;P&gt;Which is all to say: don't bother unless you're doing something really big. Otherwise, just buy an extra server or two and don't worry about it.&lt;/P&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9639268" width="1" height="1"&gt;</description></item><item><title>Thinking Big ??? Search Scale and Performance on a Budget | Microsoft Share Point</title><link>http://blogs.msdn.com/b/enterprisesearch/archive/2009/05/22/thinking-big-search-scale-and-performance-on-a-budget.aspx#9635879</link><pubDate>Fri, 22 May 2009 23:15:48 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9635879</guid><dc:creator>Thinking Big ??? Search Scale and Performance on a Budget | Microsoft Share Point</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://microsoft-sharepoint.simplynetdev.com/thinking-big-%e2%80%93-search-scale-and-performance-on-a-budget/"&gt;http://microsoft-sharepoint.simplynetdev.com/thinking-big-%e2%80%93-search-scale-and-performance-on-a-budget/&lt;/a&gt;&lt;/p&gt;
&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9635879" width="1" height="1"&gt;</description></item></channel></rss>