<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>eScience @ Microsoft : Graywulf</title><link>http://blogs.msdn.com/escience/archive/tags/Graywulf/default.aspx</link><description>Tags: Graywulf</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>Graywulf Takes Byte Out of Data Overload</title><link>http://blogs.msdn.com/escience/archive/2009/05/08/graywulf-takes-byte-out-of-data-overload.aspx</link><pubDate>Fri, 08 May 2009 20:46:17 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9597387</guid><dc:creator>eScience</dc:creator><slash:comments>0</slash:comments><comments>http://blogs.msdn.com/escience/comments/9597387.aspx</comments><wfw:commentRss>http://blogs.msdn.com/escience/commentrss.aspx?PostID=9597387</wfw:commentRss><description>&lt;p&gt;&lt;a href="http://blogs.msdn.com/blogfiles/dan_fay/WindowsLiveWriter/GraywulfTakesByteOutofDataOverload_F3E8/jimgray_2.gif"&gt;&lt;img style="border-right-width: 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; margin-left: 0px; border-left-width: 0px; margin-right: 0px" title="jimgray" border="0" alt="jimgray" align="right" src="http://blogs.msdn.com/blogfiles/dan_fay/WindowsLiveWriter/GraywulfTakesByteOutofDataOverload_F3E8/jimgray_thumb.gif" width="147" height="244" /&gt;&lt;/a&gt;Graywulf is the natural evolution of &lt;a href="http://en.wikipedia.org/wiki/Beowulf_cluster" target="_blank"&gt;Beowulf Clusters&lt;/a&gt; – it brings together HPC clusters and databases to do &lt;a href="http://blogs.msdn.com/blogfiles/dan_fay/WindowsLiveWriter/GraywulfTakesByteOutofDataOverload_F3E8/graywulf-full-color_2.jpg"&gt;&lt;img style="border-right-width: 0px; margin: 5px 5px 5px 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="graywulf-full-color" border="0" alt="graywulf-full-color" align="left" src="http://blogs.msdn.com/blogfiles/dan_fay/WindowsLiveWriter/GraywulfTakesByteOutofDataOverload_F3E8/graywulf-full-color_thumb.jpg" width="244" height="148" /&gt;&lt;/a&gt;efficient processing and data management.&amp;#160; It’s name and design also pays homage to &lt;a href="http://research.microsoft.com/en-us/um/people/gray/" target="_blank"&gt;Jim Gray&lt;/a&gt; – who helped&amp;#160; champion the use of relational databases in the scientific projects.&lt;/p&gt;  &lt;p&gt;At it’s simplest form Graywulf is having a database installed on each of the HPC compute nodes – this brings the data to the computation – one of the points Jim made quite often and utilizes the power of databases (queries, stored procedures, etc).&amp;#160; Since it’s a generic architecture Graywulf clusters can be built using any OS and any database…the ones in the case study below implemented them using &lt;a href="http://www.microsoft.com/hpc"&gt;Windows HPC Server&lt;/a&gt; and &lt;a href="http://www.microsoft.com/sql"&gt;SQL Server&lt;/a&gt; and the motivation was to be more efficient in doing the science – it’s always great to have innovative folks using technologies to do good work.&amp;#160; &lt;/p&gt;  &lt;blockquote&gt;   &lt;p&gt;“To put it simply, a scientist needs to be able to live within the data,” says Alexander Szalay, a cosmologist-turned-computer-scientist at The Johns Hopkins University (JHU) in Baltimore, Maryland. The power of information, Szalay says, is determined not by its quantity so much as how easy it is to access, manipulate and analyze.     &lt;br /&gt;“It’s not just about doing the numerical calculations,” adds Andrew Simms, a biomedical health informatics graduate student working on protein structure analysis in Valerie Daggett’s bioengineering laboratory at the University of Washington (UW) in Seattle. “It’s also about assembling the data so we can run calculations while performing analyses and ad hoc explorations and then feed it all back into the data warehouse.”&lt;/p&gt; &lt;/blockquote&gt;  &lt;blockquote&gt;   &lt;h4&gt;&lt;a title="Graywulf Takes Byte Out of Data Overload" href="http://research.microsoft.com/en-us/collaboration/focus/e3/graywulf.aspx"&gt;Graywulf Takes Byte Out of Data Overload&lt;/a&gt;&lt;/h4&gt;    &lt;p&gt;&lt;img style="margin: 0px 0px 0px 5px; display: inline" title="Graywulf takes byte out of data overload" alt="Graywulf takes byte out of data overload" align="right" src="http://research.microsoft.com/en-us/collaboration/focus/e3/graywulf1.jpg" /&gt;Astronomers at The Johns Hopkins University and protein scientists at the University of Washington are using inexpensive computer hardware combined with powerful computing and database software to help manage and analyze a growing volume of scientific data. &lt;/p&gt;    &lt;p&gt;For details, read the &lt;a href="http://research.microsoft.com/en-us/collaboration/focus/e3/graywulf.pdf"&gt;Graywulf case study&lt;/a&gt;. &lt;/p&gt;    &lt;h5&gt;Project Principals&lt;/h5&gt;    &lt;ul&gt;     &lt;li&gt;&lt;a href="http://physics-astronomy.jhu.edu/people/faculty/szalay.html"&gt;Alexander Szalay&lt;/a&gt;, Alumni Centennial Professor, Department of Physics and Astronomy, The Johns Hopkins University &lt;/li&gt;      &lt;li&gt;&lt;a href="http://depts.washington.edu/daglab/valerie.html"&gt;Valerie Daggett&lt;/a&gt;, Professor of Bioengineering, University of Washington &lt;/li&gt;   &lt;/ul&gt; &lt;/blockquote&gt;  &lt;p&gt;&lt;a href="http://research.microsoft.com/en-us/collaboration/focus/e3/graywulf.aspx"&gt;Graywulf Takes Byte Out of Data Overload - Microsoft Research&lt;/a&gt;&lt;/p&gt;
Cross Posted from Dan Fay's Blog (http://blogs.msdn.com/dan_fay)&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=9597387" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/escience/archive/tags/Research/default.aspx">Research</category><category domain="http://blogs.msdn.com/escience/archive/tags/SQL+Server/default.aspx">SQL Server</category><category domain="http://blogs.msdn.com/escience/archive/tags/WinHPC/default.aspx">WinHPC</category><category domain="http://blogs.msdn.com/escience/archive/tags/Science/default.aspx">Science</category><category domain="http://blogs.msdn.com/escience/archive/tags/Data+Analysis/default.aspx">Data Analysis</category><category domain="http://blogs.msdn.com/escience/archive/tags/Graywulf/default.aspx">Graywulf</category></item></channel></rss>