<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Jamie's Junk : Association Rules</title><link>http://blogs.msdn.com/jamiemac/archive/tags/Association+Rules/default.aspx</link><description>Tags: Association Rules</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>How can we mine?  Let me count the ways...</title><link>http://blogs.msdn.com/jamiemac/archive/2007/11/19/how-can-we-mine-let-me-count-the-ways.aspx</link><pubDate>Tue, 20 Nov 2007 00:18:10 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:6410880</guid><dc:creator>JamieMac</dc:creator><slash:comments>1</slash:comments><comments>http://blogs.msdn.com/jamiemac/comments/6410880.aspx</comments><wfw:commentRss>http://blogs.msdn.com/jamiemac/commentrss.aspx?PostID=6410880</wfw:commentRss><description>&lt;p&gt;Recently I received some customer feedback that SQL Server Data Mining &amp;quot;doesn't have enough algorithms.&amp;quot;&amp;#160; More specifically, the comment was that we have the same capabilities are other Data Mining providers, we just &amp;quot;hide&amp;quot; many facilities as algorithm parameters rather than separating out each as a named algorithm.&amp;#160; So let's count the Microsoft algorithms a few different ways to work this out.&lt;/p&gt;  &lt;p&gt;First - let's go by the box.&amp;#160; This is the list of algorithms as specified in Books Online&lt;/p&gt;  &lt;ol&gt;   &lt;li&gt;Microsoft Decision Trees&lt;/li&gt;    &lt;li&gt;Microsoft Clustering&lt;/li&gt;    &lt;li&gt;Microsoft Naive Bayes&lt;/li&gt;    &lt;li&gt;Microsoft Association Rules&lt;/li&gt;    &lt;li&gt;Microsoft Neural Networks&lt;/li&gt;    &lt;li&gt;Microsoft Time Series&lt;/li&gt;    &lt;li&gt;Microsoft Sequence Clustering&lt;/li&gt;    &lt;li&gt;Microsoft Linear Regression&lt;/li&gt;    &lt;li&gt;Microsoft Logistic Regression&lt;/li&gt; &lt;/ol&gt;  &lt;p&gt;So that's nine - count 'em &lt;em&gt;nine&lt;/em&gt; algorithms.&amp;#160;&amp;#160; But that's just one way.&amp;#160; If you look at my book, Data Mining with SQL Server 2005 written with Zhaohui Tang, there are only &lt;em&gt;seven &lt;/em&gt;algorithms!&amp;#160; What?&amp;#160; You say!&amp;#160; How can it be?&lt;/p&gt;  &lt;p&gt;Let me explain.&amp;#160; During the development of SQL Server 2005, we realized a couple of tricks; 1) linear regression was the same as our tree algorithm,&amp;#160; just forced to not split; and 2) logistic regression was the same as our Neural Nets, just with zero hidden layers.&amp;#160; However, we got similar feedback - people want &lt;em&gt;more algorithms&lt;/em&gt;, and specifically these ones, so we set up two &amp;quot;new algorithms&amp;quot; by forcibly setting parameters on the Decision Tree and Neural Network algorithms and voila! we shipped with nine named algorithms.&amp;#160; It would have been hard to fill up two entire chapters explaining that last sentence, so Zhaohui and I decided just to stick to the seven core algorithms.&lt;/p&gt;  &lt;p&gt;Anyway, this posting isn't really about how to count &lt;em&gt;less&lt;/em&gt; algorithms, I really wanted to show you how to count &lt;em&gt;more.&amp;#160; &lt;/em&gt;When we set about designing SQL Server Data Mining, we really and truly tried to make data mining operations simpler.&amp;#160; We thought at the time, rightly or wrongly, that the more options end users have, the more complicated and difficult the product would be to use.&amp;#160; Therefore, we tried to determine the best behavior in a class, and make more advanced options available through parameters.&lt;/p&gt;  &lt;p&gt;For example, take our clustering algorithm.&amp;#160; We assumed that if people wanted clustering, most likely didn't care about the details of the algorithm, they just wanted to get the job done, and that those people who wanted more would look for it (the design principal - make the simple things simple, and the complex things possible).&amp;#160; So we bundled up different flavors of clustering into a single package that many vendors would have broken apart.&amp;#160; So let's start counting with clustering.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;1&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Our default clustering behavior is &lt;strong&gt;EM (Expectation Maximization) clustering&lt;/strong&gt; using the Bradley-Fayyad scalable framework&lt;/p&gt; &lt;strong&gt;&lt;font size="5"&gt;&lt;/font&gt;&lt;/strong&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;2&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Setting a parameter changes that to a &lt;strong&gt;K-Means clustering &lt;/strong&gt;implementation using the same framework&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;3+4&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Setting the same parameter another way, provides &lt;strong&gt;non-scalable&lt;/strong&gt; versions of the two clustering varieties.&amp;#160; (I know it's hard to swallow that the non-scalable versions count as separate algorithms, but if you &lt;em&gt;started&lt;/em&gt; with the vanilla versions and &lt;em&gt;added&lt;/em&gt; scalability, then &lt;em&gt;of course&lt;/em&gt; you would consider those versions as new algorithms - I'm just working backwards here.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;5&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Let's move to our Decision Tree algorithm and we will consider our classification tree as one algorithm.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;6&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;But our Decision Tree also predicts continuous and counts as a &lt;em&gt;regression&lt;/em&gt; tree, so we will count that as another algorithm.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;7&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Oops!&amp;#160; Our Decision Tree &lt;em&gt;also &lt;/em&gt;creates full linear regressions at each of the leaf nodes.&amp;#160; To get the typical regression tree behavior you need to make sure that none of the continuous inputs have the REGRESSOR flag and you get yet another algorithm.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;8&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Oh yeah, our trees allow for multiple targets in each model, allowing the discovery and display of interrelated patterns through our dependency net.&amp;#160; I've seen other vendors advertise such functionality as an &amp;quot;algorithm&amp;quot; so there's our #8.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;9&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;How about collaborative filtering with Trees - just slap a PREDICT flag on a nested table, and you have a complete recommendation system.&amp;#160; Let's call it Associative Trees&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;10&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Naive Bayes.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;11+12&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;If we're going to count Associative Trees, we also have &amp;quot;Associative Bayes&amp;quot;.&amp;#160; I guess the multiple target interrelated pattern thing counts here as well.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;13&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Association Rules.&amp;#160; A-priori style&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;14&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;It seems odd to count association rules twice since we can do predictions with it, but nobody else does it (or didn't before - correct me if I'm wrong), so Predictive Association Rules makes the cut.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;15+16+17+18&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Well if we're going to go and call predictive association an algorithm, we had better do the same for our clustering algorithm.&amp;#160; Granted, clustering doesn't make a great classifier or estimator, but the great Highlight Exceptions functionality of the Data Mining addins comes from this ability.&amp;#160; Yes, we can do nested table prediction as well with clustering, but I wouldn't recommend it to my mom, so I won't take another four for that.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;19+20+21+22+23&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;Neural Networks, Sequence Clustering, Time Series, Linear Regression and Logistic Regression.&amp;#160; Yeah, yeah, I could get into varieties here, but I think you get the point.&lt;/p&gt;  &lt;p&gt;&amp;#160;&lt;/p&gt;  &lt;p&gt;So by that count, and not being &lt;em&gt;too &lt;/em&gt;creative (trust me, I can do more) we're looking at &lt;font size="5"&gt;&lt;strong&gt;23 &lt;/strong&gt;&lt;/font&gt;algorithms in SQL Server 2005 Data Mining.&amp;#160; There are a few more options coming up in SQL Server 2008 that are worth discussing as well.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;24&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;The time series of SQL Server 2007 uses the ARTXP algorithm - &amp;quot;Auto Regression Trees with Cross Predict&amp;quot;.&amp;#160; In 2008, we're adding ARIMA as well, for algorithm #24.&lt;/p&gt;  &lt;p&gt;&lt;strong&gt;&lt;font size="5"&gt;25&lt;/font&gt;&lt;/strong&gt;&lt;/p&gt;  &lt;p&gt;And yet again with Time Series, the default mode of operation is to blend ARTXP and ARIMA results in an intelligent way to maximize accuracy and stability for #25.&lt;/p&gt;  &lt;p&gt;&amp;#160;&lt;/p&gt;  &lt;p&gt;Arbitrarily there are 23 algorithms in SQL 2005 and 25 in SQL 2008, with the option of teasing out even more varieties depending on how you apply parameters and flags to the base nine (or seven - depending on how you count!).&amp;#160;&amp;#160; Next time someone quips that SQL Server only has &amp;quot;nine&amp;quot; algorithms, tell them that's just the packaging - each of those nine provides a wealth of value in each box.&lt;/p&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=6410880" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Decision+Trees/default.aspx">Decision Trees</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Clustering/default.aspx">Clustering</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Association+Rules/default.aspx">Association Rules</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Time+Series/default.aspx">Time Series</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Algorithms/default.aspx">Algorithms</category></item><item><title>Social Network Analysis with SQL Server 2005 Data Mining</title><link>http://blogs.msdn.com/jamiemac/archive/2007/02/26/social-network-analysis-with-sql-server-2005-data-mining.aspx</link><pubDate>Tue, 27 Feb 2007 02:34:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:1766258</guid><dc:creator>JamieMac</dc:creator><slash:comments>0</slash:comments><comments>http://blogs.msdn.com/jamiemac/comments/1766258.aspx</comments><wfw:commentRss>http://blogs.msdn.com/jamiemac/commentrss.aspx?PostID=1766258</wfw:commentRss><description>&lt;P&gt;A few years ago &lt;A class="" href="http://blogs.msdn.com/johnchancock/default.aspx" mce_href="http://blogs.msdn.com/johnchancock/default.aspx"&gt;John Hancock&lt;/A&gt; spent some time with our team and did some interesting work around using the Association Rules algorithm and some custom Analysis Services stored procedures to implement link analysis (a.k.a. social network analysis).&amp;nbsp; His implementaion was extremely novel, and the product team (self included) was duly impressed by what John was able to eek out of our system.&lt;/P&gt;
&lt;P&gt;Well, as he puts it himself he "&lt;A class="" href="http://blogs.msdn.com/johnchancock/archive/2007/02/25/social-relationship-analysis-with-data-mining.aspx" mce_href="http://blogs.msdn.com/johnchancock/archive/2007/02/25/social-relationship-analysis-with-data-mining.aspx "&gt;finally got around to posting&lt;/A&gt;" all the source, etc. for this work for the enlightenment of everyone!&lt;/P&gt;
&lt;P&gt;I'm happy to hear of (and promote!) any other slick projects using SSDM that you want to share!&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=1766258" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Code/default.aspx">Code</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Association+Rules/default.aspx">Association Rules</category></item><item><title>Predicting based on rules alone and getting everything you always wanted</title><link>http://blogs.msdn.com/jamiemac/archive/2006/12/05/predicting-based-on-rules-alone-and-getting-everything-you-always-wanted.aspx</link><pubDate>Tue, 05 Dec 2006 10:53:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:1209953</guid><dc:creator>JamieMac</dc:creator><slash:comments>1</slash:comments><comments>http://blogs.msdn.com/jamiemac/comments/1209953.aspx</comments><wfw:commentRss>http://blogs.msdn.com/jamiemac/commentrss.aspx?PostID=1209953</wfw:commentRss><description>&lt;P&gt;In this &lt;A class="" href="http://blogs.msdn.com/jamiemac/archive/2006/04/19/579409.aspx" mce_href="http://blogs.msdn.com/jamiemac/archive/2006/04/19/579409.aspx"&gt;post&lt;/A&gt;&amp;nbsp;I explained how to couch an associative prediction that omitted predictions that were based solely on marginal statistics.&amp;nbsp; The one catch was that if you asked for 5 items, you might not get 5, you could get 4 or 3 or even down to 0, since all predictions without a corresponding prediction node id were filtered out.&lt;/P&gt;
&lt;P&gt;However, by leveraging the TopCount function, you can have your bread and butter it too (it's late).&amp;nbsp; Use syntax like the following:&lt;/P&gt;
&lt;P&gt;SELECT TopCount(&lt;BR&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (SELECT Product, $Probability FROM &lt;BR&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; PredictAssociation(Products, INCLUDE_STATISTICS, INCLUDE_NODE_ID)&lt;BR&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; WHERE $NodeID &amp;lt;&amp;gt; ''),&lt;BR&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; $PredictProbability,&lt;BR&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 5) AS PredictedProducts&lt;BR&gt;FROM MyAssociationModel&lt;BR&gt;NATURAL PREDICTION JOIN&lt;BR&gt;.....&lt;/P&gt;
&lt;P mce_keep="true"&gt;&amp;nbsp;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=1209953" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/jamiemac/archive/tags/DMX/default.aspx">DMX</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Association+Rules/default.aspx">Association Rules</category></item><item><title>Predict based on rules alone</title><link>http://blogs.msdn.com/jamiemac/archive/2006/04/19/predict-based-on-rules-alone.aspx</link><pubDate>Wed, 19 Apr 2006 23:18:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:579409</guid><dc:creator>JamieMac</dc:creator><slash:comments>1</slash:comments><comments>http://blogs.msdn.com/jamiemac/comments/579409.aspx</comments><wfw:commentRss>http://blogs.msdn.com/jamiemac/commentrss.aspx?PostID=579409</wfw:commentRss><description>&lt;P&gt;&lt;FONT face=Verdana size=2&gt;I've been asked this question a few times and never thought of an answer until I was asked again today on the data mining newsgroups.&amp;nbsp; We'll be posting a tip or trick on sqlserverdatamining.com&amp;nbsp;on this, but since it's fresh in my mind I'll let everyone have it now.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face=Verdana size=2&gt;The problem that some people are having is the behavior of associative prediction returning popular results if the algorithm doesn't actually predict anything.&amp;nbsp; Many seem to want only results returned by predictions (we thought that if you asked for 5 you wanted 5 - who knew?).&amp;nbsp; In any case you can check the node id to see if the prediction was based on a rule - if there's no node id, no rule - you can drop it.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face=Verdana size=2&gt;A sample query using this technique looks like this:&lt;/FONT&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt; mso-layout-grid-align: none"&gt;&lt;SPAN style="FONT-SIZE: 10pt; COLOR: blue; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;SELECT&lt;?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt; mso-layout-grid-align: none"&gt;&lt;SPAN style="FONT-SIZE: 10pt; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;(&lt;SPAN style="COLOR: blue"&gt;SELECT&lt;/SPAN&gt; * &lt;SPAN style="COLOR: blue"&gt;FROM&lt;/SPAN&gt; &lt;SPAN style="COLOR: maroon"&gt;PredictAssociation&lt;/SPAN&gt;([MovieModel].[Movies],5, &lt;BR&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN style="COLOR: blue"&gt;INCLUDE_NODE_ID&lt;/SPAN&gt;) &lt;BR&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN style="COLOR: blue"&gt;WHERE&lt;/SPAN&gt; &lt;SPAN style="COLOR: blue"&gt;$NODEID&lt;/SPAN&gt;&amp;lt;&amp;gt;&lt;SPAN style="COLOR: maroon"&gt;''&lt;/SPAN&gt;)&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt; mso-layout-grid-align: none"&gt;&lt;SPAN style="FONT-SIZE: 10pt; COLOR: blue; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;From&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt; mso-layout-grid-align: none"&gt;&lt;SPAN style="FONT-SIZE: 10pt; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;[MovieModel]&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt; mso-layout-grid-align: none"&gt;&lt;SPAN style="FONT-SIZE: 10pt; COLOR: blue; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;NATURAL&lt;/SPAN&gt;&lt;SPAN style="FONT-SIZE: 10pt; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt; &lt;SPAN style="COLOR: blue"&gt;PREDICTION&lt;/SPAN&gt; &lt;SPAN style="COLOR: blue"&gt;JOIN&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt; mso-layout-grid-align: none"&gt;&lt;SPAN style="FONT-SIZE: 10pt; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;(&lt;SPAN style="COLOR: blue"&gt;SELECT&lt;/SPAN&gt; (&lt;SPAN style="COLOR: blue"&gt;SELECT&lt;/SPAN&gt; &lt;SPAN style="COLOR: maroon"&gt;'What''s New, Pussycat'&lt;/SPAN&gt; &lt;SPAN style="COLOR: blue"&gt;AS&lt;/SPAN&gt; [Movie]&lt;o:p&gt;&lt;/o:p&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class=MsoNormal style="MARGIN: 0in 0in 0pt"&gt;&lt;SPAN style="FONT-SIZE: 10pt; FONT-FAMILY: 'Courier New'; mso-no-proof: yes"&gt;&lt;SPAN style="mso-spacerun: yes"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN style="COLOR: blue"&gt;UNION&lt;/SPAN&gt; &lt;SPAN style="COLOR: blue"&gt;SELECT&lt;/SPAN&gt; &lt;SPAN style="COLOR: maroon"&gt;'28 Days'&lt;/SPAN&gt; &lt;SPAN style="COLOR: blue"&gt;AS&lt;/SPAN&gt; [Movie]) &lt;SPAN style="COLOR: blue"&gt;AS&lt;/SPAN&gt; [Movies]) &lt;SPAN style="COLOR: blue"&gt;AS&lt;/SPAN&gt; t&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face=Verdana size=2&gt;Another interesting flag to try out for PredictAssociation is INCLUDE_STATISTICS, so you can determine the probability and support for each suggestion.&lt;/FONT&gt;&lt;/P&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=579409" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/jamiemac/archive/tags/DMX/default.aspx">DMX</category><category domain="http://blogs.msdn.com/jamiemac/archive/tags/Association+Rules/default.aspx">Association Rules</category></item></channel></rss>