<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Data Quality Services (DQS)</title><link>http://blogs.msdn.com/b/dqs/</link><description>This blog provides information, news, tips, and announcements about the SQL Server Data Quality Services (DQS) feature introduced in SQL Server 2012.</description><dc:language>en-US</dc:language><generator>Telligent Evolution Platform Developer Build (Build: 5.6.50428.7875)</generator><item><title>Matching related and duplicate Customer Records using SQL Server Data Quality Services (DQS)</title><link>http://blogs.msdn.com/b/dqs/archive/2013/06/17/matching-related-and-duplicate-customer-records-using-sql-server-data-quality-services-dqs.aspx</link><pubDate>Mon, 17 Jun 2013 17:11:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10426465</guid><dc:creator>Welly.Lee</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10426465</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2013/06/17/matching-related-and-duplicate-customer-records-using-sql-server-data-quality-services-dqs.aspx#comments</comments><description>&lt;p&gt;SQL Server Data Quality Services (DQS) enables you to match records using your matching rules defined in DQS Knowledge Base. There are two common use cases where you would like to match related records. One is to identify duplicate in your records, such as same customers appear as separate records in your dataset. Another common case is to identify related customers. For example, you would like to identify two separate customer records&amp;nbsp; that belong to the same parent company.&amp;nbsp; In this blog article, I am going to show how to create a simple matching rules and use them to match duplicate&amp;nbsp;and related records.&lt;/p&gt;
&lt;p&gt;For this article, I am using the same data set and the knowledge base created in the previous blog article: &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/06/14/cleansing-customer-data-using-data-quality-services-dqs.aspx"&gt;How to cleanse Customer Data using SQL Server Data Quality Services&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The following are the high level steps:&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;A. Add matching policy to DQS Knowledge Base&lt;/strong&gt;&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;B. Create and execute a DQS matching project&lt;/strong&gt;&lt;/p&gt;
&lt;h3&gt;&amp;nbsp;&lt;strong&gt;A. Add matching policy to DQS Knowledge Base&lt;/strong&gt;&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;Start Data Quality Client. For information about this step, refer to &lt;a href="http://msdn.microsoft.com/en-us/library/hh213023.aspx"&gt;Run the Data Quality Client Application&lt;/a&gt;.&amp;nbsp;&lt;/li&gt;
&lt;li&gt;In the Data Quality Client home screen, under Knowledge Base Management:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Click Open Knowledge Base&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;: this article builds on the an existing Knowledge Base created from the &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/06/14/cleansing-customer-data-using-data-quality-services-dqs.aspx"&gt;previous article&lt;/a&gt; to illustrate how domain definition for cleansing helps matching. However, it is not required that you create matching policy from an existing Knowledge Base. You can start by creating a new Knowledge Base.&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;Select MyCustomerKB then select Matching Policy under Select Activity then click next&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note:&lt;/em&gt; If you accidently click open without changing activity to matching policy, you will open the domain management screen. Unfortunately, there is no way to navigate to matching policy screen from domain management screen, so you have to first publish the Knowledge Base and start over step 2 to go to Matching Policy screen.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/8816.Matching_5F00_A2.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/8816.Matching_5F00_A2.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="3"&gt;
&lt;li&gt;In the Map screen:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Connect to a data source to test your matching policy.&amp;nbsp; Select the columns in your data set to the domains in your Knowledge Base by selecting from the drop down. Click Next to finish mapping.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/3583.Matching_5F00_A3a.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/3583.Matching_5F00_A3a.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="3"&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;You only need to map the columns needed for matching purposes. In my example, I only select CustomerName, City and Province and&amp;nbsp;I left out CustomerID (unique row identifier) and last update columns as I do not need those columns for&amp;nbsp;matching&lt;/li&gt;
&lt;li&gt;If you start with a new Knowledge Base or your Knowledge Base does not have the domain needed for your matching, you can click the Create Domain icon (&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/7853.CreateDomainIcon.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/50x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/7853.CreateDomainIcon.png" alt="" width="17" height="16" border="0" /&gt;&lt;/a&gt;) to create the domain directly from this screen.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;In the Matching Policy Screen:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Create rule for matching by Customer Name&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Click icon to Create a matching rule (&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/4456.CreateAMatchinRule.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/25x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/4456.CreateAMatchinRule.png" alt="" width="22" height="17" border="0" /&gt;&lt;/a&gt;) and give a name (e.g. Match By Customer Name). In this example, we would like to identify related customer by assuming that customer which has similar/same name belong to the same organization (This is a simplified example to illustrate the capability, in reality, you would use many more domains to achieve the goal-- more about using multiple domains for matching rules later).&lt;/li&gt;
&lt;li&gt;Click Add a new domain element icon to add domain for the matching policy. Specify weight = 100% (more about this later).&lt;/li&gt;
&lt;li&gt;Click Start button to test your rule.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 120px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/1460.Matching_5F00_A4.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/1460.Matching_5F00_A4.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;When performing matching, DQS automatically applies all domain definition in the Knowledge Base. As we are using an existing Knowledge Base which includes term definition for "co"&amp;nbsp;as the same as&amp;nbsp;"Company", DQS uses it to match "Consolidate Co Ltd" and "Consolidation Company Ltd". If you start with a new Knowledge Base or do not have the term based relation defined for CustomerName matching, then the 2 records will not be matched.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;li&gt;Create rule for matching by City and State&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Click the icon to Create a matching rule again to create a second rule and give a name (e.g. Match by City State). In this example, we would like to identify duplicate customer recording assuming that a customer has only 1 store for a given city in the same state (It is not a realistic example, but you can imagine extending this to an address where if two businesses records&amp;nbsp;have the same address, then the records could be a duplicate).&lt;/li&gt;
&lt;li&gt;Click Add a new domain element icon to add domain to the matching policy. Click down arrow next to the domain name and from the drop down, select City domain.&amp;nbsp;&lt;/li&gt;
&lt;li&gt;Click Add a new domain element icon again to add the second domain, and select State from the drop down. When you have more than 1 domain in your rule, you will need to distribute weight across the domains. DQS will calculate matching score for each domain then use the weight to calculate overall matching score for the rule.&amp;nbsp; For our example, you can distribute equal weight (50-50) across the 2 domains. Click Start button to test your rule.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;As the case of the CustomerName, DQS use the domain value definition in the city and state for matching purposes. As "LA" was defined as synonym for "Los Angeles", DQS match the 2 records as 100% match&lt;/li&gt;
&lt;li&gt;In the rule editor, you can also specify how the matching should be performed: similar (which we have been using in our example) or exact.&amp;nbsp; The exact matching requires the value to be identical. This is typically used when you want to match things such as BirthDate or Identifier (e.g. SSN, CustomerID, etc.). If you have domains that can use exact matching, it is recommended that you mark it as prerequisite to improve performance. In order to match records from a dataset, DQS compare a record with all others, specifying prerequisite, effectively partition your dataset base on the value of the column.&amp;nbsp;This may significantly improve performance.&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;Click Start to test your rule.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 120px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/5481.Matching_5F00_A5.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/5481.Matching_5F00_A5.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;Click Next to continue.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;li&gt;In the matching results screen:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Click Start to test. While in the previous step, you test against specific matching rule, in this step, you can see how matching results from ALL of the rules. All the records that are matched together have the same Cluster value.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/0181.Matching_5F00_A6.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/0181.Matching_5F00_A6.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;The following is what you could interpret from&amp;nbsp;the results:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;The first cluster shows the two related companies which have similar name (87% matching score). The name is similar enough that the difference in name may be due to human error during data entry.&lt;/li&gt;
&lt;li&gt;The second cluster shows the two companies that have the same address although they are entered as different names (one name in Chinese and another in English) -- the two customer records appears to be duplicate.&lt;/li&gt;
&lt;li&gt;The last cluster match records based on the 2 rules ("Match By Customer Name" and "Match By City State" -- the Rule column provides different color for different rule, you can click the Matching Rules tab at the bottom for the rule color legend). "Big Cheese, The" and "THE BIG CHEESE" appears to be related companies in a different city. The second one "To be filled later" appears to be a placeholder as the name may not available when first entered into the system. Later, a different person may enter the record as separate new record rather than updating an existing one and resulting in a duplicate.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;ol start="6"&gt;
&lt;li&gt;Click Finish to publish your Knowledge Base.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;h3&gt;&lt;strong&gt;B. Create and execute a DQS matching project&lt;/strong&gt;&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;In the Data Quality Client home screen, under Data Quality Projects, click New Data Quality Project.&lt;/li&gt;
&lt;li&gt;Name your new Project (e.g. MyCustomer Matching Project), select Knowledge Base created in the previous step (e.g. MyCustomerKB) and select Matching Activity&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;: Make sure to select the matching activity, once you click Next, you can't go back and change the activity, so you have to create a NEW project and start over.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/4705.Matching_5F00_B2.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/4705.Matching_5F00_B2.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="3"&gt;
&lt;li&gt;In the Map screen:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Select data source, database, and table for your data (e.g. MyCustomers table and data generated from the script at the beginning of this article).&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;: in this example, we are using the same data set when we create the knowledge base, but it is not required. In more realistic example, you build the rule with a representative sample of your data set, then use the full data set for the project. You can reuse the same Knowledge Base for different data set by creating a different project.&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;Select columns to be matched and map to the domains in your knowledge base (you must map all domains used by all the rules to continue).&lt;/li&gt;
&lt;li&gt;You should have your data source columns mapped as below. Click Next to continue.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/7382.Matching_5F00_B3.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/7382.Matching_5F00_B3.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;In the Matching screen:&lt;/li&gt;
&lt;ol&gt;
&lt;li&gt;Click start to begin the matching process&lt;/li&gt;
&lt;li&gt;Review the results and click next to continue.&lt;/li&gt;
&lt;/ol&gt;&lt;/ol&gt;
&lt;p style="padding-left: 60px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/5618.Matching_5F00_B4.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/5618.Matching_5F00_B4.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;li&gt;In the Export screen&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Select destination (e.g. SQL Server)&lt;/li&gt;
&lt;li&gt;Select database name&lt;/li&gt;
&lt;li&gt;Click the checkbox next to the Matching Results and specify table name&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 60px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/4848.Matching_5F00_B5.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/4848.Matching_5F00_B5.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;ul&gt;
&lt;li&gt;Click Export to continue&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;&lt;em&gt;Note&lt;/em&gt;:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Optionally, you can also export survivorship results, for DQS to export only one record from the same cluster. You can specify how the record should be picked from the following option:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Pivot record : the first record in the cluster&lt;/li&gt;
&lt;li&gt;Most complete and longest record&lt;/li&gt;
&lt;li&gt;Most complete&lt;/li&gt;
&lt;li&gt;Longest record&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;Unfortunately, DQS does not support the option to select record based on value in a field (such as updated date or the originating system). For scenario such as picking the most recent record or record from a trusted system, you have to export the entire matching results then implement your own logic for survivorship.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;ol start="6"&gt;
&lt;li&gt;Review the results&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Open SQL Server Management Studio and query the export table&lt;/li&gt;
&lt;li&gt;Review the result. Use the CLUSTER_ID column to identify cluster of matched records.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/3771.Matching_5F00_B6.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSMatching/3771.Matching_5F00_B6.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;In this blog article, I discussed how to create a Knowledge Base to define matching policy. You can then refer to the Knowledge Base to create Data Quality Projects to match your customer data. In this article, I use a simple example to walk through the most common functionalities. For more detailed explanation of DQS Matching, refer to the blog article wrote by my colleague, Gadi Peleg:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://blogs.msdn.com/b/dqs/archive/2011/09/14/introduction-to-data-quality-services-data-matching.aspx"&gt;Introduction to Data Quality Services Data Matching&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="http://blogs.msdn.com/b/dqs/archive/2011/11/02/matching-policy-a-closer-look-into-data-quality-services-data-matching.aspx"&gt;Matching Policy - A closer look into Data Quality Services Data Matching&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;It is also&amp;nbsp;recommended that you&amp;nbsp;run a separate cleansing project on your data set before you run matching project. Although DQS matching process uses the same domain definition for the cleansing, it does not make the correction nor does it flag invalid values. In addition, DQS matching only use the internal domain definition and does not automatically send your data to &lt;a href="https://datamarket.azure.com/browse/data?category=dqs"&gt;external third party providers&lt;/a&gt;. You must explicitly run a cleansing project to send the data externally to the Reference Data Service providers. However, the Service Provider may include additional information about the data you send. For example, &lt;a href="https://datamarket.azure.com/dataset/dnb/dqscompanymatch#images"&gt;Dun &amp;amp; Bradstreet&lt;/a&gt;&amp;nbsp;includes DunsNumber in the output. This may be another useful information to use as part of your matching rule. Refer to my previous article on &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/06/11/cleansing-your-customer-data-using-dun-amp-bradstreet-dqs-service.aspx"&gt;Cleansing your Customer data using Dun &amp;amp; Bradstreet DQS Service&lt;/a&gt; for more information on how to use DQS to cleanse data using Reference Data Service. &amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10426465" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality/">data quality</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/sql+server/">sql server</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+service/">data quality service</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/knowledge+base/">knowledge base</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/known+issues/">known issues</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQLServer/">SQLServer</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQL+Server+2012/">SQL Server 2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/2012/">2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/matching/">matching</category></item><item><title>Cleansing Customer Data Using Data Quality Services (DQS)</title><link>http://blogs.msdn.com/b/dqs/archive/2013/06/14/cleansing-customer-data-using-data-quality-services-dqs.aspx</link><pubDate>Sat, 15 Jun 2013 02:08:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10426090</guid><dc:creator>Welly.Lee</dc:creator><slash:comments>3</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10426090</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2013/06/14/cleansing-customer-data-using-data-quality-services-dqs.aspx#comments</comments><description>&lt;p&gt;SQL Server 2012 Data Quality Services (DQS) enables you to cleanse data using your own Knowledge Base. In this blog article, I am going to show how to create a simple Knowledge Base which you can use to cleanse your customer data.&lt;/p&gt;
&lt;p&gt;The steps for using DQS to cleanse data is as follows:&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;A. Create DQS Knowledge Base&lt;/strong&gt;&lt;br /&gt;&lt;strong&gt;B. Create a DQS project to cleanse your data using the Knowledge Base&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;For this example, I am using the following sample data - you can run the following script on your SQL Server database&amp;nbsp;:&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&amp;nbsp;CREATE TABLE MyCustomers &lt;br /&gt;&amp;nbsp; (&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CustomerID INT,&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CustomerName NVARCHAR(255),&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; City NVARCHAR(32),&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Province NVARCHAR(32),&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; LastUpdate DATETIME&lt;br /&gt;&amp;nbsp; )&lt;br /&gt;&amp;nbsp;&amp;nbsp; &lt;br /&gt;&amp;nbsp;INSERT INTO MyCustomers&lt;br /&gt;&amp;nbsp; VALUES&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (1, 'Consolidate Co Ltd', 'Miami', 'FL','2013-01-01'), &lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (2, 'Consolidation Company Ltd', 'New York', 'NY','2013-01-01'), &lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (3, N'什锦的件', 'LA', 'CA','2013-01-01'), &lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (4, 'Chop-suey Chinese', 'Los Angeles', 'CA', '2013-03-03'),&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (5, 'Big Cheese, The', 'Redmond', 'WA', '2013-02-02'),&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (6, 'THE BIG CHEESE', 'Chicago', 'Il','2013-02-02'),&lt;br /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (7, 'To Be Filled Later', 'Redmond', 'Wash.', '2013-01-01')&lt;/p&gt;
&lt;p style="padding-left: 60px;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;h3&gt;A. Create DQS Knowledge Base&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;Start Data Quality Client. For information about this step, refer to Run the Data Quality Client Application.&lt;/li&gt;
&lt;li&gt;In the Data Quality Client home screen, under Knowledge Base Management, click New Knowledge Base.&lt;/li&gt;
&lt;li&gt;Name your new Knowledge Base (e.g. MyCustomerKB), make sure Domain Management activity is selected and click Next&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8765.DQCleansing_5F00_A3.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8765.DQCleansing_5F00_A3.png" alt="" width="420" height="256" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;Click Create Domain Icon.&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6787.DQCleansing_5F00_A4.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6787.DQCleansing_5F00_A4.png" alt="" width="207" height="71" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;li&gt;From the pop-up window, name your domain (e.g. CustomerName). As an option, you can select to change the format of the output to Capitalize.&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/1325.DQCleansing_5F00_A5.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/1325.DQCleansing_5F00_A5.png" alt="" width="406" height="334" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="6"&gt;
&lt;li&gt;Repeat the steps and create domains for City and State.&amp;nbsp;&lt;/li&gt;
&lt;li&gt;Select City domain and click on domain values tab. In this screen, you can enter all the list of valid and invalid values for the domain.&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Click on Add new domain value icon. Enter a correct value (e.g. Los Angeles) then press enter.&amp;nbsp;&lt;/li&gt;
&lt;li&gt;Click on Add new domain value icon. Enter an invalid value for the city (e.g. United States), click the down arrow in the Type column and select the invalid type (yellow triangle) then press enter. (There are 3 supported type: correct, invalid, and error; invalid suggests that the value is not valid for the given domain, but may be valid in a different domain. For example, United States is not a valid value for city domain but is a valid value for country domain; error suggests that the value is globally incorrect). For invalid or error value, you can optionally specify correct to value for DQS to automatically correct. Otherwise, records with invalid or error value will be flag as invalid during cleansing.&lt;/li&gt;
&lt;li&gt;Click on Add new domain value icon. Enter a synonym value for the city (e.g. LA) then press enter. Highlight both "LA" and "Los Angeles" values, right click then select Set as Synonyms.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6518.DQCleansing_5F00_A73.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6518.DQCleansing_5F00_A73.png" alt="" width="291" height="259" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p style="padding-left: 90px;"&gt;Note: by default, DQS include DQS_NULL as valid value for the domain, you can change the type to invalid if you would like the record with missing value to be flagged&lt;/p&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;Update the leading value by right clicking Los Angeles and select Set as Leading&amp;nbsp;during cleansing.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8103.DQCleansing_5F00_A74.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8103.DQCleansing_5F00_A74.png" alt="" width="322" height="255" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;The final city domain values should look as follows:&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6747.DQCleansing_5F00_A75.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6747.DQCleansing_5F00_A75.png" alt="" width="339" height="184" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="8"&gt;
&lt;li&gt;Select State domain and click on domain values tab. In this step, we will import the values from a spreadsheet.&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Obtain the list of US states, its standard abbreviation and postal abbreviation from &lt;a href="http://www.stateabbreviations.us/"&gt;here&lt;/a&gt;. Copy the first 3 columns into excel and save as csv file.&lt;/li&gt;
&lt;li&gt;Click the down arrow next to import values icon on the state domain management and select import valid values from excel. From the pop-up window, browse to the location of the csv file (make sure to select file type as csv) and update Use First row as header checkbox according to your file (if you do not include header when you prepare the file, then you can leave the default option unchecked.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-filesystemfile.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8132.DQCleansing_5F00_A82.png"&gt;&lt;img style="max-width: 550px;" src="http://blogs.msdn.com/cfs-filesystemfile.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8132.DQCleansing_5F00_A82.png" alt="" width="349" height="107" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li&gt;You should see the State domain values screen populated as below - notice that when you import list of values with more than one column, DQS automatically set the first column as leading value and the subsequent column value as its synonyms.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8547.DQCleansing_5F00_A83.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8547.DQCleansing_5F00_A83.png" alt="" width="455" height="241" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="9"&gt;
&lt;li&gt;Select CustomerName domain and click on Term-Based Relations tab. DQS allows you to define terms within your domain value and standardize them into standard term. For example, business name often include abbreviation such as "corp", etc. When there are variation for the use of the terms (e.g. one record may use abbreviation such as "Microsoft Corp" while another record may have values for "Microsoft Corporation"), you can use DQS to standardize the use of terms within your domain values. In this step, we will define 2 term based relations for our CustomerName domain:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Click Add new relation icon&lt;/li&gt;
&lt;li&gt;Add "co" as value and "Company" as correct to. Press enter to continue.&lt;/li&gt;
&lt;li&gt;Add "ltd" as value and "Limited" as correct to.&amp;nbsp;&lt;/li&gt;
&lt;li&gt;You should see the CompanyName Term-Based Relations screen to be populated as below. Click finish to continue.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/4336.DQCleansing_5F00_A94.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/4336.DQCleansing_5F00_A94.png" alt="" width="403" height="261" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="10"&gt;
&lt;li&gt;Click Publish Knowledge Base button to continue.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Creating your own DQS Knowledge Base sometimes requires a lot of effort. For things such as address cleansing, phone number cleansing, creating your own complete list of all valid and invalid values can be a huge effort. DQS supports integration with third party service provider to cleanse your data through DQS. Refer to my blog article on &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/06/11/cleansing-your-customer-data-using-dun-amp-bradstreet-dqs-service.aspx"&gt;how to cleanse Customer Data using Dun &amp;amp; Bradstreet&lt;/a&gt; for more information.&lt;/p&gt;
&lt;h3&gt;B. Create a DQS project to cleanse your data using the Knowledge Base&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;In the Data Quality Client home screen, under Data Quality Projects, click New Data Quality Project.&lt;/li&gt;
&lt;li&gt;Name your new Project (e.g. MyCustomer Cleansing Project), make sure you select Knowledge Base created in the previous step (e.g. MyCustomerKB) then click Next to continue.&lt;/li&gt;
&lt;li&gt;In the Map screen:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Select data source, database, and table for your data (e.g. MyCustomers table and data generated from the script at the beginning of this article)&lt;/li&gt;
&lt;li&gt;Select columns to be cleansed and map to the domains in your knowledge base&lt;/li&gt;
&lt;li&gt;You should have your datasource columns mapped as below. Click Next to continue.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8360.DQCleansing_5F00_B3c.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/8360.DQCleansing_5F00_B3c.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;In the Cleanse screen, click Start to begin the process. Click Next when the process finishes.&lt;/li&gt;
&lt;li&gt;In the Manage and View results screen:&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;Select City domain and click Corrected tab. You should see "LA" corrected to "Los Angeles"&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/2313.DQCleansing_5F00_B5a.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/2313.DQCleansing_5F00_B5a.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;ul&gt;
&lt;li&gt;Select City domain and click New tab. You should see the list of city not defined in the knowledge base domain value definition listed here. Click Approve all terms icon to continue (Refer to &lt;a href="http://msdn.microsoft.com/en-us/library/hh479581.aspx"&gt;this msdn article&lt;/a&gt; on how do import all approved project values into DQS knowledge base). The approved values are now appear under Correct tab.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6355.DQCleansing_5F00_B5b.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/6355.DQCleansing_5F00_B5b.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;ul&gt;
&lt;li&gt;Select CustomerName domain and click New tab. You should see Customer Names are updated to use standardized terms. Click Next to continue.&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/2330.DQCleansing_5F00_B5c.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/2330.DQCleansing_5F00_B5c.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="6"&gt;
&lt;li&gt;In the Manage and View results screen, you can export the results to SQL Server table or Excel.&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="padding-left: 90px;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/1273.DQCleansing_5F00_B6.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-DQSCleansing/1273.DQCleansing_5F00_B6.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;In this blog article, I discussed how to create a Knowledge Base to cleanse customer data, including use of domain values and term based relations. You can then refer to the Knowledge Base to create Data Quality Projects to cleanse your data. You can use the same knowledge base to perform cleansing on many Data Quality Projects. You can also automate the cleansing using SQL Server 2012 Integration Services. Matt Mason wrote a nice article : &lt;a href="http://blogs.msdn.com/b/mattm/archive/2011/07/14/overview-of-the-dqs-cleansing-transform.aspx"&gt;Overview of DQS Transform&lt;/a&gt; that describes the SSIS DQS Cleansing transform.&lt;/p&gt;
&lt;p&gt;You may also notice that there appears to be duplicate records in the sample data I used. In the &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/06/17/matching-related-and-duplicate-customer-records-using-sql-server-data-quality-services-dqs.aspx"&gt;next article&lt;/a&gt;, I will describe on how to enhance your knowledge base by adding matching policy and identify duplicate and related records in your dataset.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10426090" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality/">data quality</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/sql+server/">sql server</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+cleansing/">dqs cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+cleansing+component/">dqs cleansing component</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/cleansing/">cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+service/">data quality service</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/knowledge+base/">knowledge base</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+cleansing/">data cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQLServer/">SQLServer</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/2012/">2012</category></item><item><title>Cleansing your Customer Data using Dun &amp; Bradstreet DQS service</title><link>http://blogs.msdn.com/b/dqs/archive/2013/06/11/cleansing-your-customer-data-using-dun-amp-bradstreet-dqs-service.aspx</link><pubDate>Tue, 11 Jun 2013 16:39:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10425142</guid><dc:creator>Welly.Lee</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10425142</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2013/06/11/cleansing-your-customer-data-using-dun-amp-bradstreet-dqs-service.aspx#comments</comments><description>&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Starting today you can use SQL Server Data Quality Service (DQS) to cleanse data using &lt;a href="http://www.dnb.com/"&gt;Dun &amp;amp; Bradstreet&lt;/a&gt; (D&amp;amp;B) extensive business information data. This is done using DQS Reference Data Service (RDS) functionality. You can subscribe to the D&amp;amp;B &lt;a href="https://datamarket.azure.com/dataset/dnb/dqscompanymatch"&gt;Company Cleanse &amp;amp; Match for Microsoft SQL Server Data Quality Service&lt;/a&gt; from &lt;a href="https://datamarket.azure.com/"&gt;Windows Azure Marketplace&lt;/a&gt;, then connect to the service from your DQS client tool or SSIS DQS Cleansing transform task to cleanse your data.&lt;/p&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;The following are the step to use the D&amp;amp;B for your DQS data cleansing:&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;A. Subscribe to D&amp;amp;B Reference Data Service from Windows Azure MarketPlace&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;B. Configure DQS to use Reference Data Service&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;C. Create DQS knowledge base and attach to D&amp;amp;B Reference Data Service&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;D. Cleanse Data using D&amp;amp;B Reference Data Service from&amp;nbsp;Data Quality Client&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;E. Automate data cleansing using SSIS DQS Cleansing transform task&lt;/p&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;h1 style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;A. Subscribe to D&amp;amp;B Reference Data Service from Windows Azure MarketPlace&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Create &lt;a href="http://msdn.microsoft.com/library/ff717655.aspx"&gt;Windows Azure Marketplace&lt;/a&gt; account (if you have previously create an account with Windows Azure Marketplace, you can skip this step and login with your account)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Navigate to &lt;a href="https://datamarket.azure.com/dataset/dnb/dqscompanymatch"&gt;Data &amp;gt; Data Quality Services &amp;gt; Company Cleanse &amp;amp; Match for Microsoft SQL Server Data Quality Services&lt;/a&gt; and subscribe. For trial subscription, please contact &lt;a href="mailto:DNB_MS_Partnership_CoreTeam@DNB.com"&gt;DNB_MS_Partnership_CoreTeam@DNB.com&lt;/a&gt; for a promotion code.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Navigate to &lt;a href="https://datamarket.azure.com/account"&gt;My Account&lt;/a&gt; and copy Primary Account Key information&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;h1 style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;B. Configure DQS to use Reference Data Service&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;Start Data Quality Client. For information about this step, refer to &lt;a href="http://msdn.microsoft.com/en-us/library/hh213023.aspx"&gt;Run the Data Quality Client Application&lt;/a&gt;.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Data Quality Client home screen, under Administration, click Configuration.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Reference Data tab, under the Network Settings area, type appropriate values in the Proxy Server and Port boxes if you or your organization uses proxy server to connect to the Internet.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Specify the Marketplace primary account key (See Step A3 above) in the DataMarket Account ID box, and click the Validate DataMarket Account ID icon to validate the account key.&amp;nbsp;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Click close to go back to the main menu.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-filesystemfile.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2664.DQ_5F00_DNB_5F00_B5.png"&gt;&lt;img style="max-width: 550px;" src="http://blogs.msdn.com/cfs-filesystemfile.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2664.DQ_5F00_DNB_5F00_B5.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;Note that you may want to extend the DQS RDS timeout setting. Refer to &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/04/10/extending-timeout-setting-for-reference-data-services-rds.aspx"&gt;this article&lt;/a&gt;&amp;nbsp;for the instruction.&amp;nbsp;&lt;/p&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;Reference : &lt;a href="http://msdn.microsoft.com/en-us/library/hh213070.aspx"&gt;Configure DQS to use Reference Data (msdn)&lt;/a&gt;&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;h1 style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;C. Create DQS knowledge base and attach to D&amp;amp;B Reference Data Service&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;From Data Quality Client main menu, click New Knowledge Base&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Specify Name for your knowledge base (e.g. DNBCleansing), make sure "Domain Management" is selected as the activity and click Next to continue&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Click on create domain icon (see the screenshot below) and create the following domains (you can use default properties for this example; refer to &lt;a href="http://msdn.microsoft.com/en-us/library/hh510401.aspx"&gt;Create a Domain&lt;/a&gt; for more information about creating DQS Knowledge Base domain)&lt;/p&gt;
&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;CompanyName&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;State&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Country&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;AddressLine (&lt;em&gt;optional&lt;/em&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;City (&lt;em&gt;optional&lt;/em&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Zip (&lt;em&gt;optional&lt;/em&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;ZipPlus4 (&lt;em&gt;optional&lt;/em&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;DunsNumber (&lt;em&gt;optional&lt;/em&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;OrderReasonCode (&lt;em&gt;required only if you are cleansing non US company&lt;/em&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5287.DQ_5F00_DNB_5F00_C3.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5287.DQ_5F00_DNB_5F00_C3.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Create a composite domain by clicking on create composite domain icon (see the screenshot below), specify name (e.g. FullCompanyInformation) and select all domains you created in Step C3. The final configuration should look like below:&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4428.DQ_5F00_DNB_5F00_C5.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4428.DQ_5F00_DNB_5F00_C5.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Make sure the composite domain you created in step 4 is selected in the domain list and click Reference Data tab&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4645.DQ_5F00_DNB_5F00_C6.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4645.DQ_5F00_DNB_5F00_C6.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="6"&gt;
&lt;li&gt;
&lt;div style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Click Browse and from the Online Reference Data Provider Catalog pop-up window, select D&amp;amp;B Cleanse and Match and map the schema to your domain (from step C3 above). If you are cleansing international company (which require "OrderReasonCode") or you want to include more domains, then click Create More Schema icon (see screenshot below), select additional schema, and map to your domains. Make sure that all domains in your composite domain (step C4) are mapped to a schema. The final mapping should look like below, click OK to continue.&lt;/div&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7380.DQ_5F00_DNB_5F00_C7.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7380.DQ_5F00_DNB_5F00_C7.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="7"&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Click Finish publish your knowledge base.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;Reference : &lt;a href="http://msdn.microsoft.com/en-us/library/gg524799.aspx"&gt;DQS Knowledge Bases and Domains&lt;/a&gt; (msdn)&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;h1 style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;D. Cleanse Data using D&amp;amp;B Reference Data Service from Data Quality Client&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Start Data Quality Client. For information about this step, refer to &lt;a href="http://msdn.microsoft.com/en-us/library/hh213023.aspx"&gt;Run the Data Quality Client Application&lt;/a&gt;.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Data Quality Client home screen, click New data quality project&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the New Data Quality Project screen:&lt;/p&gt;
&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Name box, type a name for the new data quality project (e.g. New Customer Data Cleansing)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Use Knowledge base list, click to select a knowledge base created in Step C (e.g. DNBCleansing)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Ensure that Cleansing is selected as activity&amp;nbsp;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Click Next&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7103.DQ_5F00_DNB_5F00_D3.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7103.DQ_5F00_DNB_5F00_D3.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Map screen:&lt;/p&gt;
&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Select data source.&amp;nbsp; For testing purpose, I used the following file I created using Excel and save the file to C:\temp\NewCustomerData.xls (if you are using x64 machine, you may need to save the file as type "Excel 97-2003 Workbook". If you are cleansing international data, you need to include a column for "order reason code" and populate the column with value of "1" (make sure to set the excel column as string data type as you can only map data with the same type as your domain data type definition).&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 90px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4760.DQ_5F00_DNB_5F00_D4_5F00_a.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4760.DQ_5F00_DNB_5F00_D4_5F00_a.png" alt="" width="241" height="92" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="4"&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Select the input file, specify worksheet, and map the column to the domains created in Step C3 - the final mapping should looks like below and click Next to continue.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 90px; font-family: Calibri; font-size: 11pt;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/6403.DQ_5F00_DNB_5F00_D4_5F00_b.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/6403.DQ_5F00_DNB_5F00_D4_5F00_b.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ol start="5"&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Cleanse screen: click Start button, click Next to continue&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In the Manage and View Results you can review the results. Refer to Cleanse Data Using Reference Data (external) knowledge for more information on reviewing results.&amp;nbsp; Note that D&amp;amp;B also provides additional information about your data. You can review this information on the last step. For example, the below screenshot shows additional information such as address and DUNS number added to the valid company.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4456.DQ_5F00_DNB_5F00_D5.png"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4456.DQ_5F00_DNB_5F00_D5.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;Reference: &lt;a href="http://msdn.microsoft.com/en-us/library/hh510392.aspx"&gt;Cleanse Data Using Reference Data (External) Knowledge&lt;/a&gt; (msdn)&lt;/p&gt;
&lt;p style="margin: 0in; padding-left: 30px; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;
&lt;h1 style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;E. Automate data cleansing using SSIS DQS Cleansing transform task&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Create a new SQL Server Integration Services project using SQL Server 2012 Data Tool&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Configure your data source&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Select DQS Cleansing transform task from Data Flow toolbox&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Configure the&amp;nbsp; DQS Cleansing transform task (for more information, refer to &lt;a href="http://blogs.msdn.com/b/mattm/archive/2011/07/14/overview-of-the-dqs-cleansing-transform.aspx"&gt;Overview of the DQS Cleansing Transform&lt;/a&gt;)&lt;/p&gt;
&lt;/li&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Connect to Data Quality Server which host the Knowledge Base (created in Step C).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Map the column in your data source to the Knowledge Base domain (created in Step C3)&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Configure the data destination&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;Execute SSIS&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;In this blog article, I describe using SQL Server Data Quality Service (DQS) to connect to 3rd party Service Provider such as Dun &amp;amp; Bradstreet to cleanse customer data. Alternatively, you can also create your own Knowledge Base to cleanse data. In the &lt;a href="http://blogs.msdn.com/b/dqs/archive/2013/06/14/cleansing-customer-data-using-data-quality-services-dqs.aspx"&gt;next article&lt;/a&gt;, I describe how to create your own DQS Knowledge Base for data cleansing.&lt;/p&gt;
&lt;p style="margin: 0in; font-family: Calibri; font-size: 11pt;"&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10425142" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality/">data quality</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+ssis/">dqs ssis</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+cleansing/">dqs cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/ssis/">ssis</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/cleansing/">cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+enrichment/">data enrichment</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+service/">data quality service</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/reference+data+service/">reference data service</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/datamarket/">datamarket</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/azure+datamarket/">azure datamarket</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+cleansing/">data cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/reference+data/">reference data</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQLServer/">SQLServer</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQL+Server+2012/">SQL Server 2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/2012/">2012</category></item><item><title>Extending timeout setting for Reference Data Services (RDS)</title><link>http://blogs.msdn.com/b/dqs/archive/2013/04/10/extending-timeout-setting-for-reference-data-services-rds.aspx</link><pubDate>Wed, 10 Apr 2013 22:21:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10410183</guid><dc:creator>Welly.Lee</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10410183</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2013/04/10/extending-timeout-setting-for-reference-data-services-rds.aspx#comments</comments><description>&lt;p&gt;DQS, by default, uses connection timeout of 1 minutes when connecting to external Reference Data Services (RDS) provider.&amp;nbsp;&amp;nbsp;You can modify the timeout setting, using the following steps:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Run the following query on SQL Server where DQS is installed:&lt;/li&gt;
&lt;/ul&gt;
&lt;blockquote&gt;
&lt;p&gt;SELECT [ID]&lt;/p&gt;
&lt;p&gt;,[SERVICE]&lt;/p&gt;
&lt;p&gt;,[NAME]&lt;/p&gt;
&lt;p&gt;,[UPDATE_TIME]&lt;/p&gt;
&lt;p&gt;,[DESCRIPTION]&lt;/p&gt;
&lt;p&gt;,[VALUE]&lt;/p&gt;
&lt;p&gt;,[REVISION]&lt;/p&gt;
&lt;p&gt;FROM [DQS_MAIN].[dbo].[A_CONFIGURATION]&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;where id=916&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Click on the value cell (XML) and copy the XML output&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-metablogapi/1780.clip_5F00_image002_5F00_747F01C4.jpg"&gt;&lt;img style="display: inline; background-image: none;" title="clip_image002" src="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78-metablogapi/3817.clip_5F00_image002_5F00_thumb_5F00_586E42D9.jpg" alt="clip_image002" width="606" height="69" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Change the ConnectionTimeout parameter (default 60000 or 1 minute) and update the record (&lt;strong&gt;&lt;span style="color: #ff0000;"&gt;Tips&lt;/span&gt;&lt;/strong&gt;: Make sure to update DQS_MAIN database before you make the update), for example:&lt;/li&gt;
&lt;/ul&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;BEFORE&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;update [DQS_MAIN].[dbo].[A_CONFIGURATION]&lt;/p&gt;
&lt;p&gt;set value = '&amp;lt;configuration name="ReferenceData"&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheEnabled" value="1" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheCleanupIntervalMinutes" value="1440" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheInvalidationTimeHours" value="24" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheSizeEntries" value="1000000" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="BatchQuerySizeEntries" value="100" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="QueryMinConfidence" value="0.4" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="QueryMaxSuggestions" value="5" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="ProxyAddress" value="" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&amp;lt;param name="ConnectionTimeout" value="60000" /&amp;gt;&lt;/strong&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="DataMarketCatalogEntryPoint" value="services/my/Datasets?$filter=Category%20eq%20''Data Quality Services''" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="MetadataEntryPoint" value="/$metadata" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="RetryErrorCodes" value="504" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="RetryReduceFactor" value="5" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;lt;/configuration&amp;gt;'&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;where&lt;/strong&gt;&lt;strong&gt; id=916&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; AFTER&lt;/strong&gt;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;update [DQS_MAIN].[dbo].[A_CONFIGURATION]&lt;/p&gt;
&lt;p&gt;set value = '&amp;lt;configuration name="ReferenceData"&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheEnabled" value="1" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheCleanupIntervalMinutes" value="1440" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheInvalidationTimeHours" value="24" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="CacheSizeEntries" value="1000000" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="BatchQuerySizeEntries" value="100" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="QueryMinConfidence" value="0.4" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="QueryMaxSuggestions" value="5" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="ProxyAddress" value="" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&amp;lt;param name="ConnectionTimeout" value="300000" /&amp;gt;&lt;/strong&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="DataMarketCatalogEntryPoint" value="services/my/Datasets?$filter=Category%20eq%20''Data Quality Services''" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="MetadataEntryPoint" value="/$metadata" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="RetryErrorCodes" value="504" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp; &amp;lt;param name="RetryReduceFactor" value="5" /&amp;gt;&lt;/p&gt;
&lt;p&gt;&amp;lt;/configuration&amp;gt;'&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;where&lt;/strong&gt;&lt;strong&gt; id=916&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Note: make sure to qualify single quote(&amp;lsquo;) for DatamarketCatalogEntryPoint with two single quote (&amp;lsquo;&amp;rsquo;)&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Restart sql server&lt;/li&gt;
&lt;/ul&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10410183" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality/">data quality</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+cleansing/">dqs cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+service/">data quality service</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/reference+data+service/">reference data service</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/azure+datamarket/">azure datamarket</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/reference+data/">reference data</category></item><item><title>Languages supported by DQS for Cleansing and Matching</title><link>http://blogs.msdn.com/b/dqs/archive/2013/03/14/languages-supported-by-dqs-for-cleansing-and-matching.aspx</link><pubDate>Thu, 14 Mar 2013 07:39:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10402213</guid><dc:creator>Kumar Vivek</dc:creator><slash:comments>2</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10402213</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2013/03/14/languages-supported-by-dqs-for-cleansing-and-matching.aspx#comments</comments><description>&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Data Quality Services (DQS) internally stores the domain values in a knowledge base in the Unicode format, and uses the trigram algorithm, which is language agnostic, to compare the domain values with your source data for the cleansing and matching operations. Therefore, you can practically use DQS to cleanse and match data in all the languages that are supported by the Windows operating system.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;OK! But, why do you have limited set of language options while creating a string domain?&lt;/h2&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;While creating a string domain in DQS, one can specify the language&amp;nbsp;for the domain.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5657.CreateDomain.jpg"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5657.CreateDomain.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The &lt;strong&gt;Language&lt;/strong&gt; drop-down list displays a limited set of languages, and this selection is only applicable for the Speller feature in DQS. The Speller feature works only for those languages that are listed in the &lt;strong&gt;Language&lt;/strong&gt; drop-down list. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;However, if you have values in a non-listed language (for example, Greek, Chinese, and so on), you must select &lt;strong&gt;Other&lt;/strong&gt; from the &lt;strong&gt;Language&lt;/strong&gt; drop-down list.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;strong&gt;NOTE&lt;/strong&gt;: Selecting &lt;strong&gt;Other&lt;/strong&gt; from the &lt;strong&gt;Language&lt;/strong&gt; drop-down list&amp;nbsp;disables the Speller feature for the domain.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;Does collation setting impact the cleansing/matching in DQS?&lt;/h2&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The collation setting determines the rules for comparing data in SQL Server. Although DQS stores all values in the Unicode format, the collation setting does influence the comparison rules. For example, the characters that are considered to be different in a collation setting might be considered the same in another collation setting. Therefore, you should choose a different collation setting, other than the default collation setting, while installing DQS &lt;span style="text-decoration: underline;"&gt;only if&lt;/span&gt; you are completely aware of the collation comparison rules, and are sure about using the same comparison rules in DQS for cleansing and matching.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;However, if you are not sure about the comparison rules in a collation, you must install DQS with the default server collation, and things should work fine for you. For more information about installing DQS and specifying collation settings, see &lt;a href="http://msdn.microsoft.com/en-us/library/hh231682.aspx" target="_blank"&gt;here&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Thanks,&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;DQS Team&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10402213" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/cleansing/">cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/language+support/">language support</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/collation/">collation</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/matching/">matching</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/Unicode+support/">Unicode support</category></item><item><title>Upgrading DQS to SQL Server 2012 Service Pack 1 (SP1) Release</title><link>http://blogs.msdn.com/b/dqs/archive/2012/11/29/upgrading-dqs-to-sql-server-2012-service-pack-1-sp1-release.aspx</link><pubDate>Thu, 29 Nov 2012 08:34:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10372987</guid><dc:creator>Kumar Vivek</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10372987</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2012/11/29/upgrading-dqs-to-sql-server-2012-service-pack-1-sp1-release.aspx#comments</comments><description>&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;This post provides information about the Data Quality Services (DQS) improvements in the SQL Server 2012 Service Pack 1 (SP1) release, and step-by-step instructions&amp;nbsp;to upgrade your current installation of DQS to the SP1 release. The steps to upgrade &lt;span style="text-decoration: underline;"&gt;from&lt;/span&gt; the SQL Server 2012 RTM or any of the cumulative update (CU)&amp;nbsp;releases &lt;span style="text-decoration: underline;"&gt;to&lt;/span&gt; the SP1 release are the same.&lt;/span&gt;&lt;/p&gt;
&lt;h1&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: large;"&gt;List of DQS bugs fixed in the SQL Server 2012 SP1 release&lt;/span&gt;&lt;/h1&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;For detailed information about the DQS bugs fixed in the SP1 release, see the article at &lt;a href="http://support.microsoft.com/kb/2674319"&gt;http://support.microsoft.com/kb/2674319&lt;/a&gt;, and search for "DQS" or "data quality".&lt;/span&gt;&lt;/p&gt;
&lt;h1&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: large;"&gt;Before You Upgrade&lt;/span&gt;&lt;/h1&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Upgrading DQS to the SP1 release involves upgrading the schema of DQS databases as well. Ensure that you back up your DQS databases before you upgrade to the SP1 release. Although, the DQS installer reminds you about backing up your databases during the process, it does not back up the databases as part of the process. You can manually perform a full backup of your DQS databases&amp;nbsp;by running the following example script in SQL Server Management Studio:&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;USE MASTER;&lt;br /&gt;GO&lt;br /&gt;BACKUP DATABASE DQS_MAIN TO DISK='C:\mybackups\DQS_Main_fullbackup.bak';&lt;br /&gt;BACKUP DATABASE DQS_PROJECTS TO DISK='C:\mybackups\DQS_Projects_fullbackup.bak';&lt;br /&gt;BACKUP DATABASE DQS_STAGING_DATA TO DISK='C:\mybackups\DQS_Staging_fullbackup.bak';&lt;br /&gt;GO&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;For more information&amp;nbsp;about backing up and restoring DQS databases, see &lt;a href="http://msdn.microsoft.com/en-us/library/hh213068(SQL.110).aspx"&gt;http://msdn.microsoft.com/en-us/library/hh213068(SQL.110).aspx&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;h1&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: large;"&gt;How To Upgrade&lt;/span&gt;&lt;/h1&gt;
&lt;ol&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Download the appropriate SQL Server 2012 SP1 installer file (32-bit or 64-bit) from &lt;a href="http://www.microsoft.com/en-us/download/details.aspx?id=35575"&gt;http://www.microsoft.com/en-us/download/details.aspx?id=35575&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2671.Upgrade1.jpg"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2671.Upgrade1.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;On the SQL Server computer, logon using an administrator account that is also a member of the sysadmin role in the corresponding database engine instance of SQL Server 2012 where DQS is installed.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Double-click the downloaded file to launch the update process on your SQL Server computer.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;On the &lt;strong&gt;License Terms&lt;/strong&gt; page, accept the terms, and click &lt;strong&gt;Next&lt;/strong&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;On the &lt;strong&gt;Select Features&lt;/strong&gt; page, ensure that all the features for your instance are visible. If you have multiple instances, you can select an instance that you want to update. &lt;strong&gt;Shared Features&lt;/strong&gt; are always updated since there is only one copy to be shared by all instances. Click &lt;strong&gt;Next&lt;/strong&gt;.&lt;br /&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7875.Upgrade2.JPG"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7875.Upgrade2.JPG" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The installer checks for files in use to help avoid unnecessary reboots, and warns you if an in-use file is detected so you can manually kill those applications. If you don&amp;rsquo;t kill the apps prompted during the check, the installer might prompt you for a reboot at the end of the update process.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Click &lt;strong&gt;Update&lt;/strong&gt; to begin the update process. This might take a while to complete (~15-20 minutes) depending on the installed features in your SQL Server instance. After the update process completes successfully, a success message is displayed. Click &lt;strong&gt;Close&lt;/strong&gt; to close the installer page.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;We will now upgrade the DQS database schemas. To do so, start Command Prompt, and change your directory to the location where DQSInstaller.exe is available. If you installed the default instance of SQL Server, the DQSInstaller.exe file will be available at C:\Program Files\Microsoft SQL Server\MSSQL11.MSSQLSERVER\MSSQL\Binn:&lt;br /&gt;&lt;span style="font-family: courier new,courier;"&gt;cd C:\Program Files\Microsoft SQL Server\MSSQL11.MSSQLSERVER\MSSQL\Binn&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;At the command prompt, type the following command, and press ENTER:&lt;br /&gt;&lt;span style="font-family: courier new,courier;"&gt;dqsinstaller.exe &amp;ndash;upgrade&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;span style="font-family: courier new,courier;"&gt;&lt;span style="font-family: trebuchet ms,geneva;"&gt;&lt;strong&gt;NOTE&lt;/strong&gt;: The DQS installer checks for any pending restarts on your computer owing to any other updates that have been recently installed on your computer. If any pending restarts are found, a message appears to notify you about the same, and you can choose to continue or abort the installation by pressing Y or N respectively. We recommend that if there are any pending restarts, you must abort the DQS schema upgrade process, restart your computer, and then perform the schema upgrade by starting from step 8.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;hr /&gt;&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The installer prompts you to back up the DQS databases before proceeding. Also, during the database schema upgrade process, current DQS activity will be terminated, and all the DQS users will be disconnected from the Data Quality Server.&lt;br /&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/1805.Upgrade3.JPG"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/1805.Upgrade3.JPG" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Since you have already backed up your databases, type &lt;strong&gt;Y&lt;/strong&gt; or &lt;strong&gt;Yes&lt;/strong&gt;, and then press ENTER to continue with the upgrade.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;A completion message is displayed after the successful upgrade of the DQS databases schema.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;
&lt;hr /&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;strong&gt;NOTE&lt;/strong&gt;: If Data Quality Client is installed on a different computer, you will have to run the SP1 installer file on each computer where Data Quality Client is installed to upgrade it by following steps 2-7 earlier.&lt;/span&gt;&lt;/p&gt;
&lt;hr /&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;h1&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: large;"&gt;Verifying the Upgrade&lt;/span&gt;&lt;/h1&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;To verify that you have successfully upgraded your DQS installation, you can do either of the following:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Use Data Quality Client to logon to Data Quality Server, click the &lt;strong&gt;Help&lt;/strong&gt; icon in the top-right corner of the Data Quality Client home screen, and then click &lt;strong&gt;About SQL Server Data Quality Services&lt;/strong&gt;. The version dialog box appears that should display the version of server and client as &lt;strong&gt;11.0.3000.0&lt;/strong&gt;:&lt;br /&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/8637.Upgrade4.JPG"&gt;&lt;img src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/8637.Upgrade4.JPG" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;You can check the current assembly version in the DQS_MAIN and DQS_PROJECTS databases by querying the A_DB_VERSION table in each database. To do so, run the following query using SQL Server Management Studio:&lt;br /&gt;&lt;span style="font-family: courier new,courier;"&gt;SELECT * FROM DQS_MAIN.dbo.A_DB_VERSION WHERE STATUS=2;&amp;nbsp; &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new,courier;"&gt;SELECT * FROM DQS_PROJECTS.dbo.A_DB_VERSION WHERE STATUS=2;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Each upgrade step will show an entry, and the date of the upgrade. The maximum VERSION_ID and ASSEMBLY_VERSION on the latest date is the current version. In this case, it should be &lt;strong&gt;1600&lt;/strong&gt; and &lt;strong&gt;11.0.3000.0&lt;/strong&gt; respectively. Sample output:&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;
&lt;table border="1" frame="border" align="left"&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;ID&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;UPGRADE_DATE&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;VERSION_ID&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;ASSEMBLY_VERSION&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;USER_NAME&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;STATUS&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&lt;strong&gt;ERROR&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;1000&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;2012-04-09 16:27:18.680&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;1200&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;11.0.2316.0&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;DOMAIN\UserName&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;2&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&amp;nbsp;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;1001&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;2012-04-09 16:27:25.083&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;1400&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;11.0.2316.0&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;DOMAIN\UserName&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;2&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&amp;nbsp;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;1002&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;2012-11-27 17:26:43.180&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;1600&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;11.0.3000.0&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;DOMAIN\UserName&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;&amp;nbsp;2&lt;/span&gt;&lt;/td&gt;
&lt;td&gt;&amp;nbsp;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h1&gt;&amp;nbsp;&lt;/h1&gt;
&lt;h1&gt;&amp;nbsp;&lt;/h1&gt;
&lt;h1&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: large;"&gt;Having trouble upgrading DQS to SP1?&lt;/span&gt;&lt;/h1&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Post your query on the &lt;a href="http://social.msdn.microsoft.com/Forums/en-US/sqldataqualityservices/threads" target="_blank"&gt;DQS Forum&lt;/a&gt; or contact &lt;a href="http://support.microsoft.com/select/default.aspx?target=assistance" target="_blank"&gt;Microsoft Support&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Thanks,&lt;br /&gt;- DQS Team&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10372987" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQL+Server+2012/">SQL Server 2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/upgrade/">upgrade</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/service+pack+1/">service pack 1</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SP1/">SP1</category></item><item><title>Pattern Matching in DQS Domain Rules</title><link>http://blogs.msdn.com/b/dqs/archive/2012/10/08/pattern-matching-in-dqs-domain-rules.aspx</link><pubDate>Mon, 08 Oct 2012 16:43:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10357412</guid><dc:creator>Kumar Vivek</dc:creator><slash:comments>3</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10357412</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2012/10/08/pattern-matching-in-dqs-domain-rules.aspx#comments</comments><description>&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;This post provides you information on how to use the pattern matching rules in DQS while creating domain rules. These are the four pattern matching rules in DQS:&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7167.Pic1.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/7167.Pic1.jpg" width="230" height="264" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;For a pattern matching rule:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Any &amp;nbsp;letter (A&amp;hellip;Z) &amp;nbsp;can be used as a pattern for any letter; case insensitive&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Any digit (0&amp;hellip;9) can be used as a pattern for any digit&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Any special character, except a letter or a digit, can be used as a pattern for itself&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Brackets, [], define optional matching&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;&lt;b&gt;Example 1&lt;/b&gt;:&lt;/h4&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Consider the following pattern matching rule to represent Employee Code:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;/span&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/8484.Pic2.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/8484.Pic2.jpg" width="396" height="50" /&gt;&lt;/a&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2502.Pic2.jpg"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;This rule implies that the Employee Code will contain three parts: &lt;span style="text-decoration: underline;"&gt;&lt;em&gt;any three&lt;/em&gt;&lt;/span&gt; letters followed by a colon (:), which is again followed by &lt;span style="text-decoration: underline;"&gt;&lt;em&gt;any four&lt;/em&gt;&lt;/span&gt; digits.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;In this case, following are some &lt;strong&gt;valid&lt;/strong&gt; values as per the rule:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2620.Pic3.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/2620.Pic3.jpg" width="573" height="199" /&gt;&lt;/a&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;And, following are some &lt;strong&gt;invalid&lt;/strong&gt; values as per the rule:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="text-decoration: underline;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5342.Pic4.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5342.Pic4.jpg" width="408" height="144" /&gt;&lt;/a&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/4265.Pic4.jpg"&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h4&gt;&lt;span style="text-decoration: underline;"&gt;&lt;b&gt;Example 2&lt;/b&gt;:&lt;/span&gt;&lt;/h4&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Consider the following pattern matching rule for an Employee Code that shows &lt;span style="text-decoration: underline;"&gt;optional matching&lt;/span&gt;:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/3857.Pic5.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/3857.Pic5.jpg" width="408" height="46" /&gt;&lt;/a&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/0825.Pic5.jpg"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;This rule implies that the Employee Code will contain three parts: &lt;em&gt;&lt;span style="text-decoration: underline;"&gt;any three&lt;/span&gt;&lt;/em&gt; letters followed by a&amp;nbsp;hyphen (-), which is again followed by&lt;span style="text-decoration: underline;"&gt;&lt;/span&gt;&lt;span style="text-decoration: underline;"&gt;&lt;/span&gt;&lt;em&gt;&lt;span style="text-decoration: underline;"&gt; any four or five digits&lt;/span&gt;&lt;/em&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;In this case,&amp;nbsp;following are some &lt;strong&gt;valid&lt;/strong&gt; values as per the rule:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/8562.Pic6.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/8562.Pic6.jpg" width="564" height="199" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;And the following are some&amp;nbsp;&lt;strong&gt;invalid&lt;/strong&gt; values as per the rule:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5165.Pic7.jpg"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/5165.Pic7.jpg" width="405" height="151" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Thanks,&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;DQS Team&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10357412" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/pattern+matching/">pattern matching</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/domain+rules/">domain rules</category></item><item><title>Excessive database growth issue fixed in DQS in the CU2 Release for SQL Server 2012</title><link>http://blogs.msdn.com/b/dqs/archive/2012/07/03/excessive-database-growth-issue-fixed-in-dqs-in-the-cu2-release-for-sql-server-2012.aspx</link><pubDate>Tue, 03 Jul 2012 13:55:00 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10326470</guid><dc:creator>Kumar Vivek</dc:creator><slash:comments>2</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10326470</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2012/07/03/excessive-database-growth-issue-fixed-in-dqs-in-the-cu2-release-for-sql-server-2012.aspx#comments</comments><description>&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The Cumulative Update 2 (CU2) release for SQL Server 2012 is out, and it fixes a DQS issue where the DQS_PROJECTS database use to grow excessively during the knowledge discovery or the cleansing activity. For detailed information about this issue, see &lt;a href="http://support.microsoft.com/kb/2685743"&gt;http://support.microsoft.com/kb/2685743&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;You can download and install CU2 release from here: &lt;a href="http://support.microsoft.com/kb/2703275"&gt;http://support.microsoft.com/kb/2703275&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;strong&gt;Note&lt;/strong&gt;: Before installing the CU2 release, ensure that you back up your DQS databases. For more information, see &lt;a href="http://msdn.microsoft.com/en-us/library/hh213040"&gt;http://msdn.microsoft.com/en-us/library/hh213040&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;To update your existing SQL Server installation with the&amp;nbsp;CU2 release, do the following:&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Visit &lt;a href="http://support.microsoft.com/kb/2703275,"&gt;http://support.microsoft.com/kb/2703275,&lt;/a&gt; and click&amp;nbsp;&lt;strong&gt;Get Hotfix Now&lt;/strong&gt; at the top of the page.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;In the next page, select the hotfix to download, type in your email address where you want the hotfix link to be sent, and then click &lt;strong&gt;Request Hotfix&lt;/strong&gt;.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Check your email for the hotfix link, and click the link to download the hotfix.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;After downloding the hotfix file, double-click the file to extract the CU2 installer file to a folder on your computer.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Double-click the CU2 installer file to launch the update process on your SQL Server computer.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;In the License Terms page, accept the terms, and click &lt;strong&gt;Next&lt;/strong&gt;.&amp;nbsp;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;On the Select Features page, ensure&amp;nbsp;that all the features&amp;nbsp;for your instance are visible. If you have multiple instances, you can select an&amp;nbsp;instance that you want to update. &lt;strong&gt;Shared Features&lt;/strong&gt; are always updated since there is only one copy to be shared by all instances. Click &lt;strong&gt;Next&lt;/strong&gt;.&lt;/span&gt;&lt;br /&gt;&lt;a href="http://blogs.msdn.com/cfs-file.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/1323.SelectFeatures.JPG"&gt;&lt;img border="0" alt="" src="http://blogs.msdn.com/resized-image.ashx/__size/550x0/__key/communityserver-blogs-components-weblogfiles/00-00-01-45-78/1323.SelectFeatures.JPG" /&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The installer checks for files in use to help avoid unnecessary reboots, and warns you if an in-use file is detected so you can manually kill those applications. If you don&amp;rsquo;t kill the apps prompted during the check, the installer might prompt you for a reboot at the end of the&amp;nbsp;update process.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Click &lt;strong&gt;Next&lt;/strong&gt; to begin the&amp;nbsp;update process. This might take a while to complete (~10-20 mins).&amp;nbsp;After the update process&amp;nbsp;completes successfully, a success message is displayed. Click &lt;strong&gt;Close&lt;/strong&gt; to close the&amp;nbsp;installer page.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Start Command Prompt.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;At the command prompt, change your directory to the location where DQSInstaller.exe is available. If you installed the default instance of SQL Server, the DQSInstaller.exe file will be available at C:\Program Files\Microsoft SQL Server\MSSQL11.MSSQLSERVER\MSSQL\Binn:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;cd C:\Program Files\Microsoft SQL Server\MSSQL11.MSSQLSERVER\MSSQL\Binn&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;At the command prompt, type the following command, and press ENTER:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new,courier; font-size: small;"&gt;dqsinstaller.exe -upgrade&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The installer prompts you for backing up the DQS databases before proceeding.&amp;nbsp;Type Y or Yes and press ENTER to continue with the upgrade.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;A completion message is displayed after successful upgrade of the DQS databases schema.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;strong&gt;Credits&lt;/strong&gt;: &lt;a href="http://social.msdn.microsoft.com/profile/jason%20h%20-%20sql/"&gt;Jason Howell&lt;/a&gt; for helping with the detailed steps and the screenshot.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;- DQS Team&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10326470" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQL+Server+2012/">SQL Server 2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/CU2/">CU2</category></item><item><title>DQS supported in a SQL Server 2012 cluster</title><link>http://blogs.msdn.com/b/dqs/archive/2012/06/25/dqs-supported-in-a-sql-server-2012-cluster.aspx</link><pubDate>Mon, 25 Jun 2012 12:00:05 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10323540</guid><dc:creator>Kumar Vivek</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10323540</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2012/06/25/dqs-supported-in-a-sql-server-2012-cluster.aspx#comments</comments><description>&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;The SQL Server 2012 Cumulative Update 1 (CU1) release enables support for DQS as part of a clustered SQL Server 2012 setup.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;Ensure that you install the CU1 release&amp;nbsp;as mentioned under the &lt;strong&gt;Resolution&lt;/strong&gt; section of the following KB article to enable cluster support for DQS: &lt;a href="http://support.microsoft.com/kb/2674817"&gt;http://support.microsoft.com/kb/2674817&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;You can download the CU1&amp;nbsp;package from here: &lt;a href="http://support.microsoft.com/kb/2679368"&gt;http://support.microsoft.com/kb/2679368&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;&lt;strong&gt;NOTE&lt;/strong&gt;:&amp;nbsp;When you run the SQL Server 2012 setup, you are prompted for the "Scan for Product Updates" step. However,&amp;nbsp;this step can find and download only Microsoft General Distribution Release (GDR) and service pack updates. The&amp;nbsp;CU1 package is not listed. Therefore, you must manually download the CU1 package from the location mentioned earlier, and point the SQL Server setup to the package for the installation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: trebuchet ms,geneva; font-size: small;"&gt;- DQS Team&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10323540" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality/">data quality</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/sql+server/">sql server</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/CU1/">CU1</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/2012/">2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/cluster+support/">cluster support</category></item><item><title>DQS on the Twelve Days of SQL Server 2012</title><link>http://blogs.msdn.com/b/dqs/archive/2012/04/18/dqs-on-the-twelve-days-of-sql-server-2012.aspx</link><pubDate>Wed, 18 Apr 2012 21:12:28 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:10295109</guid><dc:creator>Matthew Roche [MSFT]</dc:creator><slash:comments>2</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://blogs.msdn.com/b/dqs/rsscomments.aspx?WeblogPostID=10295109</wfw:commentRss><comments>http://blogs.msdn.com/b/dqs/archive/2012/04/18/dqs-on-the-twelve-days-of-sql-server-2012.aspx#comments</comments><description>&lt;p&gt;&lt;em&gt;&lt;strong&gt;Call to Action:&lt;/strong&gt; When you&amp;rsquo;re done reading this article and checking out the resources it references, please take a moment to comment and let the DQS team know what additional resources you would like provided. If there&amp;rsquo;s a topic you would like covered in a blog post or webcast or demo, we want to know about it. We can&amp;rsquo;t promise we&amp;rsquo;ll be able to create everything that&amp;rsquo;s requested, but if you don&amp;rsquo;t ask, odds are you won&amp;rsquo;t receive.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;This morning the Data Quality Services team hosted as session as part of the ongoing &amp;ldquo;Twelve Days of SQL Server 2012&amp;rdquo; webcast series. Senior Program Manager Matthew Roche ran the session, while team members Howie Kroehl, Gadi Peleg, Welly Lee, Cim Ryan and Matt Masson hung out in the Q&amp;amp;A area answering attendee questions. We ended up having just over 200 attendees for the session, and the feedback was overwhelmingly positive. If you&amp;rsquo;re interested in viewing the recorded session on-demand, please take a look at the &lt;a href="http://swrt.worktankseattle.com/series/59/seriessignup.aspx"&gt;Twelve Days of SQL Server 2012 event web site&lt;/a&gt; &amp;ndash; the recording should be online within 24 to 48 hours.&lt;/p&gt;
&lt;p&gt;While the recording includes the session content, it does not include the questions asked (and answers provided) during the live session. For your reference, here is the Q&amp;amp;A transcript, slightly formatted for ease of reading:&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Will Microsoft ever do at source profiling?&lt;br /&gt;&lt;strong&gt;Private Answer&lt;/strong&gt;: Can you clarify what you mean by "at source profiling"?&lt;br /&gt;&lt;strong&gt;Question:&lt;/strong&gt; At source Profiling = going to the source DB2 system and looking at the data instead of moving the data to the SQL Server in chunks to profile it&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; You're right that profiling the data in non-SQL Server systems isn't really feasible using DQS or SSIS, directly, in SQL Server 2012; the data does need to be staged in SQL Server, first. As for the future, all we can say here is that Microsoft hasn't publicly announced capabilities along those lines.&lt;br /&gt;&lt;strong&gt;Question:&lt;/strong&gt; If i can get the right drivers to set up a linked server for the AS400 db i should be able to profile it correct?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; Correct&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; What is the cleansing speed?&lt;br /&gt;&lt;strong&gt;Answer: &lt;/strong&gt;&lt;a href="http://blogs.msdn.com/b/dqs/archive/2012/04/17/significant-performance-enhancements-in-dqs-with-the-cumulative-update-1-cu1-release-for-sql-server-2012.aspx"&gt;http://blogs.msdn.com/b/dqs/archive/2012/04/17/significant-performance-enhancements-in-dqs-with-the-cumulative-update-1-cu1-release-for-sql-server-2012.aspx&lt;/a&gt; &lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; The table in the blog post above describes the performance of DQS in different activities comparing the RTM version to the latest improvements in CU1 (cumulative update).&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Can this be integrated with SSIS so as to make it a part of my process of loading the data?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; Yes, there is a DQS Cleansing Transform for use in the SSIS Data Flow.&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Do you have to have Windows Azure to use Data Quality Services?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; No, DQS is not a cloud product in SQL Server 2012. The RDS capabilities are useful on top of the capabilities offered in the box.&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; This is amazing !!! I love this Demo&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; Thanks!&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Can data in xml source be used as source data for DQS?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; Unfortunately no, you can use Excel, CSV, SQL Table/view or linked server to connect to any other source.&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; do the azure RDS have a fee?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; Yes, you can subscribe to these services and pay according to the plan you select: &amp;lt;&lt;a href="https://datamarket.azure.com/browse/Data"&gt;https://datamarket.azure.com/browse/Data&lt;/a&gt;&amp;gt;&lt;br /&gt;&lt;strong&gt;Question:&lt;/strong&gt; Are the RDS's used with no fee? Is it included in the DQS license?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; Most RDS do have additional fees, although some provide cleansing free of charge. For more information, see &lt;a href="https://datamarket.azure.com/"&gt;https://datamarket.azure.com/&lt;/a&gt;&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; did I hear this SSIS integration is not now in 2012?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; The DQS cleansing component for SSIS is in SQL Server 2012&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Can you post the link to DQS online tutorial please?&lt;br /&gt;&lt;strong&gt;Answer: &lt;/strong&gt;&lt;a href="http://technet.microsoft.com/en-us/sqlserver/hh780961"&gt;http://technet.microsoft.com/en-us/sqlserver/hh780961&lt;/a&gt;&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Can I use local SQL Server data as a reference data set (i.e. a lookup table as a reference data set)?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; DQS does not support lookup matching however you can import or discover your data into your domains in the knowledge base and use it to cleans your data via SSIS or project in DQS.&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;strong&gt;Question:&lt;/strong&gt; Is MDS the tool to manage the data between the systems and DQS for cleansing and matching?&lt;br /&gt;&lt;strong&gt;Answer:&lt;/strong&gt; To "manage", yes. To transfer, use SSIS.&lt;/p&gt;
&lt;p&gt;In the final slide of the session we included links to additional DQS resources online. One of them is the DQS blog (which you have already found) while the other were for the &amp;ldquo;DQS Movies&amp;rdquo; video series and the DQS forums. Here are the URLs for these resources and a few more:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;DQS &amp;ldquo;How Do I?&amp;rdquo; videos: &lt;a href="http://msdn.microsoft.com/en-us/sqlserver/hh323832.aspx"&gt;http://msdn.microsoft.com/en-us/sqlserver/hh323832.aspx&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;DQS Forums: &lt;a href="http://social.msdn.microsoft.com/Forums/en-us/sqldataqualityservices/threads"&gt;http://social.msdn.microsoft.com/Forums/en-us/sqldataqualityservices/threads&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;DQS resources page: &lt;a href="http://msdn.microsoft.com/en-us/sqlserver/hh780961.aspx"&gt;http://msdn.microsoft.com/en-us/sqlserver/hh780961.aspx&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;DQS on the TechNet Wiki: &lt;a href="http://social.technet.microsoft.com/wiki/contents/articles/4385.data-quality-services-dqs.aspx"&gt;http://social.technet.microsoft.com/wiki/contents/articles/4385.data-quality-services-dqs.aspx&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;em&gt;Please&lt;/em&gt;&lt;/strong&gt; let us know what you think (both about the session and about DQS in general) and what additional resources you&amp;rsquo;d like to see us create. We can&amp;rsquo;t wait to hear from you!&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://blogs.msdn.com/aggbug.aspx?PostID=10295109" width="1" height="1"&gt;</description><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality/">data quality</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs/">dqs</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+quality+services/">data quality services</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/introduction/">introduction</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+cleansing/">dqs cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/ssis/">ssis</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/dqs+cleansing+component/">dqs cleansing component</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/cleansing/">cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/knowledge+base/">knowledge base</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/data+cleansing/">data cleansing</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/learning+resources/">learning resources</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/demo/">demo</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/SQL+Server+2012/">SQL Server 2012</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/webcast/">webcast</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/links/">links</category><category domain="http://blogs.msdn.com/b/dqs/archive/tags/2012/">2012</category></item></channel></rss>