<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx</link><description>Being able to remove duplicate rows of information from a table of data is a request we hear fairly often from our customers (and one of the top questions in the comments in this blog). Some users know that this capability exists in Excel today; unfortunately</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489269</link><pubDate>Sat, 05 Nov 2005 02:10:53 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489269</guid><dc:creator>Harlan Grove</dc:creator><description>In current versions when one uses advanced filters to remove duplicates, all fields in the result (so all fields in the source range if filtered in place) are included when checking for duplicates. Will XL12 default to all fields selected in the new dialog?&lt;br&gt;&lt;br&gt;Also, using advanced filters, one has the option of copying the filtered data to a different location. Will that be an option in XL12 using the new UI?&lt;br&gt;&lt;br&gt;As for removing duplicates, presumably this is top-to-bottom, so the topmost record of possibly many duplicates remains and subsequent duplicate records deleted.&lt;br&gt;&lt;br&gt;One thing that screws up unique record filtering is visually duplicate records that have different amounts of leading or trailing spaces in various fields. It's not always practical simply to remove all spaces, and Excel currently provides no means other than the TRIM function and macros to remove only leading/trailing spaces. It'd be VERY USEFUL if this new feature included a checkbox to ignore (or not) leading and trailing spaces (and define space liberally as either characters with decimal codes 32 or 160).</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489286</link><pubDate>Sat, 05 Nov 2005 02:53:51 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489286</guid><dc:creator>David Gainer</dc:creator><description>Hi - yes, all fields selected by default.  The feature removes data in place.  Good feedback on the spaces - as the feature currently sits, you would need to use the extra column.</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489299</link><pubDate>Sat, 05 Nov 2005 03:10:55 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489299</guid><dc:creator>Rob van Gelder</dc:creator><description>Nice work on making this quicker.&lt;br&gt;Unique Advanced Filter is something I use a lot - and, you're right - there are a lot of clicks to get there.&lt;br&gt;&lt;br&gt;The ability to select just some columns seems, at first, like a useful feature.  Once I started thinking, I dont think I'll use it much.&lt;br&gt;The problem is that some of your checkout times are lost.&lt;br&gt;I cant think of a time where I've wanted to destroy remaining values but just keep the first.&lt;br&gt;&lt;br&gt;It would be more useful if there were options for what to do with the remaining columns.&lt;br&gt;Such as:&lt;br&gt;Use first, use last, sum, min, max, average, join with comma separator, join with &amp;lt;user defined separator&amp;gt;&lt;br&gt;&lt;br&gt;In your example, I might have wanted to keep the min(checkout) and max(checkin).&lt;br&gt;&lt;br&gt;&lt;br&gt;Regarding Harlan's comments (are you renaming to the David and Harlan blog yet? ;)&lt;br&gt;&lt;br&gt;Agree there should be a &amp;quot;copy filtered to&amp;quot; feature.&lt;br&gt;&lt;br&gt;On the autotrim checkbox:&lt;br&gt;I would not like to see that feature.  In my experience, the users who would use that feature are the very same who let untrimmed strings slide in their raw data - which leads to all sorts of integrity issues later.&lt;br&gt;Better to have those features in a 'data cleaning' window which concentrates such features. eg. trim spaces, Proper Case, alphanum only, text stored numbers to real numbers, etc...&lt;br&gt;&lt;br&gt;&lt;br&gt;Nice work - I cant wait to try the beta...&lt;br&gt;</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489302</link><pubDate>Sat, 05 Nov 2005 03:25:43 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489302</guid><dc:creator>Jean Martineau</dc:creator><description>Thank’s for the remove duplicates improvement.&lt;br&gt;&lt;br&gt;I agree with Harlan that we need to keep&lt;br&gt;|the option of copying the filtered data to a&lt;br&gt;|different location.&lt;br&gt;In this case, I also need to keep an option to specify a Criteria range.&lt;br&gt;&lt;br&gt;Jean</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489321</link><pubDate>Sat, 05 Nov 2005 04:51:30 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489321</guid><dc:creator>Dave Solimini</dc:creator><description>wow. it would be difficult for me to express how incredibly happy i am to see this feature improved! You folks look to be doing some amazing work. Hats-off!</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489329</link><pubDate>Sat, 05 Nov 2005 05:40:29 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489329</guid><dc:creator>Ken Clifton</dc:creator><description>I too would like to see options to move the data to a different location, sheet, file, etc.&lt;br&gt;&lt;br&gt;Also have an option for each field you could pick keep first, last, Nth position.&lt;br&gt;&lt;br&gt;Also be able to pick what each field will do like MIN, MAX, AVG, COUNT, SUM, OLDEST, NEWEST, etc.&lt;br&gt;&lt;br&gt;That way if you had an employee with data you could average some columns, sum others, get the MIN, MAX what ever you might need with this one feature.  This would be a GREAT selling point.</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489356</link><pubDate>Sat, 05 Nov 2005 08:04:04 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489356</guid><dc:creator>Biff</dc:creator><description>I'll echo what many have already noted:&lt;br&gt;&lt;br&gt;Copying filtered data to another location is very useful, especially the unique values.</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489357</link><pubDate>Sat, 05 Nov 2005 08:06:59 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489357</guid><dc:creator>Orion Adrian</dc:creator><description>I love the data cleaning idea. Stuff like trim, change case, reformat, remove charactes (like ()$.-_, etc.) would be very useful ot me.&lt;br&gt;&lt;br&gt;Orion Adrian</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489408</link><pubDate>Sat, 05 Nov 2005 16:27:29 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489408</guid><dc:creator>Roy</dc:creator><description>All the tables in your screenshots look great!  I hope there's a new easy way to format them like that.</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489576</link><pubDate>Sun, 06 Nov 2005 20:17:40 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489576</guid><dc:creator>Harlan Grove</dc:creator><description>Rob van Gelder...&lt;br&gt;...&lt;br&gt;|Better to have those features in a 'data&lt;br&gt;|cleaning' window which concentrates such&lt;br&gt;|features. eg. trim spaces, Proper Case, alphanum&lt;br&gt;|only, text stored numbers to real numbers,&lt;br&gt;|etc...&lt;br&gt;...&lt;br&gt;&lt;br&gt;I agree. If the goal of all these changes is the greatest good for the greatest number of users for the fewest keystrokes or mouse clicks, then someone on the development team need to realize that a very great deal of the data stored and manipulated in spreadsheets is text rather than numbers, and Excel provides very few tools for manipulating text that don't require using formulas in worksheet cells.&lt;br&gt;&lt;br&gt;With respect to autotrim, it kinda exists already in Data &amp;gt; Text to Columns, which can be run using fixed width fields but only one field. Excel will strip off leading and trailing ACSII spaces (decimal character code 32) but not nonbreaking HTML spaces (decimal character code 160). This is pretty obscure.&lt;br&gt;&lt;br&gt;There are also frequent questions about converting ranges to all upper case or proper case. The only practical approach currently is writing UPPER or PROPER formulas elsewhere, pasting their values on top of the original range, and clearing the range of formulas. A lot of work for most users.&lt;br&gt;&lt;br&gt;If it's a good idea to relieve users of the complexity of using formulas in conditional formatting, wouldn't it also be a good idea to relieve them ot the complexity of using formulas in data cleansing?</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489725</link><pubDate>Mon, 07 Nov 2005 09:11:45 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489725</guid><dc:creator>Arun Philip</dc:creator><description>It would be nice if there's some way of highlighting the values and then getting a user approval prior to removing the duplicates.  Basically, it would combine the functionality of Conditional Formatting's Highlight Duplicate Values, with the Remove Duplicates Feature.  </description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#489907</link><pubDate>Mon, 07 Nov 2005 21:38:55 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:489907</guid><dc:creator>headtoadie</dc:creator><description>Sometimes it is the duplicates that we *really* want and not the uniques.&lt;br&gt;&lt;br&gt;For example, if I have a customer history file and I want to identify the repeat customers then it is the duplicates that I'm really after.</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#490017</link><pubDate>Tue, 08 Nov 2005 00:32:46 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:490017</guid><dc:creator>David Gainer</dc:creator><description>Hello folks - again, thanks for the comments and suggestions from everybody. &lt;br&gt;&lt;br&gt;headtoadie - you can use conditional formatting (in Excel 12) to highlight dupes and then filter on the conditional formatting, for a total of about 3-4 mouse clicks. &lt;br&gt;&lt;br&gt;everyone - &lt;br&gt;&lt;br&gt;... &lt;br&gt;|Better to have those features in a 'data &lt;br&gt;|cleaning' window which concentrates such &lt;br&gt;|features. eg. trim spaces, Proper Case, alphanum &lt;br&gt;|only, text stored numbers to real numbers, &lt;br&gt;|etc... &lt;br&gt;... &lt;br&gt;|I agree. If the goal of all these changes is &lt;br&gt;|the greatest good for the greatest number of&lt;br&gt;|users for the fewest keystrokes or mouse &lt;br&gt;|clicks, then someone on the development team &lt;br&gt;|need to realize that a very great deal of the &lt;br&gt;|data stored and manipulated in spreadsheets &lt;br&gt;|is text rather than numbers, and Excel &lt;br&gt;|provides very few tools for manipulating text &lt;br&gt;|that don't require using formulas in |worksheet cells. &lt;br&gt;...&lt;br&gt;&lt;br&gt;What are the most common 'data clensing' tasks you find yourself doing? &lt;br&gt;</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#490061</link><pubDate>Tue, 08 Nov 2005 01:53:32 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:490061</guid><dc:creator>Harlan Grove</dc:creator><description>David Gainer...&lt;br&gt;...&lt;br&gt;|What are the most common 'data clensing' tasks&lt;br&gt;|you find yourself doing?&lt;br&gt;&lt;br&gt;Myself? I don't use Excel for data cleansing.&lt;br&gt;&lt;br&gt;My impression of users asking about this in newsgroups or stating problems that would be resolved by data cleansing, deleting leading and trailing ASCII spaces, deleting nonbreaking spaces, changing letter case and converting text that looks numeric into numbers are the most frequent data cleansing/conditioning needs. All except changing letter case could be accomplished using Data &amp;gt; Text to Columns.</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#490260</link><pubDate>Tue, 08 Nov 2005 12:10:49 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:490260</guid><dc:creator>XL-Dennis</dc:creator><description>David,&lt;br&gt;&lt;br&gt;I fully agree with Harlan Grove. Per se the data cleansing should be less in view of the XML-era and general development but it tend to be the opposite.&lt;br&gt;&lt;br&gt;The keyissue is that many corporate central systems in use can create, as output, so called Excel-files (core textfiles with the xls-extension) which looks great on it's face but create a demand for cleaning up before anything can be done with the files. It also involve the issue with dot and comma as  separators for values.&lt;br&gt;&lt;br&gt;Of course we can argue that this is something that the system vendors should solve but usually this is not a keyissue for central IT-departments in their dialogs with the vendors. In the end it's up to to the end-users to solve it in Excel which is reflected in newsgroups and public forums.&lt;br&gt;&lt;br&gt;Kind regards,&lt;br&gt;Dennis</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#490279</link><pubDate>Tue, 08 Nov 2005 13:52:18 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:490279</guid><dc:creator>Nigel Harper</dc:creator><description>Dave,&lt;br&gt;&lt;br&gt;I think it is more important to ask why we end up with duplicates in the first place.  XL-Dennis rightly points out that many text files created by exporting data from other apps and renaming with an xls extension often contain extra characters and confuse numeric text and numbers.&lt;br&gt;&lt;br&gt;Additionally gadgets containing contacts to be imported into Outlook are a major bugbear.  When I worked at a large investment bank, part of my duties were to be the in-house gadget guru.  Almost every new banker arrived with his/her gizmo.  This generally contained around 2500 contacts made up of generations of contacts from the various preceding gadget of the day.  Each one, sometimes even each model from the same manufacturer, has different ways of storing 'multi-field' fields such as address, the synchronising software is not much better.  Assistants would tear their hair out trying to filter and sort for duplicates that had built up over the years sometimes because of a linefeed or tab character etc.  Inevitably it would come to me in the form of a spreadsheet to use text function formulas to sort things out.&lt;br&gt;&lt;br&gt;Harlan is correct that much can be done with text to columns but there is more that it could do.  To be able to evaluate rows by accounting for leading, trailing and instances of strings, automatic trimming and cleaning, numeric evaluation and case sensitivity WITHOUT actually having to change the contents would be very useful.  &lt;br&gt;&lt;br&gt;'Duplicates' is actually a bit of a misnomer, most of the time the reason there are duplicates is because the similar rows are not actually duplicates.  Hope this helps,&lt;br&gt;&lt;br&gt;Nigel</description></item><item><title>re: Tables Part 6: Removing duplicates from tables (and ranges) of data</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#490360</link><pubDate>Tue, 08 Nov 2005 18:38:51 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:490360</guid><dc:creator>headtoadie</dc:creator><description>On data cleaning...one area where we experience problems is with text strings that contain a single &amp;quot;E&amp;quot; (ex. 72E0050). Excel tends automatically convert these to scientific notation. Once it is converted, the only way to get it back to original format is to retype it.&lt;br&gt;</description></item><item><title>  Revealed - Thousands of leads in 6 minutes and 37 seconds</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#8763135</link><pubDate>Tue, 22 Jul 2008 08:23:54 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:8763135</guid><dc:creator>  Revealed - Thousands of leads in 6 minutes and 37 seconds</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://crewealthtraining.com/members/?p=52"&gt;http://crewealthtraining.com/members/?p=52&lt;/a&gt;&lt;/p&gt;
</description></item><item><title> Microsoft Excel Tables Part 6 Removing duplicates from tables and | Outdoor Ceiling Fans</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#9670420</link><pubDate>Sun, 31 May 2009 19:39:48 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9670420</guid><dc:creator> Microsoft Excel Tables Part 6 Removing duplicates from tables and | Outdoor Ceiling Fans</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://outdoorceilingfansite.info/story.php?id=5628"&gt;http://outdoorceilingfansite.info/story.php?id=5628&lt;/a&gt;&lt;/p&gt;
</description></item><item><title> Microsoft Excel Tables Part 6 Removing duplicates from tables and | Outdoor Ceiling Fans</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#9670905</link><pubDate>Sun, 31 May 2009 21:45:24 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9670905</guid><dc:creator> Microsoft Excel Tables Part 6 Removing duplicates from tables and | Outdoor Ceiling Fans</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://outdoorceilingfansite.info/story.php?id=23264"&gt;http://outdoorceilingfansite.info/story.php?id=23264&lt;/a&gt;&lt;/p&gt;
</description></item><item><title> Microsoft Excel Tables Part 6 Removing duplicates from tables and | Hair Growth Products</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#9721991</link><pubDate>Wed, 10 Jun 2009 05:44:25 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9721991</guid><dc:creator> Microsoft Excel Tables Part 6 Removing duplicates from tables and | Hair Growth Products</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://hairgrowthproducts.info/story.php?id=7467"&gt;http://hairgrowthproducts.info/story.php?id=7467&lt;/a&gt;&lt;/p&gt;
</description></item><item><title> Microsoft Excel Tables Part 6 Removing duplicates from tables and | adirondack chairs</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#9783767</link><pubDate>Fri, 19 Jun 2009 12:17:52 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9783767</guid><dc:creator> Microsoft Excel Tables Part 6 Removing duplicates from tables and | adirondack chairs</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://adirondackchairshub.info/story.php?id=3737"&gt;http://adirondackchairshub.info/story.php?id=3737&lt;/a&gt;&lt;/p&gt;
</description></item><item><title> Microsoft Excel Tables Part 6 Removing duplicates from tables and | patio set</title><link>http://blogs.msdn.com/excel/archive/2005/11/04/489259.aspx#9784753</link><pubDate>Fri, 19 Jun 2009 13:14:51 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:9784753</guid><dc:creator> Microsoft Excel Tables Part 6 Removing duplicates from tables and | patio set</dc:creator><description>&lt;p&gt;PingBack from &lt;a rel="nofollow" target="_new" href="http://patiosetsite.info/story.php?id=663"&gt;http://patiosetsite.info/story.php?id=663&lt;/a&gt;&lt;/p&gt;
</description></item></channel></rss>