When I talk about bug data - or potential uses for bug data, there are two things I typically mention. Both are strong opinions of mine, and while the first is now becoming more accepted, the second is less so (as far as I know).
The first opinion I share is the notion of using bug data to measure tester performance. This is dumb, and I'm betting most of you agree. Over the years, I see fewer and fewer cases of this, both inside and outside of Microsoft, so I probably don't need to repeat the story here.
The second opinion is my take on duplicate bugs. On most test teams (ok - nearly all test teams) I have worked with, entering a duplicate bug is viewed as a bad thing. (Some teams (typically those who disagree with me on the first point) even have fancy algorithms that use found, fixed, and resolution to come up with a magic tester efficiency number).
I guess duplicate bugs are viewed as bad because they can waste someone's time. The nerve of Bobby Tester entering a bug that Jane entered so clearly just a few days before - now someone has to take the time to resolve that bug as duplicate. So what if they test different areas of the product and didn't realize that the bug was actually in a shared component. Bad Bobby, Bad Bobby!
I don't think entering duplicate bugs is bad at all - in fact, I think worrying about them is bad. let me tell you why.
So, the next time someone tells you that entering duplicate bugs is bad, tell them that I said they were wrong. Of course, if you are entering verbatim duplicates of your own bugs, none of the above applies :-}.
** The term "triage" is taken from emergency medicine, and is often used to describe the process of examining and dealing with bugs. I assume it's familiar to everyone, but can elaborate as necessary.