Kentc's SOAPbox

Disclaimer: All opinions are solely mine and don't necessarily reflect the opinions of Microsoft or my family.

MTBF for HDDs

Do you remember when hard drives were 10 MB and bad sectors would eliminate 20% of the drive?  Hard drives were not very robust (nor were the PCs, the OSes, …) but that was pretty outrageous. Not only would the sectors go bad which wiping out large percentages of the drive but the drives crashed pretty often as well.  Over the course of four years I had to replace at least three drives.  

That sets the stage for something I thought was a thing of the past—hard drive failures.  I’m not suggesting that drives never crash but with the exception of a x-coworker who used to claim his HDD crashed in order to get out of doing a demo/presentation (more than was even remotely reasonable) the MTBF must be pretty low.  Even with the five – seven laptops and Tablet PCs I’ve had which at least have a reason for failing do to all of the banging around in my bag I’ve had zero HDD problems.  

In the past three months I’ve had two hard drives crash.  One drive was in my Ultimate TV (upside: I now have 3x the capacity I had—room enough for recording almost all of the SciFi Channel’s Twilight Zone New Year’s Day marathon!) and just this past Tuesday the 160GB HDD in my Windows Media Center Edition PC (I popped in the drive that it replaced and it worked fine so it is a drive failure).  The UTV was two or three years old and that drive was always being written or read from because that’s just how it works.  The latter drive was only three months old.  It was manufactured by a well respected, name brand OEM.  They have a great warranty/return policy all completely done via their web site.  They even give the warranty holder the option of using a credit card so you don’t have to wait until they receive your drive before sending you the replacement.  Kudos for the fully functional web site.  However, maybe drive crashes happen a little too often which is why they have the return section really nailed. <g> I’m joking, sort of.  What a bummer though if I hadn’t of had a backup of my data.  

We never turned off the MCE PC and it has far too little memory (see earlier RDRAM post) so it swaps constantly.  I always felt bad for the HDD as the light indicating disk activity looked like the one on my firewall that indicates packet traffic—it was constantly on.  Did that contribute to its failure?  It might have but I’ve got another PC that is six years old that is always on and it’s humming only nicely (but with 756 MB of SDRAM it never swaps).  

I have to find an advocate within Microsoft to get my PC RAID5 idea on the feature list (unless it is already being developed and I just don’t know it which is quite possible). Because I’d gone so long between crashes it almost seemed like it wasn’t worth pursuing but now I know it has merit.  If it weren’t for doing very regular backups I’d been hosed.  Fortunately, between the script (which several of you were kind enough to help me with only last week—see earlier post) which distributed our music and photos to all of the other PCs and my external HDD I’ve not lost anything (well maybe my recently updated shared OneNote file containing future blog ideas).  

I’m really trying to dive down into the details of Longhorn’s WinFS because it’s a fascinating idea.  How do you build redundancy into the “database” engine such that the inevitable corruption (unexpected outages might not happen on servers because of UPSes but they certainly do on PCs) doesn’t mean data loss?  I’m glad there are some really smart people thinking about it because I certainly don’t fully understand how to make it 100% fault tolerant.  I’ve tried to grab some of the WinFS PM’s blogs but I haven’t read anyone discuss that issue (though it could be a trade secret).

BTW, doesn’t the capacity of the drives fifteen years ago seem preposterous compared to the capacity we have today?  I’m sure that’s exactly what we’ll be saying in ten years when terabytes of storage are the norm. <g>

 

Published Saturday, January 24, 2004 12:42 PM by Kentc

Comments

 

Mario Goebbels said:

In a cardboard box around here I've more broken 60gig IBMs than you'd want me to throw at you. Which is a sad thing compared to the way higher lifetime of some older 1.2gig, 3gig and 10gig drives I had in the past. Some of them which I sold still work in some old boxes.

But I worked as sysadmin at a computer wholesales sometime ago, so I know how rough the workers handle the crates in the warehouse. Outright scary.
January 24, 2004 11:10 AM
 

Andy said:

yeah, I've never had a hard drive crash until this past fall. A brand new Dell laptop I bought for my wife to use at school crapped out after 3 months. They diagnosed and overnighted a new disk. Still, would have rather had it happen to me. She didn't back up and I spent 2 days with all sorts of low level utilties to get it back. I explained backups and she's happily burning CD backups now.

I have to say that http://www.ontrack.com/ saved my butt. They have a limited version that lets you recover 25 files at a time. The NTFS partition was destroyed but the util still recoverd the 300 files she needed. Phew!!! Figured I'd pass that on to anyone else who might run into the same problem.

I wonder if the downturn in the economy has drive makers taking bigger risks by shipping marginal disks? Conceivable.
January 24, 2004 8:15 PM
 

Kent said:

All is well. I recieved the replacement drive and spent an unbelievable amount of money for 512Mb of RDRAM and my computer has never run better. I'm now wondering if some of the strangeness I was seeing under the old configuration (the WinTV card stopped working) was a result of the drive starting to go bad. My Pentium 4 PC has also never been quieter. Its almost as if I bought a new PC. :-)

BTW, what do the HDD OEMs do with the drives that are sent back to them? In my case, I have some fairly confidential information on the drive (e.g. my Money file). Do they zap it with an enormously large magnet field after seeing what mechanism broke? Do they recover the data and have the world's wildest holiday party ("watch me tranfer $1000 from his Quicken checking account to an account in Switzerland")?
February 1, 2004 9:24 AM
 

Desktop Computers » Kentc’s SOAPbox : MTBF for HDDs said:

March 19, 2008 12:16 PM
 

Kentc s SOAPbox MTBF for HDDs | Paid Surveys said:

May 28, 2009 7:13 PM
 

Kentc s SOAPbox MTBF for HDDs | storage bench said:

June 19, 2009 3:49 AM
Anonymous comments are disabled

© 2010 Microsoft Corporation. All rights reserved. Terms of Use  |  Trademarks  |  Privacy Statement
Microsoft
Page view tracker