alejandro_m1
10-17-2006, 08:27 AM
So here goes my tragic story
I had a file server setup with Suse Linux and using a couple Maxtor Diamond Plus 9 250gb SATA HDD´s, working in Raid 1 for "extra protection". I set it up with Yast Partitioning program because to be honest I´m quite a noob in Linux and know very few commands, so I do need some graphic inteface most of the time. Everything worked fine, and kept that way for a few months, from February until last week, when I decided that it was time to put all my files in order.
While I was classifying some textures I start to get a pretty slow response from the server, for example with Max while trying to access the textures that resided in the file server, it took about 10 minutes to load a simple scene, so I knew something was wrong. After checking that in fact the file server was acting terribly slow I decided to restart it, after all it had been working for about 2 weeks without being turned off.
So after restarting there is no way to go into Linux, it keeps looping a disk check, trying without success to move data from some damaged sectors. So I think, ok so 1 one of the disks died, what a shame, so I will try to get the data from the other disk... The problem is there is the exact same error in the "clone" disk... So to make the story short after a lot of work trying to get either one of the disks to work and to get the array working again I finally manage to get some data by plugging it into my main workstation and running some EXT2 recovery programs in XP. Most of the info that I was moving and classifying was lost, in both HDD but most of the other files were OK.
So I decide that Linux isn´t for me, i think to myself that maybe if I did knew a bit more about Linux I would have recovered more info, so I went back to XP for the file server, I reformat the HDDs with NTFS, mount them both as slaves with no RAID and run a chkdsk on each one to find out how bad was the problem, to my surprise both report the exact same number of KB in bad sectors (something around 560 kb). So after this long story here goes my question: how could this happen? how can they have the same damaged sectors, and failed at the exact same time? I can´t find any culprit of the HDDs death, I mean no power failure, no problems with the PSU (as far as I know) and the temp are a bit hot but not that much.
BTW Maxtor utility states that both disks are healthy
I only had a HDD dying on me years before, and was some factory problem (a Seagate) that´s why a changed to Maxtor and recently to Western Digital, but now two Maxtors dying together is way too much.
PD
So I try to RMA the disks and find out that the 8th of this month the warranty expired. HDDs are pretty cheap now so for my file server I´m getting new ones (WD most probably I´ll stay away from Maxtor this time) But just for not wasting this somewhat new HDDs: How bad would you think that the damage to the disk is? Would you use it for non-critical data or is it time to toss them both into the trash can? In the old times I remember using a HDD with a lot of damaged sectors, just running chkdsk from time to time and it worked for years, but with new HDD´s I heard that when you get some bad sectors you should kiss your disk good bye, is this true?
I had a file server setup with Suse Linux and using a couple Maxtor Diamond Plus 9 250gb SATA HDD´s, working in Raid 1 for "extra protection". I set it up with Yast Partitioning program because to be honest I´m quite a noob in Linux and know very few commands, so I do need some graphic inteface most of the time. Everything worked fine, and kept that way for a few months, from February until last week, when I decided that it was time to put all my files in order.
While I was classifying some textures I start to get a pretty slow response from the server, for example with Max while trying to access the textures that resided in the file server, it took about 10 minutes to load a simple scene, so I knew something was wrong. After checking that in fact the file server was acting terribly slow I decided to restart it, after all it had been working for about 2 weeks without being turned off.
So after restarting there is no way to go into Linux, it keeps looping a disk check, trying without success to move data from some damaged sectors. So I think, ok so 1 one of the disks died, what a shame, so I will try to get the data from the other disk... The problem is there is the exact same error in the "clone" disk... So to make the story short after a lot of work trying to get either one of the disks to work and to get the array working again I finally manage to get some data by plugging it into my main workstation and running some EXT2 recovery programs in XP. Most of the info that I was moving and classifying was lost, in both HDD but most of the other files were OK.
So I decide that Linux isn´t for me, i think to myself that maybe if I did knew a bit more about Linux I would have recovered more info, so I went back to XP for the file server, I reformat the HDDs with NTFS, mount them both as slaves with no RAID and run a chkdsk on each one to find out how bad was the problem, to my surprise both report the exact same number of KB in bad sectors (something around 560 kb). So after this long story here goes my question: how could this happen? how can they have the same damaged sectors, and failed at the exact same time? I can´t find any culprit of the HDDs death, I mean no power failure, no problems with the PSU (as far as I know) and the temp are a bit hot but not that much.
BTW Maxtor utility states that both disks are healthy
I only had a HDD dying on me years before, and was some factory problem (a Seagate) that´s why a changed to Maxtor and recently to Western Digital, but now two Maxtors dying together is way too much.
PD
So I try to RMA the disks and find out that the 8th of this month the warranty expired. HDDs are pretty cheap now so for my file server I´m getting new ones (WD most probably I´ll stay away from Maxtor this time) But just for not wasting this somewhat new HDDs: How bad would you think that the damage to the disk is? Would you use it for non-critical data or is it time to toss them both into the trash can? In the old times I remember using a HDD with a lot of damaged sectors, just running chkdsk from time to time and it worked for years, but with new HDD´s I heard that when you get some bad sectors you should kiss your disk good bye, is this true?
