[olug] Hard Drive Failure?

William E. Kempf wekempf at cox.net
Mon Feb 10 21:49:37 UTC 2003


> From: ktb <xyf at nixnotes.org>
> On Mon, Feb 10, 2003 at 10:26:30AM -0500, William E. Kempf wrote:
> > Had a bad weekend.  Some of you know from the last Install Fest that I had a HD going bad.  Well, I replaced it last week with a brand new 80 GB Western Digital.  From the very beginning I was nervous... the drive seemed fairly loud to me (not the spinning of the platters, but the movement of the head) and it sure *seemed* to be running slower than my previous drive when I expected it to be faster.  But it appeared to be running properly and a forced fsck didn't reveal any problems (that's the extent of my diagnotic abilities using Linux, so if there's other things I should have done, let me know).
> > 
> > It ran for most of the week.  Twice during the week I had the entire system lock up, but didn't know what the culprit was.  Then Friday things started to get worse.  I had to reboot because the system was locked up, and though the OS would come up, it ran so slowly as to be basically unusable.  I fought with trying to diagnose the problem for a little while, and another reboot resulted in the error "no drq after issuing write" displayed by the kernel.  Eventually I also received errors such as (and this is from memory, so not 100% accurate) "dma_intr status=0x61 {DeviceReady DeviceFault Error}" and the same thing basically but with status=0x04.  Amazingly, on two different reboots the BIOS couldn't even detect the drive, and I had to do a cold boot to get the BIOS to recognize the drive's presence again.
> > 
> <snip>
> 
> Please set your mailer to wrap at 72 characters as it raises havoc with
> those of us who use text based mailers.

Sorry, but until I can get this system running again, I'm basically stuck with using Cox's webmail, which I can't configure in this way.

> I would try looking at your dma settings.  Do a search for "hdparm" and
> your specific error message.  This might also be of use -

I can't use hdparm if I can't boot to the drive ;).

I also spent a significant amount of time searching on these error messages, and got only a bunch of conflicting answers.  Some suggest this is nothing, others say it's a sign of a failing HD, and yet others say it's something that needs to be adjusted through hdparm (but they disagree on what needs adjusting, and since I can't boot into the drive to use hdparm...).

> http://www.linuxgazette.com/issue76/tag/10.html

Thanks.  I'll check this link. 

Bill Kempf

William E. Kempf
wekempf at cox.net




More information about the OLUG mailing list