I know we’re way OT, but I thought I’d give one more cautionary tale about hard drives and Linux. Just because a drive is diagnosed as “healthy” in smartmon or other testing utilities does not mean it’s 100% solid.
This story happened a while back when the kernel was updated from I believe 4.14 to 4.19. After the 4.19 update I started to have transient full lock up’s on my system. Not every day, but often enough to be really annoying.
If I stuck with kernel 4.14 no freezes occurred, but I really didn’t want to remain stuck on an older kernel forevever. I started trying to find a fix for the freezes. After checking my logs I found I/O errors when running any newer kernel and I was affected by freezes.
Kernel 4.19 saw the introduction of a different I/O scheduler. So after finding the I/O errors in my logs I switched back to the old scheduler in use on kernel 4.14. Using the old scheduler the system seemed stable, so I put the freezes down to changes with the kernel. I figured the issue would resolve itself as the kernel recieved updates, but it never did.
Over time I began to suspect an old drive hard drive was part of the problem, as it had started making the occasional weird noise. I ran numerous dignostic tests and every time the drive passed the diagnostic tests with flying colors.
Before too long I realized that if the old drive wasn’t attached to the computer when running a newer kernel things were fine. However, as soon as I plugged in the old drive the freezing issue came back. I retired the old drive, and the freezing issue was a thing of the past.
The moral of this story is, you can’t rely on software diagnostic tests to be a 100% reliable predictor of problems/failure with hardware. Ram can also test fine, but when you eliminate 1 stick of 4 matching ram sticks sequentially and reboot you can sometimes find the freezes disappear (even though all the Ram tests fine). Sometimes selectively removing hardware is the only way to truly be sure if it’s causing a problem.
Way off topic, but sometimes other people’s experiences can save you a lot of frustration and wasted time if you are warned in advance.
Storage technologies change over time and one thing I’ve found is the old tech is far from perfect when stored for long periods of time.
I was shocked how many floppies were unreadable or contained errors when I tried reading floppy disks that had been stored for years. The same thing happened when I moved and temporarily had no internet access. I took my movie collection out of mothballs and was shocked to find how many discs couldn’t be played properly after being stored for years. Even DVD’s degrade over time, so I would also suggest cloud backup for irreplaceable data such as photos.