I'm starting to have some issues with my hard drive, a WDC SN730 512GB. Now this happened twice... system crashed, and GRUB loaded got replaced by MS bootloader?? It happened today and I have absolutely nothing Microsoft running. Only explanation I have is that when the hard drive throws an error, perhaps the BIOS attempted to repair the boot and created a bigger problem. Fixed GRUB and system runs again...
Then Garuda has been showing a few messages "WDC is about to fail" lately.
So it does bring the question, was there anything wrong in my setup that could have reduced the drive durability, or I just got unlucky?
And how severe are these errors? I just ordered a 2TB replacement drive. Is this one still good enough as secondary drive?
Please send output of sudo smartctl -a /dev/your-drive
smartctl 7.3 2022-02-28 r5338 [x86_64-linux-6.0.3-x64v2-xanmod1-1] (local build)
Copyright (C) 2002-22, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Number: WDC PC SN730 SDBQNTY-512G-1014
Serial Number: 1948GT442307
Firmware Version: 11101100
PCI Vendor/Subsystem ID: 0x15b7
IEEE OUI Identifier: 0x001b44
Total NVM Capacity: 512,110,190,592 [512 GB]
Unallocated NVM Capacity: 0
Controller ID: 8215
NVMe Version: 1.3
Number of Namespaces: 1
Namespace 1 Size/Capacity: 512,110,190,592 [512 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64: 001b44 4a44aa3630
Local Time is: Sat Feb 25 13:45:32 2023 EST
Firmware Updates (0x14): 2 Slots, no Reset required
Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test
Optional NVM Commands (0x005f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp
Log Page Attributes (0x1e): Cmd_Eff_Lg Ext_Get_Lg Telmtry_Lg Pers_Ev_Lg
Maximum Data Transfer Size: 128 Pages
Warning Comp. Temp. Threshold: 84 Celsius
Critical Comp. Temp. Threshold: 88 Celsius
Namespace 1 Features (0x02): NA_Fields
Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 5.50W - - 0 0 0 0 0 0
1 + 3.50W - - 1 1 1 1 0 0
2 + 3.00W - - 2 2 2 2 0 0
3 - 0.0700W - - 3 3 3 3 4000 10000
4 - 0.0025W - - 4 4 4 4 4000 40000
Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 2
1 - 4096 0 1
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x00
Temperature: 25 Celsius
Available Spare: 100%
Available Spare Threshold: 10%
Percentage Used: 6%
Data Units Read: 291,263,284 [149 TB]
Data Units Written: 101,372,655 [51.9 TB]
Host Read Commands: 2,072,503,288
Host Write Commands: 1,127,785,668
Controller Busy Time: 7,325
Power Cycles: 305
Power On Hours: 7,607
Unsafe Shutdowns: 142
Media and Data Integrity Errors: 0
Error Information Log Entries: 0
Warning Comp. Temperature Time: 485
Critical Comp. Temperature Time: 177
Thermal Temp. 1 Transition Count: 2819
Thermal Temp. 2 Transition Count: 13224
Thermal Temp. 1 Total Time: 43972
Thermal Temp. 2 Total Time: 501116
Error Information (NVMe Log 0x01, 16 of 256 entries)
No Errors Logged