We love big disks (the storage thread)

2»

Comments

  • @root said:
    Something is wrong with this forum. Every time I read this thread on front-page, I read it as "We love big dicks (the storage thread)".

    i'm more into tits tho. i mean tb's. :p

  • === START OF INFORMATION SECTION ===
    Model Family:     Crucial/Micron Client SSDs
    Device Model:     CT1000MX500SSD1
    Serial Number:    2338E8779233
    LU WWN Device Id: 5 00a075 1e8779233
    Firmware Version: M3CR046
    User Capacity:    1,000,204,886,016 bytes [1.00 TB]
    Sector Sizes:     512 bytes logical, 4096 bytes physical
    Rotation Rate:    Solid State Device
    Form Factor:      2.5 inches
    TRIM Command:     Available
    Device is:        In smartctl database 7.3/5319
    ATA Version is:   ACS-3 T13/2161-D revision 5
    SATA Version is:  SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is:    Fri Mar 27 11:17:55 2026 -05
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled
    
    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED
    See vendor-specific Attribute list for marginal Attributes.
    
    General SMART Values:
    Offline data collection status:  (0x80) Offline data collection activity
                        was never started.
                        Auto Offline Data Collection: Enabled.
    Self-test execution status:      (   0) The previous self-test routine completed
                        without error or no self-test has ever 
                        been run.
    Total time to complete Offline 
    data collection:        (    0) seconds.
    Offline data collection
    capabilities:            (0x7b) SMART execute Offline immediate.
                        Auto Offline data collection on/off support.
                        Suspend Offline collection upon new
                        command.
                        Offline surface scan supported.
                        Self-test supported.
                        Conveyance Self-test supported.
                        Selective Self-test supported.
    SMART capabilities:            (0x0003) Saves SMART data before entering
                        power-saving mode.
                        Supports SMART auto save timer.
    Error logging capability:        (0x01) Error logging supported.
                        General Purpose Logging supported.
    Short self-test routine 
    recommended polling time:    (   2) minutes.
    Extended self-test routine
    recommended polling time:    (  30) minutes.
    Conveyance self-test routine
    recommended polling time:    (   2) minutes.
    SCT capabilities:          (0x0031) SCT Status supported.
                        SCT Feature Control supported.
                        SCT Data Table supported.
    
    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate     0x002f   100   100   000    Pre-fail  Always       -       0
      5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       0
      9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       17840
     12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       81
    171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
    172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
    173 Ave_Block-Erase_Count   0x0032   000   000   000    Old_age   Always       -       1002
    174 Unexpect_Power_Loss_Ct  0x0032   100   100   000    Old_age   Always       -       58
    180 Unused_Reserve_NAND_Blk 0x0033   000   000   000    Pre-fail  Always       -       41
    183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always       -       0
    184 Error_Correction_Count  0x0032   100   100   000    Old_age   Always       -       0
    187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
    194 Temperature_Celsius     0x0022   076   045   000    Old_age   Always       -       24 (Min/Max 15/55)
    196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
    197 Current_Pending_ECC_Cnt 0x0032   100   100   000    Old_age   Always       -       0
    198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
    199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       1
    202 Percent_Lifetime_Remain 0x0030   000   000   001    Old_age   Offline  FAILING_NOW 100
    206 Write_Error_Rate        0x000e   100   100   000    Old_age   Always       -       0
    210 Success_RAIN_Recov_Cnt  0x0032   100   100   000    Old_age   Always       -       0
    246 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       253743124217
    247 Host_Program_Page_Count 0x0032   100   100   000    Old_age   Always       -       2740494855
    248 FTL_Program_Page_Count  0x0032   100   100   000    Old_age   Always       -       14721802449
    
    SMART Error Log Version: 1
    No Errors Logged
    
    SMART Self-test log structure revision number 1
    No self-tests have been logged.  [To run self-tests, use: smartctl -t]
    
    SMART Selective self-test log data structure revision number 1
     SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
        1        0        0  Not_testing
        2        0        0  Not_testing
        3        0        0  Not_testing
        4        0        0  Not_testing
        5        0        0  Completed [00% left] (0-65535)
    Selective self-test flags (0x0):
      After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.
    

    Percent_Lifetime_Remain FAILING_NOW.

    Should I discard this drive?

  • @imok said:
    SMART overall-health self-assessment test result: PASSED
    1 Raw_Read_Error_Rate 0x002f 100 100 000 Pre-fail Always - 0
    5 Reallocate_NAND_Blk_Cnt 0x0032 100 100 010 Old_age Always - 0
    172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0
    184 Error_Correction_Count 0x0032 100 100 000 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
    197 Current_Pending_ECC_Cnt 0x0032 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
    206 Write_Error_Rate 0x000e 100 100 000 Old_age Always - 0

    Looks like everything is dandy. And you have backups in the event that the entire PC catches fire, of course.

    Percent_Lifetime_Remain FAILING_NOW.

    Should I discard this drive?

    Why, because the manufacturer? If so, please send to my inbox. I will use it only to yabs.

  • It's part of a new array so it's a good "real world" test for a RAID failure and replace, which I haven't encountered before.

  • @imok said:
    ..9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 17840
    180 Unused_Reserve_NAND_Blk 0x0033 000 000 000 Pre-fail Always - 41
    202 Percent_Lifetime_Remain 0x0030 000 000 001 Old_age Offline FAILING_NOW 100

    This SMART looks a bit weird.

    You supposedly managed to write more data than this drive can sustain (how?). At the same time "Power on hours" is a perfect 100, while its raw value is 17000+.

    Also there are seemingly no spare blocks left (NAND began failing?), but the raw value is 41. Same is "Lifetime remaining" - it's zero, but the raw is 100.

    My only guess is that smartctl is unable to properly interpret some SMART attributes this drive returns.

    BF StorageAMD EPYC VDS (ref), up to 4TB NVMe / NVMe + big HDD VPS (ref) from $2.29/TB/mo / Storage/GPU dedis (ref) [Final stock, then price 🠙]

  • @imok said:
    Percent_Lifetime_Remain FAILING_NOW.

    Should I discard this drive?

    AI is very good at this kind of things. Copy paste it to your favorite AI to process the RAW data.

    Overall: the drive is in good condition, with normal wear and no immediate failure signs.

    Here’s the breakdown of the important parts:


    ✅ Health status

    • SMART overall result: PASSED
    • No read/write errors, no uncorrectable sectors, no reallocated blocks
    • Error log: empty

    👉 This is what you want to see — nothing critical is failing.


    🧠 Wear / lifespan

    • Percent_Lifetime_Remain = 100 (even though it shows FAILING_NOW — more on that below)
    • Ave_Block-Erase_Count = 1002

    For a Crucial MX500 1TB SSD, ~1000 erase cycles is moderate usage, not near end-of-life. These drives are typically rated for much higher endurance.

    👉 Realistically, this drive still has plenty of life left.


    ⏱ Usage

    • Power-on hours: 17,840 (~2 years of continuous uptime)
    • Power cycles: 81 (low, good)
    • Unexpected power losses: 58 (a bit high, but not catastrophic)

    👉 The power loss count suggests:

    • hard shutdowns
    • crashes
    • or power cuts

    Not ideal, but the drive hasn’t suffered damage from it (good sign).


    💾 NAND / reliability indicators

    • Reallocated blocks: 0
    • Program/erase failures: 0
    • Pending sectors: 0
    • Uncorrectable errors: 0

    👉 This is excellent — NAND is still healthy.


    🌡 Temperature

    • Current: 24°C
    • Max: 55°C

    👉 Completely normal.


    ⚠️ The weird part (important)

    202 Percent_Lifetime_Remain ... FAILING_NOW 100
    

    This looks scary but is actually a known quirk/bug on some Crucial/Micron firmware:

    • VALUE = 0, THRESH = 1 → triggers “FAILING_NOW”
    • RAW_VALUE = 100 → means 100% life remaining

    👉 In reality:
    This is NOT a failure. It’s just misinterpreted SMART formatting.


    🔧 Minor note

    • UDMA_CRC_Error_Count = 1

    👉 Usually caused by:

    • SATA cable hiccup
    • temporary connection issue

    Not a concern unless it keeps increasing.


    🧾 Verdict

    ✔ Healthy SSD
    ✔ No bad blocks or errors
    ✔ Moderate wear only
    ✔ Safe to keep using


    👍 Recommendations

    • Keep regular backups (standard practice)
    • Avoid hard power cuts if possible
    • Optionally run a test:

      smartctl -t long /dev/sdX
      

    If you want, I can estimate how much data you've written (TBW) and how that compares to the rated endurance — that gives an even clearer lifespan prediction.

    If you want information, feign ignorance reply with the wrong answer. Internet people will correct you ASAP!
    It’s OK if you disagree with me. I can’t force you to be right!

Sign In or Register to comment.