We love big disks (the storage thread)

2»

Comments

  • @root said:
    Something is wrong with this forum. Every time I read this thread on front-page, I read it as "We love big dicks (the storage thread)".

    i'm more into tits tho. i mean tb's. :p

  • === START OF INFORMATION SECTION ===
    Model Family:     Crucial/Micron Client SSDs
    Device Model:     CT1000MX500SSD1
    Serial Number:    2338E8779233
    LU WWN Device Id: 5 00a075 1e8779233
    Firmware Version: M3CR046
    User Capacity:    1,000,204,886,016 bytes [1.00 TB]
    Sector Sizes:     512 bytes logical, 4096 bytes physical
    Rotation Rate:    Solid State Device
    Form Factor:      2.5 inches
    TRIM Command:     Available
    Device is:        In smartctl database 7.3/5319
    ATA Version is:   ACS-3 T13/2161-D revision 5
    SATA Version is:  SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is:    Fri Mar 27 11:17:55 2026 -05
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled
    
    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED
    See vendor-specific Attribute list for marginal Attributes.
    
    General SMART Values:
    Offline data collection status:  (0x80) Offline data collection activity
                        was never started.
                        Auto Offline Data Collection: Enabled.
    Self-test execution status:      (   0) The previous self-test routine completed
                        without error or no self-test has ever 
                        been run.
    Total time to complete Offline 
    data collection:        (    0) seconds.
    Offline data collection
    capabilities:            (0x7b) SMART execute Offline immediate.
                        Auto Offline data collection on/off support.
                        Suspend Offline collection upon new
                        command.
                        Offline surface scan supported.
                        Self-test supported.
                        Conveyance Self-test supported.
                        Selective Self-test supported.
    SMART capabilities:            (0x0003) Saves SMART data before entering
                        power-saving mode.
                        Supports SMART auto save timer.
    Error logging capability:        (0x01) Error logging supported.
                        General Purpose Logging supported.
    Short self-test routine 
    recommended polling time:    (   2) minutes.
    Extended self-test routine
    recommended polling time:    (  30) minutes.
    Conveyance self-test routine
    recommended polling time:    (   2) minutes.
    SCT capabilities:          (0x0031) SCT Status supported.
                        SCT Feature Control supported.
                        SCT Data Table supported.
    
    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate     0x002f   100   100   000    Pre-fail  Always       -       0
      5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       0
      9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       17840
     12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       81
    171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
    172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
    173 Ave_Block-Erase_Count   0x0032   000   000   000    Old_age   Always       -       1002
    174 Unexpect_Power_Loss_Ct  0x0032   100   100   000    Old_age   Always       -       58
    180 Unused_Reserve_NAND_Blk 0x0033   000   000   000    Pre-fail  Always       -       41
    183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always       -       0
    184 Error_Correction_Count  0x0032   100   100   000    Old_age   Always       -       0
    187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
    194 Temperature_Celsius     0x0022   076   045   000    Old_age   Always       -       24 (Min/Max 15/55)
    196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
    197 Current_Pending_ECC_Cnt 0x0032   100   100   000    Old_age   Always       -       0
    198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
    199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       1
    202 Percent_Lifetime_Remain 0x0030   000   000   001    Old_age   Offline  FAILING_NOW 100
    206 Write_Error_Rate        0x000e   100   100   000    Old_age   Always       -       0
    210 Success_RAIN_Recov_Cnt  0x0032   100   100   000    Old_age   Always       -       0
    246 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       253743124217
    247 Host_Program_Page_Count 0x0032   100   100   000    Old_age   Always       -       2740494855
    248 FTL_Program_Page_Count  0x0032   100   100   000    Old_age   Always       -       14721802449
    
    SMART Error Log Version: 1
    No Errors Logged
    
    SMART Self-test log structure revision number 1
    No self-tests have been logged.  [To run self-tests, use: smartctl -t]
    
    SMART Selective self-test log data structure revision number 1
     SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
        1        0        0  Not_testing
        2        0        0  Not_testing
        3        0        0  Not_testing
        4        0        0  Not_testing
        5        0        0  Completed [00% left] (0-65535)
    Selective self-test flags (0x0):
      After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.
    

    Percent_Lifetime_Remain FAILING_NOW.

    Should I discard this drive?

  • @imok said:
    SMART overall-health self-assessment test result: PASSED
    1 Raw_Read_Error_Rate 0x002f 100 100 000 Pre-fail Always - 0
    5 Reallocate_NAND_Blk_Cnt 0x0032 100 100 010 Old_age Always - 0
    172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0
    184 Error_Correction_Count 0x0032 100 100 000 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0
    197 Current_Pending_ECC_Cnt 0x0032 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
    206 Write_Error_Rate 0x000e 100 100 000 Old_age Always - 0

    Looks like everything is dandy. And you have backups in the event that the entire PC catches fire, of course.

    Percent_Lifetime_Remain FAILING_NOW.

    Should I discard this drive?

    Why, because the manufacturer? If so, please send to my inbox. I will use it only to yabs.

  • It's part of a new array so it's a good "real world" test for a RAID failure and replace, which I haven't encountered before.

  • @imok said:
    ..9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 17840
    180 Unused_Reserve_NAND_Blk 0x0033 000 000 000 Pre-fail Always - 41
    202 Percent_Lifetime_Remain 0x0030 000 000 001 Old_age Offline FAILING_NOW 100

    This SMART looks a bit weird.

    You supposedly managed to write more data than this drive can sustain (how?). At the same time "Power on hours" is a perfect 100, while its raw value is 17000+.

    Also there are seemingly no spare blocks left (NAND began failing?), but the raw value is 41. Same is "Lifetime remaining" - it's zero, but the raw is 100.

    My only guess is that smartctl is unable to properly interpret some SMART attributes this drive returns.

    VDS & StorageBudget AMD EPYC VDS (ref) from $4/m / Big HDD VPS (ref) from $2/TB/mo / EPYC/Gold/Platinum dedis (ref) w/ GPU support

  • @imok said:
    Percent_Lifetime_Remain FAILING_NOW.

    Should I discard this drive?

    AI is very good at this kind of things. Copy paste it to your favorite AI to process the RAW data.

    Overall: the drive is in good condition, with normal wear and no immediate failure signs.

    Here’s the breakdown of the important parts:


    ✅ Health status

    • SMART overall result: PASSED
    • No read/write errors, no uncorrectable sectors, no reallocated blocks
    • Error log: empty

    👉 This is what you want to see — nothing critical is failing.


    🧠 Wear / lifespan

    • Percent_Lifetime_Remain = 100 (even though it shows FAILING_NOW — more on that below)
    • Ave_Block-Erase_Count = 1002

    For a Crucial MX500 1TB SSD, ~1000 erase cycles is moderate usage, not near end-of-life. These drives are typically rated for much higher endurance.

    👉 Realistically, this drive still has plenty of life left.


    ⏱ Usage

    • Power-on hours: 17,840 (~2 years of continuous uptime)
    • Power cycles: 81 (low, good)
    • Unexpected power losses: 58 (a bit high, but not catastrophic)

    👉 The power loss count suggests:

    • hard shutdowns
    • crashes
    • or power cuts

    Not ideal, but the drive hasn’t suffered damage from it (good sign).


    💾 NAND / reliability indicators

    • Reallocated blocks: 0
    • Program/erase failures: 0
    • Pending sectors: 0
    • Uncorrectable errors: 0

    👉 This is excellent — NAND is still healthy.


    🌡 Temperature

    • Current: 24°C
    • Max: 55°C

    👉 Completely normal.


    ⚠️ The weird part (important)

    202 Percent_Lifetime_Remain ... FAILING_NOW 100
    

    This looks scary but is actually a known quirk/bug on some Crucial/Micron firmware:

    • VALUE = 0, THRESH = 1 → triggers “FAILING_NOW”
    • RAW_VALUE = 100 → means 100% life remaining

    👉 In reality:
    This is NOT a failure. It’s just misinterpreted SMART formatting.


    🔧 Minor note

    • UDMA_CRC_Error_Count = 1

    👉 Usually caused by:

    • SATA cable hiccup
    • temporary connection issue

    Not a concern unless it keeps increasing.


    🧾 Verdict

    ✔ Healthy SSD
    ✔ No bad blocks or errors
    ✔ Moderate wear only
    ✔ Safe to keep using


    👍 Recommendations

    • Keep regular backups (standard practice)
    • Avoid hard power cuts if possible
    • Optionally run a test:

      smartctl -t long /dev/sdX
      

    If you want, I can estimate how much data you've written (TBW) and how that compares to the rated endurance — that gives an even clearer lifespan prediction.

    I speak fluent sarcasm and broken logic. | I would agree with you, but thæn we’d both be wrong.

Sign In or Register to comment.