I think it's helpful to put on our statistics hats when looking at data like thi...

IgorPartola · 2025-10-15T22:26:02 1760567162

Exactly. I used to pore over the Backblaze data but so much of it is in the form of “we got 1,200 drives four months ago and so far none have failed”. That is a relatively small number over a small amount of time.

On top of that it seems like by the time there is a clear winner for reliability, the manufacturer no longer makes that particular model and the newer models are just not a part of the dataset yet. Basically, you can’t just go “Hitachi good, Seagate bad”. You have to look at specific models and there are what? Hundreds? Thousands?

toast0 · 2025-10-16T00:31:42 1760574702

> On top of that it seems like by the time there is a clear winner for reliability, the manufacturer no longer makes that particular model and the newer models are just not a part of the dataset yet.

That's how things work in general. Even if it is the same model, likely parts have changed anyway. For data storage, you can expect all devices to fail, so redundancy and backup plans are key, and once you have that set, reliability is mostly just a input into your cost calculations. (Ideally you do something to mitigate correlated failures from bad manufacturing or bad firmware)

account42 · 2025-10-16T11:18:08 1760613488

"Actually HGST was better on average than WD"is probably about the only kind of conclusion you can make. As you have noted, looking at specific models doesn't get you anything useful because by the time you have enough data the model is already replaced by a different one - but you can make out trends for manufacturers.

topaz0 · 2025-10-16T14:15:30 1760624130

> if we slice the data up three ways to hell and back, /all/ we see is unexplainable variation

It's certainly true that you can go too far, but this is a case where we can know a priori that the mfg date could be causing bias in the numbers they're showing, because the estimated failure rates at 5 years cannot contain data from any drives newer than 2020, whereas failure rates at 1 year can. At a minimum you might want to exclude newer drives from the analysis, e.g. exclude anything after 2020 if you want to draw conclusions about how the failure rate changes up to the 5-year mark.

tanvach · 2025-10-15T21:59:28 1760565568

I find it more straight forward to just model the failure rate with the variables directly, and look metrics like AUC for out of sample data.

mnw21cam · 2025-10-15T22:02:28 1760565748

I personally am looking forward to BackBlaze inventing error bars and statistical tests.

burnished · 2025-10-15T21:30:44 1760563844

Well said, and made me want to go review my stats text.