What Can 49,056 Hard Drives Tell Us? Hard Drive Reliability Stats for Q3 2015

Q3 2015 Hard Drive Reliability Stats

For the most recent hard drive reliability statistics, as well as the raw hard drive test data, visit Hard Drive Data and Stats.

As of the end of Q3 2015, there were 50,228 drives spinning in the Backblaze data center. Subtracting boot drives, drive models with less than 45 drives and drives in testing systems, we are publishing data on 49,056 hard drives spread across 26 different models, varying from 1TB to 8TB in size.

What’s New for the Q3 2015 Results?

In this edition, we are publishing the data on our 1TB drives for the first time. The data was always available in the data files we publish on our Hard Drive Data webpage, but now we’re reporting the data here too. We are also going to include “Average Drive Age” for each model and we’ll summarize the data by manufacturer size as well.

Hard Drive Failure Rates

Let’s start by breaking down the drives by size and comparing them over time:

blog-q3-stats-table
There’s a lot going on in the chart above; here are a few things to help out:

  • The 2013, 2014, and 2015 failure rates are cumulative for the given year. In the case of 2015 that is through Q3 (September).
  • If the failure rate is listed as 0.00% there were drives in use, but none of the drives failed during that period.
  • If the failure rate is blank, there were no drives in use during that period.
  • The “All Periods” failure rates are cumulative for all data (2013-Q3 2015).
  • The “Max # in Service” column is the maximum number of drives ever in service for the given hard drive model.
  • The “Avg Age (Months)” column is the average age of all the hard drives of the given hard drive model. This is based on SMART 9 data.
  • If the “Avg Age (Months)” data is 0.0, the given drive model was not in service during 2015 making the value difficult to compute. (We’ll try to figure out a better way to compute this value by the next report.)
  • The HGST (*) model name—we’ve been asked to use HGST in place of Hitachi and we are honoring that request, but these drives report their model as Hitachi and are listed as such in the data files.
  • The Low Rate and High Rate are the boundaries for the confidence interval for the failure rate listed.

If the chart is too much data all at once, you can download a ZIP file that when unzipped contains a Microsoft Excel file of the data from the chart. Then you can parse the facts and figures at your leisure.

Some Observations Based on This Data

  • In the chart below are the failure stats for all of the drives in the review, broken down by year. We’ve also computed the average failure rate for all periods for all drives at 4.81%.

blog_q3stats_total_avg

  • The Western Digital 1TB drives in use are nearly six years old on average. There are several drives with nearly seven years of service. It wasn’t until 2015 that the failure rate rose above the annual average for all drives. This makes sense given the “bathtub” curve of drive failure where drives over four years start to fail at a higher rate. Still, the WD 1TB drives have performed well for a long time.
  • Nearly all of the 1TB and 1.5TB drives were installed in Storage Pod 1.0 chassis. Yet, these two sizes have very different failure rates.
  • Nearly all of the 2TB and 3TB drives were installed in 2.0 chassis. Yet, these two drive sizes have very different failure rates.
  • Always consider the number of drives (Max # in Service) when looking at the failure rate. For example, the 1.5TB Seagate Barracuda Green drive has a failure rate of 130.9%, but that is based on only 51 drives. We tested these Seagate drives in one Storage Pod in our environment and they were not a good fit. In general, we’ve found it takes at least six Storage Pods (270 drives) worth of drives to get good sense of how a given drive will perform in our environment.
  • 4TB drives, regardless of their manufacturer, are performing well. The 2.10% overall failure rate means that over the course of a year, we have to replace only one drive in a Storage Pod filled with these drives. In other words, on average, a Pod comes down for maintenance once a year due to drive failure. The math: 2% is one out of 50. There are 45 drives in a Pod, so about once a year, one of those 45 drives, on average, will fail. Yes, the math is approximate, but you get the idea.
  • 6TB drives, especially the Seagate drives, are also performing well, on par with the 4TB drives so far. The 6TB drives give us 270TB Storage Pods, giving us 50% more storage at the same overall cost per gigabyte.
  • The 5TB and 8TB drives are performing well, but we only have 45 of each in testing, not enough to feel confident in the numbers yet as can be seen in the confidence interval (low rate/high rate) of these drives.

Drive Failures by Manufacturer

Below is the chart of failure percentages by manufacturer. This is for all the drives in this analysis for the years noted:

hard drive reliability by manufacturer

    • Embed this graph on your site

Our Environment

All the hard drives in this review are used in the production systems in our data center. The environment is climate controlled and all drives are individually monitored. Each day we pull the available SMART stats reported by each and every drive. These stats are available for download from our Hard Drive Data webpage. Those stats form the basis for this blog post. The large data files make for large data sets to work on, but if you give it a try, please let us know if you find anything interesting.

print

About Andy Klein

Andy Klein is the Principal Cloud Storage Storyteller at Backblaze. He has over 25 years of experience in technology marketing and during that time, he has shared his expertise in cloud storage and computer security at events, symposiums, and panels at RSA, SNIA SDC, MIT, the Federal Trade Commission, and hundreds more. He currently writes and rants about drive stats, Storage Pods, cloud storage, and more.