Hard Drive Test Data

Since 2013, Backblaze has collected, curated, and published the annualized failure rates (AFR) and related statistics from the hard disk drives (HDDs) and solid state drives (SSDs) in our data centers. This collection is the Backblaze Drive Stats dataset. Each quarter we publish updates to the dataset which is open source and can be downloaded using the links in the “Downloading the Drive Stats Dataset” section below.

Drive Stats Q1 2025 Snapshot

Drive count

308,861

Drive failures

1,064

Drive days

27,388,225

Drive population by manufacturer

HGST

Seagate

Toshiba

WDC

Drive reliability: annualized failure rates (AFR)

Period

Drive days

Drives failed

AFR

Quarterly: Q1 2025

27,388,225

1,064

1.42%

Annual: 2024

101,906,290

4,372

1.57%

Lifetime

452,991,106

16,388

1.32%

Drive Stats related podcasts and webinars

StorageReview: Live with Backblaze and Their Latest Drive Stats Report (May 2024)

Sharing insights live from the StorageReview lab

BrightTalk: Predicting Hard Drive Failure Rates with AI (September 2024)

A session from Backblaze Tech Day 2024

Drive Stats quarterly reports and related articles

We publish our analyses, observations, and insights based on the Drive Stats dataset on a regular basis on the Backblaze Blog which includes the quarterly Hard Drive Stats reports and SSD reports, and related topics such as the cost of storage, bathtub curve and hard drives, and more.

Learn More

Overview of the Drive Stats dataset

How we collect the data

Each day at each Backblaze data center, we take a snapshot of each operational drive. This snapshot includes basic drive information along with the S.M.A.R.T. statistics reported by that drive. The daily snapshot of one drive is one record or row of data. All of the drive snapshots for a given day are collected into a file consisting of a row for each active drive. The format of this file is a ".csv" (Comma Separated Values) file. Each day this file is named in the format YYYY-MM-DD.csv, for example, 2024-03-25.csv.

‍

How the data is organized

The Drive Stats schema is comprised of fields Backblaze includes for each drive record and the raw and normalized S.M.A.R.T. attributes reported by each drive.

Download a .csv file of the current Drive Stats schema: https://f001.backblazeb2.com/file/Backblaze-Hard-Drive-Data/Drive_Stats_Schema_Current.csv
Download a .csv file of the Drive Stats schema changes from Q1 2018 onward: https://f001.backblazeb2.com/file/Backblaze-Hard-Drive-Data/Drive_Stats_Schema_2018_Onward.csv

Please note, schema changes from quarter to quarter do occur, so you should always check for such changes each quarter and align the data to reflect any changes.

‍

How you can use the data

The Drive Stats dataset is open source and available for you to download below, all we ask is is the following:

you cite Backblaze as the source if you use the data,
you accept that you are solely responsible for how you use the data,
you may sell derivative works based on the data, but
you can not sell the data itself to anyone, it is free.

Downloading the Drive Stats dataset

Beginning in 2016 we uploaded the Drive Stats dataset for a given quarter. Prior to 2016 the datasets uploaded were annual (2013, 2014, and 2015). Each item listed below is a ZIP file of containing the .csv files for the named quarter or year.

2025 Data, Q1

1.10GB Zip file, 10.79GB on Disk, 92 files

Hard Drive Data and Stats

Drive Stats Q1 2025 Snapshot

Drive count

308,861

Drive failures

1,064

Drive days

27,388,225

Drive population by manufacturer

HGST

Seagate

Toshiba

WDC

Drive reliability: annualized failure rates (AFR)

Drive Stats related podcasts and webinars

Drive Stats quarterly reports and related articles

Overview of the Drive Stats dataset

Downloading the Drive Stats dataset

Staging secure is temporarily unavailable. Please check for any ongoing deploys. If none are in progress, contact the fullstack team for assistance. Click me to dismiss.