{"id":93200,"date":"2019-10-22T08:18:56","date_gmt":"2019-10-22T15:18:56","guid":{"rendered":"https:\/\/www.backblaze.com\/blog\/?p=93200"},"modified":"2025-12-14T15:45:13","modified_gmt":"2025-12-14T23:45:13","slug":"smart-stats-exposed-a-drive-stats-remix","status":"publish","type":"post","link":"https:\/\/www.backblaze.com\/blog\/smart-stats-exposed-a-drive-stats-remix\/","title":{"rendered":"SMART Stats Exposed \u2014 a Drive Stats Remix"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-93217 size-full\" title=\"SMART Stats Exposed -- a Drive Stats Remix\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/header-stats.jpg\" alt=\"SMART Stats On Trial\" width=\"1440\" height=\"820\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg 1440w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats-300x171.jpg 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats-1024x583.jpg 1024w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats-768x437.jpg 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats-560x319.jpg 560w\" sizes=\"auto, (max-width: 1440px) 100vw, 1440px\" \/><\/p>\n<p><strong>Editor\u2019s Note:<\/strong>\u00a0 Since 2013, Backblaze has <a href=\"\/blog\/category\/cloud-storage\/hard-drive-stats\/\" target=\"_blank\" rel=\"noopener noreferrer\">published statistics and insights<\/a> based on the hard drives in our data centers. Why? Well, we like to be helpful, and we thought sharing would help others who rely on hard drives, but don\u2019t have reliable data on performance to make informed purchasing decisions. We also hoped the data might aid manufacturers in improving their products. Given the millions of people who\u2019ve read our Hard Drive Stats posts and the increasingly collaborative relationships we have with manufacturers, it seems we might have been right.<\/p>\n<p>But we don\u2019t only share our take on the numbers, we also provide the raw data underlying our reports so that anyone who wants to can reproduce them or draw their own conclusions, and many have. We love it when people reframe our reports, question our logic (maybe even our sanity?), and provide their own take on what we should do next. That\u2019s why we\u2019re featuring Ryan Smith today.<\/p>\n<p>Ryan has held a lot of different roles in tech, but lately he\u2019s been dwelling in the world of storage as a product strategist for Hitachi. On a personal level, he explains that he has, \u201cpassion for data, finding insights from data, and helping others see how easy and rewarding it can be to look under the covers.\u201d It shows.<\/p>\n<p>A few months ago we happened on a <a href=\"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/\" target=\"_blank\" rel=\"noopener noreferrer\">post by Ryan<\/a> with an appealing header featuring our logo with an <span style=\"color: red; font-weight: bold;\">EXPOSED<\/span> stamp superimposed in red over our humble name. It looked like we had been caught in a sting operation. As a company that loves <a href=\"\/blog\/transparency-in-business\/\" target=\"_blank\" rel=\"noopener noreferrer\">transparency<\/a>, we were <em>delighted<\/em>. Reading on, we found a lot to love and plenty to argue over, but more than anything, we appreciated how Ryan took data we use to analyze hard drive failure rates and extrapolated out all sorts of other gleanings about our business. As he puts it, \u201cit\u2019s not the value at the surface but the story that can be told by tying data together.\u201d So, we thought we\u2019d share his original post with you to (hopefully) incite some more arguments and some more tying together of data.<\/p>\n<p>While we think his conclusions are reasonable based on the data available to him, the views and analysis below are entirely Ryan&#8217;s. We appreciate how he flagged some areas of uncertainty, but thought it most interesting to share his thoughts without rebuttal. If you\u2019re curious about how he reached them, you can find his <a href=\"https:\/\/www.soothsawyer.com\/how-i-analyzed-backblaze-smart-data\/\" target=\"_blank\" rel=\"noopener noreferrer\">notes on process<\/a> here. He doesn\u2019t have the full story, but we think he did amazing work with the public data.<\/p>\n<p>Our 2019 Q3 Hard Drive Stats post will be out in a few weeks, and we hope some of you will take Ryan\u2019s lead and do your own deep dive into the reporting when it\u2019s public. For those of you who can\u2019t wait, we\u2019re hoping this will tide you over for a little while.<\/p>\n<p>If you&#8217;re interested in taking a look at the data yourselves, here&#8217;s our <a href=\"https:\/\/www.backblaze.com\/cloud-storage\/resources\/hard-drive-test-data\" target=\"_blank\" rel=\"noopener noreferrer\">Hard Drive Data and Stats<\/a> webpage that has links to all our past Hard Drive Stats posts and zip files of the raw data.<\/p>\n<hr style=\"border: 0; height: 2px; background-image: linear-gradient(to right, rgba(0, 0, 0, 0), rgba(0, 0, 0, 0.75), rgba(0, 0, 0, 0)); max-width: 95%; margin: 36px auto;\" \/>\n<h2>Ryan Smith Uses Backblaze\u2019s SMART Stats to Illustrate the Power of Data<\/h2>\n<p>(Originally published July 8, 2019 on <a href=\"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/\" target=\"_blank\" rel=\"noopener noreferrer\">Soothsawyer.com<\/a>.)<\/p>\n<p id=\"bzdropcap\">It is now common practice for end-customers to share telemetry (call home) data with their vendors. My analysis below shares some insights about <strong>your<\/strong> business that vendors might gain from seemingly innocent data that you are sending them every day.<\/p>\n<p>On a daily basis, <a href=\"https:\/\/www.backblaze.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Backblaze (a cloud backup and storage provider)<\/a>, logs all its drive health data (aka SMART data) for over 100,000 of its hard drives. With 100K+ records a day, each year can produce over 30 million records. They share this raw data on their website, but most people probably don\u2019t really dig into it much. I decided to see what this data could tell me and what I found was fascinating.<\/p>\n<p>Rather than looking at nearly 100 million records, I decided to only look at just over one million which consisted of the last day of every quarter from Q1\u201916 to Q1\u201919. This would give me enough granularity to see what is happening inside Backblaze\u2019s cloud backup storage business. For those interested, I used MySQL to import and transform the data into something easy to work with (<a href=\"https:\/\/www.soothsawyer.com\/2019\/07\/08\/how-i-analyzed-backblaze-smart-data\/\" target=\"_blank\" rel=\"noopener noreferrer\">see more details on my SQL query<\/a>); I then imported the data into Excel where I could easily pivot the data and look for insights. Below are the results of this effort.<\/p>\n<h2>Capacity Growth<\/h2>\n<h3>User Data vs Physical Capacity<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/01_Backblaze_user_data_stored_vs_physical_capacity.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93220\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/01_Backblaze_user_data_stored_vs_physical_capacity.png\" alt=\"User Data Stored vs Physical Capacity\" width=\"972\" height=\"705\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/01_Backblaze_user_data_stored_vs_physical_capacity.png 972w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/01_Backblaze_user_data_stored_vs_physical_capacity-300x218.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/01_Backblaze_user_data_stored_vs_physical_capacity-768x557.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/01_Backblaze_user_data_stored_vs_physical_capacity-560x406.png 560w\" sizes=\"auto, (max-width: 972px) 100vw, 972px\" \/><\/a><\/p>\n<p>I grabbed the publicly posted \u201cPetabytes stored\u201d that BackBlaze claims on their website (\u201cUser Petabytes\u201d) and compared that to the total capacity from the SMART data they log (\u201cPhysical Petabytes\u201d) and then compared them against each other to see how much overhead or unused capacity they have. The Theoretical Max (green line) is based on their ECC protection scheme (13+2 and\/or 17+3) that they use to protect user data. If the \u201c% User Petabytes\u201d is below that max then this means Backblaze either has unused capacity or they didn\u2019t update their website with the actual data stored.<\/p>\n<h3>Data Read\/Written vs Capacity Growth<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/02_Backblaze_ReadsWrites_vs_CapacityGrowth_YoY.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93222\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/02_Backblaze_ReadsWrites_vs_CapacityGrowth_YoY.png\" alt=\"Reads\/Writes versus Capacity Growth (Year-over-Year)\" width=\"991\" height=\"720\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/02_Backblaze_ReadsWrites_vs_CapacityGrowth_YoY.png 991w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/02_Backblaze_ReadsWrites_vs_CapacityGrowth_YoY-300x218.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/02_Backblaze_ReadsWrites_vs_CapacityGrowth_YoY-768x558.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/02_Backblaze_ReadsWrites_vs_CapacityGrowth_YoY-560x407.png 560w\" sizes=\"auto, (max-width: 991px) 100vw, 991px\" \/><\/a><\/p>\n<p>Looking at the last two years, by quarter, you can see a healthy amount of year-over-year growth in their write workload; roughly 80% over the last four quarters! This is good since writes likely correlate with new user data, which means broader adoption of their offering. For some reason their read workloads spiked in Q2\u201917 and have maintained a higher read workload since then (as indicated by the YoY spikes from Q2\u201917 to Q1\u201918, and then settling back to less than 50% YoY since); my guess is this was likely driven by a change to their internal workload rather than a migration because I didn\u2019t see subsequent negative YoY reads.<\/p>\n<h2>Performance<\/h2>\n<p>Now let\u2019s look at some performance insights. A quick note: Only Seagate hard drives track the needed information in their SMART data in order to get insights about performance. Fortunately, roughly 80% of Backblaze\u2019s drive population (both capacity and units) are Seagate so it\u2019s a large enough population to represent the overall drive population. Going forward, it does look like the new 12 TB WD HGST drive is starting to track bytes read\/written.<\/p>\n<h3>Pod (Storage Enclosure) Performance<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/04_Backblaze_Pod_Hard_Drive_Enclosure_Performance.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93223\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/04_Backblaze_Pod_Hard_Drive_Enclosure_Performance.png\" alt=\"Pod (Hard Drive Enclosure) Performance\" width=\"985\" height=\"699\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/04_Backblaze_Pod_Hard_Drive_Enclosure_Performance.png 985w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/04_Backblaze_Pod_Hard_Drive_Enclosure_Performance-300x213.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/04_Backblaze_Pod_Hard_Drive_Enclosure_Performance-768x545.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/04_Backblaze_Pod_Hard_Drive_Enclosure_Performance-560x397.png 560w\" sizes=\"auto, (max-width: 985px) 100vw, 985px\" \/><\/a><\/p>\n<p>Looking at Power-on-hours of each drive, I was able to calculate the vintage of each drive and the number of drives in each \u201cpod\u201d (this is the terminology that Backblaze gives to its <a href=\"https:\/\/www.backblaze.com\/cloud-storage\/resources\/storage-pod\" target=\"_blank\" rel=\"noopener noreferrer\">storage enclosures<\/a>). This lets me calculate the number of pods that Backblaze has in its data centers. Their original pods stored 45 drives and this improved to 60 drives in ~Q2\u201916 (according to <a href=\"\/blog\/category\/cloud-storage\/storage-pod\/\" target=\"_blank\" rel=\"noopener noreferrer\">past blog posts by Backblaze<\/a>). The power-on-date allowed me to place the drive into the appropriate enclosure type and provide you with pod statistics like the Mbps per pod. This is definitely an educated guess as some newer vintage drives are replacement drives into older enclosures but the overall percentage of drives that fail is low enough to where these figures should be pretty accurate.<\/p>\n<p><a href=\"\/blog\/vault-cloud-storage-architecture\/\" target=\"_blank\" rel=\"noopener noreferrer\">Backblaze has stated that they can achieve up to 1 Gbps per pod<\/a>, but as you can see they are only reaching an average throughput of 521 Mbps. I have to admit I was surprised to see such a low performance figure since I believe their storage servers are equipped with 10 Gbps ethernet.<\/p>\n<p>Overall, Backblaze\u2019s data centers are handling over 100 GB\/s of throughput across all their pods, which is quite an impressive figure. This number keeps climbing and is a result of new pods as well as overall higher performance per pod. From quick research, this is across three different data centers (Sacramento x 2, Phoenix x 1) and maybe a fourth on its way in Europe.<\/p>\n<h3>Hard Drive Performance<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/03_Backblaze_HardDrive_Read_Write_Performance.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93224\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/03_Backblaze_HardDrive_Read_Write_Performance.png\" alt=\"Hard Drive Read\/Write Performance\" width=\"1020\" height=\"690\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/03_Backblaze_HardDrive_Read_Write_Performance.png 1020w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/03_Backblaze_HardDrive_Read_Write_Performance-300x203.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/03_Backblaze_HardDrive_Read_Write_Performance-768x520.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/03_Backblaze_HardDrive_Read_Write_Performance-560x379.png 560w\" sizes=\"auto, (max-width: 1020px) 100vw, 1020px\" \/><\/a><\/p>\n<p>Since each pod holds between 45 and 60 drives, with an overall max pod performance of 1 Gbps, I wasn\u2019t surprised to see such average low drive performance. You can see that Backblaze\u2019s workload is read heavy with less than 1 MB\/s and writes only a third of that. Just to put that in perspective, these drives can deliver over 100 MB\/s, so Backblaze is not pushing the limits of these hard drives.<\/p>\n<p>As discussed earlier, you can also see how the read workload changed significantly in Q2\u201917 and has not reverted back since.<\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93225\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance.png\" alt=\"Seagate Hard Drive Read\/Write Performance, by Density\" width=\"1126\" height=\"587\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance.png 1126w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance-300x156.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance-1024x534.png 1024w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance-768x400.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/05_Backblaze_Seagate_Hard_Drive_Performance-560x292.png 560w\" sizes=\"auto, (max-width: 1126px) 100vw, 1126px\" \/><\/a><\/p>\n<p>As I expected, the read and write performance is highly correlated to the drive capacity point. So, it appears that most of the growth in read\/write performance per drive is really driven by the adoption of higher density drives. This is very typical of public storage-as-a-service (STaaS) offerings where it\u2019s really about $\/GB, IOPS\/GB, MBs\/GB, etc.<\/p>\n<p>As a side note, the black dashed lines (average between all densities) should correlate with the previous chart showing overall read\/write performance per drive.<\/p>\n<h2>Purchasing<\/h2>\n<p>Switching gears, let\u2019s look at Backblaze\u2019s purchasing history. This will help suppliers look at trends within Backblaze to predict future purchasing activities. I used power-on-hours to calculate when a drive entered the drive population.<\/p>\n<h3>Hard Drives Purchased by Density, by Year<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/06_Backblaze_HardDrive_Purchased_by_Capacity.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93226\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/06_Backblaze_HardDrive_Purchased_by_Capacity.png\" alt=\"Hard Drives Purchased by Capacity\" width=\"983\" height=\"712\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/06_Backblaze_HardDrive_Purchased_by_Capacity.png 983w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/06_Backblaze_HardDrive_Purchased_by_Capacity-300x217.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/06_Backblaze_HardDrive_Purchased_by_Capacity-768x556.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/06_Backblaze_HardDrive_Purchased_by_Capacity-560x406.png 560w\" sizes=\"auto, (max-width: 983px) 100vw, 983px\" \/><\/a><\/p>\n<p>This chart helps you see how Backblaze normalized on 4 TB, 8 TB, and now 12 TB densities. The number of drives that Backblaze purchases every year has been climbing until 2018 where it saw its first decline in units. However, this is mainly due to the efficiencies of the capacity per drive.<\/p>\n<p>A question to ponder: Did 2018 reach a point where <strong>capacity<\/strong> growth per HDD surpassed the actual demand required to maintain <strong>unit<\/strong> growth of HDDs? Or is this trend limited to Backblaze?<\/p>\n<h3>Petabytes Purchased by Quarter<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/07_Backblaze_HardDrive_Petabytes_Units_Purchased_by_Quarter.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93227\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/07_Backblaze_HardDrive_Petabytes_Units_Purchased_by_Quarter.png\" alt=\"Drives\/Petabytes Purchased, by Quarter\" width=\"977\" height=\"702\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/07_Backblaze_HardDrive_Petabytes_Units_Purchased_by_Quarter.png 977w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/07_Backblaze_HardDrive_Petabytes_Units_Purchased_by_Quarter-300x216.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/07_Backblaze_HardDrive_Petabytes_Units_Purchased_by_Quarter-768x552.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/07_Backblaze_HardDrive_Petabytes_Units_Purchased_by_Quarter-560x402.png 560w\" sizes=\"auto, (max-width: 977px) 100vw, 977px\" \/><\/a><\/p>\n<p>This looks at the number of drives purchased over the last five years, along with the amount of capacity added. It\u2019s not quite regular enough to spot a trend, but you can quickly spot that the amount of capacity purchased over the last two years has grown dramatically compared to previous years.<\/p>\n<h3>HDD Vendor Market Share<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93228\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare.png\" alt=\"Hard Drive Supplier Market Share\" width=\"1075\" height=\"691\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare.png 1075w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare-300x193.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare-1024x658.png 1024w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare-768x494.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/08_Backblaze_HardDrive_Supplier_MarketShare-560x360.png 560w\" sizes=\"auto, (max-width: 1075px) 100vw, 1075px\" \/><\/a><\/p>\n<h4>Western Digital\/WDC, Toshiba\/TOSYY, Seagate\/STX<\/h4>\n<p>Seagate is definitely the preferred vendor, capturing almost 100% of the market share save for a few quarters where WD HGST wins 50% of the business. This information could be used by Seagate or its competitors to understand where it stands within the account for future bids. However, the industry is monopolistic so it\u2019s not hard to guess who won the business if a given HDD vendor didn\u2019t.<\/p>\n<h2>Drives<\/h2>\n<h3>Drive Population by Quarter<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/09_Backblaze_HardDrive_Population_by_Quarter.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93229\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/09_Backblaze_HardDrive_Population_by_Quarter.png\" alt=\"Total Drive Population, by Quarter\" width=\"979\" height=\"715\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/09_Backblaze_HardDrive_Population_by_Quarter.png 979w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/09_Backblaze_HardDrive_Population_by_Quarter-300x219.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/09_Backblaze_HardDrive_Population_by_Quarter-768x561.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/09_Backblaze_HardDrive_Population_by_Quarter-560x409.png 560w\" sizes=\"auto, (max-width: 979px) 100vw, 979px\" \/><\/a><\/p>\n<p>This shows the total drive population over the past three years. Even though the number of drives being <strong>purchased<\/strong> has been falling lately, the overall drive <strong>population<\/strong> is still growing.<\/p>\n<p>You can quickly see that 4 TB drives saw its peak population in Q1\u201917 and has rapidly declined. In fact, let\u2019s look at the same data but with a different type of chart.<\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/10_Backblaze_HardDrive_Population_by_Quarter_line.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93230\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/10_Backblaze_HardDrive_Population_by_Quarter_line.png\" alt=\"Total Drive Population, by Quarter\" width=\"974\" height=\"721\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/10_Backblaze_HardDrive_Population_by_Quarter_line.png 974w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/10_Backblaze_HardDrive_Population_by_Quarter_line-300x222.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/10_Backblaze_HardDrive_Population_by_Quarter_line-768x569.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/10_Backblaze_HardDrive_Population_by_Quarter_line-560x415.png 560w\" sizes=\"auto, (max-width: 974px) 100vw, 974px\" \/><\/a><\/p>\n<p>That\u2019s better. We can see that 12 TBs really had a dramatic effect on both 4 TB and 8 TB adoption. In fact, Backblaze has been proactively retiring 4 TB drives. This is likely due to the desire to slow the growth of their data center footprint which comes with costs (more on this later).<\/p>\n<p>As a drive vendor, I could use this data to use the 4 TB trend to calculate how much drive replacement will be occurring next quarter, along with natural PB growth. I will look more into Backblaze\u2019s drive\/pod retirement later.<\/p>\n<h3>Current Drive Population, by Deployed Date<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93231\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate.png\" alt=\"Q1'2019 Drive Population, by Deployed Date\" width=\"966\" height=\"721\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate.png 966w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate-300x224.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate-768x573.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate-260x195.png 260w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/11_Backblaze_HardDrive_Population_by_DeployedDate-560x418.png 560w\" sizes=\"auto, (max-width: 966px) 100vw, 966px\" \/><\/a><\/p>\n<p>Be careful when interpreting this graph. What we are looking at here is the Q1\u201919 drive population where the date on the x-axis is the date the drive entered the population. This helps you see of all the drives in Backblaze\u2019s population <strong>today<\/strong>, in which the oldest drives are from 2015 (with the exception of a few stragglers).<\/p>\n<p>This indicates that the useful life of drives within Backblaze\u2019s data centers are ~4 years. In fact, a later chart will look at how drives\/pods are phased out, by year.<\/p>\n<p>Along the top of the chart, I noted when the 60-drive pods started entering into the mix. The rack density is much more efficient with this design (rather than the 45-drive pod). Combine this with the efficiency of the 4 TB to 12 TB drives and it\u2019s clear why Backblaze has aggressively been retiring its 4 TB\/45-drive enclosures. There is still a large population of these remaining so expect some further migration to occur.<\/p>\n<h3>Boot Drive Population<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/12_Backblaze_BootDrive_Population.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93232\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/12_Backblaze_BootDrive_Population.png\" alt=\"Total Boot Drive Population, by Quarter\" width=\"992\" height=\"696\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/12_Backblaze_BootDrive_Population.png 992w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/12_Backblaze_BootDrive_Population-300x210.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/12_Backblaze_BootDrive_Population-768x539.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/12_Backblaze_BootDrive_Population-560x393.png 560w\" sizes=\"auto, (max-width: 992px) 100vw, 992px\" \/><\/a><\/p>\n<p>This is the overall boot drive population over time. You can see that it is currently dominated by the 500 GB with only a few remaining smaller densities in the population today. For some reason, Toshiba has been the preferred vendor with Seagate only recently gaining some new business.<\/p>\n<p>The boot drive population is also an interesting data point to use for verifying the number of pods in the population. For example, there were 1,909 boot drives in Q1\u201919 and my calculation of pods based on the 45\/60-drive pod mix was 1,905. I was able to use the total boot drives each quarter to double check my mix of pods.<\/p>\n<h2>Pods (Drive Enclosures)<\/h2>\n<p>As discussed earlier, pods are the drive enclosures that house all of Backblaze\u2019s hard drives. Let\u2019s take a look at a few more trends that show what\u2019s going on within the walls of their data center.<\/p>\n<h3>Pods Population by Deployment Date<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/13_Backblaze_Pod_Population_by_Deployment_date.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93233\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/13_Backblaze_Pod_Population_by_Deployment_date.png\" alt=\"Pods (HDD Enclosure) Population by Deployment Date\" width=\"981\" height=\"699\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/13_Backblaze_Pod_Population_by_Deployment_date.png 981w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/13_Backblaze_Pod_Population_by_Deployment_date-300x214.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/13_Backblaze_Pod_Population_by_Deployment_date-768x547.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/13_Backblaze_Pod_Population_by_Deployment_date-560x399.png 560w\" sizes=\"auto, (max-width: 981px) 100vw, 981px\" \/><\/a><\/p>\n<p>This one is interesting. Each line in the graph indicates a particular snapshot in time of the total population. And the x-axis represents the vintage of the pods for that snapshot. By comparing snapshots, this allows you to see changes over time to the population &#8212; namely, new pods being deployed and old pods being retired. To capture this, I looked at the last day of Q1 data for the last four years and calculated the date the drives entered the population. Using the \u201cPower On Date\u201d I was able to deduce the type of pod (45 or 60 drive) it was deployed in.<\/p>\n<p>Some insights from this chart:<\/p>\n<ul>\n<li>From Q2\u201916 to Q1\u201917, they retired some pods from 2010-11<\/li>\n<li>From Q2\u201917 to Q1\u201918, they retired a significant number of pods from 2011-14<\/li>\n<li>From Q2\u201918 to Q1\u201919, they retired pods from 2013-2015<\/li>\n<li>Pods that were deployed since late 2015 have been untouched (you can tell this by seeing the lines overlap with each other)<\/li>\n<li>The most pods deployed in a quarter was 185 in Q2\u201916<\/li>\n<li>Since Q2\u201916, the number of pods deployed has been declining, on average; this is due to the increase in # of drives per pod and density of each drive<\/li>\n<li>There are still a significant number of 45-drive pods to retire<\/li>\n<\/ul>\n<h3>Pods Deployed\/Retired<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93234\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year.png\" alt=\"Total Pods (HDD Enclosure) Population\" width=\"1060\" height=\"715\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year.png 1060w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year-300x202.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year-1024x691.png 1024w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year-768x518.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/14_Backblaze_Pod_Population_Increase_Decrease_by_Year-560x378.png 560w\" sizes=\"auto, (max-width: 1060px) 100vw, 1060px\" \/><\/a><\/p>\n<p>Totaling up all the new pods being deployed and retired, it is easier to see the yearly changes happening within Backblaze\u2019s operation. Keep in mind that these are all calculations and may erroneously include drive replacements as new pods; but I don\u2019t expect it to vary significantly from what is shown here.<\/p>\n<p>The data shows that any new pods that have been deployed in the past few years have mainly been driven by replacing older, less dense pods. In fact, the pod population has plateaued at around 1,900 pods.<\/p>\n<h3>Total Racks<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/15_Backblaze_Rack_Population.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93235\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/15_Backblaze_Rack_Population.png\" alt=\"Total Racks\" width=\"972\" height=\"684\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/15_Backblaze_Rack_Population.png 972w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/15_Backblaze_Rack_Population-300x211.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/15_Backblaze_Rack_Population-768x540.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/15_Backblaze_Rack_Population-560x394.png 560w\" sizes=\"auto, (max-width: 972px) 100vw, 972px\" \/><\/a><\/p>\n<p>Based on blog posts, Backblaze\u2019s pods are all designed at 4U (4 rack units) and pictures on their site indicate 10 pods fit in a rack; this equates to 40U racks. Using this information, along with the drive population and the power-on-date, I was able to calculate the number of pods on any given date as well as the total number of racks. I did not include their networking racks in which I believe they have two of these racks per row in their data center.<\/p>\n<p>You can quickly see that Backblaze has done a great job at slowing the growth of the racks in their data center. This all results in lower costs for their customers.<\/p>\n<h2>Retiring Pods<\/h2>\n<p>What interested me when looking at Backblaze\u2019s SMART data was the fact that drives were being retired more than they were failing. This means the cost of failures is fairly insignificant in the scheme of things. It is actually efficiencies driven by technology improvements such as drive and enclosure densities that drove most of the costs. However, the benefits must outweigh the costs. Being that Backblaze uses Sungard AS for its data centers, let\u2019s try to visualize the benefit of retiring drives\/pods.<\/p>\n<h3>Colocation Costs, Assuming a Given Density<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/16_Backblaze_Yearly_Colocation_Costs_Assuming_One_Density.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93236\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/16_Backblaze_Yearly_Colocation_Costs_Assuming_One_Density.png\" alt=\"Yearly Colocation Costs, Assuming One Drive Density\" width=\"972\" height=\"684\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/16_Backblaze_Yearly_Colocation_Costs_Assuming_One_Density.png 972w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/16_Backblaze_Yearly_Colocation_Costs_Assuming_One_Density-300x211.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/16_Backblaze_Yearly_Colocation_Costs_Assuming_One_Density-768x540.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/16_Backblaze_Yearly_Colocation_Costs_Assuming_One_Density-560x394.png 560w\" sizes=\"auto, (max-width: 972px) 100vw, 972px\" \/><\/a><\/p>\n<p>This shows the total capacity over time in Backblaze\u2019s data centers, along with the colocation costs assuming all the drives were a given density. As you can see, in Q1\u201919 it would take $7.7M a year to pay for colocating costs of 861 PB if all the drives were 4 TB in size. By moving the entire population to 12 TB this can be reduced to $2.6M. So, just changing the drive density can have significant impacts on Backblaze\u2019s operational costs. I did assume $45\/RU costs in the analysis which means their costs may be as low as $15\/RU based on the scale of their operation.<\/p>\n<p>I threw in 32 TB densities to illustrate a hypothetical SSD-type density so you can see the colocation cost savings by moving to SSDs. Although lower, the acquisition costs are far too high at the moment to justify a move to SSDs.<\/p>\n<h3>Break-Even Analysis of Retiring Pods<\/h3>\n<p><a href=\"\/blog\/wp-content\/uploads\/2019\/10\/17_Backblaze_BreakEven_Analysis_Retiring_Old_Pods_Drives.png\" data-rel=\"lightbox-gallery-zIZ8GMfU\" data-rl_title=\"\" data-rl_caption=\"\" title=\"\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93237\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/17_Backblaze_BreakEven_Analysis_Retiring_Old_Pods_Drives.png\" alt=\"Break-Even Analysis of Retiring Older Pods\/Drives\" width=\"980\" height=\"713\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/17_Backblaze_BreakEven_Analysis_Retiring_Old_Pods_Drives.png 980w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/17_Backblaze_BreakEven_Analysis_Retiring_Old_Pods_Drives-300x218.png 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/17_Backblaze_BreakEven_Analysis_Retiring_Old_Pods_Drives-768x559.png 768w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/17_Backblaze_BreakEven_Analysis_Retiring_Old_Pods_Drives-560x407.png 560w\" sizes=\"auto, (max-width: 980px) 100vw, 980px\" \/><\/a><\/p>\n<p>This chart helps illustrate the math behind deciding to retire older drives\/pods based on the break-even point.<\/p>\n<p>Let\u2019s break down how to read this chart:<\/p>\n<ul>\n<li>This chart is looking at whether Backblaze should replace older drives with the newer 12 TB drives<\/li>\n<li>Assuming a cost of $0.02\/GB for a 12 TB drive, that is a $20\/TB acquisition cost you see on the far left<\/li>\n<li>Each line represents the cumulative cost over time (acquisition + operational costs)<\/li>\n<li>The grey lines (4 TB and 8 TB) all assume they were already acquired so they only represent operational costs ($0 acquisition cost) since we are deciding on replacement costs<\/li>\n<li>The operational costs (incremental yearly increase shown) is calculated off of the $45 per RU colocation cost and how many of this drive\/enclosure density fits per rack unit. The more TBs you can cram into a rack unit, the lower your colocation costs are<\/li>\n<\/ul>\n<p>Assuming you are still with me, this shows that the break-even point for retiring 4 TB 4U45 pods is just over two years! And 4 TB 4U60 pods at 3 years! It\u2019s a no brainer to kill the 4 TB enclosures and replace them with 12 TB drives. Remember that this assumes a $45RU colocation cost so the break-even point will shift to the right if the colocation costs are lower (which they surely are).<br \/>\nYou can see that the math to replace 8 TB drives with 12 TB doesn\u2019t make as much sense so we may see Backblaze\u2019s retirement strategy slow down dramatically after it retires the 4 TB capacity points.<\/p>\n<p>As hard drive densities get larger and $\/GB decreases, I expect the cumulative costs to start lower (less acquisition cost) and rise slower (less RU operational costs) making future drive retirements more attractive. Eyeballing it, it would be once $\/GB approaches $0.01\/GB to $0.015\/GB.<\/p>\n<h2>Things Backblaze Should Look Into<\/h2>\n<p><i>Top of mind, Backblaze should look into these areas:<\/i><\/p>\n<ul>\n<li>The architecture around performance is not balanced; investigate having a caching tier to handle bursts and put more drives behind each storage node to reduce \u201cenclosure\/slot tax\u201d costs.<\/li>\n<li>Look into designs like 5U84 from Seagate\/Xyratex providing 16.8 drives per RU versus the 15 being achieved on Backblaze\u2019s own 4U60 design; Another 12% efficiency!\n<ul>\n<li>5U allows for 8 pods to fit per rack versus the 10.<\/li>\n<\/ul>\n<\/li>\n<li>Look at when SSDs will be attractive to replace HDDs at a given $\/GB, density, idle costs, # of drives that fit per RU (using 2.5\u201d drives instead of 3.5\u201d) so that they can stay on top of this trend [there is no rush on this one].\n<ul>\n<li>Performance and endurance of SSDs is irrelevant since the performance requirements are so low and the WPD is almost non-existence, making QLC and beyond a great candidate.<\/li>\n<\/ul>\n<\/li>\n<li>Look at allowing pods to be more flexible in handling different capacity drives to handle drive failures more cost efficiently without having to retire pods. Having concepts of \u201cvirtual pods\u201d that don\u2019t have physical limits will better accommodate the future that Backblaze has where it won\u2019t be retiring pods as aggressively, yet still let them grow their pod densities seamlessly.<\/li>\n<\/ul>\n<h2>In Closing<\/h2>\n<p>It is kind of ironic that the reason Backblaze posted all their SMART data is to share insights around failures when I didn\u2019t even analyze failures once! There is much more analysis that could be done around this data set which I may revisit as time permits.<\/p>\n<p>As you can see, even simple health data from drives, along with a little help from other data sources, can help expose a lot more than you would initially think. I have long felt that people have yet to understand the full power of giving data freely to businesses (e.g. Facebook, Google Maps, LinkedIn, Mint, Personal Capital, News Feeds, Amazon). I often hear things like, \u201cI have nothing to hide,\u201d which indicates the lack of value they assign to their data. It\u2019s not the value at its surface but the story that can be told by tying data together.<\/p>\n<p>Until next time, Ryan Smith.<\/p>\n<p style=\"text-align: center;\">\u2022 \u00a0 \u2022 \u00a0 \u2022<\/p>\n<div class=\"one-third first\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-93264\" title=\"Ryan Smith\" src=\"https:\/\/www.backblaze.com\/blog\/wp-content\/uploads\/2019\/10\/ryan_smith.jpg\" alt=\"Ryan Smith\" width=\"522\" height=\"522\" srcset=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/ryan_smith.jpg 522w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/ryan_smith-300x300.jpg 300w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/ryan_smith-150x150.jpg 150w, https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/ryan_smith-80x80.jpg 80w\" sizes=\"auto, (max-width: 522px) 100vw, 522px\" \/><\/div>\n<div class=\"two-thirds\" style=\"font-size: 0.9em;\">Ryan Smith is currently a product strategist at Hitachi Vantara. Previously, he served as the director of NAND product marketing at Samsung Semiconductor, Inc. He is extremely passionate about uncovering insights from just about any data set. He just likes to have fun by making a notable difference, influencing others, and working with smart people.<\/div>\n<div class=\"clearfix\"><\/div>\n<p>&nbsp;<\/p>\n<hr style=\"border: 0; height: 2px; background-image: linear-gradient(to right, rgba(0, 0, 0, 0), rgba(0, 0, 0, 0.75), rgba(0, 0, 0, 0)); max-width: 95%; margin: 36px auto;\" \/>\n<p>Tell us what you think about Ryan\u2019s take on data, or better yet, give us your own! You can find all the data you would ever need on <a href=\"https:\/\/www.backblaze.com\/cloud-storage\/resources\/hard-drive-test-data\" target=\"_blank\" rel=\"noopener noreferrer\">Backblaze&#8217;s Hard Drive Data and Stats<\/a> webpage. Share your thoughts in the comments below or email us at <a href=\"&#109;&#97;i&#108;to&#58;m&#97;&#105;&#108;&#98;ag&#64;b&#97;&#99;&#107;&#98;laze.c&#111;&#109;\">m&#97;ilbag&#64;&#98;a&#99;&#107;&#98;&#108;aze&#46;c&#111;&#109;<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A fan of the Hard Drive Stats posts offers his unique and in-depth analysis of what can be learned from the raw data we collect from our data centers.<\/p>\n","protected":false},"author":144,"featured_media":93217,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"content-type":"","footnotes":""},"categories":[7,457],"tags":[468],"class_list":["post-93200","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cloud-storage","category-hard-drive-stats","tag-b2cloud","entry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>What S.M.A.R.T. Stats Can Tell You About Hard Drive Health<\/title>\n<meta name=\"description\" content=\"Backblaze logs all its drive health data (aka SMART data) for over 100,000 of its hard drives. I took a look at the data and what I found was fascinating.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What S.M.A.R.T. Stats Can Tell You About Hard Drive Health\" \/>\n<meta property=\"og:description\" content=\"Backblaze logs all its drive health data (aka SMART data) for over 100,000 of its hard drives. I took a look at the data and what I found was fascinating.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/\" \/>\n<meta property=\"og:site_name\" content=\"Backblaze Blog | Cloud Storage &amp; Cloud Backup\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/backblaze\" \/>\n<meta property=\"article:published_time\" content=\"2019-10-22T15:18:56+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-12-14T23:45:13+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1440\" \/>\n\t<meta property=\"og:image:height\" content=\"820\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Patrick Thomas\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@backblaze\" \/>\n<meta name=\"twitter:site\" content=\"@backblaze\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Patrick Thomas\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"18 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What S.M.A.R.T. Stats Can Tell You About Hard Drive Health","description":"Backblaze logs all its drive health data (aka SMART data) for over 100,000 of its hard drives. I took a look at the data and what I found was fascinating.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/","og_locale":"en_US","og_type":"article","og_title":"What S.M.A.R.T. Stats Can Tell You About Hard Drive Health","og_description":"Backblaze logs all its drive health data (aka SMART data) for over 100,000 of its hard drives. I took a look at the data and what I found was fascinating.","og_url":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/","og_site_name":"Backblaze Blog | Cloud Storage &amp; Cloud Backup","article_publisher":"https:\/\/www.facebook.com\/backblaze","article_published_time":"2019-10-22T15:18:56+00:00","article_modified_time":"2025-12-14T23:45:13+00:00","og_image":[{"width":1440,"height":820,"url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg","type":"image\/jpeg"}],"author":"Patrick Thomas","twitter_card":"summary_large_image","twitter_creator":"@backblaze","twitter_site":"@backblaze","twitter_misc":{"Written by":"Patrick Thomas","Est. reading time":"18 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#article","isPartOf":{"@id":"https:\/\/www.backblaze.com\/blog\/smart-stats-exposed-a-drive-stats-remix\/"},"author":{"name":"Patrick Thomas","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/7939165675bc36f0862dbe0b25d3657f"},"headline":"SMART Stats Exposed \u2014 a Drive Stats Remix","datePublished":"2019-10-22T15:18:56+00:00","dateModified":"2025-12-14T23:45:13+00:00","mainEntityOfPage":{"@id":"https:\/\/www.backblaze.com\/blog\/smart-stats-exposed-a-drive-stats-remix\/"},"wordCount":3679,"commentCount":5,"publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg","keywords":["B2Cloud"],"articleSection":["Cloud Storage","Hard Drive Stats"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.backblaze.com\/blog\/smart-stats-exposed-a-drive-stats-remix\/","url":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/","name":"What S.M.A.R.T. Stats Can Tell You About Hard Drive Health","isPartOf":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#primaryimage"},"image":{"@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#primaryimage"},"thumbnailUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg","datePublished":"2019-10-22T15:18:56+00:00","dateModified":"2025-12-14T23:45:13+00:00","description":"Backblaze logs all its drive health data (aka SMART data) for over 100,000 of its hard drives. I took a look at the data and what I found was fascinating.","breadcrumb":{"@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#primaryimage","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg","width":1440,"height":820,"caption":"SMART Stats On Trial"},{"@type":"BreadcrumbList","@id":"https:\/\/www.soothsawyer.com\/ryan-smith-uses-backblazes-smart-data-to-illustrate-the-power-of-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/"},{"@type":"ListItem","position":2,"name":"SMART Stats Exposed \u2014 a Drive Stats Remix"}]},{"@type":"WebSite","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#website","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","name":"Backblaze Cloud Solutions Blog","description":"Cloud Storage &amp; Cloud Backup","publisher":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#organization","name":"Backblaze","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.backblaze.com\/blog\/wp-content\/uploads\/2017\/12\/backblaze_icon_transparent.png?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"Backblaze"},"image":{"@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/backblaze","https:\/\/x.com\/backblaze","https:\/\/www.youtube.com\/user\/Backblaze","https:\/\/en.wikipedia.org\/wiki\/Backblaze"]},{"@type":"Person","@id":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/#\/schema\/person\/7939165675bc36f0862dbe0b25d3657f","name":"Patrick Thomas","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/09\/patrick_thomas-e1569451539653-150x150.png","url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/09\/patrick_thomas-e1569451539653-150x150.png","contentUrl":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/09\/patrick_thomas-e1569451539653-150x150.png","caption":"Patrick Thomas"},"description":"Patrick Thomas is the Vice President of Marketing at Backblaze. He has managed all aspects of content development and strategy across a number of industries, including literary publishing, gaming, and tech. He has developed and edited New York Times bestsellers and Wall Street Journal books of the year, and written for National Geographic and the San Francisco Chronicle. He loves nothing more than learning, and Backblaze\u2019s steady beat of innovation feeds that love every day. LinkedIn: Patrick Thomas.","url":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/author\/patrick\/"}]}},"jetpack_featured_media_url":"https:\/\/backblazeprod.wpenginepowered.com\/wp-content\/uploads\/2019\/10\/header-stats.jpg","_links":{"self":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/93200","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/users\/144"}],"replies":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/comments?post=93200"}],"version-history":[{"count":0,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/posts\/93200\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media\/93217"}],"wp:attachment":[{"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/media?parent=93200"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/categories?post=93200"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/backblazeprod.wpenginepowered.com\/blog\/wp-json\/wp\/v2\/tags?post=93200"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}