https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Drive Stats
Drive Stats is a public data set of daily metrics on the hard drives in Backblaze’s cloud storage infrastructure that Backblaze has open-sourced since April 2013. Currently, Drive Stats comprises over 388 million records, rising by over 240,000 records per day. Drive Stats is an append-only dataset effectively logging daily statistics that once written are never updated or deleted. This is our first Hugging Face dataset; feel free to suggest improvements by creating a… See the full description on the dataset page: https://huggingface.co/datasets/backblaze/Drive_Stats.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
2018 Hard Drive Failure Rates: What 100,000+ Hard Drives Tell Us
At the end of 2018 Backblaze was monitoring 104,954 hard drives used to store data. For our evaluation we remove from consideration those drives that were used for testing purposes and those drive models for which we did not have at least 45 drives (see why below). This leaves us with 104,778 hard drives. The table below covers what happened just in 2018.
How often does our hard disks fail? Check out the test data form Backblaze!
Each day, the Backblaze data center takes a snapshot of each operational hard drive. This snapshot includes basic drive information along with the S.M.A.R.T. statistics reported by that drive. The daily snapshot of one drive is one record or row of data. All of the drive snapshots for a given day are collected into a file consisting of a row for each active hard drive. The format of this file is a "csv" (Comma Separated Values) file. Each day this file is named in the format YYYY-MM-DD.csv, for example, 2019-07-01.csv.
The first row of the each file contains the column names, the remaining rows are the actual data. The columns are as follows:
Date – The date of the file in yyyy-mm-dd format. Serial Number – The manufacturer-assigned serial number of the drive. Model – The manufacturer-assigned model number of the drive. Capacity – The drive capacity in bytes. Failure – Contains a “0” if the drive is OK. Contains a “1” if this is the last day the drive was operational before failing.
Data is sourced from: https://www.backblaze.com/b2/hard-drive-test-data.html#how-you-can-use-the-data.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Drive Stats
Drive Stats is a public data set of daily metrics on the hard drives in Backblaze’s cloud storage infrastructure that Backblaze has open-sourced since April 2013. Currently, Drive Stats comprises over 388 million records, rising by over 240,000 records per day. Drive Stats is an append-only dataset effectively logging daily statistics that once written are never updated or deleted. This is our first Hugging Face dataset; feel free to suggest improvements by creating a… See the full description on the dataset page: https://huggingface.co/datasets/backblaze/Drive_Stats.