4 datasets found

Amount of data created, consumed, and stored 2010-2023, with forecasts to...
statista.com
Updated Jun 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
Explore at:
Dataset updated
Jun 30, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
May 2024
Area covered
Worldwide
Description
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching *** zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than *** zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just * percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of **** percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached *** zettabytes.
R
Compulon Dat 54 Total Dataset
universe.roboflow.com
zip
Updated Aug 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
scrapinglabs (2022). Compulon Dat 54 Total Dataset [Dataset]. https://universe.roboflow.com/scrapinglabs/compulon-dat-54-total/dataset/2
Explore at:
zipAvailable download formats
Dataset updated
Aug 23, 2022
Dataset authored and provided by
scrapinglabs
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Letters And Numbers Bounding Boxes
Description
Here are a few use cases for this project:

Optical Character Recognition: This model can be employed to digitize printed or hand-written documents by recognizing characters in images. It will enable searchability, editing, and easy sharing of physical texts—even in situations when direct scanning isn't possible.

Automated License Plate Recognition: The model can be applied in traffic control units to automatically read license plates in images or video footage. This would be helpful in tracking stolen vehicles, monitoring traffic violations, and managing parking.

Educational Tools: The model can be used to develop educational applications for children. Such as an interactive game app where children are required to identify letters and numbers in various images, helping them to learn and recognize characters better.

Assistive Technology: In aiding visually impaired individuals, the model can interpret text in real-world images, converting the recognized letters and numbers into audio output.

Inventory Management: It can help in recognizing and classifying alphanumeric codes on inventory items in a warehouse, assisting in better tracking and management of stock.
d
GDELT Dataset.
datadiscoverystudio.org
Updated Jul 18, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2014). GDELT Dataset. [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/bc78bc49f2674dec9b9d5c75648ad57a/html
Explore at:
Dataset updated
Jul 18, 2014
Description
description: The Global Database of Events, Language, and Tone (GDELT Project) monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, counts, themes, sources, and events driving our global society every second of every day, creating a free open platform for computing on the entire world. It was uploaded via the World Wide Human Geography Data (WWHGD) working group.; abstract: The Global Database of Events, Language, and Tone (GDELT Project) monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, counts, themes, sources, and events driving our global society every second of every day, creating a free open platform for computing on the entire world. It was uploaded via the World Wide Human Geography Data (WWHGD) working group.
COVID-19
kaggle.com
data.world
zip
Updated May 25, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Atila Madai (2020). COVID-19 [Dataset]. https://www.kaggle.com/atilamadai/covid19
Explore at:
zip(68606230 bytes)Available download formats
Dataset updated
May 25, 2020
Authors
Atila Madai
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

The novel coronavirus that has infected more than 79,551 people worldwide (as of time of writing this context) is spreading rapidly, and independently, in countries outside of China, including Italy, South Korea, and Iran. The viral illness is being diagnosed among hundreds of people in South Korea, Italy and Iran who have no connection to China.

Content

In the notebook I use the time series data. Time series data columns are described in the column description.

Acknowledgements

Thanks to the Johns Hopkins University for providing this data-set for educational purposes. https://github.com/CSSEGISandData/COVID-19

Inspiration

To visualize COVID-19 spread world wide.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2025). Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/

Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028

Explore at:

Dataset updated

Jun 30, 2025

Dataset authored and provided by

Statistahttp://statista.com/

Time period covered

May 2024

Area covered

Worldwide

Description

The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly, reaching *** zettabytes in 2024. Over the next five years up to 2028, global data creation is projected to grow to more than *** zettabytes. In 2020, the amount of data created and replicated reached a new high. The growth was higher than previously expected, caused by the increased demand due to the COVID-19 pandemic, as more people worked and learned from home and used home entertainment options more often. Storage capacity also growing Only a small percentage of this newly created data is kept though, as just * percent of the data produced and consumed in 2020 was saved and retained into 2021. In line with the strong growth of the data volume, the installed base of storage capacity is forecast to increase, growing at a compound annual growth rate of **** percent over the forecast period from 2020 to 2025. In 2020, the installed base of storage capacity reached *** zettabytes.

Clear search

Close search

Google apps

Main menu

Amount of data created, consumed, and stored 2010-2023, with forecasts to...

Compulon Dat 54 Total Dataset

GDELT Dataset.

COVID-19

Context

Content

Acknowledgements

Inspiration

Amount of data created, consumed, and stored 2010-2023, with forecasts to 2028