100+ datasets found

Registry of Open Data on AWS
registry.opendata.aws
Updated Aug 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amazon Web Services (2021). Registry of Open Data on AWS [Dataset]. https://registry.opendata.aws/registry-open-data/
Explore at:
Dataset updated
Aug 13, 2021
Dataset provided by
Amazon Web Serviceshttp://aws.amazon.com/
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The Registry of Open Data on AWS contains publicly available datasets that are available for access from AWS resources. Note that datasets in this registry are available via AWS resources, but they are not provided by AWS; these datasets are owned and maintained by a variety of government organizations, researchers, businesses, and individuals. This dataset contains derived forms of the data in https://github.com/awslabs/open-data-registry that have been transformed for ease of use with machine interfaces. Currently, only the ndjson form of the registry is populated here.
AWS Public Blockchain Data
registry.opendata.aws
Updated Sep 23, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amazon Web Services (2022). AWS Public Blockchain Data [Dataset]. https://registry.opendata.aws/aws-public-blockchain/
Explore at:
Dataset updated
Sep 23, 2022
Dataset provided by
Amazon Web Serviceshttp://aws.amazon.com/
Description
The AWS Public Blockchain Data initiative provides free access to blockchain datasets through collaboration with data providers. The data is optimized for analytics by being transformed into compressed Parquet files, partitioned by date for efficient querying.

Datasets
Blockchain dataset - Maintained by - Path:
- Bitcoin - AWS - s3://aws-public-blockchain/v1.0/btc/
- Ethereum - AWS - s3://aws-public-blockchain/v1.0/eth/
- Arbitrum - SonarX - s3://aws-public-blockchain/v1.1/sonarx/arbitrum/
- Aptos - SonarX - s3://aws-public-blockchain/v1.1/sonarx/aptos/
- Base - SonarX - s3://aws-public-blockchain/v1.1/sonarx/base/
- Provenance - SonarX - s3://aws-public-blockchain/v1.1/sonarx/provenance/
- XRP Ledger - SonarX - s3://aws-public-blockchain/v1.1/sonarx/xrp/
- Stellar(XDR files) - Stellar - s3://aws-public-blockchain/v1.1/stellar/
- The Open Network (TON) - TON - s3://aws-public-blockchain/v1.1/ton/
- Cronos - Cronos - s3://aws-public-blockchain/v1.1/cronos/

Become a Data Provider

We welcome additional blockchain data providers to join this initiative. If you're interested in contributing datasets to the AWS Public Blockchain Data program, please contact our team at aws-public-blockchain@amazon.com.
R
Aws Data Dataset
universe.roboflow.com
zip
Updated Nov 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
san vanwel (2023). Aws Data Dataset [Dataset]. https://universe.roboflow.com/san-vanwel-vduan/aws-data
Explore at:
zipAvailable download formats
Dataset updated
Nov 14, 2023
Dataset authored and provided by
san vanwel
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Variables measured
Object Polygons
Description
AWS Data

## Overview AWS Data is a dataset for instance segmentation tasks - it contains Object annotations for 2,886 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
w
Amazon Web Services - Public Data Sets
data.wu.ac.at
Updated Oct 10, 2013
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Global (2013). Amazon Web Services - Public Data Sets [Dataset]. https://data.wu.ac.at/schema/datahub_io/NTYxNjkxNmYtNmZlNS00N2EwLWJkYTktZjFjZWJkNTM2MTNm
Explore at:
Dataset updated
Oct 10, 2013
Dataset provided by
Global
Description
About

From website:

Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge for the community, and like all AWS services, users pay only for the compute and storage they use for their own applications. An initial list of data sets is already available, and more will be added soon.

Previously, large data sets such as the mapping of the Human Genome and the US Census data required hours or days to locate, download, customize, and analyze. Now, anyone can access these data sets from their Amazon Elastic Compute Cloud (Amazon EC2) instances and start computing on the data within minutes. Users can also leverage the entire AWS ecosystem and easily collaborate with other AWS users. For example, users can produce or use prebuilt server images with tools and applications to analyze the data sets. By hosting this important and useful data with cost-efficient services such as Amazon EC2, AWS hopes to provide researchers across a variety of disciplines and industries with tools to enable more innovation, more quickly.
Availability zones of AWS data centers APAC 2025, by market
statista.com
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Availability zones of AWS data centers APAC 2025, by market [Dataset]. https://www.statista.com/statistics/1609480/apac-aws-availability-zones-by-market/
Explore at:
Dataset updated
Jul 9, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Apr 2025
Area covered
APAC
Description
As of April 2025, Amazon Wed Services (AWS) cloud data centers operated in ** markets in the Asia-Pacific region, with ** availability zones in total. An availability zone (AZs) is one or more separate data centers located within specific regions within which cloud services originate and operate. Each AZ has independent power, cooling, and physical security.
AWS Cloudtrails Dataset from flaws.cloud
kaggle.com
zip
Updated Dec 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
N Y Kim (2023). AWS Cloudtrails Dataset from flaws.cloud [Dataset]. https://www.kaggle.com/datasets/nobukim/aws-cloudtrails-dataset-from-flaws-cloud
Explore at:
zip(316277173 bytes)Available download formats
Dataset updated
Dec 12, 2023
Authors
N Y Kim
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
featured here: https://medium.com/@george.fekkas/quick-and-dirty-cloudtrail-threat-hunting-log-analysis-b64af10ef923

https://summitroute.com/blog/2020/10/09/public_dataset_of_cloudtrail_logs_from_flaws_cloud/

The columns should be flattened. Some columns dropped because they are not good features for NVIDIA Morpheus digital fingerprinting autoencoders.
Availability zones of AWS data centers worldwide 2024, by region
statista.com
Updated Sep 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Availability zones of AWS data centers worldwide 2024, by region [Dataset]. https://www.statista.com/statistics/1491283/aws-availability-zones-globally-by-region/
Explore at:
Dataset updated
Sep 24, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2024
Area covered
Worldwide
Description
Amazon Web Services (AWS) global cloud data centers operate in ** geographic regions, each containing several availability zones (AZs). As of 2024, Europe/Middle East/Africa and Asia Pacific and China had ** zones combined, which is over ** percent of all AWS' AZs.
Test Private AWS S3 data. This is for TEST PURPOSES ONLY
catalog.data.gov
Updated May 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NOAA CoastWatch, West Coast Node (Point of Contact) (2021). Test Private AWS S3 data. This is for TEST PURPOSES ONLY [Dataset]. https://catalog.data.gov/dataset/test-private-aws-s3-data-this-is-for-test-purposes-only
Explore at:
Dataset updated
May 28, 2021
Dataset provided by
National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
Description
Test Private AWS S3 data. This is for TEST PURPOSES ONLY
Amazon AWS SaaS Sales Dataset
kaggle.com
Updated May 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nhat Thanh, Nguyen (2023). Amazon AWS SaaS Sales Dataset [Dataset]. https://www.kaggle.com/datasets/nnthanh101/aws-saas-sales
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 5, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Nhat Thanh, Nguyen
License
http://www.gnu.org/licenses/fdl-1.3.htmlhttp://www.gnu.org/licenses/fdl-1.3.html
Description
This dataset contains transaction data from a fictitious SaaS company selling sales and marketing software to other companies (B2B). In the dataset, each row represents a single transaction/order (9,994 transactions), and the columns include:

Here is the Original Dataset: https://ee-assets-prod-us-east-1.s3.amazonaws.com/modules/337d5d05acc64a6fa37bcba6b921071c/v1/SaaS-Sales.csv

Features

| # | Name of the attribute | Description | | -- | --------------------- | -------------------------------------------------------- | | 1 | Row ID | A unique identifier for each transaction. | | 2 | Order ID | A unique identifier for each order. | | 3 | Order Date | The date when the order was placed. | | 4 | Date Key | A numerical representation of the order date (YYYYMMDD). | | 5 | Contact Name | The name of the person who placed the order. | | 6 | Country | The country where the order was placed. | | 7 | City | The city where the order was placed. | | 8 | Region | The region where the order was placed. | | 9 | Subregion | The subregion where the order was placed. | | 10 | Customer | The name of the company that placed the order. | | 11 | Customer ID | A unique identifier for each customer. | | 13 | Industry | The industry the customer belongs to. | | 14 | Segment | The customer segment (SMB, Strategic, Enterprise, etc.). | | 15 | Product | The product was ordered. | | 16 | License | The license key for the product. | | 17 | Sales | The total sales amount for the transaction. | | 18 | Quantity | The total number of items in the transaction. | | 19 | Discount | The discount applied to the transaction. | | 20 | Profit | The profit from the transaction. |

Inspiration: The CRoss Industry Standard Process for Data Mining (CRISP-DM) CRISP-DM methodology

[ ] Understanding the business

[ ] Understanding the data

[x] Preparing the data

[ ] Modelling

[ ] Evaluating

[ ] Implementing the analysis.
u
DRI AWS Data
data.ucar.edu
ckanprod.data-commons.k8s.ucar.edu
netcdf
Updated Oct 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vanda GrubišiÄ (2025). DRI AWS Data [Dataset]. http://doi.org/10.26023/8625-B8TB-SR0W
Explore at:
netcdfAvailable download formats
Unique identifier
https://doi.org/10.26023/8625-B8TB-SR0W
Dataset updated
Oct 7, 2025
Authors
Vanda GrubišiÄ
Time period covered
Mar 1, 2006 - Apr 30, 2006
Area covered

Description
This data set represents the automatic weather station (AWS) data from the 16 stations of the Desert Research Institute network for the period 00 PST March 1 to 00 PST May 1, 2006 during the Terrain-induced Rotor Experiment (T-REX) field campaign. The data have a temporal resolution of 30 seconds, and are in netCDF format files.
Amazon Web Services: Landsat GLS (Global Land Survey)
catalog.data.gov
data.amerigeoss.org
+1more
Updated Sep 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AWS NEX (2025). Amazon Web Services: Landsat GLS (Global Land Survey) [Dataset]. https://catalog.data.gov/dataset/amazon-web-services-landsat-gls-global-land-survey
Explore at:
Dataset updated
Sep 4, 2025
Dataset provided by
Amazon Web Serviceshttp://aws.amazon.com/
Description
In the past, the U.S. Geological Survey (USGS) and NASA collaborated on the creation of four global land data sets from Landsat images: one from the 1970s, and one each from circa 1990, 2000, and 2005. Each of these global data sets was created from the primary Landsat sensor in use at the time: the Multispectral Scanner (MSS) in the 1970s, the Thematic Mapper (TM) in 1990, Enhanced Thematic Mapper Plus (ETM+) in 2000, and a combination of TM and ETM+ in 2005.
COVID-19 Data Lake
registry.opendata.aws
Updated Apr 8, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amazon Web Services (2020). COVID-19 Data Lake [Dataset]. https://registry.opendata.aws/aws-covid19-lake/
Explore at:
Dataset updated
Apr 8, 2020
Dataset provided by
Amazon Web Serviceshttp://aws.amazon.com/
Description
A centralized repository of up-to-date and curated datasets on or related to the spread and characteristics of the novel corona virus (SARS-CoV-2) and its associated illness, COVID-19. Globally, there are several efforts underway to gather this data, and we are working with partners to make this crucial data freely available and keep it up-to-date. Hosted on the AWS cloud, we have seeded our curated data lake with COVID-19 case tracking data from Johns Hopkins and The New York Times, hospital bed availability from Definitive Healthcare, and over 45,000 research articles about COVID-19 and related coronaviruses from the Allen Institute for AI.
AWS Pricing Dataset
kaggle.com
huggingface.co
zip
Updated Nov 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sahil (2025). AWS Pricing Dataset [Dataset]. https://www.kaggle.com/datasets/justsahil/aws-pricing-dataset/code
Explore at:
zip(369440674 bytes)Available download formats
Dataset updated
Nov 29, 2025
Authors
Sahil
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The following data is pulled from AWS official pricing API. Contains all pricing data across AWS services

Source: https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/using-price-list-query-api.html
f
ApRES Internal Layer Power and AWS Data
datasetcatalog.nlm.nih.gov
figshare.com
Updated Sep 25, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Todd, Joe; Hubbard, Alun; Brennan, Paul; Christoffersen, Poul; Kendrick, Alexander; Chu, Winnie; Doyle, Samuel; Hubbard, Bryn; Lok, Lai Bun; Nicholls, Keith; Young, Tun Jan; Schroeder, Dustin; Box, Jason E. (2018). ApRES Internal Layer Power and AWS Data [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000677349
Explore at:
Dataset updated
Sep 25, 2018
Authors
Todd, Joe; Hubbard, Alun; Brennan, Paul; Christoffersen, Poul; Kendrick, Alexander; Chu, Winnie; Doyle, Samuel; Hubbard, Bryn; Lok, Lai Bun; Nicholls, Keith; Young, Tun Jan; Schroeder, Dustin; Box, Jason E.
Description
Radar and weather station data collected in 2014 as part of the Subglacial Access and Fast Ice Research Experiment (SAFIRE) and used to quantify englacial water storage in the paper, Surface Meltwater Impounded by Seasonal Englacial Storage in West Greenland. See the following link to the manuscript: https://doi.org/10.1029/2018GL079787For additional data from the same field campaign see:https://doi.org/10.6084/m9.figshare.5745294And items 112029 and 112009 located athttp://www.bgs.ac.uk/services/ngdc/accessions/index.html
o
NEXRAD on AWS
registry.opendata.aws
Updated Apr 19, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unidata (2018). NEXRAD on AWS [Dataset]. https://registry.opendata.aws/noaa-nexrad/
Explore at:
Dataset updated
Apr 19, 2018
Dataset provided by
<a href="https://www.unidata.ucar.edu/">Unidata</a>
Description
Real-time and archival data from the Next Generation Weather Radar (NEXRAD) network.
Update
The NEXRAD Level II archive data is moving to a new bucket: unidata-nexrad-level2 and SNS topic: arn:aws:sns:us-east-1:684042711724:NewNEXRADLevel2Archive. The old bucket and SNS topic are now deprecated and will no longer be available starting September 1, 2025.
Amazon Cloud Locations
kaggle.com
zip
Updated Oct 7, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
iyoob_utexas (2020). Amazon Cloud Locations [Dataset]. https://www.kaggle.com/i2i2i2/amazon-cloud-locations
Explore at:
zip(843 bytes)Available download formats
Dataset updated
Oct 7, 2020
Authors
iyoob_utexas
Description
This data file lists approximate locations of Amazon Web Services (AWS) data centers around the world. Some of this was collected manually by searching local news articles on real estate purchases by Amazon in each region, and other information was obtained from https://www.datacenterdynamics.com/. Note that in most regions AWS has multiple data centers, and so the selected location may only reflect one of them in that region.

This data is helpful for AWS users to quickly view where their assets are housed across the world and help them ensure that they are meeting information privacy guidelines.
u
ABLE Automatic Weather Station (AWS) Data
data.ucar.edu
ascii
Updated Oct 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Richard L. Coulter (2025). ABLE Automatic Weather Station (AWS) Data [Dataset]. http://doi.org/10.26023/VR0R-66BG-700G
Explore at:
asciiAvailable download formats
Unique identifier
https://doi.org/10.26023/VR0R-66BG-700G
Dataset updated
Oct 7, 2025
Authors
Richard L. Coulter
Time period covered
May 20, 2003 - Jul 7, 2003
Area covered

Description
This data set contains 1-minute resolution surface meteorological data from the Atmospheric Boundary Layer Experiments (ABLE) operated by the Argonne National Laboratory in the Walnut River Watershed in Butler County Kansas (east of Wichita). The ABLE Automated Weather Station (AWS) Network consists of five stations. Data cover the period from 20 May to 7 July 2003 The data are in columnar ASCII format.
h
Data from: data-science-on-aws
huggingface.co
Updated Nov 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dao Xuan Tan (2025). data-science-on-aws [Dataset]. https://huggingface.co/datasets/tan-0909/data-science-on-aws
Explore at:
Dataset updated
Nov 22, 2025
Authors
Dao Xuan Tan
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
tan-0909/data-science-on-aws dataset hosted on Hugging Face and contributed by the HF Datasets community
a
Mt. Hunter AWS data
arcticdata.io
search.dataone.org
Updated May 18, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karl Kreutz (2020). Mt. Hunter AWS data [Dataset]. http://doi.org/10.18739/A2C946
Explore at:
Unique identifier
https://doi.org/10.18739/A2C946
Dataset updated
May 18, 2020
Dataset provided by
Arctic Data Center
Authors
Karl Kreutz
Time period covered
Jun 6, 2013 - Sep 22, 2013
Area covered

Description
Meteorological data and images collected on the Mt. Hunter plateau, Denali National Park, Alaska. Data were collected with an automatic weather station using instrumentation from Campbell Scientific. Large-scale atmospheric circulation systems affect the geographic distribution of precipitation in western North America, yet little is known about how these systems may have varied before the instrumental period of the last 150 years. The main goal of this project is to reconstruct the history of precipitation in Alaska during the last thousand years using ice core records of snow accumulation. The researchers plan to collect several new ice cores from the Mt. Hunter Plateau in the Alaska Range of Denali National Park and the new ice cores will be combined with an existing spatial array of ice cores in the region to map changes in the spatial patterns of precipitation. Because changes in atmospheric circulation patterns caused by ENSO and the Pacific Decadal Oscillation (PDO) affect where the precipitation falls, this spatial array of ice cores will provide a record of how these larger scale climate systems have varied during the last thousand years. The project will focus on determining the differences in the precipitation patterns at the Little Ice Age (approximately 200 to 600 years ago) and Medieval Climate Anomaly (approximately 800 to 1,200 years ago).
Amazon Dataset
brightdata.com
.json, .csv, .xlsx
Updated Mar 31, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bright Data (2022). Amazon Dataset [Dataset]. https://brightdata.com/products/datasets/amazon
Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Mar 31, 2022
Dataset authored and provided by
Bright Datahttps://brightdata.com/
License
https://brightdata.com/licensehttps://brightdata.com/license
Area covered
Worldwide
Description
Gain extensive insights with our Amazon datasets, encompassing detailed product information including pricing, reviews, ratings, brand names, product categories, sellers, ASINs, images, and much more. Ideal for market researchers, data analysts, and eCommerce professionals looking to excel in the competitive online marketplace. Over 425M records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:

Title Asin Main Image Brand Name Description Availability Subcategory Categories Parent Asin Type Product Type Name Model Number Manufacturer Color Size Date First Available Released Model Year Item Model Number Part Number Price Total Reviews Total Ratings Average Rating Features Best Sellers Rank Subcategory Buybox Buybox Seller Id Buybox Is Amazon Images Product URL And more

Facebook

Twitter

Click to copy link

Link copied

Cite

Amazon Web Services (2021). Registry of Open Data on AWS [Dataset]. https://registry.opendata.aws/registry-open-data/

Registry of Open Data on AWS

Explore at:

Dataset updated

Aug 13, 2021

Dataset provided by

Amazon Web Serviceshttp://aws.amazon.com/

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

The Registry of Open Data on AWS contains publicly available datasets that are available for access from AWS resources. Note that datasets in this registry are available via AWS resources, but they are not provided by AWS; these datasets are owned and maintained by a variety of government organizations, researchers, businesses, and individuals. This dataset contains derived forms of the data in https://github.com/awslabs/open-data-registry that have been transformed for ease of use with machine interfaces. Currently, only the ndjson form of the registry is populated here.

Clear search

Close search

Google apps

Main menu

Registry of Open Data on AWS

AWS Public Blockchain Data

Datasets

Become a Data Provider

Aws Data Dataset

AWS Data

Amazon Web Services - Public Data Sets

About

Availability zones of AWS data centers APAC 2025, by market

AWS Cloudtrails Dataset from flaws.cloud

Availability zones of AWS data centers worldwide 2024, by region

Test Private AWS S3 data. This is for TEST PURPOSES ONLY

Amazon AWS SaaS Sales Dataset

Features

Inspiration: The CRoss Industry Standard Process for Data Mining (CRISP-DM) CRISP-DM methodology

DRI AWS Data

Amazon Web Services: Landsat GLS (Global Land Survey)

COVID-19 Data Lake

AWS Pricing Dataset

ApRES Internal Layer Power and AWS Data

NEXRAD on AWS

Update

Amazon Cloud Locations

ABLE Automatic Weather Station (AWS) Data

Data from: data-science-on-aws

Mt. Hunter AWS data

Amazon Dataset

Registry of Open Data on AWS