100+ datasets found
  1. Registry of Open Data on AWS

    • registry.opendata.aws
    Updated Aug 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amazon Web Services (2021). Registry of Open Data on AWS [Dataset]. https://registry.opendata.aws/registry-open-data/
    Explore at:
    Dataset updated
    Aug 13, 2021
    Dataset provided by
    Amazon Web Serviceshttp://aws.amazon.com/
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Registry of Open Data on AWS contains publicly available datasets that are available for access from AWS resources. Note that datasets in this registry are available via AWS resources, but they are not provided by AWS; these datasets are owned and maintained by a variety of government organizations, researchers, businesses, and individuals. This dataset contains derived forms of the data in https://github.com/awslabs/open-data-registry that have been transformed for ease of use with machine interfaces. Currently, only the ndjson form of the registry is populated here.

  2. AWS Public Blockchain Data

    • registry.opendata.aws
    Updated Sep 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amazon Web Services (2022). AWS Public Blockchain Data [Dataset]. https://registry.opendata.aws/aws-public-blockchain/
    Explore at:
    Dataset updated
    Sep 23, 2022
    Dataset provided by
    Amazon Web Serviceshttp://aws.amazon.com/
    Description

    The AWS Public Blockchain Data initiative provides free access to blockchain datasets through collaboration with data providers. The data is optimized for analytics by being transformed into compressed Parquet files, partitioned by date for efficient querying.

    Datasets

    Blockchain dataset - Maintained by - Path:
    - Bitcoin - AWS - s3://aws-public-blockchain/v1.0/btc/
    - Ethereum - AWS - s3://aws-public-blockchain/v1.0/eth/
    - Arbitrum - SonarX - s3://aws-public-blockchain/v1.1/sonarx/arbitrum/
    - Aptos - SonarX - s3://aws-public-blockchain/v1.1/sonarx/aptos/
    - Base - SonarX - s3://aws-public-blockchain/v1.1/sonarx/base/
    - Provenance - SonarX - s3://aws-public-blockchain/v1.1/sonarx/provenance/
    - XRP Ledger - SonarX - s3://aws-public-blockchain/v1.1/sonarx/xrp/
    - Stellar(XDR files) - Stellar - s3://aws-public-blockchain/v1.1/stellar/
    - The Open Network (TON) - TON - s3://aws-public-blockchain/v1.1/ton/
    - Cronos - Cronos - s3://aws-public-blockchain/v1.1/cronos/

    Become a Data Provider

    We welcome additional blockchain data providers to join this initiative. If you're interested in contributing datasets to the AWS Public Blockchain Data program, please contact our team at aws-public-blockchain@amazon.com.

  3. R

    Aws Data Dataset

    • universe.roboflow.com
    zip
    Updated Nov 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    san vanwel (2023). Aws Data Dataset [Dataset]. https://universe.roboflow.com/san-vanwel-vduan/aws-data
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 14, 2023
    Dataset authored and provided by
    san vanwel
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Variables measured
    Object Polygons
    Description

    AWS Data

    ## Overview
    
    AWS Data is a dataset for instance segmentation tasks - it contains Object annotations for 2,886 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
    
  4. w

    Amazon Web Services - Public Data Sets

    • data.wu.ac.at
    Updated Oct 10, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Global (2013). Amazon Web Services - Public Data Sets [Dataset]. https://data.wu.ac.at/schema/datahub_io/NTYxNjkxNmYtNmZlNS00N2EwLWJkYTktZjFjZWJkNTM2MTNm
    Explore at:
    Dataset updated
    Oct 10, 2013
    Dataset provided by
    Global
    Description

    About

    From website:

    Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge for the community, and like all AWS services, users pay only for the compute and storage they use for their own applications. An initial list of data sets is already available, and more will be added soon.

    Previously, large data sets such as the mapping of the Human Genome and the US Census data required hours or days to locate, download, customize, and analyze. Now, anyone can access these data sets from their Amazon Elastic Compute Cloud (Amazon EC2) instances and start computing on the data within minutes. Users can also leverage the entire AWS ecosystem and easily collaborate with other AWS users. For example, users can produce or use prebuilt server images with tools and applications to analyze the data sets. By hosting this important and useful data with cost-efficient services such as Amazon EC2, AWS hopes to provide researchers across a variety of disciplines and industries with tools to enable more innovation, more quickly.

  5. Availability zones of AWS data centers APAC 2025, by market

    • statista.com
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Availability zones of AWS data centers APAC 2025, by market [Dataset]. https://www.statista.com/statistics/1609480/apac-aws-availability-zones-by-market/
    Explore at:
    Dataset updated
    Jul 9, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Apr 2025
    Area covered
    APAC
    Description

    As of April 2025, Amazon Wed Services (AWS) cloud data centers operated in ** markets in the Asia-Pacific region, with ** availability zones in total. An availability zone (AZs) is one or more separate data centers located within specific regions within which cloud services originate and operate. Each AZ has independent power, cooling, and physical security.

  6. AWS Cloudtrails Dataset from flaws.cloud

    • kaggle.com
    zip
    Updated Dec 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    N Y Kim (2023). AWS Cloudtrails Dataset from flaws.cloud [Dataset]. https://www.kaggle.com/datasets/nobukim/aws-cloudtrails-dataset-from-flaws-cloud
    Explore at:
    zip(316277173 bytes)Available download formats
    Dataset updated
    Dec 12, 2023
    Authors
    N Y Kim
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    featured here: https://medium.com/@george.fekkas/quick-and-dirty-cloudtrail-threat-hunting-log-analysis-b64af10ef923

    https://summitroute.com/blog/2020/10/09/public_dataset_of_cloudtrail_logs_from_flaws_cloud/

    The columns should be flattened. Some columns dropped because they are not good features for NVIDIA Morpheus digital fingerprinting autoencoders.

  7. Availability zones of AWS data centers worldwide 2024, by region

    • statista.com
    Updated Sep 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Availability zones of AWS data centers worldwide 2024, by region [Dataset]. https://www.statista.com/statistics/1491283/aws-availability-zones-globally-by-region/
    Explore at:
    Dataset updated
    Sep 24, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2024
    Area covered
    Worldwide
    Description

    Amazon Web Services (AWS) global cloud data centers operate in ** geographic regions, each containing several availability zones (AZs). As of 2024, Europe/Middle East/Africa and Asia Pacific and China had ** zones combined, which is over ** percent of all AWS' AZs.

  8. Test Private AWS S3 data. This is for TEST PURPOSES ONLY

    • catalog.data.gov
    Updated May 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOAA CoastWatch, West Coast Node (Point of Contact) (2021). Test Private AWS S3 data. This is for TEST PURPOSES ONLY [Dataset]. https://catalog.data.gov/dataset/test-private-aws-s3-data-this-is-for-test-purposes-only
    Explore at:
    Dataset updated
    May 28, 2021
    Dataset provided by
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    Description

    Test Private AWS S3 data. This is for TEST PURPOSES ONLY

  9. Amazon AWS SaaS Sales Dataset

    • kaggle.com
    Updated May 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nhat Thanh, Nguyen (2023). Amazon AWS SaaS Sales Dataset [Dataset]. https://www.kaggle.com/datasets/nnthanh101/aws-saas-sales
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 5, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Nhat Thanh, Nguyen
    License

    http://www.gnu.org/licenses/fdl-1.3.htmlhttp://www.gnu.org/licenses/fdl-1.3.html

    Description

    This dataset contains transaction data from a fictitious SaaS company selling sales and marketing software to other companies (B2B). In the dataset, each row represents a single transaction/order (9,994 transactions), and the columns include:

    Here is the Original Dataset: https://ee-assets-prod-us-east-1.s3.amazonaws.com/modules/337d5d05acc64a6fa37bcba6b921071c/v1/SaaS-Sales.csv

    Features

    | # | Name of the attribute | Description | | -- | --------------------- | -------------------------------------------------------- | | 1 | Row ID | A unique identifier for each transaction. | | 2 | Order ID | A unique identifier for each order. | | 3 | Order Date | The date when the order was placed. | | 4 | Date Key | A numerical representation of the order date (YYYYMMDD). | | 5 | Contact Name | The name of the person who placed the order. | | 6 | Country | The country where the order was placed. | | 7 | City | The city where the order was placed. | | 8 | Region | The region where the order was placed. | | 9 | Subregion | The subregion where the order was placed. | | 10 | Customer | The name of the company that placed the order. | | 11 | Customer ID | A unique identifier for each customer. | | 13 | Industry | The industry the customer belongs to. | | 14 | Segment | The customer segment (SMB, Strategic, Enterprise, etc.). | | 15 | Product | The product was ordered. | | 16 | License | The license key for the product. | | 17 | Sales | The total sales amount for the transaction. | | 18 | Quantity | The total number of items in the transaction. | | 19 | Discount | The discount applied to the transaction. | | 20 | Profit | The profit from the transaction. |

    Inspiration: The CRoss Industry Standard Process for Data Mining (CRISP-DM) CRISP-DM methodology

    • [ ] Understanding the business
    • [ ] Understanding the data
    • [x] Preparing the data
    • [ ] Modelling
    • [ ] Evaluating
    • [ ] Implementing the analysis.
  10. u

    DRI AWS Data

    • data.ucar.edu
    • ckanprod.data-commons.k8s.ucar.edu
    netcdf
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vanda GrubišiÄ (2025). DRI AWS Data [Dataset]. http://doi.org/10.26023/8625-B8TB-SR0W
    Explore at:
    netcdfAvailable download formats
    Dataset updated
    Oct 7, 2025
    Authors
    Vanda Grubišić
    Time period covered
    Mar 1, 2006 - Apr 30, 2006
    Area covered
    Description

    This data set represents the automatic weather station (AWS) data from the 16 stations of the Desert Research Institute network for the period 00 PST March 1 to 00 PST May 1, 2006 during the Terrain-induced Rotor Experiment (T-REX) field campaign. The data have a temporal resolution of 30 seconds, and are in netCDF format files.

  11. Amazon Web Services: Landsat GLS (Global Land Survey)

    • catalog.data.gov
    • data.amerigeoss.org
    • +1more
    Updated Sep 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AWS NEX (2025). Amazon Web Services: Landsat GLS (Global Land Survey) [Dataset]. https://catalog.data.gov/dataset/amazon-web-services-landsat-gls-global-land-survey
    Explore at:
    Dataset updated
    Sep 4, 2025
    Dataset provided by
    Amazon Web Serviceshttp://aws.amazon.com/
    Description

    In the past, the U.S. Geological Survey (USGS) and NASA collaborated on the creation of four global land data sets from Landsat images: one from the 1970s, and one each from circa 1990, 2000, and 2005. Each of these global data sets was created from the primary Landsat sensor in use at the time: the Multispectral Scanner (MSS) in the 1970s, the Thematic Mapper (TM) in 1990, Enhanced Thematic Mapper Plus (ETM+) in 2000, and a combination of TM and ETM+ in 2005.

  12. COVID-19 Data Lake

    • registry.opendata.aws
    Updated Apr 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amazon Web Services (2020). COVID-19 Data Lake [Dataset]. https://registry.opendata.aws/aws-covid19-lake/
    Explore at:
    Dataset updated
    Apr 8, 2020
    Dataset provided by
    Amazon Web Serviceshttp://aws.amazon.com/
    Description

    A centralized repository of up-to-date and curated datasets on or related to the spread and characteristics of the novel corona virus (SARS-CoV-2) and its associated illness, COVID-19. Globally, there are several efforts underway to gather this data, and we are working with partners to make this crucial data freely available and keep it up-to-date. Hosted on the AWS cloud, we have seeded our curated data lake with COVID-19 case tracking data from Johns Hopkins and The New York Times, hospital bed availability from Definitive Healthcare, and over 45,000 research articles about COVID-19 and related coronaviruses from the Allen Institute for AI.

  13. AWS Pricing Dataset

    • kaggle.com
    • huggingface.co
    zip
    Updated Nov 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sahil (2025). AWS Pricing Dataset [Dataset]. https://www.kaggle.com/datasets/justsahil/aws-pricing-dataset/code
    Explore at:
    zip(369440674 bytes)Available download formats
    Dataset updated
    Nov 29, 2025
    Authors
    Sahil
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The following data is pulled from AWS official pricing API. Contains all pricing data across AWS services

    Source: https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/using-price-list-query-api.html

  14. f

    ApRES Internal Layer Power and AWS Data

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Sep 25, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Todd, Joe; Hubbard, Alun; Brennan, Paul; Christoffersen, Poul; Kendrick, Alexander; Chu, Winnie; Doyle, Samuel; Hubbard, Bryn; Lok, Lai Bun; Nicholls, Keith; Young, Tun Jan; Schroeder, Dustin; Box, Jason E. (2018). ApRES Internal Layer Power and AWS Data [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000677349
    Explore at:
    Dataset updated
    Sep 25, 2018
    Authors
    Todd, Joe; Hubbard, Alun; Brennan, Paul; Christoffersen, Poul; Kendrick, Alexander; Chu, Winnie; Doyle, Samuel; Hubbard, Bryn; Lok, Lai Bun; Nicholls, Keith; Young, Tun Jan; Schroeder, Dustin; Box, Jason E.
    Description

    Radar and weather station data collected in 2014 as part of the Subglacial Access and Fast Ice Research Experiment (SAFIRE) and used to quantify englacial water storage in the paper, Surface Meltwater Impounded by Seasonal Englacial Storage in West Greenland. See the following link to the manuscript: https://doi.org/10.1029/2018GL079787For additional data from the same field campaign see:https://doi.org/10.6084/m9.figshare.5745294And items 112029 and 112009 located athttp://www.bgs.ac.uk/services/ngdc/accessions/index.html

  15. o

    NEXRAD on AWS

    • registry.opendata.aws
    Updated Apr 19, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2018). NEXRAD on AWS [Dataset]. https://registry.opendata.aws/noaa-nexrad/
    Explore at:
    Dataset updated
    Apr 19, 2018
    Dataset provided by
    <a href="https://www.unidata.ucar.edu/">Unidata</a>
    Description

    Real-time and archival data from the Next Generation Weather Radar (NEXRAD) network.

    Update

    The NEXRAD Level II archive data is moving to a new bucket: unidata-nexrad-level2 and SNS topic: arn:aws:sns:us-east-1:684042711724:NewNEXRADLevel2Archive. The old bucket and SNS topic are now deprecated and will no longer be available starting September 1, 2025.

  16. Amazon Cloud Locations

    • kaggle.com
    zip
    Updated Oct 7, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    iyoob_utexas (2020). Amazon Cloud Locations [Dataset]. https://www.kaggle.com/i2i2i2/amazon-cloud-locations
    Explore at:
    zip(843 bytes)Available download formats
    Dataset updated
    Oct 7, 2020
    Authors
    iyoob_utexas
    Description

    This data file lists approximate locations of Amazon Web Services (AWS) data centers around the world. Some of this was collected manually by searching local news articles on real estate purchases by Amazon in each region, and other information was obtained from https://www.datacenterdynamics.com/. Note that in most regions AWS has multiple data centers, and so the selected location may only reflect one of them in that region.

    This data is helpful for AWS users to quickly view where their assets are housed across the world and help them ensure that they are meeting information privacy guidelines.

  17. u

    ABLE Automatic Weather Station (AWS) Data

    • data.ucar.edu
    ascii
    Updated Oct 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Richard L. Coulter (2025). ABLE Automatic Weather Station (AWS) Data [Dataset]. http://doi.org/10.26023/VR0R-66BG-700G
    Explore at:
    asciiAvailable download formats
    Dataset updated
    Oct 7, 2025
    Authors
    Richard L. Coulter
    Time period covered
    May 20, 2003 - Jul 7, 2003
    Area covered
    Description

    This data set contains 1-minute resolution surface meteorological data from the Atmospheric Boundary Layer Experiments (ABLE) operated by the Argonne National Laboratory in the Walnut River Watershed in Butler County Kansas (east of Wichita). The ABLE Automated Weather Station (AWS) Network consists of five stations. Data cover the period from 20 May to 7 July 2003 The data are in columnar ASCII format.

  18. h

    Data from: data-science-on-aws

    • huggingface.co
    Updated Nov 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dao Xuan Tan (2025). data-science-on-aws [Dataset]. https://huggingface.co/datasets/tan-0909/data-science-on-aws
    Explore at:
    Dataset updated
    Nov 22, 2025
    Authors
    Dao Xuan Tan
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    tan-0909/data-science-on-aws dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. a

    Mt. Hunter AWS data

    • arcticdata.io
    • search.dataone.org
    Updated May 18, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karl Kreutz (2020). Mt. Hunter AWS data [Dataset]. http://doi.org/10.18739/A2C946
    Explore at:
    Dataset updated
    May 18, 2020
    Dataset provided by
    Arctic Data Center
    Authors
    Karl Kreutz
    Time period covered
    Jun 6, 2013 - Sep 22, 2013
    Area covered
    Description

    Meteorological data and images collected on the Mt. Hunter plateau, Denali National Park, Alaska. Data were collected with an automatic weather station using instrumentation from Campbell Scientific. Large-scale atmospheric circulation systems affect the geographic distribution of precipitation in western North America, yet little is known about how these systems may have varied before the instrumental period of the last 150 years. The main goal of this project is to reconstruct the history of precipitation in Alaska during the last thousand years using ice core records of snow accumulation. The researchers plan to collect several new ice cores from the Mt. Hunter Plateau in the Alaska Range of Denali National Park and the new ice cores will be combined with an existing spatial array of ice cores in the region to map changes in the spatial patterns of precipitation. Because changes in atmospheric circulation patterns caused by ENSO and the Pacific Decadal Oscillation (PDO) affect where the precipitation falls, this spatial array of ice cores will provide a record of how these larger scale climate systems have varied during the last thousand years. The project will focus on determining the differences in the precipitation patterns at the Little Ice Age (approximately 200 to 600 years ago) and Medieval Climate Anomaly (approximately 800 to 1,200 years ago).

  20. Amazon Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Mar 31, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2022). Amazon Dataset [Dataset]. https://brightdata.com/products/datasets/amazon
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Mar 31, 2022
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Gain extensive insights with our Amazon datasets, encompassing detailed product information including pricing, reviews, ratings, brand names, product categories, sellers, ASINs, images, and much more. Ideal for market researchers, data analysts, and eCommerce professionals looking to excel in the competitive online marketplace. Over 425M records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:

    Title Asin Main Image Brand Name Description Availability Subcategory Categories Parent Asin Type Product Type Name Model Number Manufacturer Color Size Date First Available Released Model Year Item Model Number Part Number Price Total Reviews Total Ratings Average Rating Features Best Sellers Rank Subcategory Buybox Buybox Seller Id Buybox Is Amazon Images Product URL And more

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Amazon Web Services (2021). Registry of Open Data on AWS [Dataset]. https://registry.opendata.aws/registry-open-data/
Organization logo

Registry of Open Data on AWS

Explore at:
Dataset updated
Aug 13, 2021
Dataset provided by
Amazon Web Serviceshttp://aws.amazon.com/
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

The Registry of Open Data on AWS contains publicly available datasets that are available for access from AWS resources. Note that datasets in this registry are available via AWS resources, but they are not provided by AWS; these datasets are owned and maintained by a variety of government organizations, researchers, businesses, and individuals. This dataset contains derived forms of the data in https://github.com/awslabs/open-data-registry that have been transformed for ease of use with machine interfaces. Currently, only the ndjson form of the registry is populated here.

Search
Clear search
Close search
Google apps
Main menu