32 datasets found
  1. Sales of the H&M Group worldwide 2006-2024

    • statista.com
    • ai-chatbox.pro
    Updated Jun 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Sales of the H&M Group worldwide 2006-2024 [Dataset]. https://www.statista.com/statistics/252190/gross-sales-of-the-h-und-m-group-worldwide/
    Explore at:
    Dataset updated
    Jun 2, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    This statistic depicts the sales of the H&M Group worldwide from 2006 to 2024. In the fiscal year 2024, global net sales of the H&M Group amounted to about 234 billion Swedish kronor. H&MH&M offers a broad and varied range of fashion including collections for women, men, teenagers and children. The range also includes sportswear, underwear, shoes, accessories and cosmetics, as well as home textiles and decorations from H&M Home.Germany is H&M's largest market. In 2024, over 35 billion Swedish kronor were generated from that country alone. The company operates roughly 4,253 stores worldwide and employs approximately 97,710 people. H&M dropped out of the top ten most valuable apparel brands in the world as of 2023.H&M aims to be a more sustainable choice for today’s increasingly aware customers. To this end, H&M’s investments in social improvements and reduced environmental impact extend throughout the product life cycle – from responsible use of natural resources to ensuring good working conditions at suppliers’ factories. Sustainability work is thoroughly integrated into the business, not only because it is an investment in the customer offering,but also because it is vital to the group’s long-term growth and development. However, there have been questions raised as to how effective and trustworthy H&M's sustainability practices really are.

  2. h

    clothes_desc

    • huggingface.co
    Updated Apr 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wolfgang Bensvage (2025). clothes_desc [Dataset]. https://huggingface.co/datasets/wbensvage/clothes_desc
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 21, 2025
    Authors
    Wolfgang Bensvage
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for H&M Clothes captions

    _Dataset used to train/finetune [Clothes text to image model] Captions are generated by using the 'detail_desc' and 'colour_group_name' or 'perceived_colour_master_name' from kaggle/competitions/h-and-m-personalized-fashion-recommendations. Original images were also obtained from the url (https://www.kaggle.com/competitions/h-and-m-personalized-fashion-recommendations/data?select=images)

      For each row the dataset contains image and text… See the full description on the dataset page: https://huggingface.co/datasets/wbensvage/clothes_desc.
    
  3. h

    h-and-m-fashion-caption

    • huggingface.co
    Updated Jun 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomy Tjandra (2025). h-and-m-fashion-caption [Dataset]. https://huggingface.co/datasets/tomytjandra/h-and-m-fashion-caption
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 5, 2025
    Authors
    Tomy Tjandra
    Description

    Dataset Card for "h-and-m-fashion-caption"

    More Information needed

  4. USA Hispanic-White Wage Gap Dataset (1973-2022)

    • kaggle.com
    Updated Nov 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    asaniczka (2023). USA Hispanic-White Wage Gap Dataset (1973-2022) [Dataset]. https://www.kaggle.com/datasets/asaniczka/usa-hispanic-white-wage-gap-dataset-1973-2022
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 7, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    asaniczka
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset provides valuable insights into the wage gap between Hispanic and White workers in the United States.

    The wage gap is expressed as a percentage by which hourly wages of Hispanic workers are less than those of White workers.

    It is an essential measure for understanding income disparities and examining trends over time.

    Interesting Task Ideas:

    1. Analyze the Hispanic-White wage gap trends over the years.
    2. Explore the variations in median and average wages between Hispanic and White workers.
    3. Investigate the gender-specific wage gap and identify any disparities between Hispanic and White men and women.
    4. Conduct regression analysis to analyze the wage gap while controlling for relevant factors such as education, age, and geographic location.
    5. Visualize the wage gap data to emphasize the magnitude of disparities and highlight any patterns over time.
    6. Discover the potential effects of policy changes or economic transformations on the wage gap.

    If you find this dataset insightful, don't forget to upvote it! 😊💝

    Checkout my other datasets

    Poverty-Level Wages in the USA Dataset

    1.4M Amazon Products

    Black-White Wage Gap in the USA Dataset

    Clash of Clans Clans Dataset 2023 (3.5M Clans)

    Productivity and Hourly Compensation

    Photo by Clay Banks on Unsplash

  5. Z

    Data from: SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Mar 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tsiakoulis, Pirros (2025). SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7119399
    Explore at:
    Dataset updated
    Mar 7, 2025
    Dataset provided by
    Nikitaras, Karolos
    Klapsas, Konstantinos
    Tsiakoulis, Pirros
    Ellinas, Nikolaos
    Jho, Gunu
    Vioni, Alexandra
    Chalamandaris, Aimilios
    Sung, June Sig
    Maniati, Georgia
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This is the public release of the Samsung Open Mean Opinion Scores (SOMOS) dataset for the evaluation of neural text-to-speech (TTS) synthesis, which consists of audio files generated with a public domain voice from trained TTS models based on bibliography, and numbers assigned to each audio as quality (naturalness) evaluations by several crowdsourced listeners.DescriptionThe SOMOS dataset contains 20,000 synthetic utterances (wavs), 100 natural utterances and 374,955 naturalness evaluations (human-assigned scores in the range 1-5). The synthetic utterances are single-speaker, generated by training several Tacotron-like acoustic models and an LPCNet vocoder on the LJ Speech voice public dataset. 2,000 text sentences were synthesized, selected from Blizzard Challenge texts of years 2007-2016, the LJ Speech corpus as well as Wikipedia and general domain data from the Internet.Naturalness evaluations were collected via crowdsourcing a listening test on Amazon Mechanical Turk in the US, GB and CA locales. The records of listening test participants (workers) are fully anonymized. Statistics on the reliability of the scores assigned by the workers are also included, generated through processing the scores and validation controls per submission page.

    To listen to audio samples of the dataset, please see our Github page.

    The dataset release comes with a carefully designed train-validation-test split (70%-15%-15%) with unseen systems, listeners and texts, which can be used for experimentation on MOS prediction.

    This version also contains the necessary resources to obtain the transcripts corresponding to all dataset audios.

    Terms of use

    The dataset may be used for research purposes only, for non-commercial purposes only, and may be distributed with the same terms.

    Every time you produce research that has used this dataset, please cite the dataset appropriately.

    Cite as:

    @inproceedings{maniati22_interspeech, author={Georgia Maniati and Alexandra Vioni and Nikolaos Ellinas and Karolos Nikitaras and Konstantinos Klapsas and June Sig Sung and Gunu Jho and Aimilios Chalamandaris and Pirros Tsiakoulis}, title={{SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis}}, year=2022, booktitle={Proc. Interspeech 2022}, pages={2388--2392}, doi={10.21437/Interspeech.2022-10922} }

    References of resources & models used

    Voice & synthesized texts:K. Ito and L. Johnson, “The LJ Speech Dataset,” https://keithito.com/LJ-Speech-Dataset/, 2017.

    Vocoder:J.-M. Valin and J. Skoglund, “LPCNet: Improving neural speech synthesis through linear prediction,” in Proc. ICASSP, 2019.R. Vipperla, S. Park, K. Choo, S. Ishtiaq, K. Min, S. Bhattacharya, A. Mehrotra, A. G. C. P. Ramos, and N. D. Lane, “Bunched lpcnet: Vocoder for low-cost neural text-to-speech systems,” in Proc. Interspeech, 2020.

    Acoustic models:N. Ellinas, G. Vamvoukakis, K. Markopoulos, A. Chalamandaris, G. Maniati, P. Kakoulidis, S. Raptis, J. S. Sung, H. Park, and P. Tsiakoulis, “High quality streaming speech synthesis with low, sentence-length-independent latency,” in Proc. Interspeech, 2020.Y. Wang, R. Skerry-Ryan, D. Stanton, Y. Wu, R. J. Weiss, N. Jaitly, Z. Yang, Y. Xiao, Z. Chen, S. Bengio et al., “Tacotron: Towards End-to-End Speech Synthesis,” in Proc. Interspeech, 2017.J. Shen, R. Pang, R. J. Weiss, M. Schuster, N. Jaitly, Z. Yang, Z. Chen, Y. Zhang, Y. Wang, R. Skerrv-Ryan et al., “Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions,” in Proc. ICASSP, 2018.J. Shen, Y. Jia, M. Chrzanowski, Y. Zhang, I. Elias, H. Zen, and Y. Wu, “Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling,” arXiv preprint arXiv:2010.04301, 2020.M. Honnibal and M. Johnson, “An Improved Non-monotonic Transition System for Dependency Parsing,” in Proc. EMNLP, 2015.M. Dominguez, P. L. Rohrer, and J. Soler-Company, “PyToBI: A Toolkit for ToBI Labeling Under Python,” in Proc. Interspeech, 2019.Y. Zou, S. Liu, X. Yin, H. Lin, C. Wang, H. Zhang, and Z. Ma, “Fine-grained prosody modeling in neural speech synthesis using ToBI representation,” in Proc. Interspeech, 2021.K. Klapsas, N. Ellinas, J. S. Sung, H. Park, and S. Raptis, “WordLevel Style Control for Expressive, Non-attentive Speech Synthesis,” in Proc. SPECOM, 2021.T. Raitio, R. Rasipuram, and D. Castellani, “Controllable neural text-to-speech synthesis using intuitive prosodic features,” in Proc. Interspeech, 2020.

    Synthesized texts from the Blizzard Challenges 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2016:M. Fraser and S. King, "The Blizzard Challenge 2007," in Proc. SSW6, 2007.V. Karaiskos, S. King, R. A. Clark, and C. Mayo, "The Blizzard Challenge 2008," in Proc. Blizzard Challenge Workshop, 2008.A. W. Black, S. King, and K. Tokuda, "The Blizzard Challenge 2009," in Proc. Blizzard Challenge, 2009.S. King and V. Karaiskos, "The Blizzard Challenge 2010," 2010.S. King and V. Karaiskos, "The Blizzard Challenge 2011," 2011.S. King and V. Karaiskos, "The Blizzard Challenge 2012," 2012.S. King and V. Karaiskos, "The Blizzard Challenge 2013," 2013.S. King and V. Karaiskos, "The Blizzard Challenge 2016," 2016.

    Contact

    Alexandra Vioni - a.vioni@samsung.com

    If you have any questions or comments about the dataset, please feel free to write to us.

    We are interested in knowing if you find our dataset useful! If you use our dataset, please email us and tell us about your research.

  6. h

    h-and-m-fashion-caption-12k

    • huggingface.co
    Updated Feb 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomy Tjandra (2024). h-and-m-fashion-caption-12k [Dataset]. https://huggingface.co/datasets/tomytjandra/h-and-m-fashion-caption-12k
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 17, 2024
    Authors
    Tomy Tjandra
    Description

    tomytjandra/h-and-m-fashion-caption-12k dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. u

    Wildfire Risk to Communities: Spatial datasets of wildfire risk for...

    • agdatacommons.nal.usda.gov
    bin
    Updated Jan 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Melissa R. Jaffe; Joe H. Scott; Michael N. Callahan; Gregory K. Dillon; Eva C. Karau; Mitchell T. Lazarz (2025). Wildfire Risk to Communities: Spatial datasets of wildfire risk for populated areas in the United States: 2nd edition [Dataset]. http://doi.org/10.2737/RDS-2020-0060-2
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 22, 2025
    Dataset provided by
    Forest Service Research Data Archive
    Authors
    Melissa R. Jaffe; Joe H. Scott; Michael N. Callahan; Gregory K. Dillon; Eva C. Karau; Mitchell T. Lazarz
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    United States
    Description

    The data included in this publication depict components of wildfire risk specifically for populated areas in the United States. These datasets represent where people live in the United States and the in situ risk from wildfire, i.e., the risk at the location where the adverse effects take place.

    National wildfire hazard datasets of annual burn probability and fire intensity, generated by the USDA Forest Service, Rocky Mountain Research Station and Pyrologix LLC, form the foundation of the Wildfire Risk to Communities data. Vegetation and wildland fuels data from LANDFIRE 2020 (version 2.2.0) were used as input to two different but related geospatial fire simulation systems. Annual burn probability was produced with the USFS geospatial fire simulator (FSim) at a relatively coarse cell size of 270 meters (m). To bring the burn probability raster data down to a finer resolution more useful for assessing hazard and risk to communities, we upsampled them to the native 30 m resolution of the LANDFIRE fuel and vegetation data. In this upsampling process, we also spread values of modeled burn probability into developed areas represented in LANDFIRE fuels data as non-burnable. Burn probability rasters represent landscape conditions as of the end of 2020. Fire intensity characteristics were modeled at 30 m resolution using a process that performs a comprehensive set of FlamMap runs spanning the full range of weather-related characteristics that occur during a fire season and then integrates those runs into a variety of results based on the likelihood of those weather types occurring. Before the fire intensity modeling, the LANDFIRE 2020 data were updated to reflect fuels disturbances occurring in 2021 and 2022. As such, the fire intensity datasets represent landscape conditions as of the end of 2022. The data products in this publication that represent where people live, reflect 2020 estimates of housing units and 2021 estimates of population counts from the U.S. Census Bureau, combined with building footprint data from Onegeo and USA Structures, both reflecting 2022 conditions.

    The specific raster datasets included in this publication include:

    Building Count: Building Count is a 30-m raster representing the count of buildings in the building footprint dataset located within each 30-m pixel.

    Building Density: Building Density is a 30-m raster representing the density of buildings in the building footprint dataset (buildings per square kilometer [km²]).

    Building Coverage: Building Coverage is a 30-m raster depicting the percentage of habitable land area covered by building footprints.

    Population Count (PopCount): PopCount is a 30-m raster with pixel values representing residential population count (persons) in each pixel.

    Population Density (PopDen): PopDen is a 30-m raster of residential population density (people/km²).

    Housing Unit Count (HUCount): HUCount is a 30-m raster representing the number of housing units in each pixel.

    Housing Unit Density (HUDen): HUDen is a 30-m raster of housing-unit density (housing units/km²).

    Housing Unit Exposure (HUExposure): HUExposure is a 30-m raster that represents the expected number of housing units within a pixel potentially exposed to wildfire in a year. This is a long-term annual average and not intended to represent the actual number of housing units exposed in any specific year.

    Housing Unit Impact (HUImpact): HUImpact is a 30-m raster that represents the relative potential impact of fire to housing units at any pixel, if a fire were to occur. It is an index that incorporates the general consequences of fire on a home as a function of fire intensity and uses flame length probabilities from wildfire modeling to capture likely intensity of fire.

    Housing Unit Risk (HURisk): HURisk is a 30-m raster that integrates all four primary elements of wildfire risk - likelihood, intensity, susceptibility, and exposure - on pixels where housing unit density is greater than zero.The geospatial data products described and distributed here are part of the Wildfire Risk to Communities project. This project was directed by Congress in the 2018 Consolidated Appropriations Act (i.e., 2018 Omnibus Act, H.R. 1625, Section 210: Wildfire Hazard Severity Mapping) to help U.S. communities understand components of their relative wildfire risk profile, the nature and effects of wildfire risk, and actions communities can take to mitigate risk. The first edition of these data represented the first time wildfire risk to communities had been mapped nationally with consistent methodology. They provided foundational information for comparing the relative wildfire risk among populated communities in the United States. In this version, the 2nd edition, we use improved modeling and mapping methodology and updated input data to generate the current suite of products.See the Wildfire Risk to Communities website at https://www.wildfirerisk.org for complete project information and an interactive web application for exploring some of the datasets published here. We deliver the data here as zip files by U.S. state (including AK and HI), and for the full extent of the continental U.S.

    This data publication is a second edition and represents an update to any previous versions of Wildfire Risk to Communities risk datasets published by the USDA Forest Service. This second edition was originally published on 06/03/2024. On 09/10/2024, a minor correction was made to the abstract in this overall metadata document as well as the individual metadata documents associated with each raster dataset. The supplemental file containing data product descriptions was also updated. In addition, we separated the large CONUS download into a series of smaller zip files (one for each layer).

    There are two companion data publications that are part of the WRC 2.0 data update: one that characterizes landscape-wide wildfire hazard and risk for the nation (Scott et al. 2024, https://doi.org/10.2737/RDS-2020-0016-2), and one that delineates wildfire risk reduction zones and provides tabular summaries of wildfire hazard and risk raster datasets (Dillon et al. 2024, https://doi.org/10.2737/RDS-2024-0030).

  8. d

    B2B Audience Targeting Data | 2.4M US Human Resources Professional Contact...

    • datarade.ai
    .csv, .xls
    Updated Dec 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allforce (formerly Solution Publishing) (2024). B2B Audience Targeting Data | 2.4M US Human Resources Professional Contact Data Set | B2B Contact Data | Verified Safe to Email [Dataset]. https://datarade.ai/data-products/b2b-audience-targeted-data-2-4m-us-human-resources-professi-solution-publishing
    Explore at:
    .csv, .xlsAvailable download formats
    Dataset updated
    Dec 17, 2024
    Dataset authored and provided by
    Allforce (formerly Solution Publishing)
    Area covered
    United States of America
    Description

    Allforce is a leading data intelligence company specializing in comprehensive audience targeting solutions. We maintain one of the most extensive and accurate databases of professional contact information, with a focus on delivering verified, actionable data that drives measurable marketing results for our clients.

    Dataset Overview: Our US Human Resources Professional Contact Database provides access to 2.4 million verified HR professionals across 475,000 companies nationwide. This premium dataset is specifically curated for B2B marketers seeking to connect with decision-makers in the HR ecosystem.

    Key Features & Benefits: 2.4M+ HR professionals across all specialties 475,000+ companies represented Segmented by HR function: Benefits, Payroll, Recruiting, Training, Compensation, and more Decision-maker level contacts included

    Data Quality & Verification: LinkedIn URL verification for each contact Regular database updates and maintenance High deliverability rates (Email Safe certification) Active professional verification process

    Multi-Channel Marketing Support: Email addresses (newsletter-safe, verified deliverable) Direct phone numbers for telemarketing Postal addresses for direct mail campaigns LinkedIn profile matching for social outreach Digital advertising - Programmatic audiences

    Data Compliance & Safety: All data is collected and maintained in compliance with applicable privacy regulations. Our "Safe to Email" certification ensures subscribers have opted into professional communications, reducing bounce rates and compliance risks.

    Industries Served: Healthcare, Technology, Manufacturing, Financial Services, Retail, Education, Government, and all major industry verticals with HR departments.

    Transform your HR marketing strategy with verified, actionable contact data that delivers results.

  9. Dataset

    • datasets.ai
    • catalog.data.gov
    • +1more
    0, 33
    Updated Sep 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Environmental Protection Agency (2024). Dataset [Dataset]. https://datasets.ai/datasets/dataset
    Explore at:
    0, 33Available download formats
    Dataset updated
    Sep 8, 2024
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Authors
    U.S. Environmental Protection Agency
    Description

    Publicly available weblinks used to develop research project.

    This dataset is associated with the following publication: Hall, E., R. Hall, J. Aron, S. Swanson, M. Philbin, R. Schaefer, T. Jones-Lepp, D. Heggem, J. Lin, E. Wilson, and H. Kahan. An Ecological Function Approach to Managing Harmful Cyanobacteria in Three Oregon Lakes: Beyond Water Quality Advisories and Total Maximum Daily Loads (TMDLs). WATER. MDPI AG, Basel, SWITZERLAND, 11(6): 1125, (2019).

  10. u

    Data from: Measured and simulated carbon dynamics in Midwestern US...

    • agdatacommons.nal.usda.gov
    • s.cnmilf.com
    • +1more
    txt
    Updated Feb 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Dold; Ken M. Wacha; Tom J. Sauer; Jerry L. Hatfield; John H. Prueger (2024). Data from: Measured and simulated carbon dynamics in Midwestern US corn-soybean rotations [Dataset]. http://doi.org/10.15482/USDA.ADC/1518763
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 13, 2024
    Dataset provided by
    Ag Data Commons
    Authors
    Christian Dold; Ken M. Wacha; Tom J. Sauer; Jerry L. Hatfield; John H. Prueger
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    The dataset includes information on soil properties collected from two conventionally managed fields under corn-soybean rotation in October 2005 and October 2016, respectively. The fields are located in Story County, Central Iowa. 42 sampling locations per field and year were sampled within a 50 m × 50 m grid, and 1 to 2 samples per location were collected using a hydraulic soil sampler (d= 38.2 mm) from the 0 - 120 cm soil layer. The samples were analyzed to determine carbon and nitrogen concentration, and soil pH in five soil layers (0-15, 15-30, 30-60, 60-90, and 90-120 cm). Presented is the raw data per location (mean of duplicates) with which carbon and nitrogen content can be calculated with either the equivalent soil mass method or by using bulk density. For more information about this dataset contact: Tom Sauer: tom.sauer@usda.gov Christian Dold: c.dold@fz-juelich.de Resources in this dataset:Resource Title: Data Dictionary. File Name: data-dictionary_CD.csvResource Description: Defines variables and their units for the data table, also provided as a separate sheet within xlsx file.Resource Title: Soil Properties. File Name: Soil Chemical Analysis.xlsx

  11. d

    Methane Transect Dataset

    • datasets.ai
    • catalog.data.gov
    • +1more
    53
    Updated Aug 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Environmental Protection Agency (2024). Methane Transect Dataset [Dataset]. https://datasets.ai/datasets/methane-transect-dataset
    Explore at:
    53Available download formats
    Dataset updated
    Aug 28, 2024
    Dataset authored and provided by
    U.S. Environmental Protection Agency
    Description

    This project area “OTM 33 Mobile Emission Measurements” covers research on remote emissions quantification with the various forms of mobile monitoring approaches. There will be multiple data sets included in this project. The metadata and data dictionaries are included with each data set. The data sets with metadata and data dictionaries are as follows:

    Methane Transect data Set: The dataset contains mobile methane concentration measurements acquired at 10 samples per second data acquisition rate, collected while driving along transects downwind of a methane source. The data consists of GPS coordinates, methane concetrion data and transect indicators. The controlled methane release experiment was conducted on May 15, 2010 in Durham, North Carolina, where three passes were made for the one CR experiment and the point-source release rate was controlled at S = 0.6 g/s. Additionally, the dataset contains four field studies conducted in Colorado on four separate days in July 2010, with the number of passes for each study ranging from two to five.

    This dataset is associated with the following publication: Albertson, J., T. Foster-Wittig, A. Swingler, G. Foderaro, S. Ferrari, S. Amin, M. Modrak, H. Brantley , and E. Thoma. A Mobile Sensing Approach for Regional Surveillance of Fugitive Methane Emissions in Oil and Gas Production. ENVIRONMENTAL SCIENCE & TECHNOLOGY. American Chemical Society, Washington, DC, USA, 50(5): 2487-2497, (2015).

  12. National Aquatic Resources Survey datasets

    • catalog.data.gov
    • datasets.ai
    Updated Mar 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2021). National Aquatic Resources Survey datasets [Dataset]. https://catalog.data.gov/dataset/national-aquatic-resources-survey-datasets
    Explore at:
    Dataset updated
    Mar 11, 2021
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    The 4 resource surveys (coastal, rivers and streams, lakes and reservoirs, and wetlands) each have datasets covering the biological, chemical, physical habitat, hydrologic and watershed data. This dataset is associated with the following publications: Stoddard , J., J. Van Sickle, A. Herlihy, J. Brahney, S. Paulsen , D. Peck , R. Mitchell , and A. Pollard. Continental-scale increase in stream and lake phosphorus: Are oligotrophic systems disappearing in the U.S.?. ENVIRONMENTAL SCIENCE & TECHNOLOGY. American Chemical Society, Washington, DC, USA, 50(7): 3409-3415, (2016). Herlihy, A., M. Kentula, T. Magee, G. Lomnicky, A. Nahlik, and G. Serenbetz. Striving for consistency in the National Wetland Condition Assessment: developing a reference condition approach for assessing wetlands at a continental scale. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 327, (2019). Magee, T., K. Blocksom, and S. Fennessy. A national-scale vegetation multimetric index (VMMI) as an indicator of wetland condition across the conterminous United States.. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 322, (2019). Herlihy, A., J. Sifneos, G. Lomnicky, A. Nahlik, M. Kentula, T. Magee, M. Weber, and A. Trebitz. The response of wetland quality indicators to human disturbance indicators across the United States. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 296, (2019). Herlihy, A., S. Paulsen, M. Kentula, T. Magee, A. Nahlik, and G. Lomnicky. Assessing the relative and attributable risk of stressors to wetland condition across the conterminous United States. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 320, (2019). Lomnicky, G., A.T. Herlihy, and P. Kaufmann. Quantifying the extent of human disturbance activities and anthropogenic stressors in wetlands across the conterminous United States: results from the National Wetland Condition Assessment. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 324, (2019). Bowen, G., A. Putman, J.R. Brooks, D. Bowling, E. Oerter, and S. Good. Inferring the source of evaporated waters using stable H and O isotopes.. OECOLOGIA. Springer, New York, NY, USA, 187(4): 1025-1039, (2018). Fox, E., J. Ver Hoef, and T. Olsen. Comparing Spatial Regression to Random Forests for Large Environmental Data Sets.. PLoS ONE. Public Library of Science, San Francisco, CA, USA, 15(3): e0229509, (2020). Nahlik, A., K. Blocksom, A. Herlihy, M. Kentula, T. Magee, and S. Paulsen. Use of national-scale data to examine human-mediated additions of heavy metals to wetland soils of the US. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 336, (2019). Kentula, M., and S. Paulsen. The 2011 National Wetland Condition Assessment: Overview and an Invitation. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 325, (2019). Magee, T., K. Blocksom, A. Herlihy, and A. Nahlik. Characterizing nonnative plants in wetlands across the conterminous United States. ENVIRONMENTAL MONITORING AND ASSESSMENT. Springer, New York, NY, USA, 191: 344, (2019). Feio, M., R. Hughes, M. Callisto, S.J. Nichols, O.N. Odume, B.R. Quintella, M. Kuemmerlen, F.C. Aguiar, S.F.P. Almeida, P. Alonso-EguíaLis , F.O. Arimoro, F.J. Dyer , J.S. Harding , S. Jang , P. Kaufmann, S. Lee, J. Li, D.R. Macedo, A. Mendes, N. Mercado-Silva , W. Monk, K. Nakamura, G.G. Ndiritu , R. Ogden , M. Peat , T.B. Reynoldson , B. Rios-Touma , P. Segurado , and A.G. Yates. The biological assessment and rehabilitation of the world’s rivers: an overview. WATER. MDPI AG, Basel, SWITZERLAND, 13(3): 371, (2021).

  13. G

    Canadian Large Ensembles Adjusted Dataset version 1 (CanLEADv1)

    • open.canada.ca
    netcdf
    Updated Jun 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Environment and Climate Change Canada (2024). Canadian Large Ensembles Adjusted Dataset version 1 (CanLEADv1) [Dataset]. https://open.canada.ca/data/en/dataset/a97edbc1-7fda-4ebc-b135-691505d9a595
    Explore at:
    netcdfAvailable download formats
    Dataset updated
    Jun 9, 2024
    Dataset provided by
    Environment and Climate Change Canada
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Time period covered
    Jan 1, 1950 - Dec 31, 2100
    Area covered
    Canada
    Description

    The dataset contains large ensembles of bias adjusted daily climate model outputs of minimum temperature, maximum temperature, precipitation, relative humidity, surface pressure, wind speed, incoming shortwave radiation, and incoming longwave radiation on a 0.5-degree grid over North America. Intended uses include hydrological/land surface impact modelling and related event attribution studies. The CanLEADv1 dataset is based on archived climate model simulations in the Canadian Regional Climate Model Large Ensemble (CanRCM4 LE) https://open.canada.ca/data/en/dataset/83aa1b18-6616-405e-9bce-af7ef8c2031c and Canadian Earth System Model Large Ensembles (CanESM2 LE) https://open.canada.ca/data/en/dataset/aa7b6823-fd1e-49ff-a6fb-68076a4a477c datasets. Specifically, CanLEADv1 provides bias adjusted daily climate variables over North America derived from 50 member initial condition ensembles of CanESM2 (ALL and NAT radiative forcings) and CanESM2-driven CanRCM4 (ALL radiative forcings) simulations (Scinocca et al., 2016; Fyfe et al., 2017). Raw CanESM2 LE and CanRCM4 LE outputs are bias adjusted (Cannon, 2018; Cannon et al., 2015) so that they are statistically consistent with two observationally-constrained historical meteorological forcing datasets (S14FD, Iizumi et al., 2017; EWEMBI, Lange, 2018). File names, formats, and metadata headers follow the recommended Data Reference Syntax for bias-adjusted Coordinated Regional Downscaling Experiment (CORDEX) simulations (Nikulin and Legutke, 2016). Multiple initial condition simulations can be used to investigate the externally forced response, internal variability, and the relative role of external forcing and internal variability on the climate system (e.g., Fyfe et al., 2017). Large ensembles of ALL and NAT simulations can be compared in event attribution studies (e.g., Kirchmeier-Young et al., 2017). Availability of bias adjusted outputs from the CanESM2-CanRCM4 modelling system can be used to investigate the added value of dynamical downscaling (Scinocca et al., 2016). Multiple observational datasets are used for bias adjustment to partly account for observational uncertainty (Iizumi et al., 2017). For CanESM2 LE, there are two sets of radiative forcing scenarios (ALL, which consists of historical and RCP8.5 forcings for the periods 1950-2005 and 2006-2100, respectively, and NAT, which consists of historicalNat forcings for the period 1950-2020), two observationally-constrained target datasets for bias adjustment (S14FD and EWEMBI), and 50 ensemble members, which gives a total of 2 × 2 × 50 = 200 sets of outputs. For CanRCM4 LE, historicalNat simulations were not run; hence, there are 2 × 50 = 100 sets of outputs. In both cases, CanLEADv1 provides variables on the CORDEX NAM-44i 0.5-degree grid. CanESM2 outputs (~2.8-degree grid) and CanRCM4 outputs (0.44-degree grid), are bilinearly interpolated onto the NAM-44i grid before bias adjustment. A multivariate version of quantile mapping (Cannon, 2018) is used to adjust the distribution of each simulated variable, as well as the statistical dependence between variables, so that these properties match those of the target observational dataset. Bias adjustment is performed on a grid cell by grid cell basis. Outside of the historical calibration period, the climate change signal simulated by the climate model is preserved (Cannon et al., 2015). References: Cannon, A. J. (2018). Multivariate quantile mapping bias correction: an N-dimensional probability density function transform for climate model simulations of multiple variables. Climate Dynamics, 50(1-2), 31-49. Cannon, A. J., Sobie, S. R., & Murdock, T. Q. (2015). Bias correction of GCM precipitation by quantile mapping: How well do methods preserve changes in quantiles and extremes? Journal of Climate, 28(17), 6938-6959. Fyfe, J. C., Derksen, C., Mudryk, L., Flato, G. M., Santer, B. D., Swart, N. C., Molotch, N. P., Zhang, X., Wan, H., Arora, V. K., Scinocca, J., & Jiao, Y. (2017). Large near-term projected snowpack loss over the western United States. Nature Communications, 8, 14996. Iizumi, T., Takikawa, H., Hirabayashi, Y., Hanasaki, N., & Nishimori, M. (2017). Contributions of different bias-correction methods and reference meteorological forcing data sets to uncertainty in projected temperature and precipitation extremes. Journal of Geophysical Research: Atmospheres, 122(15), 7800-7819. Kirchmeier-Young, M. C., Zwiers, F. W., Gillett, N. P., & Cannon, A. J. (2017). Attributing extreme fire risk in Western Canada to human emissions. Climatic Change, 144(2), 365-379. Lange, S. (2018). Bias correction of surface downwelling longwave and shortwave radiation for the EWEMBI dataset. Earth System Dynamics, 9(2), 627-645. Nikulin, G., & Legutke, S. (2016). Data Reference Syntax (DRS) for bias-adjusted CORDEX simulations. https://is-enes-data.github.io/CORDEX_adjust_drs.pdf Scinocca, J. F., Kharin, V. V., Jiao, Y., Qian, M. W., Lazare, M., Solheim, L., Flato, G. M., Biner, S., Desgagne, & Dugas, B. (2016). Coordinated global and regional climate modeling. Journal of Climate, 29(1), 17-35.

  14. d

    There are 26 data sets for the report. The file titles indicate report...

    • datasets.ai
    • cloud.csiss.gmu.edu
    • +1more
    10, 33, 53
    Updated Aug 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Environmental Protection Agency (2024). There are 26 data sets for the report. The file titles indicate report sections where the data are used. [Dataset]. https://datasets.ai/datasets/there-are-26-data-sets-for-the-report-the-file-titles-indicate-report-sections-where-the-d
    Explore at:
    53, 33, 10Available download formats
    Dataset updated
    Aug 27, 2024
    Dataset authored and provided by
    U.S. Environmental Protection Agency
    Description

    Title of each data set starts with the section number of report, which the data were used in analysis.

    This dataset is associated with the following publication: Yang, J., H. Wei, X. Wang, S. Buchberger, M. Liang, N. Chang, B. Bierwagen, S. Julius, Z. Li, D. Boccelli, R. Clark, H. Liu, and J. Neal. National Water Infrastructure Adaptation Assessment: Part II, Smart Urban Designer (SUD) and Application Case Studies. U.S. Environmental Protection Agency, Washington, DC, USA, 2020.

  15. u

    Wildfire Risk to Communities: Spatial datasets of landscape-wide wildfire...

    • agdatacommons.nal.usda.gov
    bin
    Updated Jan 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joe H. Scott; Gregory K. Dillon; Melissa R. Jaffe; Kevin C. Vogler; Julia H. Olszewski; Michael N. Callahan; Eva C. Karau; Mitchell T. Lazarz; Karen C. Short; Karin L. Riley; Mark A. Finney; Isaac C. Grenfell (2025). Wildfire Risk to Communities: Spatial datasets of landscape-wide wildfire risk components for the United States: 2nd edition [Dataset]. http://doi.org/10.2737/RDS-2020-0016-2
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 22, 2025
    Dataset provided by
    Forest Service Research Data Archive
    Authors
    Joe H. Scott; Gregory K. Dillon; Melissa R. Jaffe; Kevin C. Vogler; Julia H. Olszewski; Michael N. Callahan; Eva C. Karau; Mitchell T. Lazarz; Karen C. Short; Karin L. Riley; Mark A. Finney; Isaac C. Grenfell
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    United States
    Description

    The data included in this publication depict the 2024 version of components of wildfire risk for all lands in the United States that: 1) are landscape-wide (i.e., measurable at every pixel across the landscape); and 2) represent in situ risk - risk at the location where the adverse effects take place on the landscape.

    National wildfire hazard datasets of annual burn probability and fire intensity, generated by the USDA Forest Service, Rocky Mountain Research Station and Pyrologix LLC, form the foundation of the Wildfire Risk to Communities data. Vegetation and wildland fuels data from LANDFIRE 2020 (version 2.2.0) were used as input to two different but related geospatial fire simulation systems. Annual burn probability was produced with the USFS geospatial fire simulator (FSim) at a relatively coarse cell size of 270 meters (m). To bring the burn probability raster data down to a finer resolution more useful for assessing hazard and risk to communities, we upsampled them to the native 30 m resolution of the LANDFIRE fuel and vegetation data. In this upsampling process, we also spread values of modeled burn probability into developed areas represented in LANDFIRE fuels data as non-burnable. Burn probability rasters represent landscape conditions as of the end of 2020. Fire intensity characteristics were modeled at 30 m resolution using a process that performs a comprehensive set of FlamMap runs spanning the full range of weather-related characteristics that occur during a fire season and then integrates those runs into a variety of results based on the likelihood of those weather types occurring. Before the fire intensity modeling, the LANDFIRE 2020 data were updated to reflect fuels disturbances occurring in 2021 and 2022. As such, the fire intensity datasets represent landscape conditions as of the end of 2022. Additional methodology documentation is provided in a methods document (\Supplements\WRC_V2_Methods_Landscape-wideRisk.pdf) packaged in the data download.

    The specific raster datasets in this publication include:

    Risk to Potential Structures (RPS): A measure that integrates wildfire likelihood and intensity with generalized consequences to a home on every pixel. For every place on the landscape, it poses the hypothetical question, "What would be the relative risk to a house if one existed here?" This allows comparison of wildfire risk in places where homes already exist to places where new construction may be proposed. This dataset is referred to as Risk to Homes in the Wildfire Risk to Communities web application.

    Conditional Risk to Potential Structures (cRPS): The potential consequences of fire to a home at a given location, if a fire occurs there and if a home were located there. Referred to as Wildfire Consequence in the Wildfire Risk to Communities web application.

    Exposure Type: Exposure is the spatial coincidence of wildfire likelihood and intensity with communities. This layer delineates where homes are directly exposed to wildfire from adjacent wildland vegetation, indirectly exposed to wildfire from indirect sources such as embers and home-to-home ignition, or not exposed to wildfire due to distance from direct and indirect ignition sources.

    Burn Probability (BP): The annual probability of wildfire burning in a specific location. Referred to as Wildfire Likelihood in the Wildfire Risk to Communities web application.

    Conditional Flame Length (CFL): The mean flame length for a fire burning in the direction of maximum spread (headfire) at a given location if a fire were to occur; an average measure of wildfire intensity.

    Flame Length Exceedance Probability - 4 ft (FLEP4): The conditional probability that flame length at a pixel will exceed 4 feet if a fire occurs; indicates the potential for moderate to high wildfire intensity.

    Flame Length Exceedance Probability - 8 ft (FLEP8): the conditional probability that flame length at a pixel will exceed 8 feet if a fire occurs; indicates the potential for high wildfire intensity.

    Wildfire Hazard Potential (WHP): An index that quantifies the relative potential for wildfire that may be difficult to manage, used as a measure to help prioritize where fuel treatments may be needed.The geospatial data products described and distributed here are part of the Wildfire Risk to Communities project. This project was directed by Congress in the 2018 Consolidated Appropriations Act (i.e., 2018 Omnibus Act, H.R. 1625, Section 210: Wildfire Hazard Severity Mapping) to help U.S. communities understand components of their relative wildfire risk profile, the nature and effects of wildfire risk, and actions communities can take to mitigate risk. The first edition of these data represented the first time wildfire risk to communities had been mapped nationally with consistent methodology. They provided foundational information for comparing the relative wildfire risk among populated communities in the United States. In this version, the 2nd edition, we use improved modeling and mapping methodology and updated input data to generate the current suite of products.See the Wildfire Risk to Communities website at https://www.wildfirerisk.org for complete project information and an interactive web application for exploring some of the datasets published here. We deliver the data here as zip files by U.S. state (including AK and HI), and for the full extent of the continental U.S.

    This data publication is a second edition and represents an update to any previous versions of Wildfire Risk to Communities risk datasets published by the USDA Forest Service. There are two companion data publications that are part of the WRC 2.0 data update: one that includes datasets of wildfire hazard and risk for populated areas of the nation, where housing units are currently present (Jaffe et al. 2024, https://doi.org/10.2737/RDS-2020-0060-2), and one that delineates wildfire risk reduction zones and provides tabular summaries of wildfire hazard and risk raster datasets (Dillon et al. 2024, https://doi.org/10.2737/RDS-2024-0030).

  16. h

    hnm-fashion-recommendations-data

    • huggingface.co
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Rafly Ash Shiddiqi (2025). hnm-fashion-recommendations-data [Dataset]. https://huggingface.co/datasets/einrafh/hnm-fashion-recommendations-data
    Explore at:
    Dataset updated
    Jun 12, 2025
    Authors
    Muhammad Rafly Ash Shiddiqi
    Description

    Dataset Rekomendasi Fashion H&M

    Dataset ini berisi data transaksi, atribut pelanggan, dan metadata produk yang telah dianonimkan dari H&M Group. Kumpulan data komprehensif ini memungkinkan pemodelan perilaku pembelian pelanggan secara mendalam. Wawasan yang dihasilkan dapat dimanfaatkan untuk berbagai tujuan bisnis yang strategis, mulai dari meningkatkan personalisasi pengalaman berbelanja, mengoptimalkan manajemen inventaris untuk efisiensi produksi, hingga mendukung inisiatif… See the full description on the dataset page: https://huggingface.co/datasets/einrafh/hnm-fashion-recommendations-data.

  17. Z

    Data from: Large Landing Trajectory Data Set for Go-Around Analysis

    • data.niaid.nih.gov
    • zenodo.org
    Updated Dec 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Timothé Krauth (2022). Large Landing Trajectory Data Set for Go-Around Analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7148116
    Explore at:
    Dataset updated
    Dec 16, 2022
    Dataset provided by
    Timothé Krauth
    Benoit Figuet
    Marcel Dettling
    Manuel Waltert
    Raphael Monstein
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Large go-around, also referred to as missed approach, data set. The data set is in support of the paper presented at the OpenSky Symposium on November the 10th.

    If you use this data for a scientific publication, please consider citing our paper.

    The data set contains landings from 176 (mostly) large airports from 44 different countries. The landings are labelled as performing a go-around (GA) or not. In total, the data set contains almost 9 million landings with more than 33000 GAs. The data was collected from OpenSky Network's historical data base for the year 2019. The published data set contains multiple files:

    go_arounds_minimal.csv.gz

    Compressed CSV containing the minimal data set. It contains a row for each landing and a minimal amount of information about the landing, and if it was a GA. The data is structured in the following way:

        Column name
        Type
        Description
    
    
    
    
        time
        date time
        UTC time of landing or first GA attempt
    
    
        icao24
        string
        Unique 24-bit (hexadecimal number) ICAO identifier of the aircraft concerned
    
    
        callsign
        string
        Aircraft identifier in air-ground communications
    
    
        airport
        string
        ICAO airport code where the aircraft is landing
    
    
        runway
        string
        Runway designator on which the aircraft landed
    
    
        has_ga
        string
        "True" if at least one GA was performed, otherwise "False"
    
    
        n_approaches
        integer
        Number of approaches identified for this flight
    
    
        n_rwy_approached
        integer
        Number of unique runways approached by this flight
    

    The last two columns, n_approaches and n_rwy_approached, are useful to filter out training and calibration flight. These have usually a large number of n_approaches, so an easy way to exclude them is to filter by n_approaches > 2.

    go_arounds_augmented.csv.gz

    Compressed CSV containing the augmented data set. It contains a row for each landing and additional information about the landing, and if it was a GA. The data is structured in the following way:

        Column name
        Type
        Description
    
    
    
    
        time
        date time
        UTC time of landing or first GA attempt
    
    
        icao24
        string
        Unique 24-bit (hexadecimal number) ICAO identifier of the aircraft concerned
    
    
        callsign
        string
        Aircraft identifier in air-ground communications
    
    
        airport
        string
        ICAO airport code where the aircraft is landing
    
    
        runway
        string
        Runway designator on which the aircraft landed
    
    
        has_ga
        string
        "True" if at least one GA was performed, otherwise "False"
    
    
        n_approaches
        integer
        Number of approaches identified for this flight
    
    
        n_rwy_approached
        integer
        Number of unique runways approached by this flight
    
    
        registration
        string
        Aircraft registration
    
    
        typecode
        string
        Aircraft ICAO typecode
    
    
        icaoaircrafttype
        string
        ICAO aircraft type
    
    
        wtc
        string
        ICAO wake turbulence category
    
    
        glide_slope_angle
        float
        Angle of the ILS glide slope in degrees
    
    
        has_intersection
    

    string

        Boolean that is true if the runway has an other runway intersecting it, otherwise false
    
    
        rwy_length
        float
        Length of the runway in kilometre
    
    
        airport_country
        string
        ISO Alpha-3 country code of the airport
    
    
        airport_region
        string
        Geographical region of the airport (either Europe, North America, South America, Asia, Africa, or Oceania)
    
    
        operator_country
        string
        ISO Alpha-3 country code of the operator
    
    
        operator_region
        string
        Geographical region of the operator of the aircraft (either Europe, North America, South America, Asia, Africa, or Oceania)
    
    
        wind_speed_knts
        integer
        METAR, surface wind speed in knots
    
    
        wind_dir_deg
        integer
        METAR, surface wind direction in degrees
    
    
        wind_gust_knts
        integer
        METAR, surface wind gust speed in knots
    
    
        visibility_m
        float
        METAR, visibility in m
    
    
        temperature_deg
        integer
        METAR, temperature in degrees Celsius
    
    
        press_sea_level_p
        float
        METAR, sea level pressure in hPa
    
    
        press_p
        float
        METAR, QNH in hPA
    
    
        weather_intensity
        list
        METAR, list of present weather codes: qualifier - intensity
    
    
        weather_precipitation
        list
        METAR, list of present weather codes: weather phenomena - precipitation
    
    
        weather_desc
        list
        METAR, list of present weather codes: qualifier - descriptor
    
    
        weather_obscuration
        list
        METAR, list of present weather codes: weather phenomena - obscuration
    
    
        weather_other
        list
        METAR, list of present weather codes: weather phenomena - other
    

    This data set is augmented with data from various public data sources. Aircraft related data is mostly from the OpenSky Network's aircraft data base, the METAR information is from the Iowa State University, and the rest is mostly scraped from different web sites. If you need help with the METAR information, you can consult the WMO's Aerodrom Reports and Forecasts handbook.

    go_arounds_agg.csv.gz

    Compressed CSV containing the aggregated data set. It contains a row for each airport-runway, i.e. every runway at every airport for which data is available. The data is structured in the following way:

        Column name
        Type
        Description
    
    
    
    
        airport
        string
        ICAO airport code where the aircraft is landing
    
    
        runway
        string
        Runway designator on which the aircraft landed
    
    
        n_landings
        integer
        Total number of landings observed on this runway in 2019
    
    
        ga_rate
        float
        Go-around rate, per 1000 landings
    
    
        glide_slope_angle
        float
        Angle of the ILS glide slope in degrees
    
    
        has_intersection
        string
        Boolean that is true if the runway has an other runway intersecting it, otherwise false
    
    
        rwy_length
        float
        Length of the runway in kilometres
    
    
        airport_country
        string
        ISO Alpha-3 country code of the airport
    
    
        airport_region
        string
        Geographical region of the airport (either Europe, North America, South America, Asia, Africa, or Oceania)
    

    This aggregated data set is used in the paper for the generalized linear regression model.

    Downloading the trajectories

    Users of this data set with access to OpenSky Network's Impala shell can download the historical trajectories from the historical data base with a few lines of Python code. For example, you want to get all the go-arounds of the 4th of January 2019 at London City Airport (EGLC). You can use the Traffic library for easy access to the database:

    import datetime from tqdm.auto import tqdm import pandas as pd from traffic.data import opensky from traffic.core import Traffic

    load minimum data set

    df = pd.read_csv("go_arounds_minimal.csv.gz", low_memory=False) df["time"] = pd.to_datetime(df["time"])

    select London City Airport, go-arounds, and 2019-01-04

    airport = "EGLC" start = datetime.datetime(year=2019, month=1, day=4).replace( tzinfo=datetime.timezone.utc ) stop = datetime.datetime(year=2019, month=1, day=5).replace( tzinfo=datetime.timezone.utc )

    df_selection = df.query("airport==@airport & has_ga & (@start <= time <= @stop)")

    iterate over flights and pull the data from OpenSky Network

    flights = [] delta_time = pd.Timedelta(minutes=10) for _, row in tqdm(df_selection.iterrows(), total=df_selection.shape[0]): # take at most 10 minutes before and 10 minutes after the landing or go-around start_time = row["time"] - delta_time stop_time = row["time"] + delta_time

    # fetch the data from OpenSky Network
    flights.append(
      opensky.history(
        start=start_time.strftime("%Y-%m-%d %H:%M:%S"),
        stop=stop_time.strftime("%Y-%m-%d %H:%M:%S"),
        callsign=row["callsign"],
        return_flight=True,
      )
    )
    

    The flights can be converted into a Traffic object

    Traffic.from_flights(flights)

    Additional files

    Additional files are available to check the quality of the classification into GA/not GA and the selection of the landing runway. These are:

    validation_table.xlsx: This Excel sheet was manually completed during the review of the samples for each runway in the data set. It provides an estimate of the false positive and false negative rate of the go-around classification. It also provides an estimate of the runway misclassification rate when the airport has two or more parallel runways. The columns with the headers highlighted in red were filled in manually, the rest is generated automatically.

    validation_sample.zip: For each runway, 8 batches of 500 randomly selected trajectories (or as many as available, if fewer than 4000) classified as not having a GA and up to 8 batches of 10 random landings, classified as GA, are plotted. This allows the interested user to visually inspect a random sample of the landings and go-arounds easily.

  18. n

    The Alaska Arctic Vegetation Archive (AVA-AK) - Dataset - iAOS Portal

    • portal-intaros.nersc.no
    Updated Sep 16, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). The Alaska Arctic Vegetation Archive (AVA-AK) - Dataset - iAOS Portal [Dataset]. https://portal-intaros.nersc.no/dataset/the-alaska-arctic-vegetation-archive-ava-ak
    Explore at:
    Dataset updated
    Sep 16, 2020
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Alaska, Arctic
    Description

    Walker, D.A., Breen, A.L., Druckenmiller, L.A., Wirth, L.w., Fisher, W, Raynolds, M.K, Sibik, J., Walker, M.D., Hennekens, S., Boggs, K., Boucher, T., Buchhorn, M., Bültmann, H., Cooper, D.J., Daniëls, F.J.A., Davidson, S.J., Ebersole, J.J. et al. 2016. The Alaska Arctic Vegetation Archive (AVA-AK). Phytocoenologia 46:221-229. doi:10.1127/phyto/2016/0128 The Alaska Arctic Vegetation Archive (AVA-AK, GIVD-ID: NA-US-014) is a free, publically available database archive of vegetation-plot data from the Arctic tundra region of northern Alaska. The archive currently contains 24 datasets with 3,026 non-overlapping plots. Of these, 74% have geolocation data with 25-m or better precision. Species cover data and header data are stored in a Turboveg database. A standardized Pan Arctic Species List provides a consistent nomenclature for vascular plants, bryophytes, and lichens in the archive. A web-based online Alaska Arctic Geoecological Atlas (AGA-AK) allows viewing and downloading the species data in a variety of formats, and provides access to a wide variety of ancillary data. We conducted a preliminary cluster analysis of the first 16 datasets (1,613 plots) to examine how the spectrum of derived clusters is related to the suite of datasets, habitat types, and environmental gradients. We present the contents of the archive, assess its strengths and weaknesses, and provide three supplementary files that include the data dictionary, a list of habitat types, an overview of the datasets, and details of the cluster analysis.

  19. The Automotive Visual Inspection Dataset (AutoVI): A Genuine Industrial...

    • zenodo.org
    • autovi.utc.fr
    • +1more
    bin, txt, zip
    Updated Jun 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philippe Carvalho; Philippe Carvalho; Meriem Lafou; Alexandre Durupt; Alexandre Durupt; Antoine Leblanc; Yves Grandvalet; Yves Grandvalet; Meriem Lafou; Antoine Leblanc (2024). The Automotive Visual Inspection Dataset (AutoVI): A Genuine Industrial Production Dataset for Unsupervised Anomaly Detection [Dataset]. http://doi.org/10.5281/zenodo.10459003
    Explore at:
    zip, txt, binAvailable download formats
    Dataset updated
    Jun 5, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Philippe Carvalho; Philippe Carvalho; Meriem Lafou; Alexandre Durupt; Alexandre Durupt; Antoine Leblanc; Yves Grandvalet; Yves Grandvalet; Meriem Lafou; Antoine Leblanc
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    See the official website: https://autovi.utc.fr

    Modern industrial production lines must be set up with robust defect inspection modules that are able to withstand high product variability. This means that in a context of industrial production, new defects that are not yet known may appear, and must therefore be identified.

    On industrial production lines, the typology of potential defects is vast (texture, part failure, logical defects, etc.). Inspection systems must therefore be able to detect non-listed defects, i.e. not-yet-observed defects upon the development of the inspection system. To solve this problem, research and development of unsupervised AI algorithms on real-world data is required.

    Renault Group and the Université de technologie de Compiègne (Roberval and Heudiasyc Laboratories) have jointly developed the Automotive Visual Inspection Dataset (AutoVI), the purpose of which is to be used as a scientific benchmark to compare and develop advanced unsupervised anomaly detection algorithms under real production conditions. The images were acquired on Renault Group's automotive production lines, in a genuine industrial production line environment, with variations in brightness and lighting on constantly moving components. This dataset is representative of actual data acquisition conditions on automotive production lines.

    The dataset contains 3950 images, split into 1530 training images and 2420 testing images.

    The evaluation code can be found at https://github.com/phcarval/autovi_evaluation_code.

    Disclaimer
    All defects shown were intentionally created on Renault Group's production lines for the purpose of producing this dataset. The images were examined and labeled by Renault Group experts, and all defects were corrected after shooting.

    License
    Copyright © 2023-2024 Renault Group

    This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. To view a copy of the license, visit https://creativecommons.org/licenses/by-nc-sa/4.0/.

    For using the data in a way that falls under the commercial use clause of the license, please contact us.

    Attribution
    Please use the following for citing the dataset in scientific work:

    Carvalho, P., Lafou, M., Durupt, A., Leblanc, A., & Grandvalet, Y. (2024). The Automotive Visual Inspection Dataset (AutoVI): A Genuine Industrial Production Dataset for Unsupervised Anomaly Detection [Dataset]. https://doi.org/10.5281/zenodo.10459003

    Contact
    If you have any questions or remarks about this dataset, please contact us at philippe.carvalho@utc.fr, meriem.lafou@renault.com, alexandre.durupt@utc.fr, antoine.leblanc@renault.com, yves.grandvalet@utc.fr.

    Changelog

    • v1.0.0
      • Cropped engine_wiring, pipe_clip and pipe_staple images
      • Reduced tank_screw, underbody_pipes and underbody_screw image sizes
    • v0.1.1
      • Added ground truth segmentation maps
      • Fixed categorization of some images
      • Added new defect categories
      • Removed tube_fastening and kitting_cart
      • Removed duplicates in pipe_clip
  20. d

    International Cigarette Consumption Database v1.3

    • search.dataone.org
    • borealisdata.ca
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Poirier, Mathieu JP; Guindon, G Emmanuel; Sritharan, Lathika; Hoffman, Steven J (2023). International Cigarette Consumption Database v1.3 [Dataset]. http://doi.org/10.5683/SP2/AOVUW7
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Poirier, Mathieu JP; Guindon, G Emmanuel; Sritharan, Lathika; Hoffman, Steven J
    Time period covered
    Jan 1, 1970 - Jan 1, 2015
    Description

    This database contains tobacco consumption data from 1970-2015 collected through a systematic search coupled with consultation with country and subject-matter experts. Data quality appraisal was conducted by at least two research team members in duplicate, with greater weight given to official government sources. All data was standardized into units of cigarettes consumed and a detailed accounting of data quality and sourcing was prepared. Data was found for 82 of 214 countries for which searches for national cigarette consumption data were conducted, representing over 95% of global cigarette consumption and 85% of the world’s population. Cigarette consumption fell in most countries over the past three decades but trends in country specific consumption were highly variable. For example, China consumed 2.5 million metric tonnes (MMT) of cigarettes in 2013, more than Russia (0.36 MMT), the United States (0.28 MMT), Indonesia (0.28 MMT), Japan (0.20 MMT), and the next 35 highest consuming countries combined. The US and Japan achieved reductions of more than 0.1 MMT from a decade earlier, whereas Russian consumption plateaued, and Chinese and Indonesian consumption increased by 0.75 MMT and 0.1 MMT, respectively. These data generally concord with modelled country level data from the Institute for Health Metrics and Evaluation and have the additional advantage of not smoothing year-over-year discontinuities that are necessary for robust quasi-experimental impact evaluations. Before this study, publicly available data on cigarette consumption have been limited—either inappropriate for quasi-experimental impact evaluations (modelled data), held privately by companies (proprietary data), or widely dispersed across many national statistical agencies and research organisations (disaggregated data). This new dataset confirms that cigarette consumption has decreased in most countries over the past three decades, but that secular country specific consumption trends are highly variable. The findings underscore the need for more robust processes in data reporting, ideally built into international legal instruments or other mandated processes. To monitor the impact of the WHO Framework Convention on Tobacco Control and other tobacco control interventions, data on national tobacco production, trade, and sales should be routinely collected and openly reported. The first use of this database for a quasi-experimental impact evaluation of the WHO Framework Convention on Tobacco Control is: Hoffman SJ, Poirier MJP, Katwyk SRV, Baral P, Sritharan L. Impact of the WHO Framework Convention on Tobacco Control on global cigarette consumption: quasi-experimental evaluations using interrupted time series analysis and in-sample forecast event modelling. BMJ. 2019 Jun 19;365:l2287. doi: https://doi.org/10.1136/bmj.l2287 Another use of this database was to systematically code and classify longitudinal cigarette consumption trajectories in European countries since 1970 in: Poirier MJ, Lin G, Watson LK, Hoffman SJ. Classifying European cigarette consumption trajectories from 1970 to 2015. Tobacco Control. 2022 Jan. DOI: 10.1136/tobaccocontrol-2021-056627. Statement of Contributions: Conceived the study: GEG, SJH Identified multi-country datasets: GEG, MP Extracted data from multi-country datasets: MP Quality assessment of data: MP, GEG Selection of data for final analysis: MP, GEG Data cleaning and management: MP, GL Internet searches: MP (English, French, Spanish, Portuguese), GEG (English, French), MYS (Chinese), SKA (Persian), SFK (Arabic); AG, EG, BL, MM, YM, NN, EN, HR, KV, CW, and JW (English), GL (English) Identification of key informants: GEG, GP Project Management: LS, JM, MP, SJH, GEG Contacts with Statistical Agencies: MP, GEG, MYS, SKA, SFK, GP, BL, MM, YM, NN, HR, KV, JW, GL Contacts with key informants: GEG, MP, GP, MYS, GP Funding: GEG, SJH SJH: Hoffman, SJ; JM: Mammone J; SRVK: Rogers Van Katwyk, S; LS: Sritharan, L; MT: Tran, M; SAK: Al-Khateeb, S; AG: Grjibovski, A.; EG: Gunn, E; SKA: Kamali-Anaraki, S; BL: Li, B; MM: Mahendren, M; YM: Mansoor, Y; NN: Natt, N; EN: Nwokoro, E; HR: Randhawa, H; MYS: Yunju Song, M; KV: Vercammen, K; CW: Wang, C; JW: Woo, J; MJPP: Poirier, MJP; GEG: Guindon, EG; GP: Paraje, G; GL Gigi Lin Key informants who provided data: Corne van Walbeek (South Africa, Jamaica) Frank Chaloupka (US) Ayda Yurekli (Turkey) Dardo Curti (Uruguay) Bungon Ritthiphakdee (Thailand) Jakub Lobaszewski (Poland) Guillermo Paraje (Chile, Argentina) Key informants who provided useful insights: Carlos Manuel Guerrero López (Mexico) Muhammad Jami Husain (Bangladesh) Nigar Nargis (Bangladesh) Rijo M John (India) Evan Blecher (Nigeria, Indonesia, Philippines, South Africa) Yagya Karki (Nepal) Anne CK Quah (Malaysia) Nery Suarez Lugo (Cuba) Agencies providing assistance: Irani... Visit https://dataone.org/datasets/sha256%3Aaa1b4aae69c3399c96bfbf946da54abd8f7642332d12ccd150c42ad400e9699b for complete metadata about this dataset.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista (2025). Sales of the H&M Group worldwide 2006-2024 [Dataset]. https://www.statista.com/statistics/252190/gross-sales-of-the-h-und-m-group-worldwide/
Organization logo

Sales of the H&M Group worldwide 2006-2024

Explore at:
10 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 2, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description

This statistic depicts the sales of the H&M Group worldwide from 2006 to 2024. In the fiscal year 2024, global net sales of the H&M Group amounted to about 234 billion Swedish kronor. H&MH&M offers a broad and varied range of fashion including collections for women, men, teenagers and children. The range also includes sportswear, underwear, shoes, accessories and cosmetics, as well as home textiles and decorations from H&M Home.Germany is H&M's largest market. In 2024, over 35 billion Swedish kronor were generated from that country alone. The company operates roughly 4,253 stores worldwide and employs approximately 97,710 people. H&M dropped out of the top ten most valuable apparel brands in the world as of 2023.H&M aims to be a more sustainable choice for today’s increasingly aware customers. To this end, H&M’s investments in social improvements and reduced environmental impact extend throughout the product life cycle – from responsible use of natural resources to ensuring good working conditions at suppliers’ factories. Sustainability work is thoroughly integrated into the business, not only because it is an investment in the customer offering,but also because it is vital to the group’s long-term growth and development. However, there have been questions raised as to how effective and trustworthy H&M's sustainability practices really are.

Search
Clear search
Close search
Google apps
Main menu