4 datasets found
  1. MASH: A Multiplatform Annotated Dataset for Societal Impact of Hurricane

    • zenodo.org
    Updated May 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous; Anonymous (2025). MASH: A Multiplatform Annotated Dataset for Societal Impact of Hurricane [Dataset]. http://doi.org/10.5281/zenodo.15401479
    Explore at:
    Dataset updated
    May 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anonymous; Anonymous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MASH: A Multiplatform Annotated Dataset for Societal Impact of Hurricane

    We present a Multiplatform Annotated Dataset for Societal Impact of Hurricane (MASH) that includes 98,662 relevant social media data posts from Reddit, X, TikTok, and YouTube.
    In addition, all relevant posts are annotated on three dimensions: Humanitarian Classes, Bias Classes, and Information Integrity Classes in a multi-modal approach that considers both textual and visual content (text, images, and videos), providing a rich labeled dataset for in-depth analysis.
    The dataset is also complemented by an Online Analytics Platform (https://hurricane.web.illinois.edu/) that not only allows users to view hurricane-related posts and articles, but also explores high-frequency keywords, user sentiment, and the locations where posts were made.
    To our best knowledge, MASH is the first large-scale, multi-platform, multimodal, and multi-dimensionally annotated hurricane dataset. We envision that MASH can contribute to the study of hurricanes' impact on society, such as disaster severity classification, event detections, public sentiment analysis, and bias identification.

    Usage Notice

    This dataset includes four annotation files:
    • reddit_anno_publish.csv
    • tiktok_anno_publish.csv
    • twitter_anno_publish.csv
    • youtube_anno_publish.csv
    Each file contains post IDs and corresponding annotations on three dimensions: Humanitarian Classes, Bias Classes, and Information Integrity Classes.
    To protect user privacy, only post IDs are released. We recommend retrieving the full post content via the official APIs of each platform, in accordance with their respective terms of service.

    Humanitarian Classes

    Each post is annotated with seven binary humanitarian classes. For each class, the label is either:
    • True – the post contains this humanitarian information
    • False – the post does not contain this information
    These seven humanitarian classes include:
    • Casualty: The post reports people or animals who are killed, injured, or missing during the hurricane.
    • Evacuation: The post describes the evacuation, relocation, rescue, or displacement of individuals or animals due to the hurricane.
    • Damage: The post reports damage to infrastructure or public utilities caused by the hurricane.
    • Advice: The post provides advice, guidance, or suggestions related to hurricanes, including how to stay safe, protect property, or prepare for the disaster.
    • Request: Request for help, support, or resources due to the hurricane
    • Assistance: This includes both physical aid and emotional or psychological support provided by individuals, communities, or organizations.
    • Recovery: The post describes efforts or activities related to the recovery and rebuilding process after the hurricane.
    Note: A single post may be labeled as True for multiple humanitarian categories.

    Bias Classes

    Each post is annotated with five binary bias classes. For each class, the label is either:
    • True – the post contains this bias information
    • False – the post does not contain this information
    These five bias classes include:
    • Linguistic Bias: The post contains biased, inappropriate, or offensive language, with a focus on word choice, tone, or expression.
    • Political Bias: The post expresses political ideology, showing favor or disapproval toward specific political actors, parties, or policies.
    • Gender Bias: The post contains biased, stereotypical, or discriminatory language or viewpoints related to gender.
    • Hate Speech: The post contains language that expresses hatred, hostility, or dehumanization toward a specific group or individual, especially those belonging to minority or marginalized communities.
    • Racial Bias: The post contains biased, discriminatory, or stereotypical statements directed toward one or more racial or ethnic groups.
    Note: A single post may be labeled as True for multiple bias categories.

    Information Integrity Classes

    Each post is also annotated with a single information integrity class, represented by an integer:
    • -1 → False information (i.e., misinformation or disinformation)
    • 0 → Unverifiable information (unclear or lacking sufficient evidence)
    • 1 → True information (verifiable and accurate)

    Key Notes

    1. This dataset is also available at https://huggingface.co/datasets/YRC10/MASH.
    2. Version 1 is no longer available.
  2. d

    Map of slope-failure locations in Puerto Rico after Hurricane María

    • catalog.data.gov
    • data.usgs.gov
    • +2more
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Map of slope-failure locations in Puerto Rico after Hurricane María [Dataset]. https://catalog.data.gov/dataset/map-of-slope-failure-locations-in-puerto-rico-after-hurricane-maria
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Puerto Rico
    Description

    In Puerto Rico, tens of thousands of landslides, slumps, debris flows, rock falls, and other slope failures were triggered by Hurricane María, which made landfall on 20 September 2017. “Landslide” is used here and below to represent all types of slope failures. This dataset is a point shapefile of landslide headscarps identified across Puerto Rico using georeferenced aerial and satellite imagery recorded following the hurricane. The imagery used includes publicly available aerial imagery obtained by the Federal Emergency Management Agency (FEMA; Quantum Spatial, Inc., 2017), aerial imagery obtained by the National Oceanic and Atmospheric Administration (NOAA; NOAA, 2017), and several WorldView satellite imagery datasets available from DigitalGlobe, Inc. The FEMA imagery was recorded by Sanborn and Quantum Spatial, Inc. between 25 September and 27 October 2017, has a pixel resolution of approximately 15 cm, and includes over 6,000 image tiles that cover approximately 97% of the large island and 100% of Vieques. The NOAA imagery was recorded 22-26 September 2017, also has a resolution of approximately 15 cm, and covers about 10% of the large island, 60% of Vieques, and 100% of Culebra. The DigitalGlobe imagery used in this project was recorded during September-November 2017, has a pixel resolution of approximately 50 cm, and covers approximately 99% of the large island and 35% of Vieques. DigitalGlobe images were acquired via the DigitalGlobe Open Data Program, the DigitalGlobe Foundation imagery grant, and via partnership with the U.S. Geological Survey. No imagery was examined for Desecheo, Mona, Monito, Caja de Muertos, or other smaller islands. The FEMA imagery was usually used first for landslide mapping due to its high resolution and more accurate georeferencing. For almost every location, there were multiple images available due to overlap in each dataset and overlap between different datasets. This overlap was helpful when clouds or shadows obscured the view of the ground surface in one or more images for a given location. Additional oblique and un-georeferenced aerial imagery recorded by the Civil Air Patrol (ArcGIS, 2017) was consulted, if needed. Comparing the post-event imagery with pre-event imagery available through the ESRI ArcGIS basemap layer and/or Google Earth was useful to accurately identify sites that failed during September 2017; such comparisons were made for landslides that appeared potentially older. Some landslides in our inventory may have occurred prior to Hurricane María—potentially triggered by Hurricane Irma which passed northeast of Puerto Rico two weeks earlier—or between the time of the hurricane and when photographs were taken. UTM Zone 19N projection with WGS 84 datum was used throughout the mapping process. The inventory process began with creation of a first draft by a team of 15 people. This draft was subsequently checked for quality and revised by the three leaders of the mapping effort. Each identified landslide is represented by a point located at the center of its headscarp. The horizontal position of headscarp points was carefully selected using multiple overlapping images (usually available) and other geospatial datasets including lidar acquired during 2015 and available from the U.S. Geological Survey 3DEP program, the U.S. Census Bureau TIGER road shapefile, and the National Hydrology Dataset flowline shapefile. Mapping was generally performed at 1:1000 scale. Given errors in georeferencing and landslides poorly resolved in imagery, we conclude that headscarp point locations are generally accurate within 3 m. Municipality (municipio) and barrio names in which each landslide occurred are included in the attribute table of the shapefile, as are the geographic coordinates of each point in decimal degrees (WGS 84 datum). Landslides were identified in 72 of the 78 municipalities of Puerto Rico. No landslides were documented on the island municipalities of Culebra or Vieques. On the main island of Puerto Rico, 64% of land experienced 0-3 landslides per square kilometer, 26% experienced 3-25 landslides per square kilometer, and 10% experienced more than 25 landslides per square kilometer. Concentrated zones of more than 100 landslides per square kilometer are in the municipalities of Maricao, Utuado, Jayuya, and Corozal. Of the ten barrios where more than 100 landslides per square kilometer were catalogued, eight are in Utuado. The drainage basins with the highest density of landslides are the Rio Grande de Arecibo and Rio Grande de Añasco watersheds, each with over 30 landslides per square kilometer. Six out of the seven sub-basins with more than 50 landslides per square kilometer are in the Rio Grande de Arecibo basin. We identified and mapped 71,431 landslides in total. The College of Arts and Sciences at the University of Puerto Rico in Mayagüez is thanked for providing release time to K.S. Hughes to permit partial development of this dataset. References ArcGIS, 2017, CAP Imagery – Hurricane Maria: https://www.arcgis.com/home/webmap/viewer.html?webmap=3218d1cb022d4534be0c7d6833c0adf1. Last accessed 18 June 2019. NOAA, 2017, Hurricane MARIA Imagery: https://storms.ngs.noaa.gov/storms/maria/index.html. Last accessed 18 June 2019. Quantum Spatial, Inc., 2017, FEMA PR Imagery: https://s3.amazonaws.com/fema-cap-imagery/Others/Maria. Last accessed 18 June 2019.

  3. Somalia Regions hit by 2020 Tropical Cyclone Gati - Dataset - SODMA Open...

    • sodma-dev.okfn.org
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sodma-dev.okfn.org (2025). Somalia Regions hit by 2020 Tropical Cyclone Gati - Dataset - SODMA Open Data Portal [Dataset]. https://sodma-dev.okfn.org/dataset/icpac-geonode-somalia-regions-hit-by-2020-tropical-cyclone-gati
    Explore at:
    Dataset updated
    May 23, 2025
    Dataset provided by
    Open Knowledge Foundationhttp://okfn.org/
    Somali Disaster Management Agencyhttps://sodma.gov.so/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Somalia
    Description

    Tropical Cyclone Gati originated from the Bay of Bengal. On 22 November 2020 at around 13:00 UTC, TC Gati became the strongest ever documented tropical storm to hit Somalia. Gati made landfall at Ras Hafun (Northeast of Somalia) with maximum sustained winds of 170Km/hr and was classified as a Category 2 storm (or category 2 hurricane on the Saffir-Simpson scale). This layers shows the occurence of TC Gati in two regions of Somali, Sanaag and Bari Regions. TC Gati primarily impacted parts of Bari region in Puntland State and then Sanaag region in Somaliland. Authorities reported rainfall totals from the storm to be greater than the amounts normally seen for the whole year. In a 24-hour period, Bosaso recorded 128mm of rain and Balidhidin 103mm. By 25th November 2020, TC Gati had dissipated but it left trail of destruction across Bari and Sanaag regions of Somalia. Authorities estimated 180,000 people (30,000 households) had been affected in Puntland Regional State, with 42,000 people (7,000 households) displaced and at least eight people killed and unknown number injured, with considerable damages reported to infrastructure, livelihoods, and social services (telecommunication, electricity, roads, schools). Resultant flooding bursted the sewerage system and increased risk of diseases among the affected population. By the end of December, the storm had killed 9 people and over 63 000 livestock (sheep and goats), and affected around 183 000 people. The heavy rains caused the Ceel-Daahir River in Puntland to flood, blocking vehicles from bringing crucial supplies. In some towns such as Xaafun, Hurdiye, and Karduush people reported losing their entire herds of livestock. There were also reports of the destruction of 120 fishing vessels in these areas resulting in the loss of livelihood for an estimated 460 fishermen.

  4. Hurricane Florence Storm Surge

    • dorian-disasterresponse.opendata.arcgis.com
    Updated Sep 13, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri’s Disaster Response Program (2018). Hurricane Florence Storm Surge [Dataset]. https://dorian-disasterresponse.opendata.arcgis.com/datasets/hurricane-florence-storm-surge
    Explore at:
    Dataset updated
    Sep 13, 2018
    Dataset provided by
    Esrihttp://esri.com/
    Authors
    Esri’s Disaster Response Program
    Description

    This is a preliminary impact map for Hurricane Florence. Click on the categories for more detailed information on who or what may be impacted.This analysis was completed using the Enrich Layer tool in ArcGIS Online. For more information on how data is summarized using GeoEnrichment see this explanation.This application presents the National Hurricane Center Storm Surge data that has been enriched to show the potential impact to people, housing, and businesses.From the National Hurricane Center: "This graphic displays areas that are under a storm surge watch/warning. A storm surge warning indicates there is a danger of life-threatening inundation from rising water moving inland from the shoreline somewhere within the specified area, generally within 36 hours. A storm surge watch indicates that life-threatening inundation is possible somewhere within the specified area, generally within 48 hours. All persons, regardless of whether or not they are in the highlighted areas shown in the graphic, should promptly follow evacuation orders and other instructions from local officials."

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Anonymous; Anonymous (2025). MASH: A Multiplatform Annotated Dataset for Societal Impact of Hurricane [Dataset]. http://doi.org/10.5281/zenodo.15401479
Organization logo

MASH: A Multiplatform Annotated Dataset for Societal Impact of Hurricane

Explore at:
Dataset updated
May 24, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Anonymous; Anonymous
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

MASH: A Multiplatform Annotated Dataset for Societal Impact of Hurricane

We present a Multiplatform Annotated Dataset for Societal Impact of Hurricane (MASH) that includes 98,662 relevant social media data posts from Reddit, X, TikTok, and YouTube.
In addition, all relevant posts are annotated on three dimensions: Humanitarian Classes, Bias Classes, and Information Integrity Classes in a multi-modal approach that considers both textual and visual content (text, images, and videos), providing a rich labeled dataset for in-depth analysis.
The dataset is also complemented by an Online Analytics Platform (https://hurricane.web.illinois.edu/) that not only allows users to view hurricane-related posts and articles, but also explores high-frequency keywords, user sentiment, and the locations where posts were made.
To our best knowledge, MASH is the first large-scale, multi-platform, multimodal, and multi-dimensionally annotated hurricane dataset. We envision that MASH can contribute to the study of hurricanes' impact on society, such as disaster severity classification, event detections, public sentiment analysis, and bias identification.

Usage Notice

This dataset includes four annotation files:
• reddit_anno_publish.csv
• tiktok_anno_publish.csv
• twitter_anno_publish.csv
• youtube_anno_publish.csv
Each file contains post IDs and corresponding annotations on three dimensions: Humanitarian Classes, Bias Classes, and Information Integrity Classes.
To protect user privacy, only post IDs are released. We recommend retrieving the full post content via the official APIs of each platform, in accordance with their respective terms of service.

Humanitarian Classes

Each post is annotated with seven binary humanitarian classes. For each class, the label is either:
• True – the post contains this humanitarian information
• False – the post does not contain this information
These seven humanitarian classes include:
• Casualty: The post reports people or animals who are killed, injured, or missing during the hurricane.
• Evacuation: The post describes the evacuation, relocation, rescue, or displacement of individuals or animals due to the hurricane.
• Damage: The post reports damage to infrastructure or public utilities caused by the hurricane.
• Advice: The post provides advice, guidance, or suggestions related to hurricanes, including how to stay safe, protect property, or prepare for the disaster.
• Request: Request for help, support, or resources due to the hurricane
• Assistance: This includes both physical aid and emotional or psychological support provided by individuals, communities, or organizations.
• Recovery: The post describes efforts or activities related to the recovery and rebuilding process after the hurricane.
Note: A single post may be labeled as True for multiple humanitarian categories.

Bias Classes

Each post is annotated with five binary bias classes. For each class, the label is either:
• True – the post contains this bias information
• False – the post does not contain this information
These five bias classes include:
• Linguistic Bias: The post contains biased, inappropriate, or offensive language, with a focus on word choice, tone, or expression.
• Political Bias: The post expresses political ideology, showing favor or disapproval toward specific political actors, parties, or policies.
• Gender Bias: The post contains biased, stereotypical, or discriminatory language or viewpoints related to gender.
• Hate Speech: The post contains language that expresses hatred, hostility, or dehumanization toward a specific group or individual, especially those belonging to minority or marginalized communities.
• Racial Bias: The post contains biased, discriminatory, or stereotypical statements directed toward one or more racial or ethnic groups.
Note: A single post may be labeled as True for multiple bias categories.

Information Integrity Classes

Each post is also annotated with a single information integrity class, represented by an integer:
• -1 → False information (i.e., misinformation or disinformation)
• 0 → Unverifiable information (unclear or lacking sufficient evidence)
• 1 → True information (verifiable and accurate)

Key Notes

  1. This dataset is also available at https://huggingface.co/datasets/YRC10/MASH.
  2. Version 1 is no longer available.
Search
Clear search
Close search
Google apps
Main menu