100+ datasets found
  1. SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest...

    • zenodo.org
    bin
    Updated Dec 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan (2024). SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part II) [Dataset]. http://doi.org/10.5281/zenodo.14525290
    Explore at:
    binAvailable download formats
    Dataset updated
    Dec 19, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This page only provides the drone-view image dataset.

    The dataset contains drone-view RGB images, depth maps and instance segmentation labels collected from different scenes. Data from each scene is stored in a separate .7z file, along with a color_palette.xlsx file, which contains the RGB_id and corresponding RGB values.

    All files follow the naming convention: {central_tree_id}_{timestamp}, where {central_tree_id} represents the ID of the tree centered in the image, which is typically in a prominent position, and timestamp indicates the time when the data was collected.

    Specifically, each 7z file includes the following folders:

    • rgb: This folder contains the RGB images (PNG) of the scenes and their metadata (TXT). The metadata describes the weather conditions and the world time when the image was captured. An example metadata entry is: Weather:Snow_Blizzard,Hour:10,Minute:56,Second:36.

    • depth_pfm: This folder contains absolute depth information of the scenes, which can be used to reconstruct the point cloud of the scene through reprojection.

    • instance_segmentation: This folder stores instance segmentation labels (PNG) for each tree in the scene, along with metadata (TXT) that maps tree_id to RGB_id. The tree_id can be used to look up detailed information about each tree in obj_info_final.xlsx, while the RGB_id can be matched to the corresponding RGB values in color_palette.xlsx. This mapping allows for identifying which tree corresponds to a specific color in the segmentation image.

    • obj_info_final.xlsx: This file contains detailed information about each tree in the scene, such as position, scale, species, and various parameters, including trunk diameter (in cm), tree height (in cm), and canopy diameter (in cm).

    • landscape_info.txt: This file contains the ground location information within the scene, sampled every 0.5 meters.

    For birch_forest, broadleaf_forest, redwood_forest and rainforest, we also provided COCO-format annotation files (.json). Two such files can be found in these datasets:

    • {name}_coco.json: This file contains the annotation of each tree in the scene.
    • {name}_filtered.json: This file is derived from the previous one, but filtering is applied to rule out overlapping instances.

    ⚠️: 7z files that begin with "!" indicate that the RGB values in the images within the instance_segmentation folder cannot be found in color_palette.xlsx. Consequently, this prevents matching the trees in the segmentation images to their corresponding tree information, which may hinder the application of the dataset to certain tasks. This issue is related to a bug in Colossium/AirSim, which has been reported in link1 and link2.

  2. Ask Bid Regression Data

    • kaggle.com
    zip
    Updated Oct 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Syed Asim Ali Shah (2022). Ask Bid Regression Data [Dataset]. https://www.kaggle.com/datasets/syedasimalishah/ask-bid-regression-data
    Explore at:
    zip(991411 bytes)Available download formats
    Dataset updated
    Oct 15, 2022
    Authors
    Syed Asim Ali Shah
    Description

    The bid–ask spread (also bid–offer or bid/ask and buy/sell in the case of a market maker) is the difference between the prices quoted (either by a single market maker or in a limit order book) for an immediate sale (ask) and an immediate purchase (bid) for stocks, futures contracts, options, or currency pairs in some auction scenario. The size of the bid–ask spread in a security is one measure of the liquidity of the market and of the size of the transaction cost.[1] If the spread is 0 then it is a frictionless asset.

  3. i

    Joint Flow Size and Spread Measurement Dataset

    • ieee-dataport.org
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Du (2025). Joint Flow Size and Spread Measurement Dataset [Dataset]. https://ieee-dataport.org/documents/joint-flow-size-and-spread-measurement-dataset
    Explore at:
    Dataset updated
    Nov 24, 2025
    Authors
    Yang Du
    Description

    dataset for joint flow size and spread measurement and anomaly detection

  4. Data from: Meat Price Spreads

    • catalog.data.gov
    • agdatacommons.nal.usda.gov
    • +2more
    Updated Apr 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Economic Research Service, Department of Agriculture (2025). Meat Price Spreads [Dataset]. https://catalog.data.gov/dataset/meat-price-spreads
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Economic Research Servicehttp://www.ers.usda.gov/
    Description

    This data set provides monthly average price values, and the differences among those values, at the farm, wholesale, and retail stages of the production and marketing chain for selected cuts of beef, pork, and broilers. In addition, retail prices are provided for beef and pork cuts, turkey, whole chickens, eggs, and dairy products. Price spreads are reported for last 6 years, 12 quarters, and 24 months. The retail price file provides monthly estimates for the last 6 months. The historical file provides data since 1970.

  5. ICC Spread - Dataset - Banco Central do Brasil Open Data Portal

    • opendata.bcb.gov.br
    Updated Jan 15, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bcb.gov.br (2018). ICC Spread - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/27443-icc-spread
    Explore at:
    Dataset updated
    Jan 15, 2018
    Dataset provided by
    Central Bank of Brazilhttp://www.bc.gov.br/
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Concept: Difference between average cost of outstanding loans (ICC) and its average funding cost. Comprises both earmarked and nonearmarked operations. Source: Central Bank of Brazil – Statistics Department 9c2ecd38-11e2-4399-8b1f-d16cc7bb31f6 27443-icc-spread

  6. Climate-Driven Disease Spread

    • kaggle.com
    zip
    Updated Mar 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ivan Tretyakov 叶仁安 (2025). Climate-Driven Disease Spread [Dataset]. https://www.kaggle.com/datasets/hopeofchange/climate-driven-disease-spread
    Explore at:
    zip(1412510 bytes)Available download formats
    Dataset updated
    Mar 27, 2025
    Authors
    Ivan Tretyakov 叶仁安
    Description

    Dataset Description This synthetic dataset combines climate, environmental, epidemiological, and socio-economic data from 120 countries over 24 years (2000-2023). The aim is to analyze the relationship between climate change, environmental pollution and the spread of infectious diseases (malaria, dengue fever, cholera, Lyme disease). The data is suitable for forecasting tasks, anomaly detection, and risk clustering of regions.

    Uniqueness:

    • The first dataset that combines long-term climate trends with medical and demographic statistics.

    • Rare parameters are included: the UV radiation index, the migration of disease vectors, and the cost of prevention.

    • Realistic anomalies (for example, dengue outbreaks in Europe by 2023).

  7. m

    Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First...

    • data.mendeley.com
    Updated Jul 20, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hasmot Ali (2020). Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First Case And First Death [Dataset]. http://doi.org/10.17632/vw427wzzkk.5
    Explore at:
    Dataset updated
    Jul 20, 2020
    Authors
    Hasmot Ali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Contain informative data related to COVID-19 pandemic. Specially, figure out about the First Case and First Death information for every single country. The datasets mainly focus on two major fields first one is First Case which consists of information of Date of First Case(s), Number of confirm Case(s) at First Day, Age of the patient(s) of First Case, Last Visited Country and the other one First Death information consist of Date of First Death and Age of the Patient who died first for every Country mentioning corresponding Continent. The datasets also contain the Binary Matrix of spread chain among different country and region.

    *This is not a country. This is a ship. The name of the Cruise Ship was not given from the government.
    "N+": the age is not specified but greater than N
    “No Trace”: some data was not found
    “Unspecified”: not available from the authority
    “N/A”: for “Last Visited Country(s) of Confirmed Case(s)” column, “N/A” indicates that the confirmed case(s) of those countries do not have any travel history in recent past; in “Age of First Death(s)” column “N/A” indicates that those countries do not have may death case till May 16, 2020.

  8. c

    Witty Worm dataset

    • catalog.caida.org
    Updated Feb 25, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CAIDA (2021). Witty Worm dataset [Dataset]. https://catalog.caida.org/dataset/telescope_witty_worm
    Explore at:
    Dataset updated
    Feb 25, 2021
    Dataset authored and provided by
    CAIDA
    License

    https://www.caida.org/about/legal/aua/public_aua/https://www.caida.org/about/legal/aua/public_aua/

    https://www.caida.org/about/legal/aua/https://www.caida.org/about/legal/aua/

    Time period covered
    Mar 19, 2004 - Mar 25, 2004
    Description

    This dataset contains information useful for studying the spread of the Witty worm. Data were collected on the UCSD Network Telescope between Fri Mar 19 20:01:40 PST 2004 and Wed Mar 25 00:01:40 PST 2004. The dataset is exclusively available through Impact. Up until Feb 2014 the dataset was online in two portions, one public, one restricted. After Feb 2014 the whole Witty dataset was public until mid-2016. The publicly available set of files contains summarized information that does not individually identify infected computers. These are the files in witty/summaries/public. The restricted-access set of files that do contain more sensitive information, including packet traces containing complete IP and UDP headers and partial payload received from hosts spreading the Witty worm March 19-24, 2004. It also includes routing tables and summaries. These are the files in witty/data and witty/summaries/restricted/.

  9. Average spread of earmarked new credit operations - Households - Total -...

    • opendata.bcb.gov.br
    Updated Jul 31, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bcb.gov.br (2017). Average spread of earmarked new credit operations - Households - Total - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/20837-average-spread-of-earmarked-new-credit-operations---households---total
    Explore at:
    Dataset updated
    Jul 31, 2017
    Dataset provided by
    Central Bank of Brazilhttp://www.bc.gov.br/
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Concept: Difference (spread) between average interest rate on new credit operations in the relevant period in the National Financial System, which are under regulation by the National Monetary Council (CMN) or linked to budget funds, and corresponding average cost of funds. Refers to special financing operations which require proof of proper use of funds, linked to medium and long term production and investments projects. Funds origins are shares of checking accounts and savings accounts and funds from governmental programs. Source: Central Bank of Brazil – Statistics Department 2a7b340d-1ab4-4312-b00e-0969dcd949cf 20837-average-spread-of-earmarked-new-credit-operations---households---total

  10. Average spread of new credit operations - Total - Dataset - Banco Central do...

    • opendata.bcb.gov.br
    Updated Jul 31, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bcb.gov.br (2017). Average spread of new credit operations - Total - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/20783-average-spread-of-new-credit-operations---total
    Explore at:
    Dataset updated
    Jul 31, 2017
    Dataset provided by
    Central Bank of Brazilhttp://www.bc.gov.br/
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Concept: Difference (spread) between average interest rate on new credit operations in the reference period in the National Financial System and corresponding average cost of funds. Comprises both earmarked and nonearmarked operations. Source: Central Bank of Brazil – Statistics Department e2dc1628-d570-439b-b7dd-f89ce87d0ca6 20783-average-spread-of-new-credit-operations---total

  11. o

    Global Spread of Conflict by Country and Population - Dataset - Data Catalog...

    • data.opendata.am
    Updated Jul 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Global Spread of Conflict by Country and Population - Dataset - Data Catalog Armenia [Dataset]. https://data.opendata.am/dataset/dcwb0041070
    Explore at:
    Dataset updated
    Jul 7, 2023
    Area covered
    Armenia
    Description

    This dataset provides the spread of the conflict globally in terms of population and country for the years 2000-2016.

  12. Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised...

    • catalog.data.gov
    • agdatacommons.nal.usda.gov
    Updated Apr 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agricultural Research Service (2025). Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised Spatiotemporal Deep Neural Networks [Dataset]. https://catalog.data.gov/dataset/data-from-modeling-the-spread-of-a-livestock-disease-with-semi-supervised-spatiotemporal-d-bdd33
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Agricultural Research Servicehttps://www.ars.usda.gov/
    Description

    This dataset contains the spatiotemporal data used to train the spatiotemporal deep neural networks described in "Modeling the Spread of a Livestock Disease With Semi-Supervised Spatiotemporal Deep Neural Networks". The dataset consists of two sets of NumPy arrays. The first set: X_grid.npy and Y_grid.npy were used to train the convolutional LSTM, while the second set: X_graph.npy, Y_graph.npy, and edge_index.npy were used to train the graph convolutional LSTM. The data consists of spatiotemporally varying environmental and anthropogenic variables along with case reports of vesicular stomatitis. Resources in this dataset:Resource Title: NumPy Arrays of Spatiotemporal Features and VS Cases. File Name: vs_data.zipResource Description: This is a ZIP archive containing five NumPy arrays of spatiotemporal features and geotagged VS cases.Resource Software Recommended: NumPy,url: https://numpy.org/

  13. Next Day Wildfire Spread Subset Of Dataset

    • kaggle.com
    zip
    Updated Aug 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    zhiminV (2024). Next Day Wildfire Spread Subset Of Dataset [Dataset]. https://www.kaggle.com/datasets/zhiminv/next-day-wildfire-spread-subset-of-dataset
    Explore at:
    zip(2090151154 bytes)Available download formats
    Dataset updated
    Aug 12, 2024
    Authors
    zhiminV
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by zhiminV

    Released under Apache 2.0

    Contents

  14. c

    Movies and Tv Shows Dataset

    • crawlfeeds.com
    • kaggle.com
    csv, zip
    Updated Jul 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). Movies and Tv Shows Dataset [Dataset]. https://crawlfeeds.com/datasets/movies-and-tv-shows-dataset
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Jul 4, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Explore our meticulously curated Movies dataset and TV shows dataset, designed to cater to diverse analytical and research needs. Whether you're a data scientist, a student, or a business professional, these datasets provide valuable insights into the entertainment industry.

    Key Features of the Movies Dataset:

    1. Extensive collection of global movies across various genres and languages.

    2. Detailed metadata, including titles, release dates, genres, directors, cast, and ratings.

    3. Regularly updated to ensure relevance and accuracy.

    Why Choose Our TV Shows Dataset?

    Our TV shows dataset is your gateway to understanding trends in episodic content. It includes:

    • Comprehensive details about popular and niche TV shows.

    • Information on episode counts, seasons, ratings, and networks.

    • Insights into audience preferences and regional programming.

    Applications of These Datasets

    These datasets are perfect for:

    • Machine learning models for recommendation systems.

    • Academic research on media trends and audience behavior.

    • Business strategies for entertainment platforms.

    Unlock the power of TV show data with our Crawl Feeds TV Shows Dataset. Start analyzing today and gain valuable insights into your favorite shows!

  15. Price Spreads from Farm to Consumer

    • catalog.data.gov
    • s.cnmilf.com
    • +3more
    Updated Apr 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Economic Research Service, Department of Agriculture (2025). Price Spreads from Farm to Consumer [Dataset]. https://catalog.data.gov/dataset/price-spreads-from-farm-to-consumer
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Economic Research Servicehttp://www.ers.usda.gov/
    Description

    USDA Economic Research Service (ERS) compares prices paid by consumers for food with prices received by farmers for corresponding commodities. This data set reports these comparisons for a variety of foods sold through retail food stores such as supermarkets and super centers. Comparisons are made for individual foods and groupings of individual foods-market baskets-that represent what a typical U.S. household buys at retail in a year. The retail costs of these baskets are compared with the money received by farmers for a corresponding basket of agricultural commodities.

  16. s

    Trees DLR 2019 - Dataset - data.smartdublin.ie

    • data.smartdublin.ie
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Trees DLR 2019 - Dataset - data.smartdublin.ie [Dataset]. https://data.smartdublin.ie/dataset/trees-dlr-2019
    Explore at:
    Dataset updated
    Feb 26, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A dataset containing trees in the DLR area in 2019. This dataset is only a partial representation of the actual tree count within DLR and contains fields such as Location, Species, Height, Spread, Trunk and Age. Please note this data is for information purposes only and may not be an exact representation of the infrastructure. Changes and upgrades occurring since then may not be represented.

  17. w

    Dataset of book subjects that contain Spread the joy

    • workwithdata.com
    Updated Nov 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Dataset of book subjects that contain Spread the joy [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=j0-book&fop0=%3D&fval0=Spread+the+joy&j=1&j0=books
    Explore at:
    Dataset updated
    Nov 7, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects. It has 2 rows and is filtered where the books is Spread the joy. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.

  18. w

    Dataset of books in the The spread of printing, Western hemisphere series

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books in the The spread of printing, Western hemisphere series [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=j0-book_series&fop0=%3D&fval0=The+spread+of+printing%2C+Western+hemisphere&j=1&j0=book_series
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 4 rows and is filtered where the book series is The spread of printing, Western hemisphere. It features 9 columns including author, publication date, language, and book publisher.

  19. Z

    Data from: A dataset of Covid-related misinformation videos and their spread...

    • data.niaid.nih.gov
    Updated Feb 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Knuutila, Aleksi (2021). A dataset of Covid-related misinformation videos and their spread on social media [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4557827
    Explore at:
    Dataset updated
    Feb 24, 2021
    Dataset provided by
    Oxford Internet Institute
    Authors
    Knuutila, Aleksi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains metadata about all Covid-related YouTube videos which circulated on public social media, but which YouTube eventually removed because they contained false information. It describes 8,122 videos that were shared between November 2019 and June 2020. The dataset contains unique identifiers for the videos and social media accounts that shared the videos, statistics on social media engagement and metadata such as video titles and view counts where they were recoverable. We publish the data alongside the code used to produce on Github. The dataset has reuse potential for research studying narratives related to the coronavirus, the impact of social media on knowledge about health and the politics of social media platforms.

  20. R

    Water Gauge Even Spread Dataset

    • universe.roboflow.com
    zip
    Updated Aug 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    water gauge (2025). Water Gauge Even Spread Dataset [Dataset]. https://universe.roboflow.com/water-gauge-og1mn/water-gauge-even-spread-gnuc1/model/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 6, 2025
    Dataset authored and provided by
    water gauge
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Water Level SbeH Polygons
    Description

    Water Gauge Even Spread

    ## Overview
    
    Water Gauge Even Spread is a dataset for instance segmentation tasks - it contains Water Level SbeH annotations for 654 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan (2024). SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part II) [Dataset]. http://doi.org/10.5281/zenodo.14525290
Organization logo

SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part II)

Explore at:
binAvailable download formats
Dataset updated
Dec 19, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This page only provides the drone-view image dataset.

The dataset contains drone-view RGB images, depth maps and instance segmentation labels collected from different scenes. Data from each scene is stored in a separate .7z file, along with a color_palette.xlsx file, which contains the RGB_id and corresponding RGB values.

All files follow the naming convention: {central_tree_id}_{timestamp}, where {central_tree_id} represents the ID of the tree centered in the image, which is typically in a prominent position, and timestamp indicates the time when the data was collected.

Specifically, each 7z file includes the following folders:

  • rgb: This folder contains the RGB images (PNG) of the scenes and their metadata (TXT). The metadata describes the weather conditions and the world time when the image was captured. An example metadata entry is: Weather:Snow_Blizzard,Hour:10,Minute:56,Second:36.

  • depth_pfm: This folder contains absolute depth information of the scenes, which can be used to reconstruct the point cloud of the scene through reprojection.

  • instance_segmentation: This folder stores instance segmentation labels (PNG) for each tree in the scene, along with metadata (TXT) that maps tree_id to RGB_id. The tree_id can be used to look up detailed information about each tree in obj_info_final.xlsx, while the RGB_id can be matched to the corresponding RGB values in color_palette.xlsx. This mapping allows for identifying which tree corresponds to a specific color in the segmentation image.

  • obj_info_final.xlsx: This file contains detailed information about each tree in the scene, such as position, scale, species, and various parameters, including trunk diameter (in cm), tree height (in cm), and canopy diameter (in cm).

  • landscape_info.txt: This file contains the ground location information within the scene, sampled every 0.5 meters.

For birch_forest, broadleaf_forest, redwood_forest and rainforest, we also provided COCO-format annotation files (.json). Two such files can be found in these datasets:

  • {name}_coco.json: This file contains the annotation of each tree in the scene.
  • {name}_filtered.json: This file is derived from the previous one, but filtering is applied to rule out overlapping instances.

⚠️: 7z files that begin with "!" indicate that the RGB values in the images within the instance_segmentation folder cannot be found in color_palette.xlsx. Consequently, this prevents matching the trees in the segmentation images to their corresponding tree information, which may hinder the application of the dataset to certain tasks. This issue is related to a bug in Colossium/AirSim, which has been reported in link1 and link2.

Search
Clear search
Close search
Google apps
Main menu