100+ datasets found
  1. Data from: Meat Price Spreads

    • catalog.data.gov
    • agdatacommons.nal.usda.gov
    • +3more
    Updated Apr 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Economic Research Service, Department of Agriculture (2025). Meat Price Spreads [Dataset]. https://catalog.data.gov/dataset/meat-price-spreads
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Economic Research Servicehttp://www.ers.usda.gov/
    Description

    This data set provides monthly average price values, and the differences among those values, at the farm, wholesale, and retail stages of the production and marketing chain for selected cuts of beef, pork, and broilers. In addition, retail prices are provided for beef and pork cuts, turkey, whole chickens, eggs, and dairy products. Price spreads are reported for last 6 years, 12 quarters, and 24 months. The retail price file provides monthly estimates for the last 6 months. The historical file provides data since 1970.

  2. ICC Spread - Dataset - Banco Central do Brasil Open Data Portal

    • opendata.bcb.gov.br
    Updated Jan 15, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bcb.gov.br (2018). ICC Spread - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/27443-icc-spread
    Explore at:
    Dataset updated
    Jan 15, 2018
    Dataset provided by
    Central Bank of Brazilhttp://www.bc.gov.br/
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Concept: Difference between average cost of outstanding loans (ICC) and its average funding cost. Comprises both earmarked and nonearmarked operations. Source: Central Bank of Brazil – Statistics Department 27443-icc-spread 27443-icc-spread

  3. m

    Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First...

    • data.mendeley.com
    Updated Jul 20, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hasmot Ali (2020). Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First Case And First Death [Dataset]. http://doi.org/10.17632/vw427wzzkk.4
    Explore at:
    Dataset updated
    Jul 20, 2020
    Authors
    Hasmot Ali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Contain informative data related to COVID-19 pandemic. Specially, figure out about the First Case and First Death information for every single country. First Case information consist of Date of First Case(s), Number of confirm Case(s) at First Day, Age of the patient(s) of First Case, Last Visited Country and the First Death information consist of Date of First Death and Age of the Patient who died first for every Country mentioning corresponding Continent. The datasets also contain the Binary Matrix of spread chain among different country and region.

  4. Next Day Wildfire Spread

    • kaggle.com
    Updated Aug 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WildfireSpred (2024). Next Day Wildfire Spread [Dataset]. https://www.kaggle.com/datasets/wildfirespred/next-day-wildfire-spread
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 2, 2024
    Dataset provided by
    Kaggle
    Authors
    WildfireSpred
    Description

    This dataset extends the Next Day Wildfire Spread: North America 2012-2023 dataset with additional variables. The dataset includes information on meteorological conditions, vegetation types, geographical features, and wildfire spread dynamics. It aims to provide a comprehensive resource for studying and predicting wildfire behavior across diverse landscapes.

    The dataset includes:

    • Meteorological variables such as temperature, humidity, wind speed, and precipitation.
    • Geographical features including elevation, slope, and aspect.
    • Vegetation types and coverage metrics.
    • Wildfire spread metrics such as ignition points, burned areas, and spread rates.
    • Global water surface data.
  5. Parameterizing Spatial Models of Infectious Disease Transmission that...

    • plos.figshare.com
    pdf
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rajat Malik; Rob Deardon; Grace P. S. Kwong (2023). Parameterizing Spatial Models of Infectious Disease Transmission that Incorporate Infection Time Uncertainty Using Sampling-Based Likelihood Approximations [Dataset]. http://doi.org/10.1371/journal.pone.0146253
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Rajat Malik; Rob Deardon; Grace P. S. Kwong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A class of discrete-time models of infectious disease spread, referred to as individual-level models (ILMs), are typically fitted in a Bayesian Markov chain Monte Carlo (MCMC) framework. These models quantify probabilistic outcomes regarding the risk of infection of susceptible individuals due to various susceptibility and transmissibility factors, including their spatial distance from infectious individuals. The infectious pressure from infected individuals exerted on susceptible individuals is intrinsic to these ILMs. Unfortunately, quantifying this infectious pressure for data sets containing many individuals can be computationally burdensome, leading to a time-consuming likelihood calculation and, thus, computationally prohibitive MCMC-based analysis. This problem worsens when using data augmentation to allow for uncertainty in infection times. In this paper, we develop sampling methods that can be used to calculate a fast, approximate likelihood when fitting such disease models. A simple random sampling approach is initially considered followed by various spatially-stratified schemes. We test and compare the performance of our methods with both simulated data and data from the 2001 foot-and-mouth disease (FMD) epidemic in the U.K. Our results indicate that substantial computation savings can be obtained—albeit, of course, with some information loss—suggesting that such techniques may be of use in the analysis of very large epidemic data sets.

  6. Z

    Data from: A dataset of Covid-related misinformation videos and their spread...

    • data.niaid.nih.gov
    Updated Feb 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Knuutila, Aleksi (2021). A dataset of Covid-related misinformation videos and their spread on social media [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4557827
    Explore at:
    Dataset updated
    Feb 24, 2021
    Dataset authored and provided by
    Knuutila, Aleksi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains metadata about all Covid-related YouTube videos which circulated on public social media, but which YouTube eventually removed because they contained false information. It describes 8,122 videos that were shared between November 2019 and June 2020. The dataset contains unique identifiers for the videos and social media accounts that shared the videos, statistics on social media engagement and metadata such as video titles and view counts where they were recoverable. We publish the data alongside the code used to produce on Github. The dataset has reuse potential for research studying narratives related to the coronavirus, the impact of social media on knowledge about health and the politics of social media platforms.

  7. Cover smart, do your part, slow the spread. My Stay-at-Home Lab Shows How...

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Jul 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). Cover smart, do your part, slow the spread. My Stay-at-Home Lab Shows How Face Coverings Can Slow the Spread of Disease [Dataset]. https://catalog.data.gov/dataset/cover-smart-do-your-part-slow-the-spread-my-stay-at-home-lab-shows-how-face-coverings-can--e4a9e
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    This dataset illustrates the fluid dynamics of human coughing and breathing by using schlieren imaging. This dataset was used to help inform the general public about the importance of face coverings during the COVID-19 global pandemic.

  8. d

    Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised...

    • catalog.data.gov
    • agdatacommons.nal.usda.gov
    Updated Apr 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agricultural Research Service (2025). Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised Spatiotemporal Deep Neural Networks [Dataset]. https://catalog.data.gov/dataset/data-from-modeling-the-spread-of-a-livestock-disease-with-semi-supervised-spatiotemporal-d-bdd33
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    Agricultural Research Service
    Description

    This dataset contains the spatiotemporal data used to train the spatiotemporal deep neural networks described in "Modeling the Spread of a Livestock Disease With Semi-Supervised Spatiotemporal Deep Neural Networks". The dataset consists of two sets of NumPy arrays. The first set: X_grid.npy and Y_grid.npy were used to train the convolutional LSTM, while the second set: X_graph.npy, Y_graph.npy, and edge_index.npy were used to train the graph convolutional LSTM. The data consists of spatiotemporally varying environmental and anthropogenic variables along with case reports of vesicular stomatitis. Resources in this dataset:Resource Title: NumPy Arrays of Spatiotemporal Features and VS Cases. File Name: vs_data.zipResource Description: This is a ZIP archive containing five NumPy arrays of spatiotemporal features and geotagged VS cases.Resource Software Recommended: NumPy,url: https://numpy.org/

  9. Worldwide COVID-19 Data from WHO (2025 Edition)

    • kaggle.com
    Updated Jul 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adil Shamim (2025). Worldwide COVID-19 Data from WHO (2025 Edition) [Dataset]. https://www.kaggle.com/datasets/adilshamim8/worldwide-covid-19-data-from-who
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 3, 2025
    Dataset provided by
    Kaggle
    Authors
    Adil Shamim
    Description

    Dataset Overview

    This dataset contains global COVID-19 case and death data by country, collected directly from the official World Health Organization (WHO) COVID-19 Dashboard. It provides a comprehensive view of the pandemic’s impact worldwide, covering the period up to 2025. The dataset is intended for researchers, analysts, and anyone interested in understanding the progression and global effects of COVID-19 through reliable, up-to-date information.

    Source Information

    • Website: WHO COVID-19 Dashboard
    • Organization: World Health Organization (WHO)
    • Data Coverage: Global (by country/territory)
    • Time Period: Up to 2025

    The World Health Organization is the United Nations agency responsible for international public health. The WHO COVID-19 Dashboard is a trusted source that aggregates official reports from countries and territories around the world, providing daily updates on cases, deaths, and other key metrics related to COVID-19.

    Dataset Contents

    • Country/Region: The name of the country or territory.
    • Date: Reporting date.
    • New Cases: Number of new confirmed COVID-19 cases.
    • Cumulative Cases: Total confirmed COVID-19 cases to date.
    • New Deaths: Number of new confirmed deaths due to COVID-19.
    • Cumulative Deaths: Total deaths reported to date.
    • Additional fields may include population, rates per 100,000, and more (see data files for details).

    How to Use

    This dataset can be used for: - Tracking the spread and trends of COVID-19 globally and by country - Modeling and forecasting pandemic progression - Comparative analysis of the pandemic’s impact across countries and regions - Visualization and reporting

    Data Reliability

    The data is sourced from the WHO, widely regarded as the most authoritative source for global health statistics. However, reporting practices and data completeness may vary by country and may be subject to revision as new information becomes available.

    Acknowledgements

    Special thanks to the WHO for making this data publicly available and to all those working to collect, verify, and report COVID-19 statistics.

  10. Spread of the ICC - Individuals - Dataset - Banco Central do Brasil Open...

    • opendata.bcb.gov.br
    Updated Jan 15, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bcb.gov.br (2018). Spread of the ICC - Individuals - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/27445-spread-of-the-icc---individuals
    Explore at:
    Dataset updated
    Jan 15, 2018
    Dataset provided by
    Central Bank of Brazilhttp://www.bc.gov.br/
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Concept: Difference between average cost of outstanding loans (ICC) and its average funding cost. Comprises both earmarked and nonearmarked operations. Source: Central Bank of Brazil – Statistics Department 27445-spread-of-the-icc---individuals 27445-spread-of-the-icc---individuals

  11. f

    Supporting dataset for the bachelor thesis: Simulating the Spread of...

    • figshare.com
    • data.4tu.nl
    mp4
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marko Boon; Nikki Steenbakkers; Bert Zwart (2023). Supporting dataset for the bachelor thesis: Simulating the Spread of COVID-19 in the Netherlands [Dataset]. http://doi.org/10.4121/13536614.v1
    Explore at:
    mp4Available download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    4TU.ResearchData
    Authors
    Marko Boon; Nikki Steenbakkers; Bert Zwart
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Area covered
    Netherlands
    Description

    These files are videos generated by a stochastic simulation that was created by Nikki Steenbakkers under the supervision of Marko Boon and Bert Zwart (all affiliated with Eindhoven University of Technology) for her bachelor final project "Simulating the Spread of COVID-19 in the Netherlands". The report can be found in the TU/e repository of bachelor project reports:https://research.tue.nl/en/studentTheses/simulating-the-spread-of-covid-19-in-the-netherlandsThe report contains more information about the project and the simulation. It explicitly refers to these files.

  12. o

    Global Spread of Conflict by Country and Population - Dataset - Data Catalog...

    • data.opendata.am
    Updated Jul 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Global Spread of Conflict by Country and Population - Dataset - Data Catalog Armenia [Dataset]. https://data.opendata.am/dataset/dcwb0041070
    Explore at:
    Dataset updated
    Jul 7, 2023
    Description

    This dataset provides the spread of the conflict globally in terms of population and country for the years 2000-2016.

  13. Movies and Tv Shows Dataset

    • crawlfeeds.com
    csv, zip
    Updated Jul 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). Movies and Tv Shows Dataset [Dataset]. https://crawlfeeds.com/datasets/movies-and-tv-shows-dataset
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Jul 4, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Explore our meticulously curated Movies dataset and TV shows dataset, designed to cater to diverse analytical and research needs. Whether you're a data scientist, a student, or a business professional, these datasets provide valuable insights into the entertainment industry.

    Key Features of the Movies Dataset:

    1. Extensive collection of global movies across various genres and languages.

    2. Detailed metadata, including titles, release dates, genres, directors, cast, and ratings.

    3. Regularly updated to ensure relevance and accuracy.

    Why Choose Our TV Shows Dataset?

    Our TV shows dataset is your gateway to understanding trends in episodic content. It includes:

    • Comprehensive details about popular and niche TV shows.

    • Information on episode counts, seasons, ratings, and networks.

    • Insights into audience preferences and regional programming.

    Applications of These Datasets

    These datasets are perfect for:

    • Machine learning models for recommendation systems.

    • Academic research on media trends and audience behavior.

    • Business strategies for entertainment platforms.

    Unlock the power of TV show data with our Crawl Feeds TV Shows Dataset. Start analyzing today and gain valuable insights into your favorite shows!

  14. SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest...

    • zenodo.org
    bin, txt
    Updated Nov 27, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan (2024). SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part III) [Dataset]. http://doi.org/10.5281/zenodo.14228467
    Explore at:
    bin, txtAvailable download formats
    Dataset updated
    Nov 27, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This page only provides point clouds.

    This dataset contains point clouds collected from different virtual forest scenes. Data from each scene is stored in a separate .7z file, along with a point_cloud_color_palette.txtfile, which contains the Tree_id and corresponding RGB values.

    Specifically, each 7z file includes the following folders:

    • tree: This folder contains the point cloud data of every single tree within the forest scene. Each tree is stored separately in a .ply file including both location and color infomation. For performance reasons, the maximum number of point clouds for each tree is limited to 10,000.

    • ground: This folder contains a landscape.ply describing the ground information. The color of the point cloud is set to [0,0,0].

    The unit of the point cloud is meters (m).

  15. NBA Betting Data | October 2007 to June 2025

    • kaggle.com
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    cviaxmiwnptr (2025). NBA Betting Data | October 2007 to June 2025 [Dataset]. https://www.kaggle.com/datasets/cviaxmiwnptr/nba-betting-data-october-2007-to-june-2024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 24, 2025
    Dataset provided by
    Kaggle
    Authors
    cviaxmiwnptr
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Column Labels

    • season – Year the season ended. For example, the 2018-19 season is encoded as 2019.
    • date – Date of the game
    • regular – Regular season game (True or False)
    • playoffs – Playoff game (True or False)
    • away – Away team
    • home – Home team
    • score_away – Away team's score
    • score_home – Home team's score
    • q1_away – Away team's 1st quarter score
    • q2_away – Away team's 2nd quarter score
    • q3_away – Away team's 3rd quarter score
    • q4_away – Away team's 4th quarter score
    • ot_away – Away team's overtime score
    • q1_home – Home team's 1st quarter score
    • q2_home – Home team's 2nd quarter score
    • q3_home – Home team's 3rd quarter score
    • q4_home – Home team's 4th quarter score
    • ot_home – Home team's overtime score
    • whos_favored – Betting favorite (home or away)
    • spread – Point spread (always a positive number)
    • total – Over/Under
    • moneyline_away – American moneyline odds for away team
    • moneyline_home – American moneyline odds for home team
    • h2_spread – Second half point spread
    • h2_total – Second half over/under
    • id_spread – 1 if favorite covered, 0 if underdog covered. 2 if push
    • id_total – 1 if total went over, 0 if under, 2 if push

    Data Sources

    I scraped SportsbookReviewsOnline.com and fixed a few errors. They seem to have stopped updating the page so all future data will come from ESPN.

    Notes

    Seattle moved to Oklahoma City beginning in the 2008-09 season. I encode them as okc for consistency.

    New Jersey moved to Brooklyn beginning in the 2012-13 season. I encode them as bkn for consistency.

    2H and Moneyline odds are absent from the ESPN data (since Jan 2023). Note that ESPN uses non-integer values exclusively so there are no pushes.

  16. d

    Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends,...

    • datarade.ai
    .json, .csv
    Updated Aug 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataplex (2024). Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends, audience insights + more | Ideal for Interest-Based Segmentation [Dataset]. https://datarade.ai/data-products/dataplex-reddit-data-global-social-media-data-1-1m-mill-dataplex
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Aug 12, 2024
    Dataset authored and provided by
    Dataplex
    Area covered
    Jersey, Botswana, Martinique, Mexico, Chile, Christmas Island, Gambia, Macao, Holy See, Côte d'Ivoire
    Description

    The Reddit Subreddit Dataset by Dataplex offers a comprehensive and detailed view of Reddit’s vast ecosystem, now enhanced with appended AI-generated columns that provide additional insights and categorization. This dataset includes data from over 2.1 million subreddits, making it an invaluable resource for a wide range of analytical applications, from social media analysis to market research.

    Dataset Overview:

    This dataset includes detailed information on subreddit activities, user interactions, post frequency, comment data, and more. The inclusion of AI-generated columns adds an extra layer of analysis, offering sentiment analysis, topic categorization, and predictive insights that help users better understand the dynamics of each subreddit.

    2.1 Million Subreddits with Enhanced AI Insights: The dataset covers over 2.1 million subreddits and now includes AI-enhanced columns that provide: - Sentiment Analysis: AI-driven sentiment scores for posts and comments, allowing users to gauge community mood and reactions. - Topic Categorization: Automated categorization of subreddit content into relevant topics, making it easier to filter and analyze specific types of discussions. - Predictive Insights: AI models that predict trends, content virality, and user engagement, helping users anticipate future developments within subreddits.

    Sourced Directly from Reddit:

    All social media data in this dataset is sourced directly from Reddit, ensuring accuracy and authenticity. The dataset is updated regularly, reflecting the latest trends and user interactions on the platform. This ensures that users have access to the most current and relevant data for their analyses.

    Key Features:

    • Subreddit Metrics: Detailed data on subreddit activity, including the number of posts, comments, votes, and user participation.
    • User Engagement: Insights into how users interact with content, including comment threads, upvotes/downvotes, and participation rates.
    • Trending Topics: Track emerging trends and viral content across the platform, helping you stay ahead of the curve in understanding social media dynamics.
    • AI-Enhanced Analysis: Utilize AI-generated columns for sentiment analysis, topic categorization, and predictive insights, providing a deeper understanding of the data.

    Use Cases:

    • Social Media Analysis: Researchers and analysts can use this dataset to study online behavior, track the spread of information, and understand how content resonates with different audiences.
    • Market Research: Marketers can leverage the dataset to identify target audiences, understand consumer preferences, and tailor campaigns to specific communities.
    • Content Strategy: Content creators and strategists can use insights from the dataset to craft content that aligns with trending topics and user interests, maximizing engagement.
    • Academic Research: Academics can explore the dynamics of online communities, studying everything from the spread of misinformation to the formation of online subcultures.

    Data Quality and Reliability:

    The Reddit Subreddit Dataset emphasizes data quality and reliability. Each record is carefully compiled from Reddit’s vast database, ensuring that the information is both accurate and up-to-date. The AI-generated columns further enhance the dataset's value, providing automated insights that help users quickly identify key trends and sentiments.

    Integration and Usability:

    The dataset is provided in a format that is compatible with most data analysis tools and platforms, making it easy to integrate into existing workflows. Users can quickly import, analyze, and utilize the data for various applications, from market research to academic studies.

    User-Friendly Structure and Metadata:

    The data is organized for easy navigation and analysis, with metadata files included to help users identify relevant subreddits and data points. The AI-enhanced columns are clearly labeled and structured, allowing users to efficiently incorporate these insights into their analyses.

    Ideal For:

    • Data Analysts: Conduct in-depth analyses of subreddit trends, user engagement, and content virality. The dataset’s extensive coverage and AI-enhanced insights make it an invaluable tool for data-driven research.
    • Marketers: Use the dataset to better understand your target audience, tailor campaigns to specific interests, and track the effectiveness of marketing efforts across Reddit.
    • Researchers: Explore the social dynamics of online communities, analyze the spread of ideas and information, and study the impact of digital media on public discourse, all while leveraging AI-generated insights.

    This dataset is an essential resource for anyone looking to understand the intricacies of Reddit's vast ecosystem, offering the data and AI-enhanced insights needed to drive informed decisions and strategies across various fields. Whether you’re tracking emerging trends, analyzing user behavior, or conduc...

  17. A Twitter Dataset of 100+ million tweets related to COVID-19

    • zenodo.org
    application/gzip, csv +1
    Updated Apr 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juan M. Banda; Juan M. Banda; Ramya Tekumalla; Ramya Tekumalla; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding; Gerardo Chowell; Gerardo Chowell; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding (2023). A Twitter Dataset of 100+ million tweets related to COVID-19 [Dataset]. http://doi.org/10.5281/zenodo.3735274
    Explore at:
    application/gzip, tsv, csvAvailable download formats
    Dataset updated
    Apr 17, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Juan M. Banda; Juan M. Banda; Ramya Tekumalla; Ramya Tekumalla; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding; Gerardo Chowell; Gerardo Chowell; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding
    Description

    Due to the relevance of the COVID-19 global pandemic, we are releasing our dataset of tweets acquired from the Twitter Stream related to COVID-19 chatter. The first 9 weeks of data (from January 1st, 2020 to March 11th, 2020) contain very low tweet counts as we filtered other data we were collecting for other research purposes, however, one can see the dramatic increase as the awareness for the virus spread. Dedicated data gathering started from March 11th to March 30th which yielded over 4 million tweets a day. We have added additional data provided by our new collaborators from January 27th to February 27th, to provide extra longitudinal coverage.

    The data collected from the stream captures all languages, but the higher prevalence are: English, Spanish, and French. We release all tweets and retweets on the full_dataset.tsv file (101,400,452 unique tweets), and a cleaned version with no retweets on the full_dataset-clean.tsv file (20,244,746 unique tweets). There are several practical reasons for us to leave the retweets, tracing important tweets and their dissemination is one of them. For NLP tasks we provide the top 1000 frequent terms in frequent_terms.csv, the top 1000 bigrams in frequent_bigrams.csv, and the top 1000 trigrams in frequent_trigrams.csv. Some general statistics per day are included for both datasets in the statistics-full_dataset.tsv and statistics-full_dataset-clean.tsv files.

    More details can be found (and will be updated faster at: https://github.com/thepanacealab/covid19_twitter)

    As always, the tweets distributed here are only tweet identifiers (with date and time added) due to the terms and conditions of Twitter to re-distribute Twitter data. The need to be hydrated to be used.

  18. T

    BANK LENDING DEPOSIT SPREAD WB DATA.HTML by Country Dataset

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Jul 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). BANK LENDING DEPOSIT SPREAD WB DATA.HTML by Country Dataset [Dataset]. https://tradingeconomics.com/country-list/bank-lending-deposit-spread-wb-data.html/1000
    Explore at:
    excel, csv, xml, jsonAvailable download formats
    Dataset updated
    Jul 7, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    World
    Description

    This dataset provides values for BANK LENDING DEPOSIT SPREAD WB DATA.HTML reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

  19. Wildfire Risk to Communities Housing Unit Density (Image Service)

    • catalog.data.gov
    • resilience.climate.gov
    • +12more
    Updated Apr 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Forest Service (2025). Wildfire Risk to Communities Housing Unit Density (Image Service) [Dataset]. https://catalog.data.gov/dataset/wildfire-risk-to-communities-housing-unit-density-image-service-fac22
    Explore at:
    Dataset updated
    Apr 21, 2025
    Dataset provided by
    U.S. Department of Agriculture Forest Servicehttp://fs.fed.us/
    Description

    The data included in this publication depict components of wildfire risk specifically for populated areas in the United States. These datasets represent where people live in the United States and the in situ risk from wildfire, i.e., the risk at the location where the adverse effects take place.National wildfire hazard datasets of annual burn probability and fire intensity, generated by the USDA Forest Service, Rocky Mountain Research Station and Pyrologix LLC, form the foundation of the Wildfire Risk to Communities data. Vegetation and wildland fuels data from LANDFIRE 2020 (version 2.2.0) were used as input to two different but related geospatial fire simulation systems. Annual burn probability was produced with the USFS geospatial fire simulator (FSim) at a relatively coarse cell size of 270 meters (m). To bring the burn probability raster data down to a finer resolution more useful for assessing hazard and risk to communities, we upsampled them to the native 30 m resolution of the LANDFIRE fuel and vegetation data. In this upsampling process, we also spread values of modeled burn probability into developed areas represented in LANDFIRE fuels data as non-burnable. Burn probability rasters represent landscape conditions as of the end of 2020. Fire intensity characteristics were modeled at 30 m resolution using a process that performs a comprehensive set of FlamMap runs spanning the full range of weather-related characteristics that occur during a fire season and then integrates those runs into a variety of results based on the likelihood of those weather types occurring. Before the fire intensity modeling, the LANDFIRE 2020 data were updated to reflect fuels disturbances occurring in 2021 and 2022. As such, the fire intensity datasets represent landscape conditions as of the end of 2022. The data products in this publication that represent where people live, reflect 2021 estimates of housing unit and population counts from the U.S. Census Bureau, combined with building footprint data from Onegeo and USA Structures, both reflecting 2022 conditions.The specific raster datasets included in this publication include:Building Count: Building Count is a 30-m raster representing the count of buildings in the building footprint dataset located within each 30-m pixel.Building Density: Building Density is a 30-m raster representing the density of buildings in the building footprint dataset (buildings per square kilometer [km²]).Building Coverage: Building Coverage is a 30-m raster depicting the percentage of habitable land area covered by building footprints.Population Count (PopCount): PopCount is a 30-m raster with pixel values representing residential population count (persons) in each pixel.Population Density (PopDen): PopDen is a 30-m raster of residential population density (people/km²).Housing Unit Count (HUCount): HUCount is a 30-m raster representing the number of housing units in each pixel.Housing Unit Density (HUDen): HUDen is a 30-m raster of housing-unit density (housing units/km²).Housing Unit Exposure (HUExposure): HUExposure is a 30-m raster that represents the expected number of housing units within a pixel potentially exposed to wildfire in a year. This is a long-term annual average and not intended to represent the actual number of housing units exposed in any specific year.Housing Unit Impact (HUImpact): HUImpact is a 30-m raster that represents the relative potential impact of fire to housing units at any pixel, if a fire were to occur. It is an index that incorporates the general consequences of fire on a home as a function of fire intensity and uses flame length probabilities from wildfire modeling to capture likely intensity of fire.Housing Unit Risk (HURisk): HURisk is a 30-m raster that integrates all four primary elements of wildfire risk - likelihood, intensity, susceptibility, and exposure - on pixels where housing unit density is greater than zero.Additional methodology documentation is provided with the data publication download. Metadata and Downloads.Note: Pixel values in this image service have been altered from the original raster dataset due to data requirements in web services. The service is intended primarily for data visualization. Relative values and spatial patterns have been largely preserved in the service, but users are encouraged to download the source data for quantitative analysis.

  20. COVID-19 Internet Spread

    • kaggle.com
    Updated Apr 11, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bandee (2020). COVID-19 Internet Spread [Dataset]. https://www.kaggle.com/bandee/covid19-internet-spread/tasks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 11, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bandee
    Description

    Context

    This dataset allows for a comparison between the offline and online spread of COVID-19.

    Content

    • COVID-19 confirmed cases, deaths, recoveries
    • Google Trends for COVID-19
    • Wikipedia page views for COVID-19 from different countries

    Acknowledgements

    The dataset was obtained using the following APIs: https://github.com/pomber/covid19 https://github.com/GeneralMills/pytrends https://wikitech.wikimedia.org/wiki/Analytics/AQS/Pageviews

    Inspiration

    Can internet traffic data help to understand the spread of the virus?

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Economic Research Service, Department of Agriculture (2025). Meat Price Spreads [Dataset]. https://catalog.data.gov/dataset/meat-price-spreads
Organization logo

Data from: Meat Price Spreads

Related Article
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Economic Research Servicehttp://www.ers.usda.gov/
Description

This data set provides monthly average price values, and the differences among those values, at the farm, wholesale, and retail stages of the production and marketing chain for selected cuts of beef, pork, and broilers. In addition, retail prices are provided for beef and pork cuts, turkey, whole chickens, eggs, and dairy products. Price spreads are reported for last 6 years, 12 quarters, and 24 months. The retail price file provides monthly estimates for the last 6 months. The historical file provides data since 1970.

Search
Clear search
Close search
Google apps
Main menu