100+ datasets found

Data from: Meat Price Spreads
catalog.data.gov
agdatacommons.nal.usda.gov
+3more
Updated Apr 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Economic Research Service, Department of Agriculture (2025). Meat Price Spreads [Dataset]. https://catalog.data.gov/dataset/meat-price-spreads
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Economic Research Servicehttp://www.ers.usda.gov/
Description
This data set provides monthly average price values, and the differences among those values, at the farm, wholesale, and retail stages of the production and marketing chain for selected cuts of beef, pork, and broilers. In addition, retail prices are provided for beef and pork cuts, turkey, whole chickens, eggs, and dairy products. Price spreads are reported for last 6 years, 12 quarters, and 24 months. The retail price file provides monthly estimates for the last 6 months. The historical file provides data since 1970.
ICC Spread - Dataset - Banco Central do Brasil Open Data Portal
opendata.bcb.gov.br
Updated Jan 15, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
bcb.gov.br (2018). ICC Spread - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/27443-icc-spread
Explore at:
Dataset updated
Jan 15, 2018
Dataset provided by
Central Bank of Brazilhttp://www.bc.gov.br/
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
Concept: Difference between average cost of outstanding loans (ICC) and its average funding cost. Comprises both earmarked and nonearmarked operations. Source: Central Bank of Brazil – Statistics Department 27443-icc-spread 27443-icc-spread
m
Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First...
data.mendeley.com
Updated Jul 20, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hasmot Ali (2020). Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First Case And First Death [Dataset]. http://doi.org/10.17632/vw427wzzkk.4
Explore at:
Unique identifier
https://doi.org/10.17632/vw427wzzkk.4
Dataset updated
Jul 20, 2020
Authors
Hasmot Ali
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Contain informative data related to COVID-19 pandemic. Specially, figure out about the First Case and First Death information for every single country. First Case information consist of Date of First Case(s), Number of confirm Case(s) at First Day, Age of the patient(s) of First Case, Last Visited Country and the First Death information consist of Date of First Death and Age of the Patient who died first for every Country mentioning corresponding Continent. The datasets also contain the Binary Matrix of spread chain among different country and region.
Next Day Wildfire Spread
kaggle.com
Updated Aug 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WildfireSpred (2024). Next Day Wildfire Spread [Dataset]. https://www.kaggle.com/datasets/wildfirespred/next-day-wildfire-spread
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 2, 2024
Dataset provided by
Kaggle
Authors
WildfireSpred
Description
This dataset extends the Next Day Wildfire Spread: North America 2012-2023 dataset with additional variables. The dataset includes information on meteorological conditions, vegetation types, geographical features, and wildfire spread dynamics. It aims to provide a comprehensive resource for studying and predicting wildfire behavior across diverse landscapes.

The dataset includes:

Meteorological variables such as temperature, humidity, wind speed, and precipitation.

Geographical features including elevation, slope, and aspect.

Vegetation types and coverage metrics.

Wildfire spread metrics such as ignition points, burned areas, and spread rates.

Global water surface data.
Parameterizing Spatial Models of Infectious Disease Transmission that...
plos.figshare.com
pdf
Updated Jun 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rajat Malik; Rob Deardon; Grace P. S. Kwong (2023). Parameterizing Spatial Models of Infectious Disease Transmission that Incorporate Infection Time Uncertainty Using Sampling-Based Likelihood Approximations [Dataset]. http://doi.org/10.1371/journal.pone.0146253
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0146253
Dataset updated
Jun 4, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Rajat Malik; Rob Deardon; Grace P. S. Kwong
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A class of discrete-time models of infectious disease spread, referred to as individual-level models (ILMs), are typically fitted in a Bayesian Markov chain Monte Carlo (MCMC) framework. These models quantify probabilistic outcomes regarding the risk of infection of susceptible individuals due to various susceptibility and transmissibility factors, including their spatial distance from infectious individuals. The infectious pressure from infected individuals exerted on susceptible individuals is intrinsic to these ILMs. Unfortunately, quantifying this infectious pressure for data sets containing many individuals can be computationally burdensome, leading to a time-consuming likelihood calculation and, thus, computationally prohibitive MCMC-based analysis. This problem worsens when using data augmentation to allow for uncertainty in infection times. In this paper, we develop sampling methods that can be used to calculate a fast, approximate likelihood when fitting such disease models. A simple random sampling approach is initially considered followed by various spatially-stratified schemes. We test and compare the performance of our methods with both simulated data and data from the 2001 foot-and-mouth disease (FMD) epidemic in the U.K. Our results indicate that substantial computation savings can be obtained—albeit, of course, with some information loss—suggesting that such techniques may be of use in the analysis of very large epidemic data sets.
Z
Data from: A dataset of Covid-related misinformation videos and their spread...
data.niaid.nih.gov
Updated Feb 24, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Knuutila, Aleksi (2021). A dataset of Covid-related misinformation videos and their spread on social media [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4557827
Explore at:
Dataset updated
Feb 24, 2021
Dataset authored and provided by
Knuutila, Aleksi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains metadata about all Covid-related YouTube videos which circulated on public social media, but which YouTube eventually removed because they contained false information. It describes 8,122 videos that were shared between November 2019 and June 2020. The dataset contains unique identifiers for the videos and social media accounts that shared the videos, statistics on social media engagement and metadata such as video titles and view counts where they were recoverable. We publish the data alongside the code used to produce on Github. The dataset has reuse potential for research studying narratives related to the coronavirus, the impact of social media on knowledge about health and the politics of social media platforms.
Cover smart, do your part, slow the spread. My Stay-at-Home Lab Shows How...
catalog.data.gov
datasets.ai
+1more
Updated Jul 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2022). Cover smart, do your part, slow the spread. My Stay-at-Home Lab Shows How Face Coverings Can Slow the Spread of Disease [Dataset]. https://catalog.data.gov/dataset/cover-smart-do-your-part-slow-the-spread-my-stay-at-home-lab-shows-how-face-coverings-can--e4a9e
Explore at:
Dataset updated
Jul 29, 2022
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description
This dataset illustrates the fluid dynamics of human coughing and breathing by using schlieren imaging. This dataset was used to help inform the general public about the importance of face coverings during the COVID-19 global pandemic.
d
Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised...
catalog.data.gov
agdatacommons.nal.usda.gov
Updated Apr 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised Spatiotemporal Deep Neural Networks [Dataset]. https://catalog.data.gov/dataset/data-from-modeling-the-spread-of-a-livestock-disease-with-semi-supervised-spatiotemporal-d-bdd33
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
Agricultural Research Service
Description
This dataset contains the spatiotemporal data used to train the spatiotemporal deep neural networks described in "Modeling the Spread of a Livestock Disease With Semi-Supervised Spatiotemporal Deep Neural Networks". The dataset consists of two sets of NumPy arrays. The first set: X_grid.npy and Y_grid.npy were used to train the convolutional LSTM, while the second set: X_graph.npy, Y_graph.npy, and edge_index.npy were used to train the graph convolutional LSTM. The data consists of spatiotemporally varying environmental and anthropogenic variables along with case reports of vesicular stomatitis. Resources in this dataset:Resource Title: NumPy Arrays of Spatiotemporal Features and VS Cases. File Name: vs_data.zipResource Description: This is a ZIP archive containing five NumPy arrays of spatiotemporal features and geotagged VS cases.Resource Software Recommended: NumPy,url: https://numpy.org/
Worldwide COVID-19 Data from WHO (2025 Edition)
kaggle.com
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Adil Shamim (2025). Worldwide COVID-19 Data from WHO (2025 Edition) [Dataset]. https://www.kaggle.com/datasets/adilshamim8/worldwide-covid-19-data-from-who
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 3, 2025
Dataset provided by
Kaggle
Authors
Adil Shamim
Description
Dataset Overview

This dataset contains global COVID-19 case and death data by country, collected directly from the official World Health Organization (WHO) COVID-19 Dashboard. It provides a comprehensive view of the pandemic’s impact worldwide, covering the period up to 2025. The dataset is intended for researchers, analysts, and anyone interested in understanding the progression and global effects of COVID-19 through reliable, up-to-date information.

Source Information

Website: WHO COVID-19 Dashboard

Organization: World Health Organization (WHO)

Data Coverage: Global (by country/territory)

Time Period: Up to 2025

The World Health Organization is the United Nations agency responsible for international public health. The WHO COVID-19 Dashboard is a trusted source that aggregates official reports from countries and territories around the world, providing daily updates on cases, deaths, and other key metrics related to COVID-19.

Dataset Contents

Country/Region: The name of the country or territory.

Date: Reporting date.

New Cases: Number of new confirmed COVID-19 cases.

Cumulative Cases: Total confirmed COVID-19 cases to date.

New Deaths: Number of new confirmed deaths due to COVID-19.

Cumulative Deaths: Total deaths reported to date.

Additional fields may include population, rates per 100,000, and more (see data files for details).

How to Use

This dataset can be used for: - Tracking the spread and trends of COVID-19 globally and by country - Modeling and forecasting pandemic progression - Comparative analysis of the pandemic’s impact across countries and regions - Visualization and reporting

Data Reliability

The data is sourced from the WHO, widely regarded as the most authoritative source for global health statistics. However, reporting practices and data completeness may vary by country and may be subject to revision as new information becomes available.

Acknowledgements

Special thanks to the WHO for making this data publicly available and to all those working to collect, verify, and report COVID-19 statistics.
Spread of the ICC - Individuals - Dataset - Banco Central do Brasil Open...
opendata.bcb.gov.br
Updated Jan 15, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
bcb.gov.br (2018). Spread of the ICC - Individuals - Dataset - Banco Central do Brasil Open Data Portal [Dataset]. https://opendata.bcb.gov.br/dataset/27445-spread-of-the-icc---individuals
Explore at:
Dataset updated
Jan 15, 2018
Dataset provided by
Central Bank of Brazilhttp://www.bc.gov.br/
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
Concept: Difference between average cost of outstanding loans (ICC) and its average funding cost. Comprises both earmarked and nonearmarked operations. Source: Central Bank of Brazil – Statistics Department 27445-spread-of-the-icc---individuals 27445-spread-of-the-icc---individuals
f
Supporting dataset for the bachelor thesis: Simulating the Spread of...
figshare.com
data.4tu.nl
mp4
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marko Boon; Nikki Steenbakkers; Bert Zwart (2023). Supporting dataset for the bachelor thesis: Simulating the Spread of COVID-19 in the Netherlands [Dataset]. http://doi.org/10.4121/13536614.v1
Explore at:
mp4Available download formats
Unique identifier
https://doi.org/10.4121/13536614.v1
Dataset updated
May 31, 2023
Dataset provided by
4TU.ResearchData
Authors
Marko Boon; Nikki Steenbakkers; Bert Zwart
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Area covered
Netherlands
Description
These files are videos generated by a stochastic simulation that was created by Nikki Steenbakkers under the supervision of Marko Boon and Bert Zwart (all affiliated with Eindhoven University of Technology) for her bachelor final project "Simulating the Spread of COVID-19 in the Netherlands". The report can be found in the TU/e repository of bachelor project reports:https://research.tue.nl/en/studentTheses/simulating-the-spread-of-covid-19-in-the-netherlandsThe report contains more information about the project and the simulation. It explicitly refers to these files.
o
Global Spread of Conflict by Country and Population - Dataset - Data Catalog...
data.opendata.am
Updated Jul 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Global Spread of Conflict by Country and Population - Dataset - Data Catalog Armenia [Dataset]. https://data.opendata.am/dataset/dcwb0041070
Explore at:
Dataset updated
Jul 7, 2023
Description
This dataset provides the spread of the conflict globally in terms of population and country for the years 2000-2016.
Movies and Tv Shows Dataset
crawlfeeds.com
csv, zip
Updated Jul 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). Movies and Tv Shows Dataset [Dataset]. https://crawlfeeds.com/datasets/movies-and-tv-shows-dataset
Explore at:
zip, csvAvailable download formats
Dataset updated
Jul 4, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
Explore our meticulously curated Movies dataset and TV shows dataset, designed to cater to diverse analytical and research needs. Whether you're a data scientist, a student, or a business professional, these datasets provide valuable insights into the entertainment industry.

Key Features of the Movies Dataset:

Extensive collection of global movies across various genres and languages.

Detailed metadata, including titles, release dates, genres, directors, cast, and ratings.

Regularly updated to ensure relevance and accuracy.

Why Choose Our TV Shows Dataset?

Our TV shows dataset is your gateway to understanding trends in episodic content. It includes:

Comprehensive details about popular and niche TV shows.

Information on episode counts, seasons, ratings, and networks.

Insights into audience preferences and regional programming.

Applications of These Datasets

These datasets are perfect for:

Machine learning models for recommendation systems.

Academic research on media trends and audience behavior.

Business strategies for entertainment platforms.

Unlock the power of TV show data with our Crawl Feeds TV Shows Dataset. Start analyzing today and gain valuable insights into your favorite shows!
SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest...
zenodo.org
bin, txt
Updated Nov 27, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan (2024). SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part III) [Dataset]. http://doi.org/10.5281/zenodo.14228467
Explore at:
bin, txtAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14228467
Dataset updated
Nov 27, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Zhengpeng Feng; Yihang She; Keshav Srinivasan; Zhengpeng Feng; Yihang She; Keshav Srinivasan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This page only provides point clouds.

For the ground-level image dataset, please visit SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part I).

For the drone-view image dataset, please visit SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest Vision Tasks (Part II).

This dataset contains point clouds collected from different virtual forest scenes. Data from each scene is stored in a separate .7z file, along with a point_cloud_color_palette.txtfile, which contains the Tree_id and corresponding RGB values.

Specifically, each 7z file includes the following folders:

tree: This folder contains the point cloud data of every single tree within the forest scene. Each tree is stored separately in a .ply file including both location and color infomation. For performance reasons, the maximum number of point clouds for each tree is limited to 10,000.

ground: This folder contains a landscape.ply describing the ground information. The color of the point cloud is set to [0,0,0].

The unit of the point cloud is meters (m).
NBA Betting Data | October 2007 to June 2025
kaggle.com
Updated Jun 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
cviaxmiwnptr (2025). NBA Betting Data | October 2007 to June 2025 [Dataset]. https://www.kaggle.com/datasets/cviaxmiwnptr/nba-betting-data-october-2007-to-june-2024
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 24, 2025
Dataset provided by
Kaggle
Authors
cviaxmiwnptr
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Column Labels

season – Year the season ended. For example, the 2018-19 season is encoded as 2019.

date – Date of the game

regular – Regular season game (True or False)

playoffs – Playoff game (True or False)

away – Away team

home – Home team

score_away – Away team's score

score_home – Home team's score

q1_away – Away team's 1st quarter score

q2_away – Away team's 2nd quarter score

q3_away – Away team's 3rd quarter score

q4_away – Away team's 4th quarter score

ot_away – Away team's overtime score

q1_home – Home team's 1st quarter score

q2_home – Home team's 2nd quarter score

q3_home – Home team's 3rd quarter score

q4_home – Home team's 4th quarter score

ot_home – Home team's overtime score

whos_favored – Betting favorite (home or away)

spread – Point spread (always a positive number)

total – Over/Under

moneyline_away – American moneyline odds for away team

moneyline_home – American moneyline odds for home team

h2_spread – Second half point spread

h2_total – Second half over/under

id_spread – 1 if favorite covered, 0 if underdog covered. 2 if push

id_total – 1 if total went over, 0 if under, 2 if push

Data Sources

October 30, 2007 to January 16, 2023: Sportsbook Reviews Online

Since January 17, 2023: ESPN

I scraped SportsbookReviewsOnline.com and fixed a few errors. They seem to have stopped updating the page so all future data will come from ESPN.

Notes

Seattle moved to Oklahoma City beginning in the 2008-09 season. I encode them as okc for consistency.

New Jersey moved to Brooklyn beginning in the 2012-13 season. I encode them as bkn for consistency.

2H and Moneyline odds are absent from the ESPN data (since Jan 2023). Note that ESPN uses non-integer values exclusively so there are no pushes.
d
Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends,...
datarade.ai
.json, .csv
Updated Aug 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataplex (2024). Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends, audience insights + more | Ideal for Interest-Based Segmentation [Dataset]. https://datarade.ai/data-products/dataplex-reddit-data-global-social-media-data-1-1m-mill-dataplex
Explore at:
.json, .csvAvailable download formats
Dataset updated
Aug 12, 2024
Dataset authored and provided by
Dataplex
Area covered
Jersey, Botswana, Martinique, Mexico, Chile, Christmas Island, Gambia, Macao, Holy See, Côte d'Ivoire
Description
The Reddit Subreddit Dataset by Dataplex offers a comprehensive and detailed view of Reddit’s vast ecosystem, now enhanced with appended AI-generated columns that provide additional insights and categorization. This dataset includes data from over 2.1 million subreddits, making it an invaluable resource for a wide range of analytical applications, from social media analysis to market research.

Dataset Overview:

This dataset includes detailed information on subreddit activities, user interactions, post frequency, comment data, and more. The inclusion of AI-generated columns adds an extra layer of analysis, offering sentiment analysis, topic categorization, and predictive insights that help users better understand the dynamics of each subreddit.

2.1 Million Subreddits with Enhanced AI Insights: The dataset covers over 2.1 million subreddits and now includes AI-enhanced columns that provide: - Sentiment Analysis: AI-driven sentiment scores for posts and comments, allowing users to gauge community mood and reactions. - Topic Categorization: Automated categorization of subreddit content into relevant topics, making it easier to filter and analyze specific types of discussions. - Predictive Insights: AI models that predict trends, content virality, and user engagement, helping users anticipate future developments within subreddits.

Sourced Directly from Reddit:

All social media data in this dataset is sourced directly from Reddit, ensuring accuracy and authenticity. The dataset is updated regularly, reflecting the latest trends and user interactions on the platform. This ensures that users have access to the most current and relevant data for their analyses.

Key Features:

Subreddit Metrics: Detailed data on subreddit activity, including the number of posts, comments, votes, and user participation.

User Engagement: Insights into how users interact with content, including comment threads, upvotes/downvotes, and participation rates.

Trending Topics: Track emerging trends and viral content across the platform, helping you stay ahead of the curve in understanding social media dynamics.

AI-Enhanced Analysis: Utilize AI-generated columns for sentiment analysis, topic categorization, and predictive insights, providing a deeper understanding of the data.

Use Cases:

Social Media Analysis: Researchers and analysts can use this dataset to study online behavior, track the spread of information, and understand how content resonates with different audiences.

Market Research: Marketers can leverage the dataset to identify target audiences, understand consumer preferences, and tailor campaigns to specific communities.

Content Strategy: Content creators and strategists can use insights from the dataset to craft content that aligns with trending topics and user interests, maximizing engagement.

Academic Research: Academics can explore the dynamics of online communities, studying everything from the spread of misinformation to the formation of online subcultures.

Data Quality and Reliability:

The Reddit Subreddit Dataset emphasizes data quality and reliability. Each record is carefully compiled from Reddit’s vast database, ensuring that the information is both accurate and up-to-date. The AI-generated columns further enhance the dataset's value, providing automated insights that help users quickly identify key trends and sentiments.

Integration and Usability:

The dataset is provided in a format that is compatible with most data analysis tools and platforms, making it easy to integrate into existing workflows. Users can quickly import, analyze, and utilize the data for various applications, from market research to academic studies.

User-Friendly Structure and Metadata:

The data is organized for easy navigation and analysis, with metadata files included to help users identify relevant subreddits and data points. The AI-enhanced columns are clearly labeled and structured, allowing users to efficiently incorporate these insights into their analyses.

Ideal For:

Data Analysts: Conduct in-depth analyses of subreddit trends, user engagement, and content virality. The dataset’s extensive coverage and AI-enhanced insights make it an invaluable tool for data-driven research.

Marketers: Use the dataset to better understand your target audience, tailor campaigns to specific interests, and track the effectiveness of marketing efforts across Reddit.

Researchers: Explore the social dynamics of online communities, analyze the spread of ideas and information, and study the impact of digital media on public discourse, all while leveraging AI-generated insights.

This dataset is an essential resource for anyone looking to understand the intricacies of Reddit's vast ecosystem, offering the data and AI-enhanced insights needed to drive informed decisions and strategies across various fields. Whether you’re tracking emerging trends, analyzing user behavior, or conduc...
A Twitter Dataset of 100+ million tweets related to COVID-19
zenodo.org
application/gzip, csv +1
Updated Apr 17, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Juan M. Banda; Juan M. Banda; Ramya Tekumalla; Ramya Tekumalla; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding; Gerardo Chowell; Gerardo Chowell; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding (2023). A Twitter Dataset of 100+ million tweets related to COVID-19 [Dataset]. http://doi.org/10.5281/zenodo.3735274
Explore at:
application/gzip, tsv, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3735274
Dataset updated
Apr 17, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Juan M. Banda; Juan M. Banda; Ramya Tekumalla; Ramya Tekumalla; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding; Gerardo Chowell; Gerardo Chowell; Guanyu Wang; Jingyuan Yu; Tuo Liu; Yuning Ding
Description
Due to the relevance of the COVID-19 global pandemic, we are releasing our dataset of tweets acquired from the Twitter Stream related to COVID-19 chatter. The first 9 weeks of data (from January 1st, 2020 to March 11th, 2020) contain very low tweet counts as we filtered other data we were collecting for other research purposes, however, one can see the dramatic increase as the awareness for the virus spread. Dedicated data gathering started from March 11th to March 30th which yielded over 4 million tweets a day. We have added additional data provided by our new collaborators from January 27th to February 27th, to provide extra longitudinal coverage.

The data collected from the stream captures all languages, but the higher prevalence are: English, Spanish, and French. We release all tweets and retweets on the full_dataset.tsv file (101,400,452 unique tweets), and a cleaned version with no retweets on the full_dataset-clean.tsv file (20,244,746 unique tweets). There are several practical reasons for us to leave the retweets, tracing important tweets and their dissemination is one of them. For NLP tasks we provide the top 1000 frequent terms in frequent_terms.csv, the top 1000 bigrams in frequent_bigrams.csv, and the top 1000 trigrams in frequent_trigrams.csv. Some general statistics per day are included for both datasets in the statistics-full_dataset.tsv and statistics-full_dataset-clean.tsv files.

More details can be found (and will be updated faster at: https://github.com/thepanacealab/covid19_twitter)

As always, the tweets distributed here are only tweet identifiers (with date and time added) due to the terms and conditions of Twitter to re-distribute Twitter data. The need to be hydrated to be used.
T
BANK LENDING DEPOSIT SPREAD WB DATA.HTML by Country Dataset
tradingeconomics.com
csv, excel, json, xml
Updated Jul 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). BANK LENDING DEPOSIT SPREAD WB DATA.HTML by Country Dataset [Dataset]. https://tradingeconomics.com/country-list/bank-lending-deposit-spread-wb-data.html/1000
Explore at:
excel, csv, xml, jsonAvailable download formats
Dataset updated
Jul 7, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2025
Area covered
World
Description
This dataset provides values for BANK LENDING DEPOSIT SPREAD WB DATA.HTML reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
Wildfire Risk to Communities Housing Unit Density (Image Service)
catalog.data.gov
resilience.climate.gov
+12more
Updated Apr 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Forest Service (2025). Wildfire Risk to Communities Housing Unit Density (Image Service) [Dataset]. https://catalog.data.gov/dataset/wildfire-risk-to-communities-housing-unit-density-image-service-fac22
Explore at:
Dataset updated
Apr 21, 2025
Dataset provided by
U.S. Department of Agriculture Forest Servicehttp://fs.fed.us/
Description
The data included in this publication depict components of wildfire risk specifically for populated areas in the United States. These datasets represent where people live in the United States and the in situ risk from wildfire, i.e., the risk at the location where the adverse effects take place.National wildfire hazard datasets of annual burn probability and fire intensity, generated by the USDA Forest Service, Rocky Mountain Research Station and Pyrologix LLC, form the foundation of the Wildfire Risk to Communities data. Vegetation and wildland fuels data from LANDFIRE 2020 (version 2.2.0) were used as input to two different but related geospatial fire simulation systems. Annual burn probability was produced with the USFS geospatial fire simulator (FSim) at a relatively coarse cell size of 270 meters (m). To bring the burn probability raster data down to a finer resolution more useful for assessing hazard and risk to communities, we upsampled them to the native 30 m resolution of the LANDFIRE fuel and vegetation data. In this upsampling process, we also spread values of modeled burn probability into developed areas represented in LANDFIRE fuels data as non-burnable. Burn probability rasters represent landscape conditions as of the end of 2020. Fire intensity characteristics were modeled at 30 m resolution using a process that performs a comprehensive set of FlamMap runs spanning the full range of weather-related characteristics that occur during a fire season and then integrates those runs into a variety of results based on the likelihood of those weather types occurring. Before the fire intensity modeling, the LANDFIRE 2020 data were updated to reflect fuels disturbances occurring in 2021 and 2022. As such, the fire intensity datasets represent landscape conditions as of the end of 2022. The data products in this publication that represent where people live, reflect 2021 estimates of housing unit and population counts from the U.S. Census Bureau, combined with building footprint data from Onegeo and USA Structures, both reflecting 2022 conditions.The specific raster datasets included in this publication include:Building Count: Building Count is a 30-m raster representing the count of buildings in the building footprint dataset located within each 30-m pixel.Building Density: Building Density is a 30-m raster representing the density of buildings in the building footprint dataset (buildings per square kilometer [km²]).Building Coverage: Building Coverage is a 30-m raster depicting the percentage of habitable land area covered by building footprints.Population Count (PopCount): PopCount is a 30-m raster with pixel values representing residential population count (persons) in each pixel.Population Density (PopDen): PopDen is a 30-m raster of residential population density (people/km²).Housing Unit Count (HUCount): HUCount is a 30-m raster representing the number of housing units in each pixel.Housing Unit Density (HUDen): HUDen is a 30-m raster of housing-unit density (housing units/km²).Housing Unit Exposure (HUExposure): HUExposure is a 30-m raster that represents the expected number of housing units within a pixel potentially exposed to wildfire in a year. This is a long-term annual average and not intended to represent the actual number of housing units exposed in any specific year.Housing Unit Impact (HUImpact): HUImpact is a 30-m raster that represents the relative potential impact of fire to housing units at any pixel, if a fire were to occur. It is an index that incorporates the general consequences of fire on a home as a function of fire intensity and uses flame length probabilities from wildfire modeling to capture likely intensity of fire.Housing Unit Risk (HURisk): HURisk is a 30-m raster that integrates all four primary elements of wildfire risk - likelihood, intensity, susceptibility, and exposure - on pixels where housing unit density is greater than zero.Additional methodology documentation is provided with the data publication download. Metadata and Downloads.Note: Pixel values in this image service have been altered from the original raster dataset due to data requirements in web services. The service is intended primarily for data visualization. Relative values and spatial patterns have been largely preserved in the service, but users are encouraged to download the source data for quantitative analysis.
COVID-19 Internet Spread
kaggle.com
Updated Apr 11, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bandee (2020). COVID-19 Internet Spread [Dataset]. https://www.kaggle.com/bandee/covid19-internet-spread/tasks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 11, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Bandee
Description
Context

This dataset allows for a comparison between the offline and online spread of COVID-19.

Content

COVID-19 confirmed cases, deaths, recoveries

Google Trends for COVID-19

Wikipedia page views for COVID-19 from different countries

Acknowledgements

The dataset was obtained using the following APIs: https://github.com/pomber/covid19 https://github.com/GeneralMills/pytrends https://wikitech.wikimedia.org/wiki/Analytics/AQS/Pageviews

Inspiration

Can internet traffic data help to understand the spread of the virus?

Facebook

Twitter

Click to copy link

Link copied

Cite

Economic Research Service, Department of Agriculture (2025). Meat Price Spreads [Dataset]. https://catalog.data.gov/dataset/meat-price-spreads

Data from: Meat Price Spreads

Explore at:

Dataset updated

Apr 21, 2025

Dataset provided by

Economic Research Servicehttp://www.ers.usda.gov/

Description

This data set provides monthly average price values, and the differences among those values, at the farm, wholesale, and retail stages of the production and marketing chain for selected cuts of beef, pork, and broilers. In addition, retail prices are provided for beef and pork cuts, turkey, whole chickens, eggs, and dairy products. Price spreads are reported for last 6 years, 12 quarters, and 24 months. The retail price file provides monthly estimates for the last 6 months. The historical file provides data since 1970.

Clear search

Close search

Google apps

Main menu

Data from: Meat Price Spreads

ICC Spread - Dataset - Banco Central do Brasil Open Data Portal

Data for: COVID-19 Dataset: Worldwide Spread Log Including Countries First...

Next Day Wildfire Spread

Parameterizing Spatial Models of Infectious Disease Transmission that...

Data from: A dataset of Covid-related misinformation videos and their spread...

Cover smart, do your part, slow the spread. My Stay-at-Home Lab Shows How...

Data from: Modeling the Spread of a Livestock Disease With Semi-Supervised...

Worldwide COVID-19 Data from WHO (2025 Edition)

Dataset Overview

Source Information

Dataset Contents

How to Use

Data Reliability

Acknowledgements

Spread of the ICC - Individuals - Dataset - Banco Central do Brasil Open...

Supporting dataset for the bachelor thesis: Simulating the Spread of...

Global Spread of Conflict by Country and Population - Dataset - Data Catalog...

Movies and Tv Shows Dataset

Key Features of the Movies Dataset:

Why Choose Our TV Shows Dataset?

Applications of These Datasets

SPREAD: A Large-scale, High-fidelity Synthetic Dataset for Multiple Forest...

NBA Betting Data | October 2007 to June 2025

Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends,...

A Twitter Dataset of 100+ million tweets related to COVID-19

BANK LENDING DEPOSIT SPREAD WB DATA.HTML by Country Dataset

Wildfire Risk to Communities Housing Unit Density (Image Service)

COVID-19 Internet Spread

Context

Content

Acknowledgements

Inspiration

Data from: Meat Price SpreadsSee More Versions

Data from: Meat Price Spreads