100+ datasets found
  1. f

    Supplementary file 1_Modeling the dynamics of misinformation spread: a...

    • frontiersin.figshare.com
    zip
    Updated Oct 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kurunandan Jain; Krishnashree Achuthan (2025). Supplementary file 1_Modeling the dynamics of misinformation spread: a multi-scenario analysis incorporating user awareness and generative AI impact.zip [Dataset]. http://doi.org/10.3389/fcomp.2025.1570085.s001
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 3, 2025
    Dataset provided by
    Frontiers
    Authors
    Kurunandan Jain; Krishnashree Achuthan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The proliferation of misinformation on social media threatens public trust, public health, and democratic processes. We propose three models that analyze fake news propagation and evaluate intervention strategies. Grounded in epidemiological dynamics, the models include: (1) a baseline Awareness Spread Model (ASM), (2) an Extended Model with fact-checking (EM), and (3) a Generative AI-Influenced Spread model (GIFS). Each incorporates user behavior, platform-specific dynamics, and cognitive biases such as confirmation bias and emotional contagion. We simulate six distinct scenarios: (1) Accurate Content Environment, (2) Peer Network Dynamics, (3) Emotional Engagement, (4) Belief Alignment, (5) Source Trust, and (6) Platform Intervention. All models converge to a single, stable equilibrium. Sensitivity analysis across key parameters confirms model robustness and generalizability. In the ASM, forwarding rates were lowest in scenarios 1, 4, and 6 (1.47%, 3.41%, 2.95%) and significantly higher in 2, 3, and 5 (19.67%, 56.52%, 29.47%). The EM showed that fact-checking reduced spread to as low as 0.73%, with scenario-based variation from 1.16 to 17.47%. The GIFS model revealed that generative AI amplified spread by 5.7%–37.8%, depending on context. ASM highlights the importance of awareness; EM demonstrates the effectiveness of fact-checking mechanisms; GIFS underscores the amplifying impact of generative AI tools. Early intervention, coupled with targeted platform moderation (scenarios 1, 4, 6), consistently yields the lowest misinformation spread, while emotionally resonant content (scenario 3) consistently drives the highest propagation.

  2. Raw frequency data specifying the number and percentage of COVID-19 patients...

    • plos.figshare.com
    xlsx
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joseph R. Larsen; Margaret R. Martin; John D. Martin; James B. Hicks; Peter Kuhn (2023). Raw frequency data specifying the number and percentage of COVID-19 patients that experienced discernible symptoms from clinical datasets from China [20] and USA [17]. [Dataset]. http://doi.org/10.1371/journal.pcbi.1009629.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Joseph R. Larsen; Margaret R. Martin; John D. Martin; James B. Hicks; Peter Kuhn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    These frequencies were used to simulate patients to find the likeliest paths of symptom onset for discernible symptoms of COVID-19. The dataset from China contains 55,924 patients, and the dataset from USA contains 373,883 patients. (XLSX)

  3. q

    1-111-SpreadOfInformation-ModelingScenario

    • qubeshub.org
    Updated May 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeff Pettit (2022). 1-111-SpreadOfInformation-ModelingScenario [Dataset]. http://doi.org/10.25334/QA42-WD34
    Explore at:
    Dataset updated
    May 22, 2022
    Dataset provided by
    QUBES
    Authors
    Jeff Pettit
    Description

    Students perform experiments to model spread of information within a population. Students collect data, determine essential components and parameters and build a mathematical model culminating with a separable linear first order differential equation.

  4. o

    Data and Code for: The Family Origin of the Math Gender Gap is a White...

    • openicpsr.org
    Updated Apr 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaia Dossi; David N. Figlio; Paola Giuliano; Paola Sapienza (2021). Data and Code for: The Family Origin of the Math Gender Gap is a White Affluent Phenomenon [Dataset]. http://doi.org/10.3886/E139121V1
    Explore at:
    Dataset updated
    Apr 28, 2021
    Dataset provided by
    American Economic Association
    Authors
    Gaia Dossi; David N. Figlio; Paola Giuliano; Paola Sapienza
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Previous research has shown that norms around the role of women in society could help explain the gender gap in mathematics, and that these norms could be transmitted within the family. Using data from the Florida Department of Education combined with birth certificates we uncover important heterogeneity in the transmission of gender biases within the family. We find that gender role norms can explain the lower performance of girls in mathematics only in relatively affluent White families, whereas they do not apparently matter for the performance of Black girls.

  5. q

    2012-Arvind_Kumar_Misra-A simple mathematical model for the spread of two...

    • qubeshub.org
    Updated Apr 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arvind Misra (2023). 2012-Arvind_Kumar_Misra-A simple mathematical model for the spread of two political parties [Dataset]. http://doi.org/10.25334/SR44-PR13
    Explore at:
    Dataset updated
    Apr 7, 2023
    Dataset provided by
    QUBES
    Authors
    Arvind Misra
    Description

    In this paper, a non-linear mathematical model for the spread of two political parties has been proposed and analyzed by using epidemiological approach.

  6. Appendix A. Technical note on testing random walk hypotheses, derivation of...

    • wiley.figshare.com
    html
    Updated Jun 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Perry de Valpine; Kim Cuddington; Martha F. Hoopes; Julie L. Lockwood (2023). Appendix A. Technical note on testing random walk hypotheses, derivation of dynamics under "spread regulation" of new-site ratios, models used for simulated spread scenarios, and mathematical approximation showing approximate "spread regulation" expected slope of -1. [Dataset]. http://doi.org/10.6084/m9.figshare.3529799.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 12, 2023
    Dataset provided by
    Wileyhttps://www.wiley.com/
    Authors
    Perry de Valpine; Kim Cuddington; Martha F. Hoopes; Julie L. Lockwood
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Technical note on testing random walk hypotheses, derivation of dynamics under "spread regulation" of new-site ratios, models used for simulated spread scenarios, and mathematical approximation showing approximate "spread regulation" expected slope of -1.

  7. q

    1972-R_C_Rothermel-A Mathematical Model for Predicting Fire Spread in...

    • qubeshub.org
    Updated Mar 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    R models. (2023). 1972-R_C_Rothermel-A Mathematical Model for Predicting Fire Spread in Wildland Fuels [Dataset]. http://doi.org/10.25334/RP5B-5S10
    Explore at:
    Dataset updated
    Mar 23, 2023
    Dataset provided by
    QUBES
    Authors
    R models.
    Description

    The development of a mathematical model for predicting rate of fire spread and intensity applicable to a wide range of wildland fuels is presented from the conceptual stage through evaluation and demonstration of results to hypothetical fuel models.

  8. f

    Data_Sheet_1_Sexual Contact Patterns in High-Income Countries—A Comparative...

    • frontiersin.figshare.com
    • figshare.com
    docx
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Damilola Victoria Tomori; Johannes Horn; Nicole Rübsamen; Sven Kleine Bardenhorst; Christoph Kröger; Veronika K. Jaeger; André Karch; Rafael Mikolajczyk (2023). Data_Sheet_1_Sexual Contact Patterns in High-Income Countries—A Comparative Analysis Using Data From Germany, the United Kingdom, and the United States.docx [Dataset]. http://doi.org/10.3389/fepid.2022.858789.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Frontiers
    Authors
    Damilola Victoria Tomori; Johannes Horn; Nicole Rübsamen; Sven Kleine Bardenhorst; Christoph Kröger; Veronika K. Jaeger; André Karch; Rafael Mikolajczyk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States, Germany, United Kingdom
    Description

    Sexual contact patterns determine the spread of sexually transmitted infections and are a central input parameter for mathematical models in this field. We evaluated the importance of country-specific sexual contact pattern parametrization for high-income countries with similar cultural backgrounds by comparing data from two independent studies (HaBIDS and SBG) in Germany, a country without systematic sexual contact pattern data, with data from the National Survey of Sexual Attitudes and Lifestyles (Natsal) in the UK, and the National Survey of Family Growth (NSFG) in the US, the two longest running sexual contact studies in high-income countries. We investigated differences in the distribution of the reported number of opposite-sex partners, same-sex partners and both-sex partners using weighted negative binomial regression adjusted for age and sex (as well as stratified by age). In our analyses, UK and US participants reported a substantially higher number of lifetime opposite-sex sexual partners compared to both German studies. The difference in lifetime partners was caused by a higher proportion of individuals with many partners in the young age group (

  9. d

    The role of geospatial hotspots in the spatial spread of tuberculosis in...

    • datadryad.org
    zip
    Updated Sep 11, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Debebe Shaweno; James M. Trauer; Justin T. Denholm; Emma S. McBryde (2018). The role of geospatial hotspots in the spatial spread of tuberculosis in rural Ethiopia: a mathematical modelling [Dataset]. http://doi.org/10.5061/dryad.fg3js19
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 11, 2018
    Dataset provided by
    Dryad
    Authors
    Debebe Shaweno; James M. Trauer; Justin T. Denholm; Emma S. McBryde
    Time period covered
    Jun 8, 2018
    Area covered
    Ethiopia
    Description

    spatial_mathematical_modelThe file contains 1) a system of ordinary differential equations used in the model and 2). a model runner that calls the function

  10. r

    MATLAB code and output files for integral, mean and covariance of the...

    • researchdata.edu.au
    Updated 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adams Matthew (2022). MATLAB code and output files for integral, mean and covariance of the simplex-truncated multivariate normal distribution [Dataset]. http://doi.org/10.25912/RDF_1660176734022
    Explore at:
    Dataset updated
    2022
    Dataset provided by
    Queensland University of Technology
    Authors
    Adams Matthew
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 14, 2022 - Jul 22, 2022
    Description

    Compositional data, which is data consisting of fractions or probabilities, is common in many fields including ecology, economics, physical science and political science. If these data would otherwise be normally distributed, their spread can be conveniently represented by a multivariate normal distribution truncated to the non-negative space under a unit simplex. Here this distribution is called the simplex-truncated multivariate normal distribution. For calculations on truncated distributions, it is often useful to obtain rapid estimates of their integral, mean and covariance; these quantities characterising the truncated distribution will generally possess different values to the corresponding non-truncated distribution.

    In the paper "Adams, Matthew (2022) Integral, mean and covariance of the simplex-truncated multivariate normal distribution. PLoS One, 17(7), Article number: e0272014. ", three different approaches that can estimate the integral, mean and covariance of any simplex-truncated multivariate normal distribution are described and compared. These three approaches are (1) naive rejection sampling, (2) a method described by Gessner et al. that unifies subset simulation and the Holmes-Diaconis-Ross algorithm with an analytical version of elliptical slice sampling, and (3) a semi-analytical method that expresses the integral, mean and covariance in terms of integrals of hyperrectangularly-truncated multivariate normal distributions, the latter of which are readily computed in modern mathematical and statistical packages. Strong agreement is demonstrated between all three approaches, but the most computationally efficient approach depends strongly both on implementation details and the dimension of the simplex-truncated multivariate normal distribution.

    This dataset consists of all code and results for the associated article.

  11. d

    Data from: Systematic shifts in the variation among host individuals must be...

    • datadryad.org
    • data.niaid.nih.gov
    zip
    Updated Nov 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joseph Mihaljevic; David Paez (2024). Systematic shifts in the variation among host individuals must be considered in climate-disease theory [Dataset]. http://doi.org/10.5061/dryad.f1vhhmh60
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 27, 2024
    Dataset provided by
    Dryad
    Authors
    Joseph Mihaljevic; David Paez
    Time period covered
    Nov 5, 2024
    Description

    R scripts and data used to generate figures and supplementary materials for manuscript.

  12. d

    Data from: Principles Governing Establishment versus Collapse of HIV-1...

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Jun 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jason Hataye; Joseph Casazza; Katharine Best; C. Jason Liang; Taina Immonen; David Ambrozak; Samuel Darko; Amy Henry; Farida Laboune; Frank Maldarelli; Daniel Douek; Nicolas Hengartner; Takuya Yamamoto; Brandon Keele; Alan Perelson; Richard Koup (2025). Principles Governing Establishment versus Collapse of HIV-1 Cellular Spread [Dataset]. http://doi.org/10.5061/dryad.wdbrv15j3
    Explore at:
    Dataset updated
    Jun 23, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Jason Hataye; Joseph Casazza; Katharine Best; C. Jason Liang; Taina Immonen; David Ambrozak; Samuel Darko; Amy Henry; Farida Laboune; Frank Maldarelli; Daniel Douek; Nicolas Hengartner; Takuya Yamamoto; Brandon Keele; Alan Perelson; Richard Koup
    Time period covered
    Jan 1, 2019
    Description

    A population at low census might go extinct, or instead transition into exponential growth to become firmly established. Whether this pivotal event occurs for a within-host pathogen can be the difference between health and illness. Here we define the principles governing whether HIV-1 spread among cells fails or becomes established, by coupling stochastic modeling with laboratory experiments. Following ex vivo activation of latently-infected CD4 T cells without de novo infection, stochastic cell division and death contributes to high variability in the magnitude of initial virus release. Transition to exponential HIV-1 spread often fails due to release of an insufficient amount of replication-competent virus. Establishment of exponential growth occurs when virus produced from multiple infected cells exceeds a critical population size. We quantitatively define the crucial transition to exponential viral spread. Thwarting this process would prevent HIV transmission or rebound from the lat...

  13. q

    2006-Ousmane_Mousa_Tessa-Mathematical model for control of measles by...

    • qubeshub.org
    Updated Apr 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ousmane Tessa (2023). 2006-Ousmane_Mousa_Tessa-Mathematical model for control of measles by vaccination [Dataset]. http://doi.org/10.25334/624G-E549
    Explore at:
    Dataset updated
    Apr 5, 2023
    Dataset provided by
    QUBES
    Authors
    Ousmane Tessa
    Description

    In this article, we use a compartmental mathematical model of the dynamics of measles spread within a population with variable size to provide this framework.

  14. Raw frequency data specifying the number and percentage of COVID-19 patients...

    • figshare.com
    xlsx
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joseph R. Larsen; Margaret R. Martin; John D. Martin; James B. Hicks; Peter Kuhn (2023). Raw frequency data specifying the number and percentage of COVID-19 patients that experienced discernible symptoms from clinical datasets from Hong Kong [23] and Brazil [24]. [Dataset]. http://doi.org/10.1371/journal.pcbi.1009629.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Joseph R. Larsen; Margaret R. Martin; John D. Martin; James B. Hicks; Peter Kuhn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These frequencies were used to simulate patients to find the likeliest paths of symptom onset for discernible symptoms of COVID-19. The dataset from Hong Kong contains 59 patients, and the dataset from Brazil contains at least 50,000 patients. (XLSX)

  15. d

    Replication Data for: Transient oral human cytomegalovirus infections...

    • dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mayer, Bryan (2023). Replication Data for: Transient oral human cytomegalovirus infections indicate inefficient viral spread from very few initially infected cells [Dataset]. http://doi.org/10.7910/DVN/XFXIFO
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Mayer, Bryan
    Description

    These are the results from 10000 simulations of the CMV stochastic ODE model. Replication code and analysis available on github at: https://github.com/bryanmayer/CMV-Transient-Infections

  16. q

    1-053-SlimeSpread-ModelingScenario

    • qubeshub.org
    Updated May 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brian Winkel (2022). 1-053-SlimeSpread-ModelingScenario [Dataset]. http://doi.org/10.25334/YX14-TK04
    Explore at:
    Dataset updated
    May 4, 2022
    Dataset provided by
    QUBES
    Authors
    Brian Winkel
    Description

    We offer a video showing real time spread of a cylinder of slime and challenge students to build a mathematical model for this phenomenon.

  17. q

    2014-Bozkurt-Peker-Mathematical modelling of HIV epidemic and stability...

    • qubeshub.org
    Updated Mar 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fatma Bozhut (2023). 2014-Bozkurt-Peker-Mathematical modelling of HIV epidemic and stability analysis [Dataset]. http://doi.org/10.25334/VX6J-Y280
    Explore at:
    Dataset updated
    Mar 31, 2023
    Dataset provided by
    QUBES
    Authors
    Fatma Bozhut
    Description

    A nonlinear mathematical model of differential equations with piecewise constant arguments is proposed. This model is analyzed by using the theory of both differential and difference equations to show the spread of HIV in a homogeneous population.

  18. Raw frequency data specifying the number and percentage of COVID-19 patients...

    • plos.figshare.com
    xlsx
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joseph R. Larsen; Margaret R. Martin; John D. Martin; James B. Hicks; Peter Kuhn (2023). Raw frequency data specifying the number and percentage of COVID-19 patients that experienced discernible symptoms from clinical datasets in Japan before [27] and after [28] D614G mutation. [Dataset]. http://doi.org/10.1371/journal.pcbi.1009629.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Joseph R. Larsen; Margaret R. Martin; John D. Martin; James B. Hicks; Peter Kuhn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Japan
    Description

    These frequencies were used to simulate patients to find the likeliest paths of symptom onset for discernible symptoms of COVID-19. The dataset from Japan before the outbreak of the D614G variant contains 244 patients, and the dataset from Japan after the outbreak of the D614G variant reports symptoms of 2,636 patients, except for cough, where only 2,634 of the patients were recorded. (XLSX)

  19. d

    Assessing the impact of mutations and NPI interventions for curbing Covid...

    • search.dataone.org
    Updated Dec 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prithwish Ghosh; Arindom Chakraborty; Meghna Banerjee (2023). Assessing the impact of mutations and NPI interventions for curbing Covid infection using renewal process [Dataset]. http://doi.org/10.7910/DVN/54L4UR
    Explore at:
    Dataset updated
    Dec 16, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Prithwish Ghosh; Arindom Chakraborty; Meghna Banerjee
    Description

    The COVID-19 pandemic, stemming from the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), initially emerged in December 2019 in Wuhan city, Hubei province, China, and swiftly spread worldwide. India, too, faced a significant impact from this viral disease. In response to the escalating number of infections and fatalities, and with the aim of ensuring the healthcare system's capacity to treat severe cases, the Government of India, like many other nations, implemented, or is in the process of implementing, measures to curb the spread of the virus.

  20. COVID-19 global forecast : SIR JHU TimeSeries fit

    • kaggle.com
    zip
    Updated Apr 5, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dmitry A. Grechka (2020). COVID-19 global forecast : SIR JHU TimeSeries fit [Dataset]. https://www.kaggle.com/datasets/dgrechka/covid19-global-forecast-sir-jhu-timeseries-fit
    Explore at:
    zip(15833213 bytes)Available download formats
    Dataset updated
    Apr 5, 2020
    Authors
    Dmitry A. Grechka
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is obsolete, superseded by this one and will not be updated anymore..

    Fitted on data points as on 3 April 2020

    Context

    This dataset is created as a part of covid-19 global forecasting challenge. It contains parameters for the SIR model for different locations worldwide.

    The model is defined as ODE system as follows: https://wikimedia.org/api/rest_v1/media/math/render/svg/29728a7d4bebe8197dca7d873d81b9dce954522e" alt="SIR ODE equations">

    The models are fitted on John Hopkins University data (time series) using several runs of Nelder-Mead simplex optimization method (best run is taken) starting at different initial locations and RMSE as a loss.

    What parameters are fitted (estimated) per country/province: * the day when the infection emerged in the country * the initial infected count on the first day of the infection * beta - an average number of contacts (sufficient to spread the disease) per day each infected individual has * gamma - fixed fraction of the infected group that will recover during any given day * R0 - how many susceptible people are infected (on average) by single infected individual. Equals beta/gamma * initial susceptible population (e.g. init suscept pop in the figures) - how many people are susceptible with regards to the quarantine measures at the modelled location

    How to read the figures. * points are real observed data provided by Johns Hopkins University * curves are model prediction

    • blue is susceptible population - people that are not yet infected but can get the infection
    • red is infected population
    • green is removed population (recovered or dead). people that are not susceptible any more as they came through the infection.

    Content

    The dataset contains 3 data portions:

    1. Fitted SIR model parameters for different locations worldwide.
    2. Figures that visually show how the fitted parameters match the data points.
    3. CSV files with prediction for one year in the future for each individual location.

    Warning

    Always do visual check of the model fit (per_location_figures directory) for quality control before start to use the corresponding parameter values in your analysis.

    Acknowledgements

    Thanks a lot Kaggle for organizing data sharing and challenges that make the world better.

    Also many thanks to John Hopkins University for their hard work of gathering COVID-19 statistics worldwide.

    Inspiration

    You can try to find correlation between model parameters (e.g. gamma - patient recovery rate) and other properties of the modelled locations worldwide (e.g. weather, population density, level of medical care, etc.)

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Kurunandan Jain; Krishnashree Achuthan (2025). Supplementary file 1_Modeling the dynamics of misinformation spread: a multi-scenario analysis incorporating user awareness and generative AI impact.zip [Dataset]. http://doi.org/10.3389/fcomp.2025.1570085.s001

Supplementary file 1_Modeling the dynamics of misinformation spread: a multi-scenario analysis incorporating user awareness and generative AI impact.zip

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Oct 3, 2025
Dataset provided by
Frontiers
Authors
Kurunandan Jain; Krishnashree Achuthan
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The proliferation of misinformation on social media threatens public trust, public health, and democratic processes. We propose three models that analyze fake news propagation and evaluate intervention strategies. Grounded in epidemiological dynamics, the models include: (1) a baseline Awareness Spread Model (ASM), (2) an Extended Model with fact-checking (EM), and (3) a Generative AI-Influenced Spread model (GIFS). Each incorporates user behavior, platform-specific dynamics, and cognitive biases such as confirmation bias and emotional contagion. We simulate six distinct scenarios: (1) Accurate Content Environment, (2) Peer Network Dynamics, (3) Emotional Engagement, (4) Belief Alignment, (5) Source Trust, and (6) Platform Intervention. All models converge to a single, stable equilibrium. Sensitivity analysis across key parameters confirms model robustness and generalizability. In the ASM, forwarding rates were lowest in scenarios 1, 4, and 6 (1.47%, 3.41%, 2.95%) and significantly higher in 2, 3, and 5 (19.67%, 56.52%, 29.47%). The EM showed that fact-checking reduced spread to as low as 0.73%, with scenario-based variation from 1.16 to 17.47%. The GIFS model revealed that generative AI amplified spread by 5.7%–37.8%, depending on context. ASM highlights the importance of awareness; EM demonstrates the effectiveness of fact-checking mechanisms; GIFS underscores the amplifying impact of generative AI tools. Early intervention, coupled with targeted platform moderation (scenarios 1, 4, 6), consistently yields the lowest misinformation spread, while emotionally resonant content (scenario 3) consistently drives the highest propagation.

Search
Clear search
Close search
Google apps
Main menu