95 datasets found
  1. Additional file 3 of Modelling count, bounded and skewed continuous outcomes...

    • springernature.figshare.com
    txt
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Akram; Ester Cerin; Karen E. Lamb; Simon R. White (2023). Additional file 3 of Modelling count, bounded and skewed continuous outcomes in physical activity research: beyond linear regression models [Dataset]. http://doi.org/10.6084/m9.figshare.22774300.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    figshare
    Authors
    Muhammad Akram; Ester Cerin; Karen E. Lamb; Simon R. White
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary Material 3: A supplementary file with examples of SAS script for all models that have been fitted in this paper.

  2. d

    Annual peak-flow data and PeakFQ output files for selected streamflow gaging...

    • catalog.data.gov
    • data.usgs.gov
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Annual peak-flow data and PeakFQ output files for selected streamflow gaging stations operated by the U.S. Geological Survey in the New England region that were used to estimate regional skewness of annual peak flows [Dataset]. https://catalog.data.gov/dataset/annual-peak-flow-data-and-peakfq-output-files-for-selected-streamflow-gaging-stations-oper
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    New England
    Description

    "NewEngland_pkflows.PRT" is a text file that contains results of flood-frequency analysis of annual peak flows from 186 selected streamflow gaging stations (streamgages) operated by the U.S. Geological Survey (USGS) in the New England region (Maine, Connecticut, Massachusetts, Rhode Island, New York, New Hampshire, and Vermont). Only streamgages in the region that were also in the USGS "GAGES II" database (https://water.usgs.gov/GIS/metadata/usgswrd/XML/gagesII_Sept2011.xml) were considered for use in the study. The file was generated by combining PeakFQ output (.PRT) files created using version 7.0 of USGS software PeakFQ (https://water.usgs.gov/software/PeakFQ/; Veilleux and others, 2014) to conduct flood-frequency analyses using the Expected Moments Algorithm (England and others, 2018). The peak-flow files used as input to PeakFQ were obtained from the USGS National Water Information System (NWIS) database (https://nwis.waterdata.usgs.gov/usa/nwis/peak) and contained annual peak flows ending in water year 2011. Results of the flood-frequency analyses were used to estimate skewness of annual peak flows in the New England region using Bayesian Weighted Least Squares / Bayesian Generalized Least Squares (B-WLS / B-GLS) regression (Veilleux and others, 2019).

  3. d

    Flood Region A

    • catalog.data.gov
    • data.usgs.gov
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Flood Region A [Dataset]. https://catalog.data.gov/dataset/flood-region-a
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    U.S. Geological Survey
    Description

    To improve flood-frequency estimates at rural streams in Mississippi, annual exceedance probability (AEP) flows at gaged streams in Mississippi and regional-regression equations, used to estimate annual exceedance probability flows for ungaged streams in Mississippi, were developed by using current geospatial data, additional statistical methods, and annual peak-flow data through the 2013 water year. The regional-regression equations were derived from statistical analyses of peak-flow data, basin characteristics associated with 281 streamgages, the generalized skew from Bulletin 17B (Interagency Advisory Committee on Water Data, 1982), and a newly developed study-specific skew for select four-digit hydrologic unit code (HUC4) watersheds in Mississippi. Four flood regions were identified based on residuals from the regional-regression analyses. No analysis was conducted for streams in the Mississippi Alluvial Plain flood region because of a lack of long-term streamflow data and poorly defined basin characteristics. Flood regions containing sites with similar basin and climatic characteristics yielded better regional-regression equations with lower error percentages. The generalized least squares method was used to develop the final regression models for each flood region for annual exceedance probability flows. The peak-flow statistics were estimated by fitting a log-Pearson type III distribution to records of annual peak flows and then applying two additional statistical methods: (1) the expected moments algorithm to help describe uncertainty in annual peak flows and to better represent missing and historical record; and (2) the generalized multiple Grubbs-Beck test to screen out potentially influential low outliers and to better fit the upper end of the peak-flow distribution. Standard errors of prediction of the generalized least-squares models ranged from 28 to 46 percent. Pseudo coefficients of determination of the models ranged from 91 to 96 percent. Flood Region A, located in north-central Mississippi, contained 27 streamgages with drainage areas that ranged from 1.41 to 612 square miles. The 1% annual exceedance probability had a standard error of prediction of 31 percent which was lower than the prediction errors in Flood Regions B and C.

  4. d

    Replication Data for: Accounting for Skewed or One-Sided Measurement Error...

    • dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Millimet, Daniel; Parmeter, Christopher (2023). Replication Data for: Accounting for Skewed or One-Sided Measurement Error in the Dependent Variable [Dataset]. http://doi.org/10.7910/DVN/IKSE2O
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Millimet, Daniel; Parmeter, Christopher
    Description

    While classical measurement error in the dependent variable in a linear regression framework results only in a loss of precision, nonclassical measurement error can lead to estimates which are biased and inference which lacks power. Here, we consider a particular type of nonclassical measurement error: skewed errors. Unfortunately, skewed measurement error is likely to be a relatively common feature of many out- comes of interest in political science research. This study highlights the bias that can result even from relatively "small" amounts of skewed measurement error, particularly if the measurement error is heteroskedastic. We also assess potential solutions to this problem, focusing on the stochastic frontier model and nonlinear least squares. Simulations and three replications highlight the importance of thinking carefully about skewed measurement error, as well as appropriate solutions.

  5. d

    Alternative technical efficiency measures: Skew, bias and scale (replication...

    • b2find.dkrz.de
    Updated Mar 11, 2010
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2010). Alternative technical efficiency measures: Skew, bias and scale (replication data) - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/15b90c3b-464e-57a7-9d9f-9cb4502aad3c
    Explore at:
    Dataset updated
    Mar 11, 2010
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the fixed-effects stochastic frontier model an efficiency measure relative to the best firm in the sample is universally employed. This paper considers a new measure relative to the worst firm in the sample. We find that estimates of this measure have smaller bias than those of the traditional measure when the sample consists of many firms near the efficient frontier. Moreover, a two-sided measure relative to both the best and the worst firms is proposed. Simulations suggest that the new measures may be preferred depending on the skewness of the inefficiency distribution and the scale of efficiency differences.

  6. f

    AUC of ROC curves of the simulation on data in skew-normal distribution.

    • plos.figshare.com
    xls
    Updated Jun 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yao Wang; Guang Sun; Zhaohua Ji; Chong Xing; Yanchun Liang (2023). AUC of ROC curves of the simulation on data in skew-normal distribution. [Dataset]. http://doi.org/10.1371/journal.pone.0029860.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 9, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yao Wang; Guang Sun; Zhaohua Ji; Chong Xing; Yanchun Liang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    AUC of ROC curves of the simulation on data in skew-normal distribution.

  7. Data from: SkewDB: A comprehensive database of GC and 10 other skews for...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    bin, bz2
    Updated Jun 5, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bert Hubert; Bert Hubert (2022). SkewDB: A comprehensive database of GC and 10 other skews for over 28,000 chromosomes and plasmids [Dataset]. http://doi.org/10.5061/dryad.g4f4qrfr6
    Explore at:
    bin, bz2Available download formats
    Dataset updated
    Jun 5, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Bert Hubert; Bert Hubert
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff's second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.

    Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew.
    This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included.

    The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.

  8. Skew-T Plots: Denver

    • data.ucar.edu
    image
    Updated Dec 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCAR/NCAR - Earth Observing Laboratory (2024). Skew-T Plots: Denver [Dataset]. http://doi.org/10.26023/ZA9N-4BZ5-NM0J
    Explore at:
    imageAvailable download formats
    Dataset updated
    Dec 26, 2024
    Dataset provided by
    University Corporation for Atmospheric Research
    Authors
    UCAR/NCAR - Earth Observing Laboratory
    Time period covered
    Mar 17, 2010 - Apr 19, 2010
    Area covered
    Description

    This dataset contains upper air Skew-T Log-P charts taken at Denver during the HIPPO-3 project. The imagery are in GIF format. The imagery cover the time span from 2010-03-17 00:00:00 to 2010-04-19 12:00:00.

  9. Data from: Improving structured population models with more realistic...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    Updated Jun 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Megan L. Peterson; William Morris; Cristina Linares; Daniel Doak; Megan L. Peterson; William Morris; Cristina Linares; Daniel Doak (2022). Data from: Improving structured population models with more realistic representations of non-normal growth [Dataset]. http://doi.org/10.5061/dryad.t6c3573
    Explore at:
    Dataset updated
    Jun 1, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Megan L. Peterson; William Morris; Cristina Linares; Daniel Doak; Megan L. Peterson; William Morris; Cristina Linares; Daniel Doak
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description
    1. Structured population models are among the most widely used tools in ecology and evolution. Integral projection models (IPMs) use continuous representations of how survival, reproduction, and growth change as functions of state variables such as size, requiring fewer parameters to be estimated than projection matrix models (PPMs). Yet almost all published IPMs make an important assumption: that size-dependent growth transitions are or can be transformed to be normally distributed. In fact, many organisms exhibit highly skewed size transitions. Small individuals can grow more than they can shrink, and large individuals may often shrink more dramatically than they can grow. Yet the implications of such skew for inference from IPMs has not been explored, nor have general methods been developed to incorporate skewed size transitions into IPMs, or deal with other aspects of real growth rates, including bounds on possible growth or shrinkage. 2. Here we develop a flexible approach to modeling skewed growth data using a modified beta regression model. We propose that sizes first be converted to a (0,1) interval by estimating size-dependent minimum and maximum sizes through quantile regression. Transformed data can then be modeled using beta regression with widely available statistical tools. We demonstrate the utility of this approach using demographic data for a long-lived plant, gorgonians, and an epiphytic lichen. Specifically, we compare inferences of population parameters from discrete PPMs to those from IPMs that either assume normality or incorporate skew using beta regression or, alternatively, a skewed normal model. 3. The beta and skewed normal distributions accurately capture the mean, variance, and skew of real growth distributions. Incorporating skewed growth into IPMs decreases population growth and estimated lifespan relative to IPMs that assume normally-distributed growth, and more closely approximate the parameters of PPMs that do not assume a particular growth distribution. A bounded distribution, such as the beta, also avoids the eviction problem caused by predicting some growth outside the modeled size range. 4. Incorporating biologically relevant skew in growth data has important consequences for inference from IPMs. The approaches we outline here are flexible and easy to implement with existing statistical tools.
  10. Reaction times and other skewed distributions: problems with the mean and...

    • figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guillaume Rousselet; Rand Wilcox (2023). Reaction times and other skewed distributions: problems with the mean and the median [Dataset]. http://doi.org/10.6084/m9.figshare.6911924.v4
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Authors
    Guillaume Rousselet; Rand Wilcox
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Reproducibility package for the article:Reaction times and other skewed distributions: problems with the mean and the medianGuillaume A. Rousselet & Rand R. Wilcoxpreprint: https://psyarxiv.com/3y54rdoi: 10.31234/osf.io/3y54rThis package contains all the code and data to reproduce the figures and analyses in the article.

  11. d

    Data from: Selection on skewed characters and the paradox of stasis

    • datadryad.org
    • zenodo.org
    zip
    Updated Sep 8, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suzanne Bonamour; Céline Teplitsky; Anne Charmantier; Pierre-André Crochet; Luis-Miguel Chevin (2017). Selection on skewed characters and the paradox of stasis [Dataset]. http://doi.org/10.5061/dryad.pt07g
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 8, 2017
    Dataset provided by
    Dryad
    Authors
    Suzanne Bonamour; Céline Teplitsky; Anne Charmantier; Pierre-André Crochet; Luis-Miguel Chevin
    Time period covered
    2017
    Description

    Observed phenotypic responses to selection in the wild often differ from predictions based on measurements of selection and genetic variance. An overlooked hypothesis to explain this paradox of stasis is that a skewed phenotypic distribution affects natural selection and evolution. We show through mathematical modelling that, when a trait selected for an optimum phenotype has a skewed distribution, directional selection is detected even at evolutionary equilibrium, where it causes no change in the mean phenotype. When environmental effects are skewed, Lande and Arnold’s (1983) directional gradient is in the direction opposite to the skew. In contrast, skewed breeding values can displace the mean phenotype from the optimum, causing directional selection in the direction of the skew. These effects can be partitioned out using alternative selection estimates based on average derivatives of individual relative fitness, or additive genetic covariances between relative fitness and trait (Robe...

  12. Randomized Battery Usage 7: Low-Temperature Left-Skewed Random Walk

    • data.nasa.gov
    • data.staging.idas-ds1.appdat.jsc.nasa.gov
    • +1more
    application/rdfxml +5
    Updated Oct 19, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Randomized Battery Usage 7: Low-Temperature Left-Skewed Random Walk [Dataset]. https://data.nasa.gov/Raw-Data/Randomized-Battery-Usage-7-Low-Temperature-Left-Sk/sb48-rsbc
    Explore at:
    csv, json, xml, application/rdfxml, application/rssxml, tsvAvailable download formats
    Dataset updated
    Oct 19, 2022
    Dataset authored and provided by
    PCoE
    License

    https://www.usa.gov/government-workshttps://www.usa.gov/government-works

    Description

    This dataset is part of a series of datasets, where batteries are continuously cycled with randomly generated current profiles. Reference charging and discharging cycles are also performed after a fixed interval of randomized usage to provide reference benchmarks for battery state of health.

    In this dataset, four 18650 Li-ion batteries (Identified as RW13, RW14, RW15 and RW16) were continuously operated by repeatedly charging them to 4.2V and then discharging them to 3.2V using a randomized sequence of discharging currents between 0.5A and 5A. This type of discharging profile is referred to here as random walk (RW) discharging. A customized probability distribution is used in this experiment to select a new load setpoint every 1 minute during RW discharging operation. The custom probability distribution was designed to be skewed towards selecting lower currents.

  13. J

    Skewness Risk and Bond Prices (replication data)

    • journaldata.zbw.eu
    • jda-test.zbw.eu
    txt, zip
    Updated Dec 7, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francisco J. Ruge-Murcia; Francisco J. Ruge-Murcia (2022). Skewness Risk and Bond Prices (replication data) [Dataset]. http://doi.org/10.15456/jae.2022326.0702100188
    Explore at:
    txt(5000), zip(37213)Available download formats
    Dataset updated
    Dec 7, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Francisco J. Ruge-Murcia; Francisco J. Ruge-Murcia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This paper uses extreme value theory to study the implications of skewness risk for nominal loan contracts in a production economy. Productivity and inflation innovations are drawn from generalized extreme value distributions. The model is solved using a third-order perturbation and estimated by the simulated method of moments. Results show that the data reject the hypothesis that innovations are drawn from normal distributions and favor instead the alternative that they are drawn from asymmetric distributions. Estimates indicate that skewness risk accounts for 12% of the risk premia and reduces bond yields by approximately 55 basis points. For a bond that pays 1 dollar at maturity, the adjustment factor associated with skewness risk ranges from 0.15 cents for a 3?month bond to 2.05 cents for a 5?year bond.

  14. J

    CONDITIONALLY HETEROSKEDASTIC FACTOR MODELS WITH SKEWNESS AND LEVERAGE...

    • journaldata.zbw.eu
    • jda-test.zbw.eu
    pdf, txt
    Updated Dec 7, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prosper Dovonon; Prosper Dovonon (2022). CONDITIONALLY HETEROSKEDASTIC FACTOR MODELS WITH SKEWNESS AND LEVERAGE EFFECTS (replication data) [Dataset]. http://doi.org/10.15456/jae.2022321.0712481009
    Explore at:
    pdf(116501), txt(1780985), txt(1491)Available download formats
    Dataset updated
    Dec 7, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Prosper Dovonon; Prosper Dovonon
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Conditional heteroskedasticity, skewness and leverage effects are well-known features of financial returns. The literature on factor models has often made assumptions that preclude the three effects to occur simultaneously. In this paper I propose a conditionally heteroskedastic factor model that takes into account the presence of both the conditional skewness and leverage effects. This model is specified in terms of conditional moment restrictions and unconditional moment conditions are proposed allowing inference by the generalized method of moments (GMM). The model is also shown to be closed under temporal aggregation. An application to daily excess returns on sectorial indices from the UK stock market provides strong evidence for dynamic conditional skewness and leverage with a sharp efficiency gain resulting from accounting for both effects. The estimated volatilitypersistence from the proposed model is lower than that estimated from models that rule out such effects. I also find that the longer the returns' horizon, the fewer conditionally heteroskedastic factors may be required for suitable modeling and the less strong is the evidence for dynamic leverage. Some of these results are in line with the main findings of Harvey and Siddique (1999) and Jondeau and Rockinger (2003), namely that accounting for conditional skewness impacts the persistence in the conditional variance of the return process.

  15. Data from: Outgrowing the Procrustean Bed of Normality: The Utility of...

    • osf.io
    Updated Sep 6, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stephen R. Martin; Donald R. Williams (2017). Outgrowing the Procrustean Bed of Normality: The Utility of Bayesian Modeling for Asymmetrical Data Analysis [Dataset]. http://doi.org/10.17605/OSF.IO/PMH6G
    Explore at:
    Dataset updated
    Sep 6, 2017
    Dataset provided by
    Center for Open Sciencehttps://cos.io/
    Authors
    Stephen R. Martin; Donald R. Williams
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Psychological data often violate the normality assumptions made by commonly used statistical methods. These violations are addressed in a variety of ways such as transformations or assuming the employed method is robust to violations. Here we argue that data transformations are unnecessary at best and severely misleading at worst. An alternative approach is to use a Bayesian model that permits skewness and other perturbations to classical assumptions (e.g., heteroskedasticity). Through simulation, we demonstrate that a Bayesian skew-normal model has optimal frequentist properties (i.e., "type 1" error, "power", unbiasedness) compared to normal-assumptive models with or without transformation. Furthermore, the Bayesian skew-normal model has greater predictive utility, as indicated by posterior predictive checking and approximate leave-one-out cross-validation. After an applied example, we discuss practical implications of our findings for psychological science in general, and specifically how Bayesian modeling can improve reproducibility in psychology.

  16. Data from: Uneven missing data skew phylogenomic relationships within the...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Jul 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brian Smith; Brian Smith; William M. Mauck III; Brett W. Benz; Michael J. Andersen; William M. Mauck III; Brett W. Benz; Michael J. Andersen (2022). Data from: Uneven missing data skew phylogenomic relationships within the lories and lorikeets [Dataset]. http://doi.org/10.5061/dryad.n5tb2rbsp
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 21, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Brian Smith; Brian Smith; William M. Mauck III; Brett W. Benz; Michael J. Andersen; William M. Mauck III; Brett W. Benz; Michael J. Andersen
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Inlcuded is the supplementary data for Smith, B. T., Mauck, W. M., Benz, B., & Andersen, M. J. (2018). Uneven missing data skews phylogenomic relationships within the lories and lorikeets. BioRxiv, 398297.

    The resolution of the Tree of Life has accelerated with advances in DNA sequencing technology. To achieve dense taxon sampling, it is often necessary to obtain DNA from historical museum specimens to supplement modern genetic samples. However, DNA from historical material is generally degraded, which presents various challenges. In this study, we evaluated how the coverage at variant sites and missing data among historical and modern samples impacts phylogenomic inference. We explored these patterns in the brush-tongued parrots (lories and lorikeets) of Australasia by sampling ultraconserved elements in 105 taxa. Trees estimated with low coverage characters had several clades where relationships appeared to be influenced by whether the sample came from historical or modern specimens, which were not observed when more stringent filtering was applied. To assess if the topologies were affected by missing data, we performed an outlier analysis of sites and loci, and a data reduction approach where we excluded sites based on data completeness. Depending on the outlier test, 0.15% of total sites or 38% of loci were driving the topological differences among trees, and at these sites, historical samples had 10.9x more missing data than modern ones. In contrast, 70% data completeness was necessary to avoid spurious relationships. Predictive modeling found that outlier analysis scores were correlated with parsimony informative sites in the clades whose topologies changed the most by filtering. After accounting for biased loci and understanding the stability of relationships, we inferred a more robust phylogenetic hypothesis for lories and lorikeets.

  17. J

    Modelling dependence using skew t copulas: Bayesian inference and...

    • jda-test.zbw.eu
    • journaldata.zbw.eu
    txt, zip
    Updated Nov 4, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael S. Smith; Quan Gan; Robert Kohn; Michael S. Smith; Quan Gan; Robert Kohn (2022). Modelling dependence using skew t copulas: Bayesian inference and applications (replication data) [Dataset]. https://jda-test.zbw.eu/dataset/modelling-dependence-using-skew-t-copulas-bayesian-inference-and-applications
    Explore at:
    zip(76187), txt(2436)Available download formats
    Dataset updated
    Nov 4, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Michael S. Smith; Quan Gan; Robert Kohn; Michael S. Smith; Quan Gan; Robert Kohn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We construct a copula from the skew t distribution of Sahu et al. (2003). This copula can capture asymmetric and extreme dependence between variables, and is one of the few copulas that can do so and still be used in high dimensions effectively. However, it is difficult to estimate the copula model by maximum likelihood when the multivariate dimension is high, or when some or all of the marginal distributions are discrete-valued, or when the parameters in the marginal distributions and copula are estimated jointly. We therefore propose a Bayesian approach that overcomes all these problems. The computations are undertaken using a Markov chain Monte Carlo simulation method which exploits the conditionally Gaussian representation of the skew t distribution. We employ the approach in two contemporary econometric studies. The first is the modelling of regional spot prices in the Australian electricity market. Here, we observe complex non-Gaussian margins and nonlinear inter-regional dependence. Accurate characterization of this dependence is important for the study of market integration and risk management purposes. The second is the modelling of ordinal exposure measures for 15 major websites. Dependence between websites is important when measuring the impact of multi-site advertising campaigns. In both cases the skew t copula substantially outperforms symmetric elliptical copula alternatives, demonstrating that the skew t copula is a powerful modelling tool when coupled with Bayesian inference.

  18. o

    Data from: Decomposing phenotypic skew and its effects on the predicted...

    • osf.io
    Updated Mar 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joel L Pick; Jarrod Hadfield (2022). Decomposing phenotypic skew and its effects on the predicted response to strong selection [Dataset]. https://osf.io/7qyp4
    Explore at:
    Dataset updated
    Mar 2, 2022
    Dataset provided by
    Center For Open Science
    Authors
    Joel L Pick; Jarrod Hadfield
    Description

    No description was included in this Dataset collected from the OSF

  19. H

    Replication data for: Aggregate Skewness and the Business Cycle

    • dataverse.harvard.edu
    Updated Sep 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Iseringhausen; Ivan Petrella; Konstantinos Theodoridis (2023). Replication data for: Aggregate Skewness and the Business Cycle [Dataset]. http://doi.org/10.7910/DVN/W90WVL
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 14, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Martin Iseringhausen; Ivan Petrella; Konstantinos Theodoridis
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Review of Economics and Statistics: Forthcoming.

  20. f

    Table S1 - The Affective Impact of Financial Skewness on Neural Activity and...

    • plos.figshare.com
    doc
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Charlene C. Wu; Peter Bossaerts; Brian Knutson (2023). Table S1 - The Affective Impact of Financial Skewness on Neural Activity and Choice [Dataset]. http://doi.org/10.1371/journal.pone.0016838.s001
    Explore at:
    docAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Charlene C. Wu; Peter Bossaerts; Brian Knutson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Regressors of interest Z-scores and Talaraich coordinates for peak activation foci. Variance contrast compared high variance versus low variance gambles (High-Variance + Positive-Skew + Negative-Skew > Low-Variance). Skewness contrast compared skewed versus symmetric gambles of equal variance (Positive-Skew + Negative-Skew > High-Variance). Positive Skewness contrast compared positively skewed versus negatively skewed gambles (Positive-Skew > Negative-Skew). Regions surpassed threshold of Z>3.28 (p

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Muhammad Akram; Ester Cerin; Karen E. Lamb; Simon R. White (2023). Additional file 3 of Modelling count, bounded and skewed continuous outcomes in physical activity research: beyond linear regression models [Dataset]. http://doi.org/10.6084/m9.figshare.22774300.v1
Organization logo

Additional file 3 of Modelling count, bounded and skewed continuous outcomes in physical activity research: beyond linear regression models

Related Article
Explore at:
txtAvailable download formats
Dataset updated
Jun 2, 2023
Dataset provided by
figshare
Authors
Muhammad Akram; Ester Cerin; Karen E. Lamb; Simon R. White
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Supplementary Material 3: A supplementary file with examples of SAS script for all models that have been fitted in this paper.

Search
Clear search
Close search
Google apps
Main menu