67 datasets found
  1. d

    Replication Data for: Accounting for Skewed or One-Sided Measurement Error...

    • dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Millimet, Daniel; Parmeter, Christopher (2023). Replication Data for: Accounting for Skewed or One-Sided Measurement Error in the Dependent Variable [Dataset]. http://doi.org/10.7910/DVN/IKSE2O
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Millimet, Daniel; Parmeter, Christopher
    Description

    While classical measurement error in the dependent variable in a linear regression framework results only in a loss of precision, nonclassical measurement error can lead to estimates which are biased and inference which lacks power. Here, we consider a particular type of nonclassical measurement error: skewed errors. Unfortunately, skewed measurement error is likely to be a relatively common feature of many out- comes of interest in political science research. This study highlights the bias that can result even from relatively "small" amounts of skewed measurement error, particularly if the measurement error is heteroskedastic. We also assess potential solutions to this problem, focusing on the stochastic frontier model and nonlinear least squares. Simulations and three replications highlight the importance of thinking carefully about skewed measurement error, as well as appropriate solutions.

  2. U

    Annual peak-flow data and PeakFQ output files for selected streamflow gaging...

    • data.usgs.gov
    • catalog.data.gov
    Updated Feb 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Annual peak-flow data and PeakFQ output files for selected streamflow gaging stations operated by the U.S. Geological Survey in the New England region that were used to estimate regional skewness of annual peak flows [Dataset]. https://data.usgs.gov/datacatalog/data/USGS:5d0ba669e4b0e3d3116203d8
    Explore at:
    Dataset updated
    Feb 24, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    Daniel Wagner; Andrea Veilleux
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    Sep 30, 2011
    Area covered
    New England
    Description

    "NewEngland_pkflows.PRT" is a text file that contains results of flood-frequency analysis of annual peak flows from 186 selected streamflow gaging stations (streamgages) operated by the U.S. Geological Survey (USGS) in the New England region (Maine, Connecticut, Massachusetts, Rhode Island, New York, New Hampshire, and Vermont). Only streamgages in the region that were also in the USGS "GAGES II" database (https://water.usgs.gov/GIS/metadata/usgswrd/XML/gagesII_Sept2011.xml) were considered for use in the study. The file was generated by combining PeakFQ output (.PRT) files created using version 7.0 of USGS software PeakFQ (https://water.usgs.gov/software/PeakFQ/; Veilleux and others, 2014) to conduct flood-frequency analyses using the Expected Moments Algorithm (England and others, 2018). The peak-flow files used as input to PeakFQ were obtained from the USGS National Water Information System (NWIS) database (https://nwis.waterdata.usgs.gov/usa/nwis/peak) and contained annual ...

  3. Additional file 3 of Modelling count, bounded and skewed continuous outcomes...

    • springernature.figshare.com
    txt
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Akram; Ester Cerin; Karen E. Lamb; Simon R. White (2023). Additional file 3 of Modelling count, bounded and skewed continuous outcomes in physical activity research: beyond linear regression models [Dataset]. http://doi.org/10.6084/m9.figshare.22774300.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    figshare
    Authors
    Muhammad Akram; Ester Cerin; Karen E. Lamb; Simon R. White
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary Material 3: A supplementary file with examples of SAS script for all models that have been fitted in this paper.

  4. d

    Data from: Selection on skewed characters and the paradox of stasis

    • datadryad.org
    • zenodo.org
    zip
    Updated Sep 8, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suzanne Bonamour; Céline Teplitsky; Anne Charmantier; Pierre-André Crochet; Luis-Miguel Chevin (2017). Selection on skewed characters and the paradox of stasis [Dataset]. http://doi.org/10.5061/dryad.pt07g
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 8, 2017
    Dataset provided by
    Dryad
    Authors
    Suzanne Bonamour; Céline Teplitsky; Anne Charmantier; Pierre-André Crochet; Luis-Miguel Chevin
    Time period covered
    2017
    Description

    Observed phenotypic responses to selection in the wild often differ from predictions based on measurements of selection and genetic variance. An overlooked hypothesis to explain this paradox of stasis is that a skewed phenotypic distribution affects natural selection and evolution. We show through mathematical modelling that, when a trait selected for an optimum phenotype has a skewed distribution, directional selection is detected even at evolutionary equilibrium, where it causes no change in the mean phenotype. When environmental effects are skewed, Lande and Arnold’s (1983) directional gradient is in the direction opposite to the skew. In contrast, skewed breeding values can displace the mean phenotype from the optimum, causing directional selection in the direction of the skew. These effects can be partitioned out using alternative selection estimates based on average derivatives of individual relative fitness, or additive genetic covariances between relative fitness and trait (Robe...

  5. Randomized Battery Usage 4: 40C Right-Skewed Random Walk

    • data.nasa.gov
    • datasets.ai
    • +2more
    application/rdfxml +5
    Updated Oct 20, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Randomized Battery Usage 4: 40C Right-Skewed Random Walk [Dataset]. https://data.nasa.gov/Raw-Data/Randomized-Battery-Usage-4-40C-Right-Skewed-Random/gah6-q2es
    Explore at:
    xml, tsv, csv, application/rdfxml, json, application/rssxmlAvailable download formats
    Dataset updated
    Oct 20, 2022
    Dataset authored and provided by
    PCoE
    License

    https://www.usa.gov/government-workshttps://www.usa.gov/government-works

    Description

    This dataset is part of a series of datasets, where batteries are continuously cycled with randomly generated current profiles. Reference charging and discharging cycles are also performed after a fixed interval of randomized usage to provide reference benchmarks for battery state of health.

    In this dataset, four 18650 Li-ion batteries (Identified as RW25, RW26, RW27 and RW28) were continuously operated by repeatedly charging them to 4.2V and then discharging them to 3.2V using a randomized sequence of discharging currents between 0.5A and 5A. This type of discharging profile is referred to here as random walk (RW) discharging. A customized probability distribution is used in this experiment to select a new load setpoint every 1 minute during RW discharging operation. The custom probability distribution was designed to be skewed towards selecting higher currents. The ambient temperature at which the batteries are cycled was held at approximately 40C for these experiments.

  6. Reaction times and other skewed distributions: problems with the mean and...

    • figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guillaume Rousselet; Rand Wilcox (2023). Reaction times and other skewed distributions: problems with the mean and the median [Dataset]. http://doi.org/10.6084/m9.figshare.6911924.v4
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Authors
    Guillaume Rousselet; Rand Wilcox
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Reproducibility package for the article:Reaction times and other skewed distributions: problems with the mean and the medianGuillaume A. Rousselet & Rand R. Wilcoxpreprint: https://psyarxiv.com/3y54rdoi: 10.31234/osf.io/3y54rThis package contains all the code and data to reproduce the figures and analyses in the article.

  7. Experimental results for birth-death trees with the diameter of 2 and edge...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seunghwa Kang; Jijun Tang; Stephen W. Schaeffer; David A. Bader (2023). Experimental results for birth-death trees with the diameter of 2 and edge lengths in a skewed distribution. [Dataset]. http://doi.org/10.1371/journal.pone.0022483.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Seunghwa Kang; Jijun Tang; Stephen W. Schaeffer; David A. Bader
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We generate 10 model trees for a given number of genomes (). The number of false positives (FP), the number of false negatives (FN), and the execution time (time) in a cell are the average of the finished computations (finished: the number of finished computations within 24 hours) out of 10 trials using 10 different model trees. , , and in the tables are hours, minutes, and seconds, respectively. is the number of genes in a genome, which is 100 in our experiments.

  8. Data from: Improving structured population models with more realistic...

    • data.niaid.nih.gov
    • datadryad.org
    • +1more
    zip
    Updated Jun 14, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Megan L. Peterson; William Morris; Cristina Linares; Daniel Doak (2019). Improving structured population models with more realistic representations of non-normal growth [Dataset]. http://doi.org/10.5061/dryad.t6c3573
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 14, 2019
    Dataset provided by
    Duke University
    University of Colorado Boulder
    Universitat de Barcelona
    Authors
    Megan L. Peterson; William Morris; Cristina Linares; Daniel Doak
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Area covered
    NW Mediterranean Sea, Niwot Ridge, USA, Alaska, Kennicott Valley, Colorado
    Description
    1. Structured population models are among the most widely used tools in ecology and evolution. Integral projection models (IPMs) use continuous representations of how survival, reproduction, and growth change as functions of state variables such as size, requiring fewer parameters to be estimated than projection matrix models (PPMs). Yet almost all published IPMs make an important assumption: that size-dependent growth transitions are or can be transformed to be normally distributed. In fact, many organisms exhibit highly skewed size transitions. Small individuals can grow more than they can shrink, and large individuals may often shrink more dramatically than they can grow. Yet the implications of such skew for inference from IPMs has not been explored, nor have general methods been developed to incorporate skewed size transitions into IPMs, or deal with other aspects of real growth rates, including bounds on possible growth or shrinkage. 2. Here we develop a flexible approach to modeling skewed growth data using a modified beta regression model. We propose that sizes first be converted to a (0,1) interval by estimating size-dependent minimum and maximum sizes through quantile regression. Transformed data can then be modeled using beta regression with widely available statistical tools. We demonstrate the utility of this approach using demographic data for a long-lived plant, gorgonians, and an epiphytic lichen. Specifically, we compare inferences of population parameters from discrete PPMs to those from IPMs that either assume normality or incorporate skew using beta regression or, alternatively, a skewed normal model. 3. The beta and skewed normal distributions accurately capture the mean, variance, and skew of real growth distributions. Incorporating skewed growth into IPMs decreases population growth and estimated lifespan relative to IPMs that assume normally-distributed growth, and more closely approximate the parameters of PPMs that do not assume a particular growth distribution. A bounded distribution, such as the beta, also avoids the eviction problem caused by predicting some growth outside the modeled size range. 4. Incorporating biologically relevant skew in growth data has important consequences for inference from IPMs. The approaches we outline here are flexible and easy to implement with existing statistical tools.
  9. J

    Skewness Risk and Bond Prices (replication data)

    • journaldata.zbw.eu
    • jda-test.zbw.eu
    txt, zip
    Updated Dec 7, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francisco J. Ruge-Murcia; Francisco J. Ruge-Murcia (2022). Skewness Risk and Bond Prices (replication data) [Dataset]. http://doi.org/10.15456/jae.2022326.0702100188
    Explore at:
    txt(5000), zip(37213)Available download formats
    Dataset updated
    Dec 7, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Francisco J. Ruge-Murcia; Francisco J. Ruge-Murcia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This paper uses extreme value theory to study the implications of skewness risk for nominal loan contracts in a production economy. Productivity and inflation innovations are drawn from generalized extreme value distributions. The model is solved using a third-order perturbation and estimated by the simulated method of moments. Results show that the data reject the hypothesis that innovations are drawn from normal distributions and favor instead the alternative that they are drawn from asymmetric distributions. Estimates indicate that skewness risk accounts for 12% of the risk premia and reduces bond yields by approximately 55 basis points. For a bond that pays 1 dollar at maturity, the adjustment factor associated with skewness risk ranges from 0.15 cents for a 3?month bond to 2.05 cents for a 5?year bond.

  10. Data from: SkewDB: A comprehensive database of GC and 10 other skews for...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    bin, bz2
    Updated Jun 5, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bert Hubert; Bert Hubert (2022). SkewDB: A comprehensive database of GC and 10 other skews for over 28,000 chromosomes and plasmids [Dataset]. http://doi.org/10.5061/dryad.g4f4qrfr6
    Explore at:
    bin, bz2Available download formats
    Dataset updated
    Jun 5, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Bert Hubert; Bert Hubert
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff's second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand.

    Most bacteria also show the analogous TA skew. Different phyla show different kinds of skew and differing relations between TA and GC skew.
    This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 28,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included.

    The SkewDB database can be used to generate or verify hypotheses. Since the origins of both the second parity rule, as well as GC skew itself, are not yet satisfactorily explained, such a database may enhance our understanding of microbial DNA.

  11. Z

    Open Science for Social Sciences and Humanities: Open Access availability...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Aug 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seyedali Ghasempouri (2023). Open Science for Social Sciences and Humanities: Open Access availability and distribution across disciplines and Countries in OpenCitations Meta - RESULTS DATASET (with Mega Journals) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8250857
    Explore at:
    Dataset updated
    Aug 18, 2023
    Dataset provided by
    Maddalena Ghiotto
    Seyedali Ghasempouri
    Sebastiano Giacomini
    License

    Attribution 1.0 (CC BY 1.0)https://creativecommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    The dataset contains all the data produced running the research software for the study:"Open Science for Social Sciences and Humanities: Open Access availability and distribution across disciplines and Countries in OpenCitations Meta".

    Disclaimer: these results are not considered to be representative, because we have fount that Mega Journals skewed significantly some of the data. The result datasets without Mega Journals are published here.

    Description of datasets:

    SSH_Publications_in_OC_Meta_and_Open_Access_status.csv: containing information about OpenCitations Meta coverage of ERIH PLUS Journals as well as their Open Access availability. In this dataset, every row holds data for a Journal of ERIH PLUS also covered by OpenCitations Meta database. It is structured with the following columns: "EP_id", the internal ERIH PLUS identifier; "Publications_in_venue", the numbers of Publications counted in each venue; "OC_omid", the internal OpenCitations Meta identifier for the venue; "issn", numbers of publications in each venue; "Open Access", a value to represent if the journal is OA or not, either "True" or "Unknown".

    SSH_Publications_by_Discipline.csv: containing information about number of publications per discipline (in addition, number of journals per discipline are also included). The dataset has three columns, the first, labeled "Discipline", contains single disciplines of the ERIH classificaton, the second and the third, labeled "Journal_count" and "Publication_count", respectively, the number of Journals and the number of Publications counted for each discipline.

    SSH_Publications_and_Journals_by_Country: containing information about number of publications and journals per country. The dataset has three columns, the first, labeled "Country", contains single countries of the ERIH classificaton, the second and the third, labeled "Journal_count" and "Publication_count", respectively, the number of Journals and the number of Publications counted for each discipline.

    result_disciplines.json: the dictionary containing all disciplines as key and a list of related ERIH PLUS venue identifiers as value.

    result_countries.json: the dictionary containing all countries as key and a list of related ERIH PLUS venue identifiers as value.

    duplicate_omids.csv: a dataset containing the duplicated Journal entries in OpenCitations Meta, structured with two columns: "OC_omid", the internal OC Meta identifier; "issn", the issn values associated to that identifier

    eu_data.csv: contains the data specific for European countries' SSH Journals covered in OCMeta. It is structured with the following columns: "EP_id", the internal ERIH PLUS identifier; "Publications_in_venue", the numbers of Publications counted in each venue; "Original_Title", "Country_of_Publication","ERIH_PLUS_Disciplines", "disc_count", the number of disciplines per Journal.

    eu_disciplines_count.csv: containing information about number of publications per discipline and number of journals per discipline of european countries. The dataset has three columns, the first, labeled "Discipline", contains single disciplines of the ERIH classificaton, the second and the third, labeled "Journal_count" and "Publication_count", respectively, the number of Journals and the number of Publications counted for each discipline.

    meta_coverage_eu.csv: contains the data specific for European countries' SSH Journals covered in OCMeta. It is structured with the following columns: "EP_id", the internal ERIH PLUS identifier; "Publications_in_venue", the numbers of Publications counted in each venue; "OC_omid", the internal OpenCitations Meta identifier for the venue; "issn", numbers of publications in each venue; "Open Access", a value to represent if the journal is OA or not, either "True" or "Unknown".

    us_data.csv: contains the data specific for the United States' SSH Journals covered in OCMeta. It is structured with the following columns: "EP_id", the internal ERIH PLUS identifier; "Publications_in_venue", the numbers of Publications counted in each venue; "Original_Title", "Country_of_Publication","ERIH_PLUS_Disciplines", "disc_count", the number of disciplines per Journal.

    us_disciplines_count.csv: containing information about number of publications per discipline and number of journals per discipline of the United States. The dataset has three columns, the first, labeled "Discipline", contains single disciplines of the ERIH classificaton, the second and the third, labeled "Journal_count" and "Publication_count", respectively, the number of Journals and the number of Publications counted for each discipline.

    meta_coverage_us.csv: contains the data specific for the United States' SSH Journals covered in OCMeta. It is structured with the following columns: "EP_id", the internal ERIH PLUS identifier; "Publications_in_venue", the numbers of Publications counted in each venue; "OC_omid", the internal OpenCitations Meta identifier for the venue; "issn", numbers of publications in each venue; "Open Access", a value to represent if the journal is OA or not, either "True" or "Unknown".

    Abstract of the research:

    Purpose: this study aims to investigate the representation and distribution of Social Science and Humanities (SSH) journals within the OpenCitations Meta database, with a particular emphasis on their Open Access (OA) status, as well as their spread across different disciplines and countries. The underlying premise is that open infrastructures play a pivotal role in promoting transparency, reproducibility, and trust in scientific research. Study Design and Methodology: the study is grounded on the premise that open infrastructures are crucial for ensuring transparency, reproducibility, and fostering trust in scientific research. The research methodology involved the use of secondary data sources, namely the OpenCitations Meta database, the ERIH PLUS bibliographic index, and the DOAJ index. A custom research software was developed in Python to facilitate the processing and analysis of the data. Findings: the results reveal that 78.1% of SSH journals listed in the European Reference Index for the Humanities (ERIH-PLUS) are included in the OpenCitations Meta database. The discipline of Psychology has the highest number of publications. The United States and the United Kingdom are the leading contributors in terms of the number of publications. However, the study also uncovers that only 38% of the SSH journals in the OpenCitations Meta database are OA. Originality: this research adds to the existing body of knowledge by providing insights into the representation of SSH in open bibliographic databases and the role of open access in this domain. The study highlights the necessity for advocating OA practices within SSH and the significance of open data for bibliometric studies. It further encourages additional research into the impact of OA on various facets of citation patterns and the factors leading to disparity across disciplinary representation.

    Related resources:

    Ghasempouri S., Ghiotto M., & Giacomini S. (2023). Open Science for Social Sciences and Humanities: Open Access availability and distribution across disciplines and Countries in OpenCitations Meta - RESEARCH ARTICLE. https://doi.org/10.5281/zenodo.8263908

    Ghasempouri, S., Ghiotto, M., Giacomini, S., (2023). Open Science for Social Sciences and Humanities: Open Access availability and distribution across disciplines and Countries in OpenCitations Meta - DATA MANAGEMENT PLAN (Version 4). Zenodo. https://doi.org/10.5281/zenodo.8174644

    Ghasempouri, S., Ghiotto, M., Giacomini, S. (2023e). Open Science for Social Sciences and Humanities: Open Access availability and distribution across disciplines and Countries in OpenCitations Meta - PROTOCOL. V.5. (https://dx.doi.org/10.17504/protocols.io.5jyl8jo1rg2w/v5)

  12. d

    Data from: Sons do not take advantage of a head start: parity in herring...

    • search.dataone.org
    • explore.openaire.eu
    • +3more
    Updated Sep 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David N. Bonter; Michelle C. Moglia; Luke E. DeFisher (2023). Sons do not take advantage of a head start: parity in herring gull offspring sex ratios despite greater initial investment in males [Dataset]. http://doi.org/10.5061/dryad.1s2p7
    Explore at:
    Dataset updated
    Sep 12, 2023
    Dataset provided by
    Dryad Digital Repository
    Authors
    David N. Bonter; Michelle C. Moglia; Luke E. DeFisher
    Time period covered
    Jan 1, 2015
    Description

    Skewed adult sex ratios sometimes occur in populations of free-living animals yet the proximate mechanisms, timing of sex-biases, and the selective agents contributing to skew remain a source of debate with contradictory evidence from different systems. We investigated potential mechanisms contributing to sex biases in a population of herring gulls with an apparent female skew in the adult population. Theory predicts that skewed adult sex ratios will adaptively lead to skewed offspring sex ratios to restore balance in the effective breeding population. Parents may also adaptively bias offspring sex ratios to increase their own fitness in response to environmental factors. Therefore, we expected to detect skewed sex ratios either at hatching or at fledging as parents invest differentially in offspring of different sexes. We sampled complete clutches (n = 336 chicks) at hatching to quantify potential skews in sex ratios by position in the hatch order, time of season, year, and nesting con...

  13. Dealing with highly skewed hospital length of stay distributions: The use of...

    • plos.figshare.com
    docx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eva Williford; Valerie Haley; Louise-Anne McNutt; Victoria Lazariu (2023). Dealing with highly skewed hospital length of stay distributions: The use of Gamma mixture models to study delivery hospitalizations [Dataset]. http://doi.org/10.1371/journal.pone.0231825
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Eva Williford; Valerie Haley; Louise-Anne McNutt; Victoria Lazariu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The increased focus on addressing severe maternal morbidity and maternal mortality has led to studies investigating patient and hospital characteristics associated with longer hospital stays. Length of stay (LOS) for delivery hospitalizations has a strongly skewed distribution with the vast majority of LOS lasting two to three days in the United States. Prior studies typically focused on common LOSs and dealt with the long LOS distribution tail in ways to fit conventional statistical analyses (e.g., log transformation, trimming). This study demonstrates the use of Gamma mixture models to analyze the skewed LOS distribution. Gamma mixture models are flexible and, do not require data transformation or removal of outliers to accommodate many outcome distribution shapes, these models allow for the analysis of patients staying in the hospital for a longer time, which often includes those women experiencing worse outcomes. Random effects are included in the model to account for patients being treated within the same hospitals. Further, the role and influence of differing placements of covariates on the results is discussed in the context of distinct model specifications of the Gamma mixture regression model. The application of these models shows that they are robust to the placement of covariates and random effects. Using New York State data, the models showed that longer LOS for childbirth hospitalizations were more common in hospitals designated to accept more complicated deliveries, across hospital types, and among Black women. Primary insurance also was associated with LOS. Substantial variation between hospitals suggests the need to investigate protocols to standardize evidence-based medical care.

  14. o

    Data from: Memory for Positive Information during Skewed Decision Making

    • osf.io
    Updated Aug 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kendra Leigh Seaman; Colleen C Frank; Sharada Sateesh Pillai (2024). Memory for Positive Information during Skewed Decision Making [Dataset]. https://osf.io/jm3dx
    Explore at:
    Dataset updated
    Aug 15, 2024
    Dataset provided by
    Center For Open Science
    Authors
    Kendra Leigh Seaman; Colleen C Frank; Sharada Sateesh Pillai
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    One common strategy to test the positivity effect in emotional information processing is to examine memory for positive information. Participants with better recall or recognition of positive (compared to negative or neutral) information may have attended more closely to positive information during the task, leading to better encoding and retrieval. Memory probes are periodic trials in which participants are asked to recognize information from a previous trial and can be used to examine which specific information is attended to in a decision-making paradigm. Participants will complete a variant of the skewed gambling task to explore the role of memory for positive information in decision making. Interspersed among the 90 gambles will be 24 recognition probe trials. Participants will be asked in recognition memory probe trials if the amount (or probability) of a gain (or a loss) was part of the gamble on the previous trial. Probe trials will be distributed equally after positively- and negatively-skewed gambles and half of the probes will be correct and half will be incorrect. Thus, memory for each type of information (gain amount, loss amount, gain probability, or loss probability) will be probed 6 times.

  15. J

    CONDITIONALLY HETEROSKEDASTIC FACTOR MODELS WITH SKEWNESS AND LEVERAGE...

    • journaldata.zbw.eu
    • jda-test.zbw.eu
    pdf, txt
    Updated Dec 7, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prosper Dovonon; Prosper Dovonon (2022). CONDITIONALLY HETEROSKEDASTIC FACTOR MODELS WITH SKEWNESS AND LEVERAGE EFFECTS (replication data) [Dataset]. http://doi.org/10.15456/jae.2022321.0712481009
    Explore at:
    pdf(116501), txt(1780985), txt(1491)Available download formats
    Dataset updated
    Dec 7, 2022
    Dataset provided by
    ZBW - Leibniz Informationszentrum Wirtschaft
    Authors
    Prosper Dovonon; Prosper Dovonon
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Conditional heteroskedasticity, skewness and leverage effects are well-known features of financial returns. The literature on factor models has often made assumptions that preclude the three effects to occur simultaneously. In this paper I propose a conditionally heteroskedastic factor model that takes into account the presence of both the conditional skewness and leverage effects. This model is specified in terms of conditional moment restrictions and unconditional moment conditions are proposed allowing inference by the generalized method of moments (GMM). The model is also shown to be closed under temporal aggregation. An application to daily excess returns on sectorial indices from the UK stock market provides strong evidence for dynamic conditional skewness and leverage with a sharp efficiency gain resulting from accounting for both effects. The estimated volatilitypersistence from the proposed model is lower than that estimated from models that rule out such effects. I also find that the longer the returns' horizon, the fewer conditionally heteroskedastic factors may be required for suitable modeling and the less strong is the evidence for dynamic leverage. Some of these results are in line with the main findings of Harvey and Siddique (1999) and Jondeau and Rockinger (2003), namely that accounting for conditional skewness impacts the persistence in the conditional variance of the return process.

  16. U

    Annual peak-flow data and results of flood-frequency analysis for 76...

    • data.usgs.gov
    • catalog.data.gov
    Updated Sep 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Wagner; Jon Voss; Roger D.; David Heimann (2024). Annual peak-flow data and results of flood-frequency analysis for 76 selected streamflow gaging stations operated by the U.S. Geological Survey in the upper White River basin, Missouri and Arkansas, computed using an updated generalized (regional) flood skew [Dataset]. http://doi.org/10.5066/P9C3L7IN
    Explore at:
    Dataset updated
    Sep 3, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    Daniel Wagner; Jon Voss; Roger D.; David Heimann
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    1904 - 2020
    Area covered
    Arkansas, Missouri
    Description

    This dataset contains site information, basin characteristics, results of flood-frequency analysis, and a generalized (regional) flood skew for 76 selected streamgages operated by the U.S. Geological Survey (USGS) in the upper White River basin (4-digit hydrologic unit 1101) in southern Missouri and northern Arkansas. The Little Rock District U.S. Army Corps of Engineers (USACE) needed updated estimates of streamflows corresponding to selected annual exceedance probabilities (AEPs) and a basin-specific regional flood skew. USGS selected 111 candidate streamgages in the study area that had 20 or more years of gaged annual peak-flow data available through the 2020 water year. After screening for regulation, urbanization, redundant/nested basins, drainage areas greater than 2,500 square miles, and streamgage basins located in the Mississippi Alluvial Plain (8-digit hydrologic unit 11010013), 77 candidate streamgages remained. After conducting the initial flood-frequency analysis ...

  17. I

    Data from: Geographically skewed recruitment and COVID-19 seroprevalence...

    • data.niaid.nih.gov
    • immport.org
    url
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Geographically skewed recruitment and COVID-19 seroprevalence estimates: a cross-sectional serosurveillance study and mathematical modelling analysis [Dataset]. http://doi.org/10.21430/M3RQBC30KW
    Explore at:
    urlAvailable download formats
    Dataset updated
    Feb 29, 2024
    License

    https://www.immport.org/agreementhttps://www.immport.org/agreement

    Description

    Objectives: Convenience sampling is an imperfect but important tool for seroprevalence studies. For COVID-19, local geographic variation in cases or vaccination can confound studies that rely on the geographically skewed recruitment inherent to convenience sampling. The objectives of this study were: (1) quantifying how geographically skewed recruitment influences SARS-CoV-2 seroprevalence estimates obtained via convenience sampling and (2) developing new methods that employ Global Positioning System (GPS)-derived foot traffic data to measure and minimise bias and uncertainty due to geographically skewed recruitment. Design: We used data from a local convenience-sampled seroprevalence study to map the geographic distribution of study participants' reported home locations and compared this to the geographic distribution of reported COVID-19 cases across the study catchment area. Using a numerical simulation, we quantified bias and uncertainty in SARS-CoV-2 seroprevalence estimates obtained using different geographically skewed recruitment scenarios. We employed GPS-derived foot traffic data to estimate the geographic distribution of participants for different recruitment locations and used this data to identify recruitment locations that minimise bias and uncertainty in resulting seroprevalence estimates. Results: The geographic distribution of participants in convenience-sampled seroprevalence surveys can be strongly skewed towards individuals living near the study recruitment location. Uncertainty in seroprevalence estimates increased when neighbourhoods with higher disease burden or larger populations were undersampled. Failure to account for undersampling or oversampling across neighbourhoods also resulted in biased seroprevalence estimates. GPS-derived foot traffic data correlated with the geographic distribution of serosurveillance study participants. Conclusions: Local geographic variation in seropositivity is an important concern in SARS-CoV-2 serosurveillance studies that rely on geographically skewed recruitment strategies. Using GPS-derived foot traffic data to select recruitment sites and recording participants' home locations can improve study design and interpretation.

  18. Z

    Data from: Queen-worker ratio affects reproductive skew in a socially...

    • data.niaid.nih.gov
    Updated May 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Walter, Bartosz (2022). Data from: Queen-worker ratio affects reproductive skew in a socially polymorphic ant [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5013467
    Explore at:
    Dataset updated
    May 30, 2022
    Dataset provided by
    Heize, Jürgen
    Walter, Bartosz
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The partitioning of reproduction among individuals in communally breeding animals varies greatly among species, from the monopolization of reproduction (high reproductive skew) to similar contribution to the offspring in others (low skew). Reproductive skew models explain how relatedness or ecological constraints affect the magnitude of reproductive skew. They typically assume that individuals are capable of flexibly reacting to social and environmental changes. Most models predict a decrease of skew when benefits of staying in the group are reduced. In the ant Leptothorax acervorum, queens in colonies from marginal habitats form dominance hierarchies and only the top-ranking queen lays eggs ("functional monogyny"). In contrast, queens in colonies from extended coniferous forests throughout the Palaearctic rarely interact aggressively and all lay eggs ("polygyny"). An experimental increase of queen:worker ratios in colonies from low-skew populations elicits queen–queen aggression similar to that in functionally monogynous populations. Here, we show that this manipulation also results in increased reproductive inequalities among queens. Queens from natural overwintering colonies differed in the number of developing oocytes in their ovaries. These differences were greatly augmented in queens from colonies with increased queen:worker ratios relative to colonies with a low queen:worker ratio. As assumed by models of reproductive skew, L. acervorum colonies thus appear to be capable of flexibly adjusting reproductive skew to social conditions, yet in the opposite way than predicted by most models.

  19. Z

    Data from: Optimists or realists? How ants allocate resources in making...

    • data.niaid.nih.gov
    • zenodo.org
    Updated May 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enzmann, Brittany L. (2022). Data from: Optimists or realists? How ants allocate resources in making reproductive investments. [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4946732
    Explore at:
    Dataset updated
    May 28, 2022
    Dataset provided by
    Enzmann, Brittany L.
    Nonacs, Peter
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description
    1. Parents often face an investment trade-off between either producing many small or fewer large offspring. When environments vary predictably, the fittest parental solution matches available resources by varying only number of offspring and never optimal individual size. However when mismatches occur often between parental expectations and true resource levels, dynamic models like multifaceted parental investment (MFPI) and parental optimism (PO) both predict offspring size can vary significantly. MFPI is a "realist" strategy: parents assume future environments of average richness. When resources exceed expectations and it is too late to add more offspring, the best-case solution increases investment per individual. Brood size distributions therefore track the degree of mismatch from right-skewed around an optimal size (slight underestimation of resources), to left-skewed around a maximal size (gross underestimation). Conversely, PO is an "optimist" strategy: parents assume maximally good resource futures and match numbers to that situation. Normal or lean years do not affect "core" brood as costs primarily fall on excess "marginal" siblings who die or experience stunted growth (producing left-skewed distributions). 2. Investment patterns supportive of both MFPI and PO models have been observed in nature, but studies that directly manipulate food resources in order to test predictions are lacking. Ant colonies produce many offspring per reproductive cycle, and are amenable to experimental manipulation in ways that can differentiate between MFPI and PO investment strategies. 3. Colonies in a natural population of a harvester ant (Pogonomymex salinus) were protein-supplemented over two years and mature sexual offspring were collected annually prior to their nuptial flight. 4. Several results support either MFPI or PO in terms of patterns in offspring size distributions and how protein differentially affected male and female production. Unpredicted by either model, however, is that supplementation affected distributions more strongly across years than within (e.g., small females are significantly rarer in the year after colonies receive protein). 5. Parental investment strategies in P. salinus vary dynamically across years and conditions. Finding that past conditions can more strongly affect reproductive decisions than current ones, however, is not addressed by models of parental investment.
  20. o

    Data from: Decomposing phenotypic skew and its effects on the predicted...

    • osf.io
    Updated Mar 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joel L Pick; Jarrod Hadfield (2022). Decomposing phenotypic skew and its effects on the predicted response to strong selection [Dataset]. https://osf.io/7qyp4
    Explore at:
    Dataset updated
    Mar 2, 2022
    Dataset provided by
    Center For Open Science
    Authors
    Joel L Pick; Jarrod Hadfield
    Description

    No description was included in this Dataset collected from the OSF

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Millimet, Daniel; Parmeter, Christopher (2023). Replication Data for: Accounting for Skewed or One-Sided Measurement Error in the Dependent Variable [Dataset]. http://doi.org/10.7910/DVN/IKSE2O

Replication Data for: Accounting for Skewed or One-Sided Measurement Error in the Dependent Variable

Related Article
Explore at:
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Millimet, Daniel; Parmeter, Christopher
Description

While classical measurement error in the dependent variable in a linear regression framework results only in a loss of precision, nonclassical measurement error can lead to estimates which are biased and inference which lacks power. Here, we consider a particular type of nonclassical measurement error: skewed errors. Unfortunately, skewed measurement error is likely to be a relatively common feature of many out- comes of interest in political science research. This study highlights the bias that can result even from relatively "small" amounts of skewed measurement error, particularly if the measurement error is heteroskedastic. We also assess potential solutions to this problem, focusing on the stochastic frontier model and nonlinear least squares. Simulations and three replications highlight the importance of thinking carefully about skewed measurement error, as well as appropriate solutions.

Search
Clear search
Close search
Google apps
Main menu