100+ datasets found
  1. h

    INTERVAL

    • web.dev.hdruk.cloud
    • healthdatagateway.org
    unknown
    Updated Aug 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    INTERVAL must be acknowledged in all publications using these data. Further details will be issued through the Data Access Committee. (2024). INTERVAL [Dataset]. https://web.dev.hdruk.cloud/dataset/201
    Explore at:
    unknownAvailable download formats
    Dataset updated
    Aug 10, 2024
    Dataset authored and provided by
    INTERVAL must be acknowledged in all publications using these data. Further details will be issued through the Data Access Committee.
    License

    http://www.donorhealth-btru.nihr.ac.uk/wp-content/uploads/2020/04/Data-Access-Policy-v1.0-14Apr2020.pdfhttp://www.donorhealth-btru.nihr.ac.uk/wp-content/uploads/2020/04/Data-Access-Policy-v1.0-14Apr2020.pdf

    Description

    In over 100 years of blood donation practice, INTERVAL is the first randomised controlled trial to assess the impact of varying the frequency of blood donation on donor health and the blood supply. It provided policy-makers with evidence that collecting blood more frequently than current intervals can be implemented over two years without impacting on donor health, allowing better management of the supply to the NHS of units of blood with in-demand blood groups. INTERVAL was designed to deliver a multi-purpose strategy: an initial purpose related to blood donation research aiming to improve NHS Blood and Transplant’s core services and a longer-term purpose related to the creation of a comprehensive resource that will enable detailed studies of health-related questions.

    Approximately 50,000 generally healthy blood donors were recruited between June 2012 and June 2014 from 25 NHS Blood Donation centres across England. Approximately equal numbers of men and women; aged from 18-80; ~93% white ancestry. All participants completed brief online questionnaires at baseline and gave blood samples for research purposes. Participants were randomised to giving blood every 8/10/12 weeks (for men) and 12/14/16 weeks (for women) over a 2-year period. ~30,000 participants returned after 2 years and completed a brief online questionnaire and gave further blood samples for research purposes.

    The baseline questionnaire includes brief lifestyle information (smoking, alcohol consumption, etc), iron-related questions (e.g., red meat consumption), self-reported height and weight, etc. The SF-36 questionnaire was completed online at baseline and 2-years, with a 6-monthly SF-12 questionnaire between baseline and 2-years.

    All participants have had the Affymetrix Axiom UK Biobank genotyping array assayed and then imputed to 1000G+UK10K combined reference panel (80M variants in total). 4,000 participants have 50X whole-exome sequencing and 12,000 participants have 15X whole-genome sequencing. Whole-blood RNA sequencing has commenced in ~5,000 participants.

    The dataset also contains data on clinical chemistry biomarkers, blood cell traits, >200 lipoproteins, metabolomics (Metabolon HD4), lipidomics, and proteomics (SomaLogic, Olink), either cohort-wide or is large sub-sets of the cohort.

  2. f

    Data from: A Statistical Inference Course Based on p-Values

    • figshare.com
    • tandf.figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ryan Martin (2023). A Statistical Inference Course Based on p-Values [Dataset]. http://doi.org/10.6084/m9.figshare.3494549.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Ryan Martin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Introductory statistical inference texts and courses treat the point estimation, hypothesis testing, and interval estimation problems separately, with primary emphasis on large-sample approximations. Here, I present an alternative approach to teaching this course, built around p-values, emphasizing provably valid inference for all sample sizes. Details about computation and marginalization are also provided, with several illustrative examples, along with a course outline. Supplementary materials for this article are available online.

  3. Season and interval of burning and cattle exclusion in the southern Blue...

    • catalog.data.gov
    • agdatacommons.nal.usda.gov
    • +4more
    Updated Jun 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Forest Service (2023). Season and interval of burning and cattle exclusion in the southern Blue Mountains, Oregon: Understory vegetation attributes [Dataset]. https://catalog.data.gov/dataset/season-and-interval-of-burning-and-cattle-exclusion-in-the-southern-blue-mountains-oregon--c0a88
    Explore at:
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    U.S. Department of Agriculture Forest Servicehttp://fs.fed.us/
    Area covered
    Oregon, Blue Mountains
    Description

    These data document understory vegetation cover, richness and regeneration tree counts for a prescribed burning study with unburned controls on the Malheur National Forest in the southern Blue Mountains of Oregon. The original prescribed fires were conducted in the fall of 1997 and spring of 1998 and were repeated at two intervals, five and fifteen years. Five year interval reburns have been repeated three times (four burns total) and the fifteen year interval a single time (two burns total). Data include vegetation conditions prior to and following the last reburns and include understory vegetation cover; graminoid (grass and sedge) cover as well as leafing and flowering culm height, and flowering culm count data; shrub cover; conifer regeneration count data; presence/absence of all vascular species; and plant functional group information, descriptions and associated species.

  4. d

    Evaluating interval forecasts of high-frequency financial data (replication...

    • b2find.dkrz.de
    Updated Oct 24, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Evaluating interval forecasts of high-frequency financial data (replication data) - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/36f0a57c-230d-5f78-aa93-6ad36cd3651f
    Explore at:
    Dataset updated
    Oct 24, 2023
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A number of methods of evaluating the validity of interval forecasts of financial data are analysed, and illustrated using intraday FTSE100 index futures returns. Some existing interval forecast evaluation techniques, such as the Markov chain approach of Christoffersen (1998), are shown to be inappropriate in the presence of periodic heteroscedasticity. Instead, we consider a regression-based test, and a modified version of Christoffersen's Markov chain test for independence, and analyse their properties when the financial time series exhibit periodic volatility. These approaches lead to different conclusions when interval forecasts of FTSE100 index futures returns generated by various GARCH(1,1) and periodic GARCH(1,1) models are evaluated.

  5. Wind Generation Time Interval Exploration Data

    • data.ca.gov
    • data.cnra.ca.gov
    • +4more
    Updated Jan 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Energy Commission (2024). Wind Generation Time Interval Exploration Data [Dataset]. https://data.ca.gov/dataset/wind-generation-time-interval-exploration-data
    Explore at:
    zip, gpkg, gdb, arcgis geoservices rest api, kml, geojson, csv, html, xlsx, txtAvailable download formats
    Dataset updated
    Jan 19, 2024
    Dataset authored and provided by
    California Energy Commissionhttp://www.energy.ca.gov/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the data set behind the Wind Generation Interactive Query Tool created by the CEC. The visualization tool interactively displays wind generation over different time intervals in three-dimensional space. The viewer can look across the state to understand generation patterns of regions with concentrations of wind power plants. The tool aids in understanding high and low periods of generation. Operation of the electric grid requires that generation and demand are balanced in each period.



    The height and color of columns at wind generation areas are scaled and shaded to represent capacity factors (CFs) of the areas in a specific time interval. Capacity factor is the ratio of the energy produced to the amount of energy that could ideally have been produced in the same period using the rated nameplate capacity. Due to natural variations in wind speeds, higher factors tend to be seen over short time periods, with lower factors over longer periods. The capacity used is the reported nameplate capacity from the Quarterly Fuel and Energy Report, CEC-1304A. CFs are based on wind plants in service in the wind generation areas.

    Renewable energy resources like wind facilities vary in size and geographic distribution within each state. Resource planning, land use constraints, climate zones, and weather patterns limit availability of these resources and where they can be developed. National, state, and local policies also set limits on energy generation and use. An example of resource planning in California is the Desert Renewable Energy Conservation Plan.

    By exploring the visualization, a viewer can gain a three-dimensional understanding of temporal variation in generation CFs, along with how the wind generation areas compare to one another. The viewer can observe that areas peak in generation in different periods. The large range in CFs is also visible.



  6. Data from: Season and interval of burning in the southern Blue Mountains,...

    • s.cnmilf.com
    • agdatacommons.nal.usda.gov
    • +6more
    Updated Jun 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Forest Service (2023). Season and interval of burning in the southern Blue Mountains, Oregon: Surface fuels [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/season-and-interval-of-burning-in-the-southern-blue-mountains-oregon-surface-fuels-214c9
    Explore at:
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    U.S. Department of Agriculture Forest Servicehttp://fs.fed.us/
    Description

    These data document surface fuels data for a prescribed burning study with unburned controls on the Malheur National Forest in the southern Blue Mountains of Oregon. The original prescribed fires were conducted in the fall of 1997 and spring of 1998 and were repeated at two intervals, five and fifteen years. Five year interval reburns have been repeated three times (four burns total) and the fifteen year interval a single time (two burns total). These data document fuels prior to (2012) and following the last reburns including 1-hour (0 to 0.64 centimeter [cm] diameter), 10-hour (0.64 to 2.54 cm diameter), 100-hour (2.54 to 7.62 cm diameter) and 1000-hour fuels (> 7.62 cm diameter); average combined litter and duff depth; and surface fuel height.

  7. Life expectancy, abridged life table, at birth and at age 65

    • www150.statcan.gc.ca
    • ouvert.canada.ca
    • +2more
    Updated Feb 20, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2017). Life expectancy, abridged life table, at birth and at age 65 [Dataset]. http://doi.org/10.25318/1310003201-eng
    Explore at:
    Dataset updated
    Feb 20, 2017
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Area covered
    Canada
    Description

    Abridged life tables showing life expectancy at birth and at age 65, low 95% confidence interval, high 95% confidence interval, and coefficients of variation for life expectancy, by sex, 1990 to 2006.

  8. League of Legends Match Data at Various Time Intervals

    • zenodo.org
    • data.niaid.nih.gov
    csv
    Updated Aug 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jailson Barros da Silva Junior; Jailson Barros da Silva Junior; Claudio Campelo; Claudio Campelo (2023). League of Legends Match Data at Various Time Intervals [Dataset]. http://doi.org/10.5281/zenodo.8303397
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 31, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jailson Barros da Silva Junior; Jailson Barros da Silva Junior; Claudio Campelo; Claudio Campelo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset comprises comprehensive information from ranked matches played in the game League of Legends, spanning the time frame between January 12, 2023, and May 18, 2023. The matches cover a wide range of skill levels, specifically from the Iron tier to the Diamond tier.

    The dataset is structured based on time intervals, presenting game data at various percentages of elapsed game time, including 20%, 40%, 60%, 80%, and 100%. For each interval, detailed match statistics, player performance metrics, objective control, gold distribution, and other vital in-game information are provided.

    This collection of data not only offers insights into how matches evolve and strategies change over different phases of the game but also enables the exploration of player behavior and decision-making as matches progress. Researchers and analysts in the field of esports and game analytics will find this dataset valuable for studying trends, developing predictive models, and gaining a deeper understanding of the dynamics within ranked League of Legends matches across different skill tiers.

  9. d

    Interval censored regression with fixed effects (replication data) - Dataset...

    • b2find.dkrz.de
    Updated Oct 24, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Interval censored regression with fixed effects (replication data) - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/6a992bb4-c518-57ae-8eff-5f281a525cb3
    Explore at:
    Dataset updated
    Oct 24, 2023
    Description

    This paper considers identification and estimation of a fixed-effects model with an interval-censored dependent variable. In each time period, the researcher observes the interval (with known endpoints) in which the dependent variable lies but not the value of the dependent variable itself. Two versions of the model are considered: a parametric model with logistic errors and a semiparametric model with errors having an unspecified distribution. In both cases, the error disturbances can be heteroskedastic over cross-sectional units as long as they are stationary within a cross-sectional unit; the semiparametric model also allows for serial correlation of the error disturbances. A conditional-logit-type composite likelihood estimator is proposed for the logistic fixed-effects model, and a composite maximum-score-type estimator is proposed for the semiparametric model. In general, the scale of the coefficient parameters is identified by these estimators, meaning that the causal effects of interest are estimated directly in cases where the latent dependent variable is of primary interest (e.g., pure data-coding situations). Monte Carlo simulations and an empirical application to birthweight outcomes illustrate the performance of the parametric estimator.

  10. e

    Clouds - height and coverage at a 10 minute interval

    • data.europa.eu
    • ckan.mobidatalab.eu
    • +4more
    Updated Nov 19, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Clouds - height and coverage at a 10 minute interval [Dataset]. https://data.europa.eu/data/datasets/d789ceee-4998-4972-bedc-684d3b996991?locale=lt
    Explore at:
    Dataset updated
    Nov 19, 2018
    Description

    This dataset is constructed using measurements of cloudheight-sensors and a algorithm for coverage. The dataset is neither validated nor are missing values completed.

  11. Radiation - BSRN irradiance data at 1 minute interval at Cabauw

    • dataplatform.knmi.nl
    Updated Nov 12, 2009
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    dataplatform.knmi.nl (2009). Radiation - BSRN irradiance data at 1 minute interval at Cabauw [Dataset]. https://dataplatform.knmi.nl/dataset/cesar-bsrn-irraddown-la1-t1-v1-0
    Explore at:
    Dataset updated
    Nov 12, 2009
    Dataset provided by
    Royal Netherlands Meteorological Institutehttp://www.knmi.nl/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Cabauw
    Description

    Dataset contains direct, diffuse, global and downward longwave irradiances at 60 seconds time resolution. Dataset also contains air temperature, relative humidity and air pressure at instrument height. Supplemental information

  12. 10,000 RR Interval Data (9500NAF & 500PAF) from 24 h Holter recordings used...

    • figshare.com
    zip
    Updated Dec 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fan Lin; Xiaoyun Yang; Peng Zhang (2024). 10,000 RR Interval Data (9500NAF & 500PAF) from 24 h Holter recordings used for atrial fibrillation detection [Dataset]. http://doi.org/10.6084/m9.figshare.28000112.v2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 13, 2024
    Dataset provided by
    figshare
    Authors
    Fan Lin; Xiaoyun Yang; Peng Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This RR interval dataset is derived from 10,000 cases of 24-hour Holter monitoring data sampled at 128 Hz. Among the cases, 9,500 are labeled as non-atrial fibrillation (NAF), and 500 as paroxysmal atrial fibrillation (PAF). These data have been used in the article "Clinician-AI Collaboration: A Win-Win solution for Efficiency and Reliability in Atrial Fibrillation Diagnosis".The dataset formated as CSV file consists of two columns:rr_interval: Represents the interval between consecutive R-peaks, measured in milliseconds.label: Categorical labels for the beats, where:1 indicates AF0 indicates NAF-1 indicates noise or artifactsEach case is named based on its category. NAF cases are labeled as NAF0001.csv through NAF9500.csv, while PAF cases are labeled as PAF0001.csv through PAF0500.csv.For any questions, please contact the email: hustzp@hust.edu.cn

  13. Bowen tide gauge—predicted interval data

    • data.qld.gov.au
    • researchdata.edu.au
    • +1more
    csv
    Updated Jan 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Transport and Main Roads (2025). Bowen tide gauge—predicted interval data [Dataset]. https://www.data.qld.gov.au/dataset/bowen-tide-gauge-predicted-interval-data
    Explore at:
    csv(1572864), csvAvailable download formats
    Dataset updated
    Jan 6, 2025
    Authors
    Transport and Main Roads
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Bowen
    Description

    Predicted water level heights at Bowen at regular time intervals.

  14. Data from: Season and interval of burning and cattle exclusion in the...

    • catalog.data.gov
    • agdatacommons.nal.usda.gov
    • +5more
    Updated Jun 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Forest Service (2023). Season and interval of burning and cattle exclusion in the southern Blue Mountains, Oregon: Environmental attributes [Dataset]. https://catalog.data.gov/dataset/season-and-interval-of-burning-and-cattle-exclusion-in-the-southern-blue-mountains-oregon--c5161
    Explore at:
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    U.S. Department of Agriculture Forest Servicehttp://fs.fed.us/
    Area covered
    Oregon, Blue Mountains
    Description

    These data document environmental variables including overstory canopy cover, O horizon depth, ground cover and soils for a prescribed burning study with unburned controls on the Malheur National Forest in the southern Blue Mountains of Oregon. The original prescribed fires were conducted in the fall of 1997 and spring of 1998 and were repeated at two intervals, five and fifteen years. Five year interval reburns have been repeated three times (four burns total) and the fifteen year interval a single time (two burns total). Data include environmental conditions prior to and following the last reburns except for soils data which were collected prior to the last reburns only. Specifically, this data publication includes overstory tree canopy cover data from the 10-meter radius plots, ground cover data (litter, rock, bare soil and coarse woody debris) from the 1 x 1 meter quadrats, O horizon depth data from the 10-meter radius plots, and soils data (e.g. carbon, nitrogen and phosphorous concentrations, pH and bulk density) from 2012.

  15. Port Alma tide gauge—predicted interval data

    • data.qld.gov.au
    • data.wu.ac.at
    csv
    Updated Jan 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Transport and Main Roads (2025). Port Alma tide gauge—predicted interval data [Dataset]. https://www.data.qld.gov.au/dataset/port-alma-tide-gauge-predicted-interval-data
    Explore at:
    csv(1048576), csv(1572864), csvAvailable download formats
    Dataset updated
    Jan 6, 2025
    Authors
    Transport and Main Roads
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Port Alma
    Description

    Predicted water level heights at Port Alma at regular time intervals.

  16. d

    High-Resolution Georeferenced Major Rivers Point Data, Spaced in 150m...

    • catalog.data.gov
    • s.cnmilf.com
    • +1more
    Updated Jun 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Climate Adaptation Science Centers (2024). High-Resolution Georeferenced Major Rivers Point Data, Spaced in 150m intervals [Dataset]. https://catalog.data.gov/dataset/high-resolution-georeferenced-major-rivers-point-data-spaced-in-150m-intervals
    Explore at:
    Dataset updated
    Jun 15, 2024
    Dataset provided by
    Climate Adaptation Science Centers
    Description

    The Global River Points dataset is a high-resolution vector file geodatabase of 73 rivers world-wide. Each river is represented by a series of points spaced 150 meters apart and each point has attached environmental attributes extracted from multiple data sets. The attributes include physical information (slope, elevation, temperature, precipitation, river width and discharge) and landscape variables (human influence, fishing pressure, and organic load). The dataset also incorporates the river classification data from the Global River Reach Classifications GloRiC Version 1.0 dataset.

  17. z

    Counts of Influenza reported in UNITED STATES OF AMERICA: 1919-1951

    • zenodo.org
    • data.niaid.nih.gov
    json, xml, zip
    Updated Jun 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Willem Van Panhuis; Willem Van Panhuis; Anne Cross; Anne Cross; Donald Burke; Donald Burke (2024). Counts of Influenza reported in UNITED STATES OF AMERICA: 1919-1951 [Dataset]. http://doi.org/10.25337/t7/ptycho.v2.0/us.6142004
    Explore at:
    json, xml, zipAvailable download formats
    Dataset updated
    Jun 3, 2024
    Dataset provided by
    Project Tycho
    Authors
    Willem Van Panhuis; Willem Van Panhuis; Anne Cross; Anne Cross; Donald Burke; Donald Burke
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Oct 26, 1919 - Dec 8, 1951
    Area covered
    United States
    Description

    Project Tycho datasets contain case counts for reported disease conditions for countries around the world. The Project Tycho data curation team extracts these case counts from various reputable sources, typically from national or international health authorities, such as the US Centers for Disease Control or the World Health Organization. These original data sources include both open- and restricted-access sources. For restricted-access sources, the Project Tycho team has obtained permission for redistribution from data contributors. All datasets contain case count data that are identical to counts published in the original source and no counts have been modified in any way by the Project Tycho team. The Project Tycho team has pre-processed datasets by adding new variables, such as standard disease and location identifiers, that improve data interpretabilty. We also formatted the data into a standard data format.

    Each Project Tycho dataset contains case counts for a specific condition (e.g. measles) and for a specific country (e.g. The United States). Case counts are reported per time interval. In addition to case counts, datsets include information about these counts (attributes), such as the location, age group, subpopulation, diagnostic certainty, place of aquisition, and the source from which we extracted case counts. One dataset can include many series of case count time intervals, such as "US measles cases as reported by CDC", or "US measles cases reported by WHO", or "US measles cases that originated abroad", etc.

    Depending on the intended use of a dataset, we recommend a few data processing steps before analysis:

    • Analyze missing data: Project Tycho datasets do not inlcude time intervals for which no case count was reported (for many datasets, time series of case counts are incomplete, due to incompleteness of source documents) and users will need to add time intervals for which no count value is available. Project Tycho datasets do include time intervals for which a case count value of zero was reported.
    • Separate cumulative from non-cumulative time interval series. Case count time series in Project Tycho datasets can be "cumulative" or "fixed-intervals". Cumulative case count time series consist of overlapping case count intervals starting on the same date, but ending on different dates. For example, each interval in a cumulative count time series can start on January 1st, but end on January 7th, 14th, 21st, etc. It is common practice among public health agencies to report cases for cumulative time intervals. Case count series with fixed time intervals consist of mutually exxclusive time intervals that all start and end on different dates and all have identical length (day, week, month, year). Given the different nature of these two types of case count data, we indicated this with an attribute for each count value, named "PartOfCumulativeCountSeries".

  18. m

    The database of indices computed from RR-intervals of length 512 of 46...

    • mostwiedzy.pl
    zip
    Updated Jun 24, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Grzegorz Graff; Paweł Pilarczyk; Beata Graff (2021). The database of indices computed from RR-intervals of length 512 of 46 healthy subjects at rest [Dataset]. http://doi.org/10.34808/578y-0t55
    Explore at:
    zip(69247)Available download formats
    Dataset updated
    Jun 24, 2021
    Authors
    Grzegorz Graff; Paweł Pilarczyk; Beata Graff
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    This dataset contains the data that was a basis for the results discussed in the paper “Persistent homology as a new method of the assessment of heart rate variability” by Grzegorz Graff, Beata Graff, Paweł Pilarczyk, Grzegorz Jabłoński, Dariusz Gąsecki, Krzysztof Narkiewicz, Plos One (2021), DOI: 10.1371/journal.pone.0253851.

  19. f

    Data from: New Variable Selection Method Using Interval Segmentation Purity...

    • figshare.com
    • acs.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li-Juan Tang; Wen Du; Hai-Yan Fu; Jian-Hui Jiang; Hai-Long Wu; Guo-Li Shen; Ru-Qin Yu (2023). New Variable Selection Method Using Interval Segmentation Purity with Application to Blockwise Kernel Transform Support Vector Machine Classification of High-Dimensional Microarray Data [Dataset]. http://doi.org/10.1021/ci900032q.s001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    ACS Publications
    Authors
    Li-Juan Tang; Wen Du; Hai-Yan Fu; Jian-Hui Jiang; Hai-Long Wu; Guo-Li Shen; Ru-Qin Yu
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    One problem with discriminant analysis of microarray data is representation of each sample by a large number of genes that are possibly irrelevant, insignificant, or redundant. Methods of variable selection are, therefore, of great significance in microarray data analysis. A new method for key gene selection has been proposed on the basis of interval segmentation purity that is defined as the purity of samples belonging to a certain class in intervals segmented by a mode search algorithm. This method identifies key variables most discriminative for each class, which offers possibility of unraveling the biological implication of selected genes. A salient advantage of the new strategy over existing methods is the capability of selecting genes that, though possibly exhibit a multimodal distribution, are the most discriminative for the classes of interest, considering that the expression levels of some genes may reflect systematic difference in within-class samples derived from different pathogenic mechanisms. On the basis of the key genes selected for individual classes, a support vector machine with block-wise kernel transform is developed for the classification of different classes. The combination of the proposed gene mining approach with support vector machine is demonstrated in cancer classification using two public data sets. The results reveal that significant genes have been identified for each class, and the classification model shows satisfactory performance in training and prediction for both data sets.

  20. m

    Clustering Interval Time Series by Elizabeth Ann Maharaj, Paulo Teles, Paula...

    • bridges.monash.edu
    • researchdata.edu.au
    • +1more
    pdf
    Updated Jan 10, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elizabeth Ann Maharaj; Paulo Teles; Paula Brito (2019). Clustering Interval Time Series by Elizabeth Ann Maharaj, Paulo Teles, Paula Brito [Dataset]. http://doi.org/10.26180/5c372a47334a6
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jan 10, 2019
    Dataset provided by
    Monash University
    Authors
    Elizabeth Ann Maharaj; Paulo Teles; Paula Brito
    License

    Public Domain Mark 1.0https://creativecommons.org/publicdomain/mark/1.0/
    License information was derived automatically

    Description

    Supplementary MaterialData filesFigures A1 - A8: Simulations BoxplotsFigures B1: B16: Application Dendrograms Software: R and Matlab

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
INTERVAL must be acknowledged in all publications using these data. Further details will be issued through the Data Access Committee. (2024). INTERVAL [Dataset]. https://web.dev.hdruk.cloud/dataset/201

INTERVAL

INTERVAL

Explore at:
unknownAvailable download formats
Dataset updated
Aug 10, 2024
Dataset authored and provided by
INTERVAL must be acknowledged in all publications using these data. Further details will be issued through the Data Access Committee.
License

http://www.donorhealth-btru.nihr.ac.uk/wp-content/uploads/2020/04/Data-Access-Policy-v1.0-14Apr2020.pdfhttp://www.donorhealth-btru.nihr.ac.uk/wp-content/uploads/2020/04/Data-Access-Policy-v1.0-14Apr2020.pdf

Description

In over 100 years of blood donation practice, INTERVAL is the first randomised controlled trial to assess the impact of varying the frequency of blood donation on donor health and the blood supply. It provided policy-makers with evidence that collecting blood more frequently than current intervals can be implemented over two years without impacting on donor health, allowing better management of the supply to the NHS of units of blood with in-demand blood groups. INTERVAL was designed to deliver a multi-purpose strategy: an initial purpose related to blood donation research aiming to improve NHS Blood and Transplant’s core services and a longer-term purpose related to the creation of a comprehensive resource that will enable detailed studies of health-related questions.

Approximately 50,000 generally healthy blood donors were recruited between June 2012 and June 2014 from 25 NHS Blood Donation centres across England. Approximately equal numbers of men and women; aged from 18-80; ~93% white ancestry. All participants completed brief online questionnaires at baseline and gave blood samples for research purposes. Participants were randomised to giving blood every 8/10/12 weeks (for men) and 12/14/16 weeks (for women) over a 2-year period. ~30,000 participants returned after 2 years and completed a brief online questionnaire and gave further blood samples for research purposes.

The baseline questionnaire includes brief lifestyle information (smoking, alcohol consumption, etc), iron-related questions (e.g., red meat consumption), self-reported height and weight, etc. The SF-36 questionnaire was completed online at baseline and 2-years, with a 6-monthly SF-12 questionnaire between baseline and 2-years.

All participants have had the Affymetrix Axiom UK Biobank genotyping array assayed and then imputed to 1000G+UK10K combined reference panel (80M variants in total). 4,000 participants have 50X whole-exome sequencing and 12,000 participants have 15X whole-genome sequencing. Whole-blood RNA sequencing has commenced in ~5,000 participants.

The dataset also contains data on clinical chemistry biomarkers, blood cell traits, >200 lipoproteins, metabolomics (Metabolon HD4), lipidomics, and proteomics (SomaLogic, Olink), either cohort-wide or is large sub-sets of the cohort.

Search
Clear search
Close search
Google apps
Main menu