100+ datasets found
  1. n

    Data from: Continuous-time spatially explicit capture-recapture models, with...

    • data.niaid.nih.gov
    • dataone.org
    • +2more
    zip
    Updated Apr 21, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rebecca Foster; Bart Harmsen; Lorenzo Milazzo; Greg Distiller; David Borchers (2014). Continuous-time spatially explicit capture-recapture models, with an application to a jaguar camera-trap survey [Dataset]. http://doi.org/10.5061/dryad.mg5kv
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 21, 2014
    Dataset provided by
    University of Cape Town
    University of Cambridge
    University of St Andrews
    University of Belize
    Authors
    Rebecca Foster; Bart Harmsen; Lorenzo Milazzo; Greg Distiller; David Borchers
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Area covered
    Belize, Cockscomb Basin Wildlife Sanctuary
    Description

    Many capture-recapture surveys of wildlife populations operate in continuous time but detections are typically aggregated into occasions for analysis, even when exact detection times are available. This discards information and introduces subjectivity, in the form of decisions about occasion definition. We develop a spatio-temporal Poisson process model for spatially explicit capture-recapture (SECR) surveys that operate continuously and record exact detection times. We show that, except in some special cases (including the case in which detection probability does not change within occasion), temporally aggregated data do not provide sufficient statistics for density and related parameters, and that when detection probability is constant over time our continuous-time (CT) model is equivalent to an existing model based on detection frequencies. We use the model to estimate jaguar density from a camera-trap survey and conduct a simulation study to investigate the properties of a CT estimator and discrete-occasion estimators with various levels of temporal aggregation. This includes investigation of the effect on the estimators of spatio-temporal correlation induced by animal movement. The CT estimator is found to be unbiased and more precise than discrete-occasion estimators based on binary capture data (rather than detection frequencies) when there is no spatio-temporal correlation. It is also found to be only slightly biased when there is correlation induced by animal movement, and to be more robust to inadequate detector spacing, while discrete-occasion estimators with binary data can be sensitive to occasion length, particularly in the presence of inadequate detector spacing. Our model includes as a special case a discrete-occasion estimator based on detection frequencies, and at the same time lays a foundation for the development of more sophisticated CT models and estimators. It allows modelling within-occasion changes in detectability, readily accommodates variation in detector effort, removes subjectivity associated with user-defined occasions, and fully utilises CT data. We identify a need for developing CT methods that incorporate spatio-temporal dependence in detections and see potential for CT models being combined with telemetry-based animal movement models to provide a richer inference framework.

  2. Poisson Distribution - Discrete Data

    • kaggle.com
    zip
    Updated Jan 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alberto Marini (2025). Poisson Distribution - Discrete Data [Dataset]. https://www.kaggle.com/datasets/albertomarini88/poisson-process
    Explore at:
    zip(36993 bytes)Available download formats
    Dataset updated
    Jan 30, 2025
    Authors
    Alberto Marini
    License

    https://www.usa.gov/government-works/https://www.usa.gov/government-works/

    Description

    The Poisson Process file concerns the solution of an exercise from the fourth module of the Statistics and Applied Data Analysis Specialization course at the University of Colorado Boulder that I took. In these notes, I intend to explain the most important steps.

  3. Mutual Information between Discrete and Continuous Data Sets

    • plos.figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brian C. Ross (2023). Mutual Information between Discrete and Continuous Data Sets [Dataset]. http://doi.org/10.1371/journal.pone.0087357
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Brian C. Ross
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mutual information (MI) is a powerful method for detecting relationships between data sets. There are accurate methods for estimating MI that avoid problems with “binning” when both data sets are discrete or when both data sets are continuous. We present an accurate, non-binning MI estimator for the case of one discrete data set and one continuous data set. This case applies when measuring, for example, the relationship between base sequence and gene expression level, or the effect of a cancer drug on patient survival time. We also show how our method can be adapted to calculate the Jensen–Shannon divergence of two or more data sets.

  4. DISCRETE DATA of student marks

    • kaggle.com
    zip
    Updated Jul 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. H. Khalil (2023). DISCRETE DATA of student marks [Dataset]. https://www.kaggle.com/datasets/drhkhalil/discrete-data-of-student-marks
    Explore at:
    zip(8075 bytes)Available download formats
    Dataset updated
    Jul 3, 2023
    Authors
    Dr. H. Khalil
    Description

    Dataset

    This dataset was created by Dr. H. Khalil

    Contents

  5. Data from: Multilevel Modeling in the ‘Wide Format’ Approach with Discrete...

    • tandf.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M.T. Barendse; Y. Rosseel (2023). Multilevel Modeling in the ‘Wide Format’ Approach with Discrete Data: A Solution for Small Cluster Sizes [Dataset]. http://doi.org/10.6084/m9.figshare.11505561.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    M.T. Barendse; Y. Rosseel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In multilevel data, units at level 1 are nested in clusters at level 2, which in turn may be nested in even larger clusters at level 3, and so on. For continuous data, several authors have shown how to model multilevel data in a ‘wide’ or ‘multivariate’ format approach. We provide a general framework to analyze random intercept multilevel SEM in the ‘wide format’ (WF) and extend this approach for discrete data. In a simulation study, we vary response scale (binary, four response options), covariate presence (no, between-level, within-level), design (balanced, unbalanced), model misspecification (present, not present), and the number of clusters (small, large) to determine accuracy and efficiency of the estimated model parameters. With a small number of observations in a cluster, results indicate that the WF approach is a preferable approach to estimate multilevel data with discrete response options.

  6. d

    Hydrographic and chemical profile discrete data obtained during the R/V...

    • catalog.data.gov
    Updated Dec 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact) (2025). Hydrographic and chemical profile discrete data obtained during the R/V Meteor cruise M85/1 (EXPOCODE 06M320110624) in the North Atlantic Ocean from 2011-06-24 to 2011-08-02 (NCEI Accession 0170894) [Dataset]. https://catalog.data.gov/dataset/hydrographic-and-chemical-profile-discrete-data-obtained-during-the-r-v-meteor-cruise-m85-1-exp
    Explore at:
    Dataset updated
    Dec 2, 2025
    Dataset provided by
    (Point of Contact)
    Area covered
    Atlantic Ocean
    Description

    This dataset includes profile discrete measurements of temperature, salinity, oxygen and CFCs obtained During the R/V Meteor cruise M85/1 (EXPOCODE 06M320110624) in the North Atlantic Ocean from 2011-06-24 to 2011-08-02. R/V Meteor Cruise M85, leg 1, was funded by the German Federal Ministry of Education and Research (BMBF) as part of the cooperative research program "North Atlantic".

  7. n

    Data from: Discrete choice modelling of natal dispersal: "choosing" where to...

    • data.niaid.nih.gov
    • datadryad.org
    zip
    Updated Apr 29, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michalis Vardakis; Peter Goos; Frank Adriaensen; Erik Matthysen (2016). Discrete choice modelling of natal dispersal: "choosing" where to breed from a finite set of available areas [Dataset]. http://doi.org/10.5061/dryad.ft702
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 29, 2016
    Dataset provided by
    University of Antwerp
    Authors
    Michalis Vardakis; Peter Goos; Frank Adriaensen; Erik Matthysen
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Area covered
    Boshoek, Europe, Antwerp, Belgium, Flanders
    Description
    1. Classic natal dispersal studies focus mainly on distance travelled. Although distances capture some of the main selective pressures related to dispersal, this approach cannot easily incorporate the properties of the actual destination vs. the available alternatives. Recently, movement ecology studies have addressed questions on movement decisions in relation to availability of resources and/or availability of suitable habitats through the use of discrete choice models (DCMs), a widely used type of models within econometrics, which explains individual choices as a function of the properties of a finite number of alternatives. 2. In this contribution, we show how the dispersal discrete choice model (DDCM) can be used for analysing natal dispersal data in patchy environments given that the natal and the breeding area of the disperser are observed. We test this method using a case study on Great Tits (Parus major) in an archipelago of small woodlots. 3. Our results show that DDCMs are able to capture the results of classic distance-based approaches and simultaneously allow testing hypotheses on how departure and settlement are affected by variables that characterize the disperser, the natal patch and the breeding area, as well as their interactions. 4. DDCMs can be applied to any other species and system that uses some form of discrete breeding location or a certain degree of discretization can be applied.
  8. D

    Genomic Results Discrete Data Integration Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Genomic Results Discrete Data Integration Market Research Report 2033 [Dataset]. https://dataintelo.com/report/genomic-results-discrete-data-integration-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 1, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Genomic Results Discrete Data Integration Market Outlook



    According to our latest research, the global Genomic Results Discrete Data Integration market size reached USD 2.18 billion in 2024, reflecting a robust expansion driven by the rapid adoption of precision medicine and the increasing integration of multi-omics data in healthcare and research. The market is projected to grow at a CAGR of 13.6% from 2025 to 2033, reaching an estimated USD 6.47 billion by 2033. This remarkable growth is primarily fueled by technological advancements in bioinformatics, an upsurge in clinical applications of genomics, and a growing demand for actionable insights from complex biological datasets.




    One of the primary growth factors propelling the Genomic Results Discrete Data Integration market is the exponential increase in genomic data generated by next-generation sequencing (NGS) technologies. As the cost of sequencing continues to decrease, the volume of genomic, transcriptomic, proteomic, and metabolomic data being produced is rising dramatically. This surge necessitates advanced data integration solutions capable of transforming raw, heterogeneous datasets into structured, clinically relevant information. The ability to harmonize and standardize disparate data sources is crucial for supporting clinical diagnostics, personalized medicine, and drug discovery, all of which rely on robust data integration platforms to drive informed decisions and improve patient outcomes.




    Another significant driver is the growing emphasis on personalized medicine and targeted therapeutics. Healthcare providers and pharmaceutical companies are increasingly leveraging discrete data integration platforms to correlate genomic variants with phenotypic outcomes, enabling more precise disease stratification and individualized treatment strategies. The integration of multi-omics data not only enhances the understanding of disease mechanisms but also accelerates the identification of novel therapeutic targets. This trend is further reinforced by regulatory agencies and reimbursement bodies that are placing greater value on the clinical utility of integrated genomic data, thereby incentivizing investments in advanced integration technologies.




    Furthermore, the adoption of cloud-based solutions and artificial intelligence (AI) in genomic data integration is revolutionizing the market landscape. Cloud platforms offer scalable storage, computational power, and collaborative environments, making it feasible for institutions of all sizes to process and analyze vast datasets efficiently. AI-driven analytics are enhancing the extraction of actionable insights from integrated data, supporting applications across clinical diagnostics, research, and drug development. The convergence of these technologies is not only improving the speed and accuracy of data interpretation but also expanding the accessibility of genomic insights to a broader range of end-users, including hospitals, research institutes, and biotechnology companies.




    Regionally, North America dominated the Genomic Results Discrete Data Integration market in 2024, accounting for the largest revenue share due to its advanced healthcare infrastructure, high adoption of precision medicine, and significant investments in genomics research. Europe followed closely, driven by strong government support and collaborative research initiatives. The Asia Pacific region is emerging as a high-growth market, propelled by increasing healthcare expenditure, expanding genomics research capabilities, and rising awareness of personalized medicine. Latin America and the Middle East & Africa are also witnessing gradual adoption, supported by international collaborations and capacity-building efforts. The regional outlook remains optimistic, with all major regions expected to contribute significantly to the market’s overall expansion through 2033.



    Component Analysis



    The Genomic Results Discrete Data Integration market by component is segmented into software, hardware, and services, each playing a pivotal role in enabling seamless integration and interpretation of complex biological data. Software solutions represent the largest share, driven by the need for sophisticated algorithms that can harmonize, standardize, and analyze multi-omics datasets. These platforms facilitate data interoperability, support regulatory compliance, and enable advanced analytics, making them indispensable for both clinical and research applications. Key sof

  9. G

    Genomic Results Discrete Data Integration Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Genomic Results Discrete Data Integration Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/genomic-results-discrete-data-integration-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 7, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Genomic Results Discrete Data Integration Market Outlook



    As per our latest research, the global Genomic Results Discrete Data Integration market size reached USD 1.45 billion in 2024, demonstrating robust momentum driven by the increasing adoption of precision medicine and advanced data analytics in genomics. The market is projected to expand at a CAGR of 13.2% during the forecast period, reaching an estimated USD 4.14 billion by 2033. This impressive growth trajectory is fueled by the convergence of high-throughput sequencing technologies, the rising demand for integrated healthcare data, and the need for actionable insights from complex genomic datasets.



    A primary growth factor in the Genomic Results Discrete Data Integration market is the exponential rise in genomic data generation, propelled by advancements in next-generation sequencing (NGS) and other high-throughput technologies. As the cost of sequencing continues to decline, the volume of raw genomic data produced by research laboratories, clinical settings, and biopharmaceutical companies has surged. However, the true value of this data is only realized when disparate datasets—spanning genomics, transcriptomics, proteomics, and metabolomics—are seamlessly integrated and analyzed. The integration of discrete genomic results enables researchers and clinicians to uncover complex biological relationships, identify novel biomarkers, and support the development of targeted therapies, thus driving widespread adoption of data integration platforms and solutions.



    Another significant driver is the increasing focus on personalized medicine, which relies heavily on the integration of multi-omics data to tailor medical treatments to individual patients. Healthcare providers and pharmaceutical companies are leveraging integrated genomic data to stratify patient populations, predict disease susceptibility, and optimize therapeutic interventions. This shift toward data-driven healthcare is further supported by regulatory agencies encouraging the use of real-world evidence and integrated datasets for drug approval and post-market surveillance. Consequently, the demand for robust, scalable, and interoperable data integration solutions is surging, as stakeholders seek to harness the full potential of genomic and related datasets for clinical and research applications.



    Furthermore, the Genomic Results Discrete Data Integration market benefits from technological innovations in artificial intelligence (AI), machine learning (ML), and cloud computing. These technologies facilitate the efficient aggregation, harmonization, and analysis of massive and heterogeneous datasets, overcoming traditional barriers to data integration such as data silos, format inconsistencies, and security concerns. The adoption of AI-driven analytics and cloud-based integration platforms is accelerating, enabling real-time data sharing, collaborative research, and scalable storage solutions. These advancements are not only enhancing the accuracy and speed of data interpretation but also democratizing access to integrated genomic insights across diverse healthcare and research environments.



    From a regional perspective, North America continues to dominate the Genomic Results Discrete Data Integration market, accounting for the largest share in 2024, followed by Europe and Asia Pacific. The region’s leadership is attributed to its advanced healthcare infrastructure, significant investments in genomics research, and the presence of leading biopharmaceutical and technology companies. Meanwhile, Asia Pacific is emerging as the fastest-growing region, propelled by expanding genomic research initiatives, increasing healthcare expenditure, and government support for precision medicine. Europe also demonstrates steady growth, driven by collaborative research projects and strong regulatory frameworks supporting data integration. Latin America and Middle East & Africa represent nascent but promising markets, with growing awareness and gradual adoption of integrated genomic solutions.





    Component Analysis



    The Com

  10. d

    Data from: How to use discrete choice experiments to capture stakeholder...

    • search.dataone.org
    • data.niaid.nih.gov
    • +2more
    Updated Jul 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alan R. Ellis; Qiana R. Cryer-Coupet; Bridget E. Weller; Kirsten Howard; Rakhee Raghunandan; Kathleen C. Thomas (2025). How to use discrete choice experiments to capture stakeholder preferences in social work research [Dataset]. http://doi.org/10.5061/dryad.z612jm6m0
    Explore at:
    Dataset updated
    Jul 31, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Alan R. Ellis; Qiana R. Cryer-Coupet; Bridget E. Weller; Kirsten Howard; Rakhee Raghunandan; Kathleen C. Thomas
    Description

    The primary article (cited below under "Related works") introduces social work researchers to discrete choice experiments (DCEs) for studying stakeholder preferences. The article includes an online supplement with a worked example demonstrating DCE design and analysis with realistic simulated data. The worked example focuses on caregivers' priorities in choosing treatment for children with attention deficit hyperactivity disorder. This dataset includes the scripts (and, in some cases, Excel files) that we used to identify appropriate experimental designs, simulate population and sample data, estimate sample size requirements for the multinomial logit (MNL, also known as conditional logit) and random parameter logit (RPL) models, estimate parameters using the MNL and RPL models, and analyze attribute importance, willingness to pay, and predicted uptake. It also includes the associated data files (experimental designs, data generation parameters, simulated population data and parameters, ..., In the worked example, we used simulated data to examine caregiver preferences for 7 treatment attributes (medication administration, therapy location, school accommodation, caregiver behavior training, provider communication, provider specialty, and monthly out-of-pocket costs) identified by dosReis and colleagues in a previous DCE. We employed an orthogonal design with 1 continuous variable (cost) and 12 dummy-coded variables (representing the levels of the remaining attributes, which were categorical). Using the parameter estimates published by dosReis et al., with slight adaptations, we simulated utility values for a population of 100,000 people, then selected a sample of 500 for analysis. Relying on random utility theory, we used the mlogit package in R to estimate the MNL and RPL models, using 5,000 Halton draws for simulated maximum likelihood estimation of the RPL model. In addition to estimating the utility parameters, we measured the relative importance of each attribute, esti..., , # Data from: How to Use Discrete Choice Experiments to Capture Stakeholder Preferences in Social Work Research

    Access this dataset on Dryad

    This dataset supports the worked example in:

    Ellis, A. R., Cryer-Coupet, Q. R., Weller, B. E., Howard, K., Raghunandan, R., & Thomas, K. C. (2024). How to use discrete choice experiments to capture stakeholder preferences in social work research. Journal of the Society for Social Work and Research. Advance online publication. https://doi.org/10.1086/731310

    The referenced article introduces social work researchers to discrete choice experiments (DCEs) for studying stakeholder preferences. In a DCE, researchers ask participants to complete a series of choice tasks: hypothetical situations in which each participant is presented with alternative scenarios and selects one or more. For example, social work researchers may want to know how parents and other caregivers pr...

  11. Discrete Data of Alzheimer's OASIS

    • kaggle.com
    zip
    Updated Aug 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DANIYAL KHAN (2024). Discrete Data of Alzheimer's OASIS [Dataset]. https://www.kaggle.com/medaniyalkhan/discrete-data-of-alzheimers-oasis
    Explore at:
    zip(4415 bytes)Available download formats
    Dataset updated
    Aug 1, 2024
    Authors
    DANIYAL KHAN
    Description

    Dataset

    This dataset was created by DANIYAL KHAN

    Contents

  12. Environmental Monitoring Program: Discrete Water Quality Data

    • data.ca.gov
    • data.cnra.ca.gov
    • +1more
    csv, pdf
    Updated Jul 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Water Resources (2025). Environmental Monitoring Program: Discrete Water Quality Data [Dataset]. https://data.ca.gov/dataset/environmental-monitoring-program-discrete-water-quality-data
    Explore at:
    csv, pdfAvailable download formats
    Dataset updated
    Jul 2, 2025
    Dataset authored and provided by
    California Department of Water Resourceshttp://www.water.ca.gov/
    License

    Attribution-NonCommercial 2.0 (CC BY-NC 2.0)https://creativecommons.org/licenses/by-nc/2.0/
    License information was derived automatically

    Description

    The Interagency Ecological Program’s (IEP) Environmental Monitoring Program (EMP) was initiated in compliance with the Water Right Decision D-1379 (now mandated by Water Right Decision D-1641) and has monitored discrete water quality and nutrients in the upper San Francisco Estuary since 1975. The objectives of the EMP are to obtain consistent and accurate monthly data at established monitoring stations, provide and document information necessary to achieve compliance with salinity, flow, and dissolved oxygen standards, and to report this information for the purpose of management and conservation of the upper San Francisco Estuary. While the EMP also collects biological data, this dataset only includes the discrete water quality and nutrient data collected by the EMP from 1975-2021. Links to other EMP datasets can be found here.

    Data is also accessible via the Environmental Data Initiative.

  13. f

    Data from: Family-Wise Error Rate Controlling Procedures for Discrete Data

    • figshare.com
    application/gzip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yalin Zhu; Wenge Guo (2023). Family-Wise Error Rate Controlling Procedures for Discrete Data [Dataset]. http://doi.org/10.6084/m9.figshare.9545174.v2
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Yalin Zhu; Wenge Guo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In applications such as clinical safety analysis, the data of the experiments usually consist of frequency counts. In the analysis of such data, researchers often face the problem of multiple testing based on discrete test statistics, aimed at controlling family-wise error rate (FWER). Most existing FWER controlling procedures are developed for continuous data, which are often conservative when analyzing discrete data. By using minimal attainable p-values, several FWER controlling procedures have been specifically developed for discrete data in the literature. In this article, by using known marginal distributions of true null p-values, three more powerful stepwise procedures are developed, which are modified versions of the conventional Bonferroni, Holm and Hochberg procedures, respectively. It is shown that the first two procedures strongly control the FWER under arbitrary dependence and are more powerful than the existing Tarone-type procedures, while the last one only ensures control of the FWER in special settings. Through extensive simulation studies, we provide numerical evidence of superior performance of the proposed procedures in terms of the FWER control and minimal power. A real clinical safety data are used to demonstrate applications of our proposed procedures. An R package “MHTdiscrete” and a web application are developed for implementing the proposed procedures.

  14. d

    Data from: Harmonized discrete and continuous water quality data in support...

    • catalog.data.gov
    • data.usgs.gov
    • +2more
    Updated Nov 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Harmonized discrete and continuous water quality data in support of modeling harmful algal blooms in the Illinois River Basin, 2005 - 2020 [Dataset]. https://catalog.data.gov/dataset/harmonized-discrete-and-continuous-water-quality-data-in-support-of-modeling-harmful-2005-
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    Harmful algal blooms (HABs) are overgrowths of algae or cyanobacteria in water and can be harmful to humans and animals directly via toxin exposure or indirectly via changes in water quality and related impacts to ecosystems services, drinking water characteristics, and recreation. While HABs occur frequently throughout the United States, the driving conditions behind them are not well understood, especially in flowing waters. In order to facilitate future model development and characterization of HABs in the Illinois River Basin, this data release publishes a synthesized and cleaned collection of HABs-related water quality and quantity data for river and stream sites in the basin. It includes nutrients, major ions, sediment, physical properties, streamflow, chlorophyll and other types of water data. This data release contains files of harmonized data from the USGS National Water Information System (NWIS), the U.S. Army Corps of Engineers (USACE), the Illinois Environmental Protection Agency (IEPA), and a USGS Open File Report (OFR) containing toxin data in Illinois (Terrio and others, 2013: https://pubs.usgs.gov/of/2013/1019/pdf/ofr2013-1019.pdf). Both discrete data and continuous sensor data for 142 parameters (44 of which returned data) between October 1, 2015 and December 31, 2022 were downloaded from NWIS programmatically. All data were harmonized into a shared format (see files named data_{parameter_group}combined.csv). The USGS NWIS data went through additional cleaning and were also grouped by generic parameters (see pcode_group_xwalk.csv to see what parameter codes are mapped to which generic parameters). Any data not from USGS NWIS were kept outside of the parameter grouping files. Additional streamflow data for select locations was retrieved from the USACE and are available in data_usace_00060_combined.csv. Additional algal toxin data provided by the IEPA and in a USGS OFR report (Terrio and others, 2013), which include some lake sites, are available in data_algaltoxins_combined.csv. We also provide collapsed datasets of daily metrics for each water quality (“generic parameter”) group of USGS NWIS data (files named daily_metrics{parameter_group}.csv). Lastly, we include a site_metadata.csv containing site identification and location information for all sites with water quality and quantity data, and mappings to the National Hydrography Dataset flowlines where available. This work was completed as part of the USGS Proxies Project, an effort supported by the Water Mission Area (WMA) Water Quality Processes program to develop estimation methods for PFAS, harmful algal blooms, and metals, at multiple spatial and temporal scales.

  15. d

    Discrete sample surface-water data for the Sacramento-San Joaquin River...

    • catalog.data.gov
    • data.usgs.gov
    Updated Nov 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Discrete sample surface-water data for the Sacramento-San Joaquin River Delta [Dataset]. https://catalog.data.gov/dataset/discrete-sample-surface-water-data-for-the-sacramento-san-joaquin-river-delta
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    San Joaquin River, Sacramento-San Joaquin Delta
    Description

    The goal of this study was to develop a suite of inter-related water quality monitoring approaches capable of modeling and estimating the spatial and temporal gradients of particulate and dissolved total mercury (THg) concentration, and particulate and dissolved methyl mercury (MeHg), concentration, in surface waters across the Sacramento / San Joaquin River Delta (SSJRD). This suite of monitoring approaches included: a) data collection at fixed continuous monitoring stations (CMS) outfitted with in-situ sensors, b) spatial mapping using boat-mounted flow-through sensors, and c) satellite-based remote sensing. The focus of this specific Child Page is to present all field and laboratory-based data associated with discrete surface water samples collected as part of the CMS and boat mapping components of the study. The data provided in the table herein constitute a collection of field-based and laboratory-based measurements that coincide with the timestamps of samples collected at 33 sites across the Delta. Laboratory-based measurements presented herein were conducted by the U.S. Geological Survey (USGS) Organic Matter Research Laboratory (OMRL) in Sacramento, CA, the USGS Earths System Processes Division (ESPD) microbial biogeochemistry laboratory in Menlo Park, CA, the USGS Reston Stable Isotope Laboratory (RSIL) in Reston, VA and the USGS National Water Quality Laboratory (NWQL) in Denver, CO. The machine-readable (comma separated value, *.csv) file presented herein includes laboratory-based measurements for discrete samples collected from 33 established field sites (sampled repeatedly). In addition, field-based sensor data from continuous measurement platforms (CMS locations or as part of the mapping boat flow-through system) are also included in this discrete sample dataset by ensuring that the field sensor measurements were both spatially and temporally coincident with the physically discrete water sample collected for laboratory analysis.

  16. d

    Data from: Discrete and high frequency water quality data for Allequash...

    • datasets.ai
    • data.usgs.gov
    • +1more
    55
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of the Interior (2023). Discrete and high frequency water quality data for Allequash Creek, Wisconsin, WY 2019-2021 [Dataset]. https://datasets.ai/datasets/discrete-and-high-frequency-water-quality-data-for-allequash-creek-wisconsin-wy-2019-2021
    Explore at:
    55Available download formats
    Dataset updated
    Jun 1, 2023
    Dataset authored and provided by
    Department of the Interior
    Area covered
    Wisconsin, Allequash Creek
    Description

    This data set is a compilation of discrete and high frequency water quality data from sites on Allequash Creek in Wisconsin, and within the Allequash Creek watershed, for the water years (WY) 2019-2021.

  17. d

    Data from: Discrete and high-frequency chloride (Cl) and specific...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Oct 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Discrete and high-frequency chloride (Cl) and specific conductance (SC) data sets and Cl-SC regression equations used for analysis of 93 USGS water quality monitoring stations in the eastern United States [Dataset]. https://catalog.data.gov/dataset/discrete-and-high-frequency-chloride-cl-and-specific-conductance-sc-data-sets-and-cl-sc-re
    Explore at:
    Dataset updated
    Oct 30, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    United States
    Description

    High frequency estimated chloride (Cl) and observed specific conductance (SC) data sets, along with response variables derived from those data sets, were used in an analysis to quantify the extent to which deicer applications in winter affect water quality in 93 U.S. Geological Survey water quality monitoring stations across the eastern United States. The analysis was documented in the following publication: Moore, J., R. Fanelli, and A. Sekellick. In review. High-frequency data reveal deicing salts drive elevated conductivity and chloride along with pervasive and frequent exceedances of the EPA aquatic life criteria for chloride in urban streams. Submitted to Environmental Science and Technology. This data release contains five child items: 1) Input datasets of discrete specific conductance (SC) and chloride (Cl) observations used to develop regression models describing the relationship between chloride and SC 2) The predicted chloride concentrations generated by applying the sites-specific and regional regression models to high-frequency SC datasets 3) The regression equations for 56 USGS water quality monitoring stations across the eastern Unite States, as well as three regions 4) Response variables describing temporal patterns in SC and chloride, calculated by using the estimated high-frequency chloride time series datasets and high-frequency SC datasets 5) Watershed characteristics describing the land use, geology, climate, and deicer application rates for the 93 watersheds included in the Moore et. al (in review) study.

  18. U

    Discrete and daily-aligned groundwater levels, metadata, and other...

    • data.usgs.gov
    • s.cnmilf.com
    • +1more
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Angela Robinson; Erik Wojtylko; William Asquith; Ronald Seanor; Courtney Killian; Virginia McGuire, Discrete and daily-aligned groundwater levels, metadata, and other attributes useful for statistical modeling for the Mississippi River Valley Alluvial aquifer, Mississippi Alluvial Plain, 1980–2019 [Dataset]. http://doi.org/10.5066/P9O3XGBK
    Explore at:
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    Angela Robinson; Erik Wojtylko; William Asquith; Ronald Seanor; Courtney Killian; Virginia McGuire
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    Jan 1, 1980 - Dec 31, 2019
    Area covered
    Mississippi River, Mississippi River Alluvial Plain
    Description

    A combination of discrete and daily-aligned groundwater levels for the Mississippi River Valley alluvial aquifer clipped to the Mississippi Alluvial Plain, as defined by Painter and Westerman (2018), with corresponding metadata are based on processing of U.S. Geological Survey National Water Information System (NWIS) (U.S. Geological Survey, 2020) data. The processing was made after retrieval using aggregation and filtering through the infoGW2visGWDB software (Asquith and Seanor, 2019). The nomenclature GWmaster mimics that of the output from infoGW2visGWDB. Two separate data retrievals for NWIS were made. First, the discrete data were retrieved, and second, continuous records from recorder sites with daily-mean or other daily statistics codes were retrieved. Each dataset was separately passed through the infoGW2visGWDB software to create a "GWmaster discrete" and "GWmaster continuous" and these tables were combined and then sorted on the site identifier and date to form the data ...

  19. Analyzing Music with Discrete Mathematics

    • figshare.com
    pdf
    Updated Aug 29, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jorge Blanco (2018). Analyzing Music with Discrete Mathematics [Dataset]. http://doi.org/10.6084/m9.figshare.7021478.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Aug 29, 2018
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Jorge Blanco
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    his project intends to find a correlation between the songs "Moonlight Sonata" and "Because" with the use of discrete mathematics. With music theory, the field of using math to analyze music, lacking exploration, our primary goal was to draw conclusions from what was found: analyze the data collected to better understand the behind the scenes mathematics in music. This was done by translating the sheet music into numbers using modular arithmetic (Mod 12) system; then, the numbers collected were classified into pitch class sets, normal form, and prime form. We also used matrices to test for the property of invariance. After thorough examination, it was clear that the songs were very similar in nature, which fundamentally demonstrates that a mathematical interpretation can be used to prove what is heard.

  20. b

    Discrete data on hydrography (from CTD casts) and chemical analyses of...

    • bco-dmo.org
    • search.dataone.org
    csv
    Updated Mar 10, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brian Roberts (2021). Discrete data on hydrography (from CTD casts) and chemical analyses of dissolved nutrients and organic carbon on the Louisiana-Texas shelf in the Gulf of Mexico from R/V Acadiana cruise 18-21 September 2017 [Dataset]. http://doi.org/10.26008/1912/bco-dmo.844709.1
    Explore at:
    csv(10.82 KB)Available download formats
    Dataset updated
    Mar 10, 2021
    Dataset provided by
    Biological and Chemical Data Management Office
    Authors
    Brian Roberts
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Sep 18, 2017 - Sep 21, 2017
    Area covered
    Variables measured
    DOC, NH4, NO2, PO4, TDN, Date, SiO2, Time, Type, Depth, and 14 more
    Measurement technique
    CTD Sea-Bird SBE SEACAT 19plus, Flow Injection Analyzer, Total Organic Carbon Analyzer, Niskin bottle
    Description

    Usage note: Time zone information was not received by BCO-DMO so it is unknown if the dates and times in this dataset are local or UTC. Please contact the dataset PI if you have questions about this.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rebecca Foster; Bart Harmsen; Lorenzo Milazzo; Greg Distiller; David Borchers (2014). Continuous-time spatially explicit capture-recapture models, with an application to a jaguar camera-trap survey [Dataset]. http://doi.org/10.5061/dryad.mg5kv

Data from: Continuous-time spatially explicit capture-recapture models, with an application to a jaguar camera-trap survey

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Apr 21, 2014
Dataset provided by
University of Cape Town
University of Cambridge
University of St Andrews
University of Belize
Authors
Rebecca Foster; Bart Harmsen; Lorenzo Milazzo; Greg Distiller; David Borchers
License

https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

Area covered
Belize, Cockscomb Basin Wildlife Sanctuary
Description

Many capture-recapture surveys of wildlife populations operate in continuous time but detections are typically aggregated into occasions for analysis, even when exact detection times are available. This discards information and introduces subjectivity, in the form of decisions about occasion definition. We develop a spatio-temporal Poisson process model for spatially explicit capture-recapture (SECR) surveys that operate continuously and record exact detection times. We show that, except in some special cases (including the case in which detection probability does not change within occasion), temporally aggregated data do not provide sufficient statistics for density and related parameters, and that when detection probability is constant over time our continuous-time (CT) model is equivalent to an existing model based on detection frequencies. We use the model to estimate jaguar density from a camera-trap survey and conduct a simulation study to investigate the properties of a CT estimator and discrete-occasion estimators with various levels of temporal aggregation. This includes investigation of the effect on the estimators of spatio-temporal correlation induced by animal movement. The CT estimator is found to be unbiased and more precise than discrete-occasion estimators based on binary capture data (rather than detection frequencies) when there is no spatio-temporal correlation. It is also found to be only slightly biased when there is correlation induced by animal movement, and to be more robust to inadequate detector spacing, while discrete-occasion estimators with binary data can be sensitive to occasion length, particularly in the presence of inadequate detector spacing. Our model includes as a special case a discrete-occasion estimator based on detection frequencies, and at the same time lays a foundation for the development of more sophisticated CT models and estimators. It allows modelling within-occasion changes in detectability, readily accommodates variation in detector effort, removes subjectivity associated with user-defined occasions, and fully utilises CT data. We identify a need for developing CT methods that incorporate spatio-temporal dependence in detections and see potential for CT models being combined with telemetry-based animal movement models to provide a richer inference framework.

Search
Clear search
Close search
Google apps
Main menu