100+ datasets found
  1. Data from: Comparison of NSDUH Mental Health Data and Methods with Other...

    • catalog.data.gov
    • data.virginia.gov
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Substance Abuse and Mental Health Services Administration (2025). Comparison of NSDUH Mental Health Data and Methods with Other Data Sources [Dataset]. https://catalog.data.gov/dataset/comparison-of-nsduh-mental-health-data-and-methods-with-other-data-sources
    Explore at:
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    Substance Abuse and Mental Health Services Administrationhttps://www.samhsa.gov/
    Description

    This report compares estimates of adult mental health from the 2009 National Survey on Drug Use and Health (NSDUH) with estimates of similar measures from 2001 to 2003 National Comorbidity Survey Replication (NCS-R), 2001 to 2002 National Epidemiologic Survey on Alcohol and Related Conditions (NESARC), 2007 Behavioral Risk Factor Surveillance System (BRFSS), 2008 National Health Interview Survey (NHIS), 2008 Medical Expenditure Panel Survey (MEPS), and 2008 Uniform Reporting System (URS).

  2. f

    Data from: Analysis of Commercial and Public Bioactivity Databases

    • acs.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pekka Tiikkainen; Lutz Franke (2023). Analysis of Commercial and Public Bioactivity Databases [Dataset]. http://doi.org/10.1021/ci2003126.s003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    ACS Publications
    Authors
    Pekka Tiikkainen; Lutz Franke
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Activity data for small molecules are invaluable in chemoinformatics. Various bioactivity databases exist containing detailed information of target proteins and quantitative binding data for small molecules extracted from journals and patents. In the current work, we have merged several public and commercial bioactivity databases into one bioactivity metabase. The molecular presentation, target information, and activity data of the vendor databases were standardized. The main motivation of the work was to create a single relational database which allows fast and simple data retrieval by in-house scientists. Second, we wanted to know the amount of overlap between databases by commercial and public vendors to see whether the former contain data complementing the latter. Third, we quantified the degree of inconsistency between data sources by comparing data points derived from the same scientific article cited by more than one vendor. We found that each data source contains unique data which is due to different scientific articles cited by the vendors. When comparing data derived from the same article we found that inconsistencies between the vendors are common. In conclusion, using databases of different vendors is still useful since the data overlap is not complete. It should be noted that this can be partially explained by the inconsistencies and errors in the source data.

  3. Supplementary material from "Visual comparison of two data sets: Do people...

    • figshare.com
    xlsx
    Updated Mar 14, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robin Kramer; Caitlin Telfer; Alice Towler (2017). Supplementary material from "Visual comparison of two data sets: Do people use the means and the variability?" [Dataset]. http://doi.org/10.6084/m9.figshare.4751095.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 14, 2017
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Robin Kramer; Caitlin Telfer; Alice Towler
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In our everyday lives, we are required to make decisions based upon our statistical intuitions. Often, these involve the comparison of two groups, such as luxury versus family cars and their suitability. Research has shown that the mean difference affects judgements where two sets of data are compared, but the variability of the data has only a minor influence, if any at all. However, prior research has tended to present raw data as simple lists of values. Here, we investigated whether displaying data visually, in the form of parallel dot plots, would lead viewers to incorporate variability information. In Experiment 1, we asked a large sample of people to compare two fictional groups (children who drank ‘Brain Juice’ versus water) in a one-shot design, where only a single comparison was made. Our results confirmed that only the mean difference between the groups predicted subsequent judgements of how much they differed, in line with previous work using lists of numbers. In Experiment 2, we asked each participant to make multiple comparisons, with both the mean difference and the pooled standard deviation varying across data sets they were shown. Here, we found that both sources of information were correctly incorporated when making responses. Taken together, we suggest that increasing the salience of variability information, through manipulating this factor across items seen, encourages viewers to consider this in their judgements. Such findings may have useful applications for best practices when teaching difficult concepts like sampling variation.

  4. Data from: A consensus compound/bioactivity dataset for data-driven drug...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated May 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laura Isigkeit; Laura Isigkeit; Apirat Chaikuad; Apirat Chaikuad; Daniel Merk; Daniel Merk (2022). A consensus compound/bioactivity dataset for data-driven drug design and chemogenomics [Dataset]. http://doi.org/10.5281/zenodo.6320761
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 13, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Laura Isigkeit; Laura Isigkeit; Apirat Chaikuad; Apirat Chaikuad; Daniel Merk; Daniel Merk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Information

    The diverse publicly available compound/bioactivity databases constitute a key resource for data-driven applications in chemogenomics and drug design. Analysis of their coverage of compound entries and biological targets revealed considerable differences, however, suggesting benefit of a consensus dataset. Therefore, we have combined and curated information from five esteemed databases (ChEMBL, PubChem, BindingDB, IUPHAR/BPS and Probes&Drugs) to assemble a consensus compound/bioactivity dataset comprising 1144803 compounds with 10915362 bioactivities on 5613 targets (including defined macromolecular targets as well as cell-lines and phenotypic readouts). It also provides simplified information on assay types underlying the bioactivity data and on bioactivity confidence by comparing data from different sources. We have unified the source databases, brought them into a common format and combined them, enabling an ease for generic uses in multiple applications such as chemogenomics and data-driven drug design.

    The consensus dataset provides increased target coverage and contains a higher number of molecules compared to the source databases which is also evident from a larger number of scaffolds. These features render the consensus dataset a valuable tool for machine learning and other data-driven applications in (de novo) drug design and bioactivity prediction. The increased chemical and bioactivity coverage of the consensus dataset may improve robustness of such models compared to the single source databases. In addition, semi-automated structure and bioactivity annotation checks with flags for divergent data from different sources may help data selection and further accurate curation.

    Structure and content of the dataset

    Dataset structure

    ChEMBL

    ID

    PubChem

    ID

    IUPHAR

    ID

    Target

    Activity

    type

    Assay typeUnitMean C (0)...Mean PC (0)...Mean B (0)...Mean I (0)...Mean PD (0)...Activity check annotationLigand namesCanonical SMILES C...Structure checkSource

    The dataset was created using the Konstanz Information Miner (KNIME) (https://www.knime.com/) and was exported as a CSV-file and a compressed CSV-file.

    Except for the canonical SMILES columns, all columns are filled with the datatype ‘string’. The datatype for the canonical SMILES columns is the smiles-format. We recommend the File Reader node for using the dataset in KNIME. With the help of this node the data types of the columns can be adjusted exactly. In addition, only this node can read the compressed format.

    Column content:

    • ChEMBL ID, PubChem ID, IUPHAR ID: chemical identifier of the databases
    • Target: biological target of the molecule expressed as the HGNC gene symbol
    • Activity type: for example, pIC50
    • Assay type: Simplification/Classification of the assay into cell-free, cellular, functional and unspecified
    • Unit: unit of bioactivity measurement
    • Mean columns of the databases: mean of bioactivity values or activity comments denoted with the frequency of their occurrence in the database, e.g. Mean C = 7.5 *(15) -> the value for this compound-target pair occurs 15 times in ChEMBL database
    • Activity check annotation: a bioactivity check was performed by comparing values from the different sources and adding an activity check annotation to provide automated activity validation for additional confidence
      • no comment: bioactivity values are within one log unit;
      • check activity data: bioactivity values are not within one log unit;
      • only one data point: only one value was available, no comparison and no range calculated;
      • no activity value: no precise numeric activity value was available;
      • no log-value could be calculated: no negative decadic logarithm could be calculated, e.g., because the reported unit was not a compound concentration
    • Ligand names: all unique names contained in the five source databases are listed
    • Canonical SMILES columns: Molecular structure of the compound from each database
    • Structure check: To denote matching or differing compound structures in different source databases
      • match: molecule structures are the same between different sources;
      • no match: the structures differ;
      • 1 source: no structure comparison is possible, because the molecule comes from only one source database.
    • Source: From which databases the data come from

  5. d

    Dataset for: Same Question, Different Answers? An Empirical Comparison of...

    • demo-b2find.dkrz.de
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Dataset for: Same Question, Different Answers? An Empirical Comparison of Web Data and Traditional Data - Dataset - B2FIND [Dataset]. http://demo-b2find.dkrz.de/dataset/bc684dad-c657-5013-b2d4-cc35b4a2e7ee
    Explore at:
    Dataset updated
    Sep 22, 2025
    Description

    Psychological scientists increasingly study web data, such as user ratings or social media postings. However, whether research relying on such web data leads to the same conclusions as research based on traditional data is largely unknown. To test this, we (re)analyzed three datasets, thereby comparing web data with lab and online survey data. We calculated correlations across these different datasets (Study 1) and investigated identical, illustrative research questions in each dataset (Studies 2 to 4). Our results suggest that web and traditional data are not fundamentally different and usually lead to similar conclusions, but also that it is important to consider differences between data types such as populations and research settings. Web data can be a valuable tool for psychologists when accounting for such differences, as it allows for testing established research findings in new contexts, complementing them with insights from novel data sources.

  6. Data from: Comparison of NSDUH Health and Health Care Utilization Estimates...

    • catalog.data.gov
    • data.virginia.gov
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Substance Abuse and Mental Health Services Administration (2025). Comparison of NSDUH Health and Health Care Utilization Estimates to Other National Data Sources [Dataset]. https://catalog.data.gov/dataset/comparison-of-nsduh-health-and-health-care-utilization-estimates-to-other-national-data-so
    Explore at:
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    Substance Abuse and Mental Health Services Administrationhttps://www.samhsa.gov/
    Description

    This report compares specific health conditions, overall health, and health care utilization prevalence estimates from the 2006 National Survey on Drug Use and Health (NSDUH) and other national data sources. Methodological differences among these data sources that may contribute to differences in estimates are described. In addition to NSDUH, three of the data sources use respondent self-reports to measure health characteristics and service utilization: the National Health Interview Survey (NHIS), the Behavioral Risk Factor Surveillance System (BRFSS), and the Medical Expenditure Panel Survey (MEPS). One survey, the National Health and Nutrition Examination Survey (NHANES), conducts initial interviews in respondents\' homes, collecting further data at nearby locations. Five data sources provide health care utilization data extracted from hospital records; these sources include the National Hospital Discharge Survey (NHDS), the Nationwide Inpatient Sample (NIS), the Nationwide Emergency Department Sample (NEDS), the National Health and Ambulatory Medical Care Survey (NHAMCS), and the Drug Abuse Warning Network (DAWN). Several methodological differences that could cause differences in estimates are discussed, including type and mode of data collection; weighting and representativeness of the sample; question placement, wording, and format; and use of proxy reporting for adolescents.

  7. Data from: Comparison of NSDUH Mental Health Data and Methods with Other...

    • odgavaprod.ogopendata.com
    • catalog.data.gov
    html
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Substance Abuse and Mental Health Services Administration (2025). Comparison of NSDUH Mental Health Data and Methods with Other Data Sources [Dataset]. https://odgavaprod.ogopendata.com/dataset/comparison-of-nsduh-mental-health-data-and-methods-with-other-data-sources1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    Substance Abuse and Mental Health Services Administrationhttps://www.samhsa.gov/
    Description

    This report compares adult mental health prevalence estimates generated from the 2009 National Survey on Drug Use and Health (NSDUH) with estimates of similar measures generated from other national data sources. It also describes the methodologies of the different data sources and discusses the differences in survey design and estimation that may contribute to differences among these estimates. The other data systems discussed include the 2001 to 2003 National Comorbidity Survey Replication (NCS-R), 2001 to 2002 National Epidemiologic Survey on Alcohol and Related Conditions (NESARC), 2007 Behavioral Risk Factor Surveillance System (BRFSS), 2008 National Health Interview Survey (NHIS), 2008 Medical Expenditure Panel Survey (MEPS), and 2008 Uniform Reporting System (URS).

  8. f

    P-values for comparison of relative risks for study outcomes assessed using...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Aug 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harding, Jane E.; Milne, Barry; Shahbaz, Mohammad; von Randow, Martin; Gamble, Greg D.; Walters, Anthony (2024). P-values for comparison of relative risks for study outcomes assessed using different data sources alone or in combination. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001424576
    Explore at:
    Dataset updated
    Aug 7, 2024
    Authors
    Harding, Jane E.; Milne, Barry; Shahbaz, Mohammad; von Randow, Martin; Gamble, Greg D.; Walters, Anthony
    Description

    P-values for comparison of relative risks for study outcomes assessed using different data sources alone or in combination.

  9. 2016-2017 NSDUH: Other Sources of State-Level Data

    • healthdata.gov
    • data.virginia.gov
    • +1more
    csv, xlsx, xml
    Updated Jul 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). 2016-2017 NSDUH: Other Sources of State-Level Data [Dataset]. https://healthdata.gov/d/yewh-wy4j
    Explore at:
    xml, xlsx, csvAvailable download formats
    Dataset updated
    Jul 14, 2025
    Description

    This document briefly describes one of these other data systems that publish state estimates and presents selected comparisons with NSDUH results: Behavioral Risk Factor Surveillance System (BRFSS) and Youth Risk Behavior Survey (YRBS).

  10. H

    Training Webinars on China Research Data: Sources, Tools and Applications

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Feb 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Spatial Data Lab (2024). Training Webinars on China Research Data: Sources, Tools and Applications [Dataset]. http://doi.org/10.7910/DVN/LN6OHH
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 20, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Spatial Data Lab
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    China
    Description

    This webinar series introduce some research data with a focus on China and discuss the difference from the US data. Each webinar will cover the following topics: (1) data sources, data collection, data category, definition, description, and interpretation; (2) alternative data and derivable data from other data sources, especially some big data sources; (3) comparison of data difference between the US and China; (4) available tools for efficient data analysis; (5) discussions on pros and cons; and (6) data applications in research and teaching.

  11. Z

    Data from: Large-scale comparison of bibliographic data sources: Scopus, Web...

    • data.niaid.nih.gov
    Updated Aug 17, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Visser, Martijn; Van Eck, Nees Jan; Waltman, Ludo (2020). Large-scale comparison of bibliographic data sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3837427
    Explore at:
    Dataset updated
    Aug 17, 2020
    Dataset provided by
    Centre for Science and Technology Studies, Leiden University
    Authors
    Visser, Martijn; Van Eck, Nees Jan; Waltman, Ludo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data set contains supplementary material for the paper 'Large-scale comparison of bibliographic data sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic' by Martijn Visser, Nees Jan van Eck, and Ludo Waltman. The data set provides the statistics presented in the figures in the paper.

  12. Linked Open Data Management Services: A Comparison

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    Updated Sep 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robert Nasarek; Robert Nasarek; Lozana Rossenova; Lozana Rossenova (2023). Linked Open Data Management Services: A Comparison [Dataset]. http://doi.org/10.5281/zenodo.7738424
    Explore at:
    Dataset updated
    Sep 18, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Robert Nasarek; Robert Nasarek; Lozana Rossenova; Lozana Rossenova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Thanks to a variety of software services, it has never been easier to produce, manage and publish Linked Open Data. But until now, there has been a lack of an accessible overview to help researchers make the right choice for their use case. This dataset release will be regularly updated to reflect the latest data published in a comparison table developed in Google Sheets [1]. The comparison table includes the most commonly used LOD management software tools from NFDI4Culture to illustrate what functionalities and features a service should offer for the long-term management of FAIR research data, including:

    • ConedaKOR
    • LinkedDataHub
    • Metaphacts
    • Omeka S
    • ResearchSpace
    • Vitro
    • Wikibase
    • WissKI

    The table presents two views based on a comparison system of categories developed iteratively during workshops with expert users and developers from the respective tool communities. First, a short overview with field values coming from controlled vocabularies and multiple-choice options; and a second sheet allowing for more descriptive free text additions. The table and corresponding dataset releases for each view mode are designed to provide a well-founded basis for evaluation when deciding on a LOD management service. The Google Sheet table will remain open to collaboration and community contribution, as well as updates with new data and potentially new tools, whereas the datasets released here are meant to provide stable reference points with version control.

    The research for the comparison table was first presented as a paper at DHd2023, Open Humanities – Open Culture, 13-17.03.2023, Trier and Luxembourg [2].

    [1] Non-editing access is available here: docs.google.com/spreadsheets/d/1FNU8857JwUNFXmXAW16lgpjLq5TkgBUuafqZF-yo8_I/edit?usp=share_link To get editing access contact the authors.

    [2] Full paper will be made available open access in the conference proceedings.

  13. Data from: Comparing and Evaluating Substance Use Treatment Utilization...

    • catalog.data.gov
    • data.virginia.gov
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Substance Abuse and Mental Health Services Administration (2025). Comparing and Evaluating Substance Use Treatment Utilization Estimates from the National Survey on Drug Use and Health and Other Data Sources [Dataset]. https://catalog.data.gov/dataset/comparing-and-evaluating-substance-use-treatment-utilization-estimates-from-the-national-s
    Explore at:
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    Substance Abuse and Mental Health Services Administrationhttps://www.samhsa.gov/
    Description

    This report presents an evaluation of the coverage, overlap, biases, strengths, and weaknesses of three sources of data on the receipt of specialty substance use treatment: the National Survey on Drug Use and Health (NSDUH), the National Survey of Substance Abuse Treatment Services (N-SSATS), and the Treatment Episode Data Set (TEDS). Specialty substance use treatment measures compared include numbers and characteristics of persons treated in a given year, single-day treatment counts, numbers of admissions in a given year, and estimates of the numbers of persons who needed substance use treatment but did not receive it. This report includes data from the 2005 through 2010 NSDUHs; 2007 through 2009 N-SSATS; and 2007 through 2009 TEDS. Results are show by substance treated, age, race/ethnicity, and employment status,

  14. Comparison of labour market data sources

    • gov.uk
    • s3.amazonaws.com
    Updated Dec 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2020). Comparison of labour market data sources [Dataset]. https://www.gov.uk/government/statistics/reconciliation-of-labour-market-statistics
    Explore at:
    Dataset updated
    Dec 3, 2020
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Office for National Statistics
    Description

    Official statistics are produced impartially and free from political influence.

  15. Data from: Datasets associated with: Comparing temperature data sources for...

    • zenodo.org
    • repository.uantwerpen.be
    bin, txt
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonas Lembrechts; Jonas Lembrechts (2020). Datasets associated with: Comparing temperature data sources for use in species distribution models: From in-situ logging to remote sensing. Global Ecology and Biogeography [Dataset]. http://doi.org/10.5281/zenodo.3302639
    Explore at:
    txt, binAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jonas Lembrechts; Jonas Lembrechts
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data associated with the paper 'Comparing temperature data sources for use in species distribution models: From in-situ logging to remote sensing. Global Ecology and Biogeography' by Lembrechts JJ et al., published in Global Ecology and Biogeography.

    Contains a dataset containing all extracted and measured temperature variables for all 106 measurement plots (climatedata), as well as the climate and species data used in the Species Distribution Models (SDMs).

    For details on the content of the table, see the readme-file, for details on methodology, see the original paper.

  16. Comparing U.S. cropland expansion estimates from the LCMAP with three other...

    • catalog.data.gov
    Updated Dec 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2024). Comparing U.S. cropland expansion estimates from the LCMAP with three other sources [Dataset]. https://catalog.data.gov/dataset/comparing-u-s-cropland-expansion-estimates-from-the-lcmap-with-three-other-sources
    Explore at:
    Dataset updated
    Dec 16, 2024
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    This dataset contains tabular data and scripts used to analyze and produce figures for the manuscript Martin et al. entitled "Tracking cropland transitions: a comparative analysis of U.S. land cover change data."

  17. b

    Comparison of Global Agricultural Supply and Demand Estimates from different...

    • bonndata.uni-bonn.de
    • daten.zef.de
    csv, jpeg, png, txt +1
    Updated Sep 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Brockhaus; Jan Brockhaus (2023). Comparison of Global Agricultural Supply and Demand Estimates from different Sources covering Wheat, Rice, Corn, and Soybeans. Raw Data from AMIS, 2011-2015 [Dataset]. http://doi.org/10.60507/FK2/AYRWKW
    Explore at:
    png(7086), xml(26021), txt(961), csv(167863), jpeg(9499)Available download formats
    Dataset updated
    Sep 18, 2023
    Dataset provided by
    bonndata
    Authors
    Jan Brockhaus; Jan Brockhaus
    License

    https://bonndata.uni-bonn.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.60507/FK2/AYRWKWhttps://bonndata.uni-bonn.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.60507/FK2/AYRWKW

    Time period covered
    Jan 1, 2011 - Jun 30, 2015
    Area covered
    World
    Description

    Global agricultural supply and demand estimates from the FAO, USDA, and IGC are compared to analyze data quality, documentation, and differences in the sources as well as the underlying reasons for these differences. All data is collected via AMIS (www.amis-outlook.org). The data originates from the AMIS Market Monitors No. 2 to 29. Quality/Lineage: See linked doctoral thesis by Jan Brockhaus for more details on data quality and methods which were used for analyzing the data. In general, the data seems to be of high quality. Few mistakes were found and corrected. Differences between the sources exist. For underlying reasons, see linked doctoral thesis. Purpose: Understanding why different sources provide different estimates.

  18. f

    Data from: Using Data Analysis To Evaluate and Compare Chemical Syntheses

    • acs.figshare.com
    zip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dustin Kaiser; Jianbo Yang; Georg Wuitschik (2023). Using Data Analysis To Evaluate and Compare Chemical Syntheses [Dataset]. http://doi.org/10.1021/acs.oprd.8b00199.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    ACS Publications
    Authors
    Dustin Kaiser; Jianbo Yang; Georg Wuitschik
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    We present ChemPager, a freely available tool for systematically evaluating chemical syntheses. By processing and visualizing chemical data, the impact of past changes is uncovered and future work guided. The tool calculates commonly used metrics such as process mass intensity (PMI), Volume–Time Output, and production costs. Also, a set of scores is introduced aiming to measure crucial but elusive characteristics such as process robustness, design, and safety. Our tool employs a hierarchical data layout built on common software for data entry (Excel, Google Sheets, etc.) and visualization (Spotfire). With all project data being stored in one place, cross-project comparison and data aggregation becomes possible as well as cross-linking with other data sources or visualizations.

  19. D

    Data from: Data sources for trait databases: comparing the phenomic content...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    • +1more
    Updated May 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dechecchi, T. Alex; Mabee, Paula M.; Dececchi, T. Alex; Blackburn, David C. (2016). Data sources for trait databases: comparing the phenomic content of monographs and evolutionary matrices [Dataset]. http://doi.org/10.5061/dryad.dc8nn
    Explore at:
    Dataset updated
    May 20, 2016
    Authors
    Dechecchi, T. Alex; Mabee, Paula M.; Dececchi, T. Alex; Blackburn, David C.
    Description

    Databases of organismal traits that aggregate information from one or multiple sources can be leveraged for large-scale analyses in biology. Yet the differences among these data streams and how well they capture trait diversity have never been explored. We present the first analysis of the differences between phenotypes captured in free text of descriptive publications (‘monographs’) and those used in phylogenetic analyses (‘matrices’). We focus our analysis on osteological phenotypes of the limbs of four extinct vertebrate taxa critical to our understanding of the fin-to-limb transition. We find that there is low overlap between the anatomical entities used in these two sources of phenotype data, indicating that phenotypes represented in matrices are not simply a subset of those found in monographic descriptions. Perhaps as expected, compared to characters found in matrices, phenotypes in monographs tend to emphasize descriptive and positional morphology, be somewhat more complex, and relate to fewer additional taxa. While based on a small set of focal taxa, these qualitative and quantitative data suggest that either source of phenotypes alone will result in incomplete knowledge of variation for a given taxon. As a broader community develops to use and expand databases characterizing organismal trait diversity, it is important to recognize the limitations of the data sources and develop strategies to more fully characterize variation both within species and across the tree of life.

  20. COVID-19 Global Case and Death Data

    • kaggle.com
    zip
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). COVID-19 Global Case and Death Data [Dataset]. https://www.kaggle.com/datasets/thedevastator/covid-19-global-case-and-death-data
    Explore at:
    zip(81724234 bytes)Available download formats
    Dataset updated
    Dec 4, 2023
    Authors
    The Devastator
    Description

    COVID-19 Global Case and Death Data

    Global COVID-19 Cases and Deaths Over Time

    By Coronavirus (COVID-19) Data Hub [source]

    About this dataset

    The COVID-19 Global Time Series Case and Death Data is a comprehensive collection of global COVID-19 case and death information recorded over time. This dataset includes data from various sources such as JHU CSSE COVID-19 Data and The New York Times.

    The dataset consists of several columns providing detailed information on different aspects of the COVID-19 situation. The COUNTRY_SHORT_NAME column represents the short name of the country where the data is recorded, while the Data_Source column indicates the source from which the data was obtained.

    Other important columns include Cases, which denotes the number of COVID-19 cases reported, and Difference, which indicates the difference in case numbers compared to the previous day. Additionally, there are columns such as CONTINENT_NAME, DATA_SOURCE_NAME, COUNTRY_ALPHA_3_CODE, COUNTRY_ALPHA_2_CODE that provide additional details about countries and continents.

    Furthermore, this dataset also includes information on deaths related to COVID-19. The column PEOPLE_DEATH_NEW_COUNT shows the number of new deaths reported on a specific date.

    To provide more context to the data, certain columns offer demographic details about locations. For instance, Population_Count provides population counts for different areas. Moreover,**FIPS** code is available for provincial/state regions for identification purposes.

    It is important to note that this dataset covers both confirmed cases (Case_Type: confirmed) as well as probable cases (Case_Type: probable). These classifications help differentiate between various types of COVID-19 infections.

    Overall, this dataset offers a comprehensive picture of global COVID-19 situations by providing accurate and up-to-date information on cases, deaths, demographic details like population count or FIPS code), source references (such as JHU CSSE or NY Times), geographical information (country names coded with ALPHA codes) , etcetera making it useful for researchers studying patterns and trends associated with this pandemic

    How to use the dataset

    • Understanding the Dataset Structure:

      • The dataset is available in two files: COVID-19 Activity.csv and COVID-19 Cases.csv.
      • Both files contain different columns that provide information about the COVID-19 cases and deaths.
      • Some important columns to look out for are: a. PEOPLE_POSITIVE_CASES_COUNT: The total number of confirmed positive COVID-19 cases. b. COUNTY_NAME: The name of the county where the data is recorded. c. PROVINCE_STATE_NAME: The name of the province or state where the data is recorded. d. REPORT_DATE: The date when the data was reported. e. CONTINENT_NAME: The name of the continent where the data is recorded. f. DATA_SOURCE_NAME: The name of the data source. g. PEOPLE_DEATH_NEW_COUNT: The number of new deaths reported on a specific date. h.COUNTRY_ALPHA_3_CODE :The three-letter alpha code represents country f.Lat,Long :latitude and longitude coordinates represent location i.Country_Region or COUNTRY_SHORT_NAME:The country or region where cases were reported.
    • Choosing Relevant Columns: It's important to determine which columns are relevant to your analysis or research question before proceeding with further analysis.

    • Exploring Data Patterns: Use various statistical techniques like summarizing statistics, creating visualizations (e.g., bar charts, line graphs), etc., to explore patterns in different variables over time or across regions/countries.

    • Filtering Data: You can filter your dataset based on specific criteria using column(s) such as COUNTRY_SHORT_NAME, CONTINENT_NAME, or PROVINCE_STATE_NAME to focus on specific countries, continents, or regions of interest.

    • Combining Data: You can combine data from different sources (e.g., COVID-19 cases and deaths) to perform advanced analysis or create insightful visualizations.

    • Analyzing Trends: Use the dataset to analyze and identify trends in COVID-19 cases and deaths over time. You can examine factors such as population count, testing count, hospitalization count, etc., to gain deeper insights into the impact of the virus.

    • Comparing Countries/Regions: Compare COVID-19

    Research Ideas

    • Trend Analysis: This dataset can be used to analyze and track the trends of COVID-19 cases and deaths over time. It provides comprehensive global data, allowing researchers and po...
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Substance Abuse and Mental Health Services Administration (2025). Comparison of NSDUH Mental Health Data and Methods with Other Data Sources [Dataset]. https://catalog.data.gov/dataset/comparison-of-nsduh-mental-health-data-and-methods-with-other-data-sources
Organization logo

Data from: Comparison of NSDUH Mental Health Data and Methods with Other Data Sources

Related Article
Explore at:
Dataset updated
Sep 6, 2025
Dataset provided by
Substance Abuse and Mental Health Services Administrationhttps://www.samhsa.gov/
Description

This report compares estimates of adult mental health from the 2009 National Survey on Drug Use and Health (NSDUH) with estimates of similar measures from 2001 to 2003 National Comorbidity Survey Replication (NCS-R), 2001 to 2002 National Epidemiologic Survey on Alcohol and Related Conditions (NESARC), 2007 Behavioral Risk Factor Surveillance System (BRFSS), 2008 National Health Interview Survey (NHIS), 2008 Medical Expenditure Panel Survey (MEPS), and 2008 Uniform Reporting System (URS).

Search
Clear search
Close search
Google apps
Main menu