100+ datasets found
  1. The primary data types collected.

    • figshare.com
    • plos.figshare.com
    xls
    Updated Jun 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid (2023). The primary data types collected. [Dataset]. http://doi.org/10.1371/journal.pone.0265257.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 15, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The primary data types collected.

  2. w

    Global Electronic Data Capture Tool Market Research Report: By Application...

    • wiseguyreports.com
    Updated Oct 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Global Electronic Data Capture Tool Market Research Report: By Application (Clinical Trials, Pharmaceutical Research, Market Research, Survey Data Collection), By Deployment Type (Cloud-Based, On-Premises, Hybrid), By End User (Healthcare Organizations, Academic Institutions, Market Research Firms, Government Bodies), By Data Type (Qualitative Data, Quantitative Data, Mixed Data) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2035 [Dataset]. https://www.wiseguyreports.com/reports/electronic-data-capture-tool-market
    Explore at:
    Dataset updated
    Oct 14, 2025
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Time period covered
    Oct 25, 2025
    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2023
    REGIONS COVEREDNorth America, Europe, APAC, South America, MEA
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 20244.37(USD Billion)
    MARKET SIZE 20254.71(USD Billion)
    MARKET SIZE 203510.0(USD Billion)
    SEGMENTS COVEREDApplication, Deployment Type, End User, Data Type, Regional
    COUNTRIES COVEREDUS, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA
    KEY MARKET DYNAMICSgrowing demand for clinical trials, increasing adoption of electronic solutions, rising regulatory compliance needs, need for real-time data access, growing focus on patient engagement
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDCastor, SAS Institute, CureClick, Veeva Systems, RedCap, WCG Clinical, ERT, Medidata Solutions, Parexel International, Appian, Advarra, CRF Health, IBM, ClinCapture, Oracle
    MARKET FORECAST PERIOD2025 - 2035
    KEY MARKET OPPORTUNITIESIncreased demand for remote trials, Adoption of cloud-based solutions, Growing need for regulatory compliance, Rising use in clinical research, Expansion in emerging markets
    COMPOUND ANNUAL GROWTH RATE (CAGR) 7.8% (2025 - 2035)
  3. t

    Which of the Five Types of Data Science Does Your Startup Need? - Data...

    • tomtunguz.com
    Updated Oct 2, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomasz Tunguz (2013). Which of the Five Types of Data Science Does Your Startup Need? - Data Analysis [Dataset]. https://tomtunguz.com/data-science-types/
    Explore at:
    Dataset updated
    Oct 2, 2013
    Dataset provided by
    Theory Ventures
    Authors
    Tomasz Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Discover the 5 distinct types of data scientists your startup needs, from quantitative PhDs to operational analysts. Learn which role best fits your company's growth stage.

  4. o

    Individual Level Data - Quantitative Metadata - baselinesurvey_oh

    • hcs.chrr.ohio-state.edu
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    Dataset updated
    Sep 22, 2025
    Description

    Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Questionnaire Temporal Features: One-Time Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Behavioral Data, Demographics Topics: Drug Use, Fentanyl Test Strips

  5. d

    Dataplex: FDA Medical Device Data | 24M+ Rows of Key Device Product Data for...

    • datarade.ai
    .csv
    Updated Aug 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataplex (2024). Dataplex: FDA Medical Device Data | 24M+ Rows of Key Device Product Data for Research & Analysis [Dataset]. https://datarade.ai/data-products/dataplex-fda-medical-device-data-24m-rows-of-key-device-i-dataplex
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Aug 12, 2024
    Dataset authored and provided by
    Dataplex
    Area covered
    United States of America
    Description

    The FDA Device Dataset by Dataplex provides comprehensive access to over 24 million rows of detailed information, covering 9 key data types essential for anyone involved in the medical device industry. Sourced directly from the U.S. Food and Drug Administration (FDA), this dataset is a critical resource for regulatory compliance, market analysis, and product safety assessment regarding.

    Dataset Overview:

    This dataset includes data on medical device registrations, approvals, recalls, and adverse events, among other crucial aspects. The dataset is meticulously cleaned and structured to ensure that it meets the needs of researchers, regulatory professionals, and market analysts.

    24 Million Rows of Data:

    With over 24 million rows, this dataset offers an extensive view of the regulatory landscape for medical devices. It includes data types such as classification, event, enforcement, 510k, registration listings, recall, PMA, UDI, and covid19 serology. This wide range of data types allows users to perform granular analysis on a broad spectrum of device-related topics.

    Sourced from the FDA:

    All data in this dataset is sourced directly from the FDA, ensuring that it is accurate, up-to-date, and reliable. Regular updates ensure that the dataset remains current, reflecting the latest in device approvals, clearances, and safety reports.

    Key Features:

    • Comprehensive Coverage: Includes 9 key device data types, such as 510(k) clearances, premarket approvals, device classifications, and adverse event reports.

    • Regulatory Compliance: Provides detailed information necessary for tracking compliance with FDA regulations, including device recalls and enforcement actions.

    • Market Analysis: Analysts can utilize the dataset to assess market trends, monitor competitor activities, and track the introduction of new devices.

    • Product Safety Analysis: Researchers can analyze adverse event reports and device recalls to evaluate the safety and performance of medical devices.

    Use Cases: - Regulatory Compliance: Ensure your devices meet FDA standards, monitor compliance trends, and stay informed about regulatory changes.

    • Market Research: Identify trends in the medical device market, track new device approvals, and analyze competitive landscapes with up-to-date and historical data.

    • Product Safety: Assess the safety and performance of medical devices by examining detailed adverse event reports and recall data.

    Data Quality and Reliability:

    The FDA Device Dataset prioritizes data quality and reliability. Each record is meticulously sourced from the FDA's official databases, ensuring that the information is both accurate and up-to-date. This makes the dataset a trusted resource for critical applications, where data accuracy is vital.

    Integration and Usability:

    The dataset is provided in CSV format, making it compatible with most data analysis tools and platforms. Users can easily import, analyze, and utilize the data for various applications, from regulatory reporting to market analysis.

    User-Friendly Structure and Metadata:

    The data is organized for easy navigation, with clear metadata files included to help users identify relevant records. The dataset is structured by device type, approval and clearance processes, and adverse event reports, allowing for efficient data retrieval and analysis.

    Ideal For:

    • Regulatory Professionals: Monitor FDA compliance, track regulatory changes, and prepare for audits with comprehensive and up-to-date product data.

    • Market Analysts: Conduct detailed research on market trends, assess new device entries, and analyze competitive dynamics with extensive FDA data.

    • Healthcare Researchers: Evaluate the safety and efficacy of medical devices product data, identify potential risks, and contribute to improved patient outcomes through detailed analysis.

    This dataset is an indispensable resource for anyone involved in the medical device industry, providing the data and insights necessary to drive informed decisions and ensure compliance with FDA regulations.

  6. q

    Intro to Data Types and Graphing Lab

    • qubeshub.org
    Updated Oct 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stephanie Spera (2020). Intro to Data Types and Graphing Lab [Dataset]. http://doi.org/10.25334/1XYA-TF48
    Explore at:
    Dataset updated
    Oct 12, 2020
    Dataset provided by
    QUBES
    Authors
    Stephanie Spera
    Description

    This is the third lab in an Introductory Physical Geography/Environmental Studies course. It introduces students to different data types (qualitative vs quantitative), basic statistical analyses (correlation analysis s, t-test), and graphing techniques.

  7. o

    Individual Level Data - Quantitative Metadata - screenersurvey_oh

    • hcs.chrr.ohio-state.edu
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    Dataset updated
    Sep 22, 2025
    Description

    Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Questionnaire Temporal Features: One-Time Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Participant Screening Topics: Drug Use, Fentanyl Test Strips

  8. Z

    Conceptualization of public data ecosystems

    • data.niaid.nih.gov
    Updated Sep 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anastasija, Nikiforova; Martin, Lnenicka (2024). Conceptualization of public data ecosystems [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13842001
    Explore at:
    Dataset updated
    Sep 26, 2024
    Dataset provided by
    University of Tartu
    University of Hradec Králové
    Authors
    Anastasija, Nikiforova; Martin, Lnenicka
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains data collected during a study "Understanding the development of public data ecosystems: from a conceptual model to a six-generation model of the evolution of public data ecosystems" conducted by Martin Lnenicka (University of Hradec Králové, Czech Republic), Anastasija Nikiforova (University of Tartu, Estonia), Mariusz Luterek (University of Warsaw, Warsaw, Poland), Petar Milic (University of Pristina - Kosovska Mitrovica, Serbia), Daniel Rudmark (Swedish National Road and Transport Research Institute, Sweden), Sebastian Neumaier (St. Pölten University of Applied Sciences, Austria), Karlo Kević (University of Zagreb, Croatia), Anneke Zuiderwijk (Delft University of Technology, Delft, the Netherlands), Manuel Pedro Rodríguez Bolívar (University of Granada, Granada, Spain).

    As there is a lack of understanding of the elements that constitute different types of value-adding public data ecosystems and how these elements form and shape the development of these ecosystems over time, which can lead to misguided efforts to develop future public data ecosystems, the aim of the study is: (1) to explore how public data ecosystems have developed over time and (2) to identify the value-adding elements and formative characteristics of public data ecosystems. Using an exploratory retrospective analysis and a deductive approach, we systematically review 148 studies published between 1994 and 2023. Based on the results, this study presents a typology of public data ecosystems and develops a conceptual model of elements and formative characteristics that contribute most to value-adding public data ecosystems, and develops a conceptual model of the evolutionary generation of public data ecosystems represented by six generations called Evolutionary Model of Public Data Ecosystems (EMPDE). Finally, three avenues for a future research agenda are proposed.

    This dataset is being made public both to act as supplementary data for "Understanding the development of public data ecosystems: from a conceptual model to a six-generation model of the evolution of public data ecosystems ", Telematics and Informatics*, and its Systematic Literature Review component that informs the study.

    Description of the data in this data set

    PublicDataEcosystem_SLR provides the structure of the protocol

    Spreadsheet#1 provides the list of results after the search over three indexing databases and filtering out irrelevant studies

    Spreadsheets #2 provides the protocol structure.

    Spreadsheets #3 provides the filled protocol for relevant studies.

    The information on each selected study was collected in four categories:(1) descriptive information,(2) approach- and research design- related information,(3) quality-related information,(4) HVD determination-related information

    Descriptive Information

    Article number

    A study number, corresponding to the study number assigned in an Excel worksheet

    Complete reference

    The complete source information to refer to the study (in APA style), including the author(s) of the study, the year in which it was published, the study's title and other source information.

    Year of publication

    The year in which the study was published.

    Journal article / conference paper / book chapter

    The type of the paper, i.e., journal article, conference paper, or book chapter.

    Journal / conference / book

    Journal article, conference, where the paper is published.

    DOI / Website

    A link to the website where the study can be found.

    Number of words

    A number of words of the study.

    Number of citations in Scopus and WoS

    The number of citations of the paper in Scopus and WoS digital libraries.

    Availability in Open Access

    Availability of a study in the Open Access or Free / Full Access.

    Keywords

    Keywords of the paper as indicated by the authors (in the paper).

    Relevance for our study (high / medium / low)

    What is the relevance level of the paper for our study

    Approach- and research design-related information

    Approach- and research design-related information

    Objective / Aim / Goal / Purpose & Research Questions

    The research objective and established RQs.

    Research method (including unit of analysis)

    The methods used to collect data in the study, including the unit of analysis that refers to the country, organisation, or other specific unit that has been analysed such as the number of use-cases or policy documents, number and scope of the SLR etc.

    Study’s contributions

    The study’s contribution as defined by the authors

    Qualitative / quantitative / mixed method

    Whether the study uses a qualitative, quantitative, or mixed methods approach?

    Availability of the underlying research data

    Whether the paper has a reference to the public availability of the underlying research data e.g., transcriptions of interviews, collected data etc., or explains why these data are not openly shared?

    Period under investigation

    Period (or moment) in which the study was conducted (e.g., January 2021-March 2022)

    Use of theory / theoretical concepts / approaches? If yes, specify them

    Does the study mention any theory / theoretical concepts / approaches? If yes, what theory / concepts / approaches? If any theory is mentioned, how is theory used in the study? (e.g., mentioned to explain a certain phenomenon, used as a framework for analysis, tested theory, theory mentioned in the future research section).

    Quality-related information

    Quality concerns

    Whether there are any quality concerns (e.g., limited information about the research methods used)?

    Public Data Ecosystem-related information

    Public data ecosystem definition

    How is the public data ecosystem defined in the paper and any other equivalent term, mostly infrastructure. If an alternative term is used, how is the public data ecosystem called in the paper?

    Public data ecosystem evolution / development

    Does the paper define the evolution of the public data ecosystem? If yes, how is it defined and what factors affect it?

    What constitutes a public data ecosystem?

    What constitutes a public data ecosystem (components & relationships) - their "FORM / OUTPUT" presented in the paper (general description with more detailed answers to further additional questions).

    Components and relationships

    What components does the public data ecosystem consist of and what are the relationships between these components? Alternative names for components - element, construct, concept, item, helix, dimension etc. (detailed description).

    Stakeholders

    What stakeholders (e.g., governments, citizens, businesses, Non-Governmental Organisations (NGOs) etc.) does the public data ecosystem involve?

    Actors and their roles

    What actors does the public data ecosystem involve? What are their roles?

    Data (data types, data dynamism, data categories etc.)

    What data do the public data ecosystem cover (is intended / designed for)? Refer to all data-related aspects, including but not limited to data types, data dynamism (static data, dynamic, real-time data, stream), prevailing data categories / domains / topics etc.

    Processes / activities / dimensions, data lifecycle phases

    What processes, activities, dimensions and data lifecycle phases (e.g., locate, acquire, download, reuse, transform, etc.) does the public data ecosystem involve or refer to?

    Level (if relevant)

    What is the level of the public data ecosystem covered in the paper? (e.g., city, municipal, regional, national (=country), supranational, international).

    Other elements or relationships (if any)

    What other elements or relationships does the public data ecosystem consist of?

    Additional comments

    Additional comments (e.g., what other topics affected the public data ecosystems and their elements, what is expected to affect the public data ecosystems in the future, what were important topics by which the period was characterised etc.).

    New papers

    Does the study refer to any other potentially relevant papers?

    Additional references to potentially relevant papers that were found in the analysed paper (snowballing).

    Format of the file.xls, .csv (for the first spreadsheet only), .docx

    Licenses or restrictionsCC-BY

    For more info, see README.txt

  9. n

    Data from: The species versus subspecies conundrum: quantitative...

    • data.niaid.nih.gov
    • datadryad.org
    zip
    Updated Dec 16, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jen-Pan Huang; L. Lacey Knowles (2015). The species versus subspecies conundrum: quantitative delimitation from integrating multiple data types within a single Bayesian approach in Hercules beetles [Dataset]. http://doi.org/10.5061/dryad.8p6m0
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 16, 2015
    Dataset provided by
    University of Michigan
    Authors
    Jen-Pan Huang; L. Lacey Knowles
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    With the recent attention and focus on quantitative methods for species delimitation, an overlooked but equally important issue regards what has actually been delimited. This study investigates the apparent arbitrariness of some taxonomic distinctions, and in particular how species and subspecies are assigned. Specifically, we use a recently developed Bayesian model-based approach to show that in the Hercules beetles (genus Dynastes) there is no statistical difference in the probability that putative taxa represent different species, irrespective of whether they were given species or subspecies designations. By considering multiple data types, as opposed to relying exclusively on genetic data alone, we also show that both previously recognized species and subspecies represent a variety of points along the speciation spectrum (i.e., previously recognized species are not systematically further along the continuum than subspecies). For example, based on evolutionary models of divergence, some taxa are statistically distinguishable on more than one axis of differentiation (e.g., along both phenotypic and genetic dimensions), whereas other taxa can only be delimited statistically from a single data type. Because both phenotypic and genetic data are analyzed in a common Bayesian framework, our study provides a framework for investigating whether disagreements in species boundaries among data types reflect (i) actual discordance with the actual history of lineage splitting, or instead (ii) differences among data types in the amount of time required for differentiation to become apparent among the delimited taxa. We discuss what the answers to these questions imply about what characters are used to delimit species, as well as the diverse processes involved in the origin and maintenance of species boundaries. With this in mind, we then reflect more generally on how quantitative methods for species delimitation are used to assign taxonomic status.

  10. Clustering of samples and variables with mixed-type data

    • plos.figshare.com
    tiff
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuela Hummel; Dominic Edelmann; Annette Kopp-Schneider (2023). Clustering of samples and variables with mixed-type data [Dataset]. http://doi.org/10.1371/journal.pone.0188274
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Manuela Hummel; Dominic Edelmann; Annette Kopp-Schneider
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of data measured on different scales is a relevant challenge. Biomedical studies often focus on high-throughput datasets of, e.g., quantitative measurements. However, the need for integration of other features possibly measured on different scales, e.g. clinical or cytogenetic factors, becomes increasingly important. The analysis results (e.g. a selection of relevant genes) are then visualized, while adding further information, like clinical factors, on top. However, a more integrative approach is desirable, where all available data are analyzed jointly, and where also in the visualization different data sources are combined in a more natural way. Here we specifically target integrative visualization and present a heatmap-style graphic display. To this end, we develop and explore methods for clustering mixed-type data, with special focus on clustering variables. Clustering of variables does not receive as much attention in the literature as does clustering of samples. We extend the variables clustering methodology by two new approaches, one based on the combination of different association measures and the other on distance correlation. With simulation studies we evaluate and compare different clustering strategies. Applying specific methods for mixed-type data proves to be comparable and in many cases beneficial as compared to standard approaches applied to corresponding quantitative or binarized data. Our two novel approaches for mixed-type variables show similar or better performance than the existing methods ClustOfVar and bias-corrected mutual information. Further, in contrast to ClustOfVar, our methods provide dissimilarity matrices, which is an advantage, especially for the purpose of visualization. Real data examples aim to give an impression of various kinds of potential applications for the integrative heatmap and other graphical displays based on dissimilarity matrices. We demonstrate that the presented integrative heatmap provides more information than common data displays about the relationship among variables and samples. The described clustering and visualization methods are implemented in our R package CluMix available from https://cran.r-project.org/web/packages/CluMix.

  11. o

    Individual Level Data - Quantitative Metadata - weeklysurveys_oh - Dataset -...

    • hcs.chrr.ohio-state.edu
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    Dataset updated
    Sep 22, 2025
    Description

    Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Questionnaire Temporal Features: Other Repeated Measures Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Behavioral Data Topics: Drug Use, Fentanyl Test Strips

  12. o

    Individual Level Data - Quantitative Metadata - consort_oh - Dataset - CHRR...

    • hcs.chrr.ohio-state.edu
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    Dataset updated
    Sep 22, 2025
    Description

    Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Tracker Temporal Features: One-Time Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Participant Screening Topics: Drug Use, Fentanyl Test Strips

  13. D

    Replication data for: Measure schematicity through information content: A...

    • dataverse.no
    • search.dataone.org
    csv, txt
    Updated Sep 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Liulin Zhang; Liulin Zhang (2025). Replication data for: Measure schematicity through information content: A quantitative approach to grammaticalization [Dataset]. http://doi.org/10.18710/APTUHA
    Explore at:
    csv(2222436), txt(9134), csv(4911848)Available download formats
    Dataset updated
    Sep 3, 2025
    Dataset provided by
    DataverseNO
    Authors
    Liulin Zhang; Liulin Zhang
    License

    https://dataverse.no/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18710/APTUHAhttps://dataverse.no/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18710/APTUHA

    Time period covered
    0700 - 0900
    Area covered
    China
    Description

    This is a study to propose a quantitative method to compute the schematicity of constructions, which is a key indicator of the level of grammaticalization of morphemes. In this method, to estimate the schematicity of a schema made up of two morphemes, i.e., X_ (X is the target morpheme and _ represents an open slot), we need to know the total token frequency of all types of X_, and the token frequencies of all kinds of elements occurring in the open slot. For example, if we are interested in the schematicity of “_ment”. We need to know the total token frequency of “_ment”, which is the sum of the frequencies of “shipment”, “equipment”, “employment”, “appointment” … (all types of “_ment”). We also need to know the token frequencies of “ship”, “equip”, “employ”, “appoint” … (all types of elements occurring in the open slot). Therefore, the data are morpheme bigrams (2-gram) generated from the English and Chinese corpora showing what morphemes can each morpheme combine with, together with the token frequency of each bigram, and the token frequencies of its two components respectively.

  14. d

    Intertidal species inventory and quantitative data collected on Oahu and...

    • catalog.data.gov
    • search.dataone.org
    • +1more
    Updated Mar 10, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    US National Oceanographic Data Center (Point of Contact) (2021). Intertidal species inventory and quantitative data collected on Oahu and Maui, Islands of Hawaii during 2001 - 2005 (NODC Accession 0002447) [Dataset]. https://catalog.data.gov/dataset/intertidal-species-inventory-and-quantitative-data-collected-on-oahu-and-maui-islands-of-h
    Explore at:
    Dataset updated
    Mar 10, 2021
    Dataset provided by
    US National Oceanographic Data Center (Point of Contact)
    Area covered
    Maui, Hawaiian Islands, Hawaii, O‘ahu
    Description

    Two types of data sets generated by our project: species inventories and quantitative counts of key organisms. The species inventories are a compilation of data collected by Chela Zabin of the Department of Zoology of the University of Hawaii in 2001 and by Zabin with the assistance of Erin Baumgartner's 9th grade Marine Science class at the Education Laboratory School in 2003, 2004 and 2005, through a National Science Foundation Graduate Teaching Fellowship. Each site was visited only once each year: by 50 students in 2003 and by 25 students in 2004 and 2005.

  15. Josh Richards' Tweets

    • kaggle.com
    zip
    Updated Dec 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). Josh Richards' Tweets [Dataset]. https://www.kaggle.com/datasets/thedevastator/josh-richards-twitter-performance-content-engage
    Explore at:
    zip(128323 bytes)Available download formats
    Dataset updated
    Dec 21, 2022
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Josh Richards' Tweets

    A Quantitative Study

    By Twitter [source]

    About this dataset

    This dataset presents an unprecedented look into the digital communication habits of Josh Richards - one of Twitter's most popular and influential users. Analyzing his tweets from the last several weeks, we can gain a comprehensive understanding of how he shapes content, engages with his followers, and links to outside sources. The data covers everything from the type of media posted to the level of engagement generated by each tweet - making it an invaluable resource for anyone interested in exploring how Josh Richards crafts his online presence. Uncover the strategies behind his remarkable impact on social media by taking a closer look at this exciting dataset!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset provides an in-depth look into the posts of Josh Richards, a well-known social media celebrity, on Twitter. The dataset includes columns such as the post's content, the type of media used (images, videos and text), metrics related to engagement (likes, retweets and a measure of engagement rate) and external links shared by Josh. To glean insights from this data you can conduct descriptive analysis on all columns to get an overview of what type of content he typically posts on Twitter. Additionally, you can perform correlation analysis to identify any relationships between different variables or formats (e.g. does Josh receive more likes when he uses images or videos?). Moreover, you can also use these data for predictive purposes by attempting to predict what type of content will engage his audience most based on past performance metrics such as likes and RTs per post

    Research Ideas

    • One clever idea that can be used with this dataset is to identify the types of content and media that are most effective in engaging Josh Richards' followers. Through analyzing the level of engagement with different post types and media, marketers can then use this information to craft more effective campaigns when targeting similar audiences.
    • This dataset could be used to analyze how successful external links shared by Josh Richards are in driving traffic to other websites. By studying whether certain types of links (e.g., posts linking to video content) tend to generate more clicks than others, marketers could adjust their strategies accordingly when crafting content for similar audiences.
    • A further use for this dataset involves studying the impact that posting frequency has on engagement levels with Josh Richard's posts, as well as on external link click-through activity from his followers. This knowledge could provide valuable insights into how consistently social media accounts must post content in order to maximize user engagement and drive traffic from followers elsewhere online

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

    Columns

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Twitter.

  16. Data from: BiasCorrector: fast and accurate correction of all types of...

    • zenodo.org
    zip
    Updated May 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lorenz A. Kapsner; Lorenz A. Kapsner; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev (2023). BiasCorrector: fast and accurate correction of all types of experimental biases in quantitative DNA methylation data derived by different technologies [Dataset]. http://doi.org/10.5281/zenodo.4310973
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 16, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Lorenz A. Kapsner; Lorenz A. Kapsner; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset includes the data of the experiments used for the manuscript "BiasCorrector: fast and accurate correction of all types of experimental biases in quantitative DNA methylation data derived by different technologies".

  17. d

    Data from: Qualitative and quantitative surveys of native freshwater mussels...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Oct 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Qualitative and quantitative surveys of native freshwater mussels in the upper and middle Delaware River (2000-2002) [Dataset]. https://catalog.data.gov/dataset/qualitative-and-quantitative-surveys-of-native-freshwater-mussels-in-the-upper-and-mi-2000
    Explore at:
    Dataset updated
    Oct 22, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Delaware River
    Description

    Data is included for two types of field surveys conducted for freshwater mussels in the mainstem of the middle and upper Delaware River in the Mid-Atlantic region of the United States from 2000-2002. Timed search (qualitative) surveys were conducted during 2000-2001 from a point at the confluence of the East and West Branches of the Delaware River near Hancock, NY continuously downstream to a point at the mouth of the Paulins Kill River near Columbia, NJ. In this qualitative survey, mussel species and counts were collected in the field catch-per-unit-effort (CPUE) data was determined for all mussel species within each of 1,095 consecutive stream sections ~200 m in length. Subsequent quantitative surveys were conducted in select 200-m sections of river using quadrats during 2002 in order to estimate abundance and density of mussel present in these sections. One Excel file contains data from qualitative surveys, and a second excel file contains data from quantitative quadrat surveys.

  18. Data from: A Quantitative Comparison of Cell-Type-Specific Microarray Gene...

    • scholarworks.brandeis.edu
    • plos.figshare.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benjamin W Okaty; Ken Sugino; Sacha Nelson, A Quantitative Comparison of Cell-Type-Specific Microarray Gene Expression Profiling Methods in the Mouse Brain [Dataset]. https://scholarworks.brandeis.edu/esploro/outputs/dataset/A-Quantitative-Comparison-of-Cell-Type-Specific-Microarray/9924086330201921
    Explore at:
    Dataset provided by
    figshare
    Authors
    Benjamin W Okaty; Ken Sugino; Sacha Nelson
    Time period covered
    Jan 1, 2016
    Description

    Expression profiling of restricted neural populations using microarrays can facilitate neuronal classification and provide insight into the molecular bases of cellular phenotypes. Due to the formidable heterogeneity of intermixed cell types that make up the brain, isolating cell types prior to microarray processing poses steep technical challenges that have been met in various ways. These methodological differences have the potential to distort cell-type-specific gene expression profiles insofar as they may insufficiently filter out contaminating mRNAs or induce aberrant cellular responses not normally present in vivo. Thus we have compared the repeatability, susceptibility to contamination from off-target cell-types, and evidence for stress-responsive gene expression of five different purification methods - Laser Capture Microdissection (LCM), Translating Ribosome Affinity Purification (TRAP), Immunopanning (PAN), Fluorescence Activated Cell Sorting (FACS), and manual sorting of fluorescently labeled cells (Manual). We found that all methods obtained comparably high levels of repeatability, however, data from LCM and TRAP showed significantly higher levels of contamination than the other methods. While PAN samples showed higher activation of apoptosis-related, stress-related and immediate early genes, samples from FACS and Manual studies, which also require dissociated cells, did not. Given that TRAP targets actively translated mRNAs, whereas other methods target all transcribed mRNAs, observed differences may also reflect translational regulation.

  19. Z

    Dataset: Publication cultures and Dutch research output: a quantitative...

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kramer, Bianca; Bosman, Jeroen (2020). Dataset: Publication cultures and Dutch research output: a quantitative assessment [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2643366
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Utrecht University Library
    Authors
    Kramer, Bianca; Bosman, Jeroen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset belonging to the report: Publication cultures and Dutch research output: a quantitative assessment

    On the report:

    Research into publication cultures commissioned by VSNU and carried out by Utrecht University Library has detailed university output beyond just journal articles, as well as the possibilities to assess open access levels of these other output types. For all four main fields reported on, the use of publication types other than journal articles is indeed substantial. For Social Sciences and Arts & Humanities in particular (with over 40% and over 60% of output respectively not being regular journal articles) looking at journal articles only ignores a significant share of their contribution to research and society. This is not only about books and book chapters, either: book reviews, conference papers, reports, case notes (in law) and all kinds of web publications are also significant parts of university output.

    Analyzing all these publication forms and especially determining to what extent they are open access is currently not easy. Even combining some the largest citation databases (Web of Science, Scopus and Dimensions) leaves out a lot of non-article content and in some fields even journal articles are only partly covered. Lacking metadata like affiliations and DOIs (either in the original documents or in the scholarly search engines) makes it even harder to analyze open access levels by institution and field. Using repository-harvesting databases like BASE and NARCIS in addition to the main citation databases improves understanding of open access of non-article output, but these routes also have limitations. The report has recommendations for stakeholders, mostly to improve metadata and coverage and apply persistent identifiers.

  20. o

    Data from: CLARISSA Cash Plus Social Protection intervention: quantitative...

    • ordo.open.ac.uk
    png
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neil Howard; Keetie Roelen; Giel Ton; Mauricio Espinoza; Afrin Aktar; Saklain Al Mamum (2024). CLARISSA Cash Plus Social Protection intervention: quantitative and qualitative data [Dataset]. http://doi.org/10.21954/ou.rd.26061106.v3
    Explore at:
    pngAvailable download formats
    Dataset updated
    Dec 16, 2024
    Dataset provided by
    The Open University
    Authors
    Neil Howard; Keetie Roelen; Giel Ton; Mauricio Espinoza; Afrin Aktar; Saklain Al Mamum
    License

    Attribution-ShareAlike 2.0 (CC BY-SA 2.0)https://creativecommons.org/licenses/by-sa/2.0/
    License information was derived automatically

    Description

    The CLARISSA Cash Plus intervention represented an innovative social protection scheme for tackling social ills, including the worst forms of child labour (WFCL). A universal and unconditional ‘cash plus’ programme, it combined community mobilisation, case work, and cash transfers (CTs). It was implemented in a high-density, low-income neighbourhood in Dhaka to build individual, family, and group capacities to meet needs. This, in turn, was expected to lead to a corresponding decrease in deprivation and community-identified social issues that negatively affect wellbeing, including WFCL. Four principles underpinned the intervention: Unconditionality, Universality, Needs-centred and people-led, and Emergent and open-ended.The intervention took place in Dhaka – North Gojmohol – over a 27-month period, between October 2021 and December 2023, to test and study the impact of providing unconditional and people‑led support to everyone in a community. Cash transfers were provided between January and June 2023 in monthly instalments, plus one investment transfer in September 2023. A total of 1,573 households received cash, through the Upay mobile financial service. Cash was complemented by a ‘plus’ component, implemented between October 2021 and December 2023. Referred to as relational needs-based community organising (NBCO), a team of 20 community mobilisers (CMs) delivered case work at the individual and family level and community mobilisation at the group level. The intervention was part of the wider CLARISSA programme, led by the Institute of Development Studies (IDS) and funded by UK’s Foreign, Commonwealth & Development Office (FCDO). The intervention was implemented by Terre des hommes (Tdh) in Bangladesh and evaluated in collaboration with the BRAC Institute of Governance and Development (BIGD) and researchers from the University of Bath and the Open University, UK.The evaluation of the CLARISSA Social Protection pilot was rooted in contribution analysis that combined multiple methods over more than three years in line with emerging best practice guidelines for mixed methods research on children, work, and wellbeing. Quantitative research included bi-monthly monitoring surveys administered by the project’s community mobilisers (CMs), including basic questions about wellbeing, perceived economic resilience, school attendance, etc. This was complimented by baseline, midline, and endline surveys, which collected information about key outcome indicators within the sphere of influence of the intervention, such as children’s engagement with different forms of work and working conditions, with schooling and other activities, household living conditions and sources of income, and respondents’ perceptions of change. Qualitative tools were used to probe topics and results of interest, as well as impact pathways. These included reflective diaries written by the community mobilisers; three rounds of focus group discussions (FGDs) with community members; three rounds of key informant interviews (KIIs) with members of case study households; and long-term ethnographic observation.Quantitative DataThe quantitative evaluation of the CLARISSA Cash Plus intervention involved several data collection methods to gather information about household living standards, children’s education and work, and social dynamics. The data collection included a pre-intervention census, four periodic surveys, and 13 rounds of bi-monthly monitoring surveys, all conducted between late 2020 and late 2023. Details of each instrument are as follows:Census: Conducted in October/November 2020 in the target neighbourhood of North Gojmohol (n=1,832) and the comparison neighbourhood of Balurmath (n=2,365)Periodic surveys: Baseline (February 2021, n=752 in North Gojmohol), Midline 1 (before cash) (October 2022, n=771 in North Gojmohol), Midline 2 (after 6 rounds of cash) (July 2023, n=769 in North Gojmohol), and Endline (December 2023, n=750 in North Gojmohol and n=773 in Balumath)Bi-monthly monitoring data (13 rounds): Conducted between December 2021 and December 2023 in North Gojmohol (average of 1,400 households per round)The present repository summarizes this information, organized as follows:1.1 Bimonthly survey (household): Panel dataset comprising 13 rounds of bi-monthly monitoring data at the household level (average of 1,400 households per round, total of 18,379 observations)1.2 Bimonthly survey (child): Panel dataset comprising 13 rounds of bi-monthly monitoring data at the child level (aged 5 to 16 at census) (average of 940 children per round, total of 12,213 observations)2.1 Periodic survey (household): Panel dataset comprising 5 periodic surveys (census, baseline, midline 1, midline 2, endline) at the household level (average of 750 households per period, total of 3,762 observations)2.2 Periodic survey (child): Panel dataset comprising 4 periodic surveys (baseline, midline 1, midline 2, endline) at the child level (average of 3,100 children per period, total of 12,417 observations)3.0 Balurmat - North Gojmohol panel: Balanced panel dataset comprising 558 households in North Gojmohol and 773 households in Balurmath, observed both at 2020 census and 2023 endline (total of 2,662 observations)4.0 Questionnaires: Original questionnaires for all datasetsAll datasets are provided in Stata format (.dta) and Excel format (.xlsx) and are accompanied by their respective dictionary in Excel format (.xlsx).Qualitative DataThe qualitative study was conducted in three rounds: the first round of IDIs and FGDs took place between December 2022 and January 2023; the second round took place from April to May 2023; and the third round took place from November to December 2023. KIIs were taken during the 2nd round of study in May 2023.The sample size by round and instrument type is shown below:RoundsIDIs with childrenIDIs with parentsIDIs with CMsFGDsKIIs1st Round (12/2022 – 01/2023)3026-06-2nd Round ( 04/2023 – 05/2023)3023-06053rd Round (11/2023 – 12/2023)26250307-The files in this archive contain the qualitative data and include six types of transcripts:· 1.1 Interviews with children in case study households (IDI): 30 families in round 1, 30 in round 2, and 26 in round 3· 1.2 Interviews with parents in case study households (IDI): 26 families in round 1, 23 in round 2, and 25 in round 3· 1.3 Interviews with community mobiliser (IDI): 3 CM in round 3· 2.0 Key informant interviews (KII): 5 in round 2· 3.0 Focus group discussions (FGD): 6 in round 1, 6 in round 2, and 7 in round 3· 4.0 Community mobiliser micro-narratives (556 cases)Additionally, this repository includes a comprehensive list of all qualitative data files ("List of all qualitative data+MC.xlsx").

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid (2023). The primary data types collected. [Dataset]. http://doi.org/10.1371/journal.pone.0265257.t002
Organization logo

The primary data types collected.

Related Article
Explore at:
7 scholarly articles cite this dataset (View in Google Scholar)
xlsAvailable download formats
Dataset updated
Jun 15, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The primary data types collected.

Search
Clear search
Close search
Google apps
Main menu