100+ datasets found

The primary data types collected.
figshare.com
plos.figshare.com
xls
Updated Jun 15, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid (2023). The primary data types collected. [Dataset]. http://doi.org/10.1371/journal.pone.0265257.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0265257.t002
Dataset updated
Jun 15, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The primary data types collected.

Global Electronic Data Capture Tool Market Research Report: By Application...

wiseguyreports.com

Updated Oct 14, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

(2025). Global Electronic Data Capture Tool Market Research Report: By Application (Clinical Trials, Pharmaceutical Research, Market Research, Survey Data Collection), By Deployment Type (Cloud-Based, On-Premises, Hybrid), By End User (Healthcare Organizations, Academic Institutions, Market Research Firms, Government Bodies), By Data Type (Qualitative Data, Quantitative Data, Mixed Data) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2035 [Dataset]. https://www.wiseguyreports.com/reports/electronic-data-capture-tool-market

Explore at:

Dataset updated

Oct 14, 2025

License

https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

Time period covered

Oct 25, 2025

Area covered

Global

Description

BASE YEAR	2024
HISTORICAL DATA	2019 - 2023
REGIONS COVERED	North America, Europe, APAC, South America, MEA
REPORT COVERAGE	Revenue Forecast, Competitive Landscape, Growth Factors, and Trends
MARKET SIZE 2024	4.37(USD Billion)
MARKET SIZE 2025	4.71(USD Billion)
MARKET SIZE 2035	10.0(USD Billion)
SEGMENTS COVERED	Application, Deployment Type, End User, Data Type, Regional
COUNTRIES COVERED	US, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA
KEY MARKET DYNAMICS	growing demand for clinical trials, increasing adoption of electronic solutions, rising regulatory compliance needs, need for real-time data access, growing focus on patient engagement
MARKET FORECAST UNITS	USD Billion
KEY COMPANIES PROFILED	Castor, SAS Institute, CureClick, Veeva Systems, RedCap, WCG Clinical, ERT, Medidata Solutions, Parexel International, Appian, Advarra, CRF Health, IBM, ClinCapture, Oracle
MARKET FORECAST PERIOD	2025 - 2035
KEY MARKET OPPORTUNITIES	Increased demand for remote trials, Adoption of cloud-based solutions, Growing need for regulatory compliance, Rising use in clinical research, Expansion in emerging markets
COMPOUND ANNUAL GROWTH RATE (CAGR)	7.8% (2025 - 2035)

t
Which of the Five Types of Data Science Does Your Startup Need? - Data...
tomtunguz.com
Updated Oct 2, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tomasz Tunguz (2013). Which of the Five Types of Data Science Does Your Startup Need? - Data Analysis [Dataset]. https://tomtunguz.com/data-science-types/
Explore at:
Dataset updated
Oct 2, 2013
Dataset provided by
Theory Ventures
Authors
Tomasz Tunguz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Discover the 5 distinct types of data scientists your startup needs, from quantitative PhDs to operational analysts. Learn which role best fits your company's growth stage.
o
Individual Level Data - Quantitative Metadata - baselinesurvey_oh
hcs.chrr.ohio-state.edu
Updated Sep 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
Dataset updated
Sep 22, 2025
Description
Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Questionnaire Temporal Features: One-Time Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Behavioral Data, Demographics Topics: Drug Use, Fentanyl Test Strips
d
Dataplex: FDA Medical Device Data | 24M+ Rows of Key Device Product Data for...
datarade.ai
.csv
Updated Aug 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataplex (2024). Dataplex: FDA Medical Device Data | 24M+ Rows of Key Device Product Data for Research & Analysis [Dataset]. https://datarade.ai/data-products/dataplex-fda-medical-device-data-24m-rows-of-key-device-i-dataplex
Explore at:
.csvAvailable download formats
Dataset updated
Aug 12, 2024
Dataset authored and provided by
Dataplex
Area covered
United States of America
Description
The FDA Device Dataset by Dataplex provides comprehensive access to over 24 million rows of detailed information, covering 9 key data types essential for anyone involved in the medical device industry. Sourced directly from the U.S. Food and Drug Administration (FDA), this dataset is a critical resource for regulatory compliance, market analysis, and product safety assessment regarding.

Dataset Overview:

This dataset includes data on medical device registrations, approvals, recalls, and adverse events, among other crucial aspects. The dataset is meticulously cleaned and structured to ensure that it meets the needs of researchers, regulatory professionals, and market analysts.

24 Million Rows of Data:

With over 24 million rows, this dataset offers an extensive view of the regulatory landscape for medical devices. It includes data types such as classification, event, enforcement, 510k, registration listings, recall, PMA, UDI, and covid19 serology. This wide range of data types allows users to perform granular analysis on a broad spectrum of device-related topics.

Sourced from the FDA:

All data in this dataset is sourced directly from the FDA, ensuring that it is accurate, up-to-date, and reliable. Regular updates ensure that the dataset remains current, reflecting the latest in device approvals, clearances, and safety reports.

Key Features:

Comprehensive Coverage: Includes 9 key device data types, such as 510(k) clearances, premarket approvals, device classifications, and adverse event reports.

Regulatory Compliance: Provides detailed information necessary for tracking compliance with FDA regulations, including device recalls and enforcement actions.

Market Analysis: Analysts can utilize the dataset to assess market trends, monitor competitor activities, and track the introduction of new devices.

Product Safety Analysis: Researchers can analyze adverse event reports and device recalls to evaluate the safety and performance of medical devices.

Use Cases: - Regulatory Compliance: Ensure your devices meet FDA standards, monitor compliance trends, and stay informed about regulatory changes.

Market Research: Identify trends in the medical device market, track new device approvals, and analyze competitive landscapes with up-to-date and historical data.

Product Safety: Assess the safety and performance of medical devices by examining detailed adverse event reports and recall data.

Data Quality and Reliability:

The FDA Device Dataset prioritizes data quality and reliability. Each record is meticulously sourced from the FDA's official databases, ensuring that the information is both accurate and up-to-date. This makes the dataset a trusted resource for critical applications, where data accuracy is vital.

Integration and Usability:

The dataset is provided in CSV format, making it compatible with most data analysis tools and platforms. Users can easily import, analyze, and utilize the data for various applications, from regulatory reporting to market analysis.

User-Friendly Structure and Metadata:

The data is organized for easy navigation, with clear metadata files included to help users identify relevant records. The dataset is structured by device type, approval and clearance processes, and adverse event reports, allowing for efficient data retrieval and analysis.

Ideal For:

Regulatory Professionals: Monitor FDA compliance, track regulatory changes, and prepare for audits with comprehensive and up-to-date product data.

Market Analysts: Conduct detailed research on market trends, assess new device entries, and analyze competitive dynamics with extensive FDA data.

Healthcare Researchers: Evaluate the safety and efficacy of medical devices product data, identify potential risks, and contribute to improved patient outcomes through detailed analysis.

This dataset is an indispensable resource for anyone involved in the medical device industry, providing the data and insights necessary to drive informed decisions and ensure compliance with FDA regulations.
q
Intro to Data Types and Graphing Lab
qubeshub.org
Updated Oct 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stephanie Spera (2020). Intro to Data Types and Graphing Lab [Dataset]. http://doi.org/10.25334/1XYA-TF48
Explore at:
Unique identifier
https://doi.org/10.25334/1XYA-TF48
Dataset updated
Oct 12, 2020
Dataset provided by
QUBES
Authors
Stephanie Spera
Description
This is the third lab in an Introductory Physical Geography/Environmental Studies course. It introduces students to different data types (qualitative vs quantitative), basic statistical analyses (correlation analysis s, t-test), and graphing techniques.
o
Individual Level Data - Quantitative Metadata - screenersurvey_oh
hcs.chrr.ohio-state.edu
Updated Sep 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
Dataset updated
Sep 22, 2025
Description
Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Questionnaire Temporal Features: One-Time Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Participant Screening Topics: Drug Use, Fentanyl Test Strips
Z
Conceptualization of public data ecosystems
data.niaid.nih.gov
Updated Sep 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anastasija, Nikiforova; Martin, Lnenicka (2024). Conceptualization of public data ecosystems [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13842001
Explore at:
Dataset updated
Sep 26, 2024
Dataset provided by
University of Tartu
University of Hradec Králové
Authors
Anastasija, Nikiforova; Martin, Lnenicka
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains data collected during a study "Understanding the development of public data ecosystems: from a conceptual model to a six-generation model of the evolution of public data ecosystems" conducted by Martin Lnenicka (University of Hradec Králové, Czech Republic), Anastasija Nikiforova (University of Tartu, Estonia), Mariusz Luterek (University of Warsaw, Warsaw, Poland), Petar Milic (University of Pristina - Kosovska Mitrovica, Serbia), Daniel Rudmark (Swedish National Road and Transport Research Institute, Sweden), Sebastian Neumaier (St. Pölten University of Applied Sciences, Austria), Karlo Kević (University of Zagreb, Croatia), Anneke Zuiderwijk (Delft University of Technology, Delft, the Netherlands), Manuel Pedro Rodríguez Bolívar (University of Granada, Granada, Spain).

As there is a lack of understanding of the elements that constitute different types of value-adding public data ecosystems and how these elements form and shape the development of these ecosystems over time, which can lead to misguided efforts to develop future public data ecosystems, the aim of the study is: (1) to explore how public data ecosystems have developed over time and (2) to identify the value-adding elements and formative characteristics of public data ecosystems. Using an exploratory retrospective analysis and a deductive approach, we systematically review 148 studies published between 1994 and 2023. Based on the results, this study presents a typology of public data ecosystems and develops a conceptual model of elements and formative characteristics that contribute most to value-adding public data ecosystems, and develops a conceptual model of the evolutionary generation of public data ecosystems represented by six generations called Evolutionary Model of Public Data Ecosystems (EMPDE). Finally, three avenues for a future research agenda are proposed.

This dataset is being made public both to act as supplementary data for "Understanding the development of public data ecosystems: from a conceptual model to a six-generation model of the evolution of public data ecosystems ", Telematics and Informatics*, and its Systematic Literature Review component that informs the study.

Description of the data in this data set

PublicDataEcosystem_SLR provides the structure of the protocol

Spreadsheet#1 provides the list of results after the search over three indexing databases and filtering out irrelevant studies

Spreadsheets #2 provides the protocol structure.

Spreadsheets #3 provides the filled protocol for relevant studies.

The information on each selected study was collected in four categories:(1) descriptive information,(2) approach- and research design- related information,(3) quality-related information,(4) HVD determination-related information

Descriptive Information

Article number

A study number, corresponding to the study number assigned in an Excel worksheet

Complete reference

The complete source information to refer to the study (in APA style), including the author(s) of the study, the year in which it was published, the study's title and other source information.

Year of publication

The year in which the study was published.

Journal article / conference paper / book chapter

The type of the paper, i.e., journal article, conference paper, or book chapter.

Journal / conference / book

Journal article, conference, where the paper is published.

DOI / Website

A link to the website where the study can be found.

Number of words

A number of words of the study.

Number of citations in Scopus and WoS

The number of citations of the paper in Scopus and WoS digital libraries.

Availability in Open Access

Availability of a study in the Open Access or Free / Full Access.

Keywords

Keywords of the paper as indicated by the authors (in the paper).

Relevance for our study (high / medium / low)

What is the relevance level of the paper for our study

Approach- and research design-related information

Approach- and research design-related information

Objective / Aim / Goal / Purpose & Research Questions

The research objective and established RQs.

Research method (including unit of analysis)

The methods used to collect data in the study, including the unit of analysis that refers to the country, organisation, or other specific unit that has been analysed such as the number of use-cases or policy documents, number and scope of the SLR etc.

Study’s contributions

The study’s contribution as defined by the authors

Qualitative / quantitative / mixed method

Whether the study uses a qualitative, quantitative, or mixed methods approach?

Availability of the underlying research data

Whether the paper has a reference to the public availability of the underlying research data e.g., transcriptions of interviews, collected data etc., or explains why these data are not openly shared?

Period under investigation

Period (or moment) in which the study was conducted (e.g., January 2021-March 2022)

Use of theory / theoretical concepts / approaches? If yes, specify them

Does the study mention any theory / theoretical concepts / approaches? If yes, what theory / concepts / approaches? If any theory is mentioned, how is theory used in the study? (e.g., mentioned to explain a certain phenomenon, used as a framework for analysis, tested theory, theory mentioned in the future research section).

Quality-related information

Quality concerns

Whether there are any quality concerns (e.g., limited information about the research methods used)?

Public Data Ecosystem-related information

Public data ecosystem definition

How is the public data ecosystem defined in the paper and any other equivalent term, mostly infrastructure. If an alternative term is used, how is the public data ecosystem called in the paper?

Public data ecosystem evolution / development

Does the paper define the evolution of the public data ecosystem? If yes, how is it defined and what factors affect it?

What constitutes a public data ecosystem?

What constitutes a public data ecosystem (components & relationships) - their "FORM / OUTPUT" presented in the paper (general description with more detailed answers to further additional questions).

Components and relationships

What components does the public data ecosystem consist of and what are the relationships between these components? Alternative names for components - element, construct, concept, item, helix, dimension etc. (detailed description).

Stakeholders

What stakeholders (e.g., governments, citizens, businesses, Non-Governmental Organisations (NGOs) etc.) does the public data ecosystem involve?

Actors and their roles

What actors does the public data ecosystem involve? What are their roles?

Data (data types, data dynamism, data categories etc.)

What data do the public data ecosystem cover (is intended / designed for)? Refer to all data-related aspects, including but not limited to data types, data dynamism (static data, dynamic, real-time data, stream), prevailing data categories / domains / topics etc.

Processes / activities / dimensions, data lifecycle phases

What processes, activities, dimensions and data lifecycle phases (e.g., locate, acquire, download, reuse, transform, etc.) does the public data ecosystem involve or refer to?

Level (if relevant)

What is the level of the public data ecosystem covered in the paper? (e.g., city, municipal, regional, national (=country), supranational, international).

Other elements or relationships (if any)

What other elements or relationships does the public data ecosystem consist of?

Additional comments

Additional comments (e.g., what other topics affected the public data ecosystems and their elements, what is expected to affect the public data ecosystems in the future, what were important topics by which the period was characterised etc.).

New papers

Does the study refer to any other potentially relevant papers?

Additional references to potentially relevant papers that were found in the analysed paper (snowballing).

Format of the file.xls, .csv (for the first spreadsheet only), .docx

Licenses or restrictionsCC-BY

For more info, see README.txt
n
Data from: The species versus subspecies conundrum: quantitative...
data.niaid.nih.gov
datadryad.org
zip
Updated Dec 16, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jen-Pan Huang; L. Lacey Knowles (2015). The species versus subspecies conundrum: quantitative delimitation from integrating multiple data types within a single Bayesian approach in Hercules beetles [Dataset]. http://doi.org/10.5061/dryad.8p6m0
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.8p6m0
Dataset updated
Dec 16, 2015
Dataset provided by
University of Michigan
Authors
Jen-Pan Huang; L. Lacey Knowles
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
With the recent attention and focus on quantitative methods for species delimitation, an overlooked but equally important issue regards what has actually been delimited. This study investigates the apparent arbitrariness of some taxonomic distinctions, and in particular how species and subspecies are assigned. Specifically, we use a recently developed Bayesian model-based approach to show that in the Hercules beetles (genus Dynastes) there is no statistical difference in the probability that putative taxa represent different species, irrespective of whether they were given species or subspecies designations. By considering multiple data types, as opposed to relying exclusively on genetic data alone, we also show that both previously recognized species and subspecies represent a variety of points along the speciation spectrum (i.e., previously recognized species are not systematically further along the continuum than subspecies). For example, based on evolutionary models of divergence, some taxa are statistically distinguishable on more than one axis of differentiation (e.g., along both phenotypic and genetic dimensions), whereas other taxa can only be delimited statistically from a single data type. Because both phenotypic and genetic data are analyzed in a common Bayesian framework, our study provides a framework for investigating whether disagreements in species boundaries among data types reflect (i) actual discordance with the actual history of lineage splitting, or instead (ii) differences among data types in the amount of time required for differentiation to become apparent among the delimited taxa. We discuss what the answers to these questions imply about what characters are used to delimit species, as well as the diverse processes involved in the origin and maintenance of species boundaries. With this in mind, we then reflect more generally on how quantitative methods for species delimitation are used to assign taxonomic status.
Clustering of samples and variables with mixed-type data
plos.figshare.com
tiff
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Manuela Hummel; Dominic Edelmann; Annette Kopp-Schneider (2023). Clustering of samples and variables with mixed-type data [Dataset]. http://doi.org/10.1371/journal.pone.0188274
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0188274
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Manuela Hummel; Dominic Edelmann; Annette Kopp-Schneider
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of data measured on different scales is a relevant challenge. Biomedical studies often focus on high-throughput datasets of, e.g., quantitative measurements. However, the need for integration of other features possibly measured on different scales, e.g. clinical or cytogenetic factors, becomes increasingly important. The analysis results (e.g. a selection of relevant genes) are then visualized, while adding further information, like clinical factors, on top. However, a more integrative approach is desirable, where all available data are analyzed jointly, and where also in the visualization different data sources are combined in a more natural way. Here we specifically target integrative visualization and present a heatmap-style graphic display. To this end, we develop and explore methods for clustering mixed-type data, with special focus on clustering variables. Clustering of variables does not receive as much attention in the literature as does clustering of samples. We extend the variables clustering methodology by two new approaches, one based on the combination of different association measures and the other on distance correlation. With simulation studies we evaluate and compare different clustering strategies. Applying specific methods for mixed-type data proves to be comparable and in many cases beneficial as compared to standard approaches applied to corresponding quantitative or binarized data. Our two novel approaches for mixed-type variables show similar or better performance than the existing methods ClustOfVar and bias-corrected mutual information. Further, in contrast to ClustOfVar, our methods provide dissimilarity matrices, which is an advantage, especially for the purpose of visualization. Real data examples aim to give an impression of various kinds of potential applications for the integrative heatmap and other graphical displays based on dissimilarity matrices. We demonstrate that the presented integrative heatmap provides more information than common data displays about the relationship among variables and samples. The described clustering and visualization methods are implemented in our R package CluMix available from https://cran.r-project.org/web/packages/CluMix.
o
Individual Level Data - Quantitative Metadata - weeklysurveys_oh - Dataset -...
hcs.chrr.ohio-state.edu
Updated Sep 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
Dataset updated
Sep 22, 2025
Description
Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Questionnaire Temporal Features: Other Repeated Measures Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Behavioral Data Topics: Drug Use, Fentanyl Test Strips
o
Individual Level Data - Quantitative Metadata - consort_oh - Dataset - CHRR...
hcs.chrr.ohio-state.edu
Updated Sep 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
Dataset updated
Sep 22, 2025
Description
Individual-level, quantitative data from PWUD participants in the Stay Safe Study. It covers screener, baseline, and repeated weekly surveys over a 4-week observation period. Data Type: Tracker Temporal Features: One-Time Primary Unit of Analysis: Community Members, People Who Use Drugs Counties: All or any Ohio counties Study Component: Stay Safe Study Data Primary Data Purpose: Participant Screening Topics: Drug Use, Fentanyl Test Strips
D
Replication data for: Measure schematicity through information content: A...
dataverse.no
search.dataone.org
csv, txt
Updated Sep 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Liulin Zhang; Liulin Zhang (2025). Replication data for: Measure schematicity through information content: A quantitative approach to grammaticalization [Dataset]. http://doi.org/10.18710/APTUHA
Explore at:
csv(2222436), txt(9134), csv(4911848)Available download formats
Unique identifier
https://doi.org/10.18710/APTUHA
Dataset updated
Sep 3, 2025
Dataset provided by
DataverseNO
Authors
Liulin Zhang; Liulin Zhang
License
https://dataverse.no/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18710/APTUHAhttps://dataverse.no/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18710/APTUHA
Time period covered
0700 - 0900
Area covered
China
Description
This is a study to propose a quantitative method to compute the schematicity of constructions, which is a key indicator of the level of grammaticalization of morphemes. In this method, to estimate the schematicity of a schema made up of two morphemes, i.e., X_ (X is the target morpheme and _ represents an open slot), we need to know the total token frequency of all types of X_, and the token frequencies of all kinds of elements occurring in the open slot. For example, if we are interested in the schematicity of “_ment”. We need to know the total token frequency of “_ment”, which is the sum of the frequencies of “shipment”, “equipment”, “employment”, “appointment” … (all types of “_ment”). We also need to know the token frequencies of “ship”, “equip”, “employ”, “appoint” … (all types of elements occurring in the open slot). Therefore, the data are morpheme bigrams (2-gram) generated from the English and Chinese corpora showing what morphemes can each morpheme combine with, together with the token frequency of each bigram, and the token frequencies of its two components respectively.
d
Intertidal species inventory and quantitative data collected on Oahu and...
catalog.data.gov
search.dataone.org
+1more
Updated Mar 10, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
US National Oceanographic Data Center (Point of Contact) (2021). Intertidal species inventory and quantitative data collected on Oahu and Maui, Islands of Hawaii during 2001 - 2005 (NODC Accession 0002447) [Dataset]. https://catalog.data.gov/dataset/intertidal-species-inventory-and-quantitative-data-collected-on-oahu-and-maui-islands-of-h
Explore at:
Dataset updated
Mar 10, 2021
Dataset provided by
US National Oceanographic Data Center (Point of Contact)
Area covered
Maui, Hawaiian Islands, Hawaii, O‘ahu
Description
Two types of data sets generated by our project: species inventories and quantitative counts of key organisms. The species inventories are a compilation of data collected by Chela Zabin of the Department of Zoology of the University of Hawaii in 2001 and by Zabin with the assistance of Erin Baumgartner's 9th grade Marine Science class at the Education Laboratory School in 2003, 2004 and 2005, through a National Science Foundation Graduate Teaching Fellowship. Each site was visited only once each year: by 50 students in 2003 and by 25 students in 2004 and 2005.
Josh Richards' Tweets
kaggle.com
zip
Updated Dec 21, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2022). Josh Richards' Tweets [Dataset]. https://www.kaggle.com/datasets/thedevastator/josh-richards-twitter-performance-content-engage
Explore at:
zip(128323 bytes)Available download formats
Dataset updated
Dec 21, 2022
Authors
The Devastator
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Josh Richards' Tweets

A Quantitative Study

By Twitter [source]

About this dataset

This dataset presents an unprecedented look into the digital communication habits of Josh Richards - one of Twitter's most popular and influential users. Analyzing his tweets from the last several weeks, we can gain a comprehensive understanding of how he shapes content, engages with his followers, and links to outside sources. The data covers everything from the type of media posted to the level of engagement generated by each tweet - making it an invaluable resource for anyone interested in exploring how Josh Richards crafts his online presence. Uncover the strategies behind his remarkable impact on social media by taking a closer look at this exciting dataset!

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset provides an in-depth look into the posts of Josh Richards, a well-known social media celebrity, on Twitter. The dataset includes columns such as the post's content, the type of media used (images, videos and text), metrics related to engagement (likes, retweets and a measure of engagement rate) and external links shared by Josh. To glean insights from this data you can conduct descriptive analysis on all columns to get an overview of what type of content he typically posts on Twitter. Additionally, you can perform correlation analysis to identify any relationships between different variables or formats (e.g. does Josh receive more likes when he uses images or videos?). Moreover, you can also use these data for predictive purposes by attempting to predict what type of content will engage his audience most based on past performance metrics such as likes and RTs per post

Research Ideas

One clever idea that can be used with this dataset is to identify the types of content and media that are most effective in engaging Josh Richards' followers. Through analyzing the level of engagement with different post types and media, marketers can then use this information to craft more effective campaigns when targeting similar audiences.

This dataset could be used to analyze how successful external links shared by Josh Richards are in driving traffic to other websites. By studying whether certain types of links (e.g., posts linking to video content) tend to generate more clicks than others, marketers could adjust their strategies accordingly when crafting content for similar audiences.

A further use for this dataset involves studying the impact that posting frequency has on engagement levels with Josh Richard's posts, as well as on external link click-through activity from his followers. This knowledge could provide valuable insights into how consistently social media accounts must post content in order to maximize user engagement and drive traffic from followers elsewhere online

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

Acknowledgements

If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Twitter.
Data from: BiasCorrector: fast and accurate correction of all types of...
zenodo.org
zip
Updated May 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lorenz A. Kapsner; Lorenz A. Kapsner; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev (2023). BiasCorrector: fast and accurate correction of all types of experimental biases in quantitative DNA methylation data derived by different technologies [Dataset]. http://doi.org/10.5281/zenodo.4310973
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4310973
Dataset updated
May 16, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Lorenz A. Kapsner; Lorenz A. Kapsner; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev; Mikhail G. Zavgorodnij; Svetlana P. Majorova; Oleg V. Kolychev; Igor N. Lebedev; Jörg D. Hoheisel; Arndt Hartmann; Sebastian Mate; Hans-Ulrich Prokosch; Florian Haller; Evgeny A. Moskalev
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset includes the data of the experiments used for the manuscript "BiasCorrector: fast and accurate correction of all types of experimental biases in quantitative DNA methylation data derived by different technologies".
d
Data from: Qualitative and quantitative surveys of native freshwater mussels...
catalog.data.gov
data.usgs.gov
+1more
Updated Oct 22, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2025). Qualitative and quantitative surveys of native freshwater mussels in the upper and middle Delaware River (2000-2002) [Dataset]. https://catalog.data.gov/dataset/qualitative-and-quantitative-surveys-of-native-freshwater-mussels-in-the-upper-and-mi-2000
Explore at:
Dataset updated
Oct 22, 2025
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Delaware River
Description
Data is included for two types of field surveys conducted for freshwater mussels in the mainstem of the middle and upper Delaware River in the Mid-Atlantic region of the United States from 2000-2002. Timed search (qualitative) surveys were conducted during 2000-2001 from a point at the confluence of the East and West Branches of the Delaware River near Hancock, NY continuously downstream to a point at the mouth of the Paulins Kill River near Columbia, NJ. In this qualitative survey, mussel species and counts were collected in the field catch-per-unit-effort (CPUE) data was determined for all mussel species within each of 1,095 consecutive stream sections ~200 m in length. Subsequent quantitative surveys were conducted in select 200-m sections of river using quadrats during 2002 in order to estimate abundance and density of mussel present in these sections. One Excel file contains data from qualitative surveys, and a second excel file contains data from quantitative quadrat surveys.
Data from: A Quantitative Comparison of Cell-Type-Specific Microarray Gene...
scholarworks.brandeis.edu
plos.figshare.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Benjamin W Okaty; Ken Sugino; Sacha Nelson, A Quantitative Comparison of Cell-Type-Specific Microarray Gene Expression Profiling Methods in the Mouse Brain [Dataset]. https://scholarworks.brandeis.edu/esploro/outputs/dataset/A-Quantitative-Comparison-of-Cell-Type-Specific-Microarray/9924086330201921
Explore at:
Dataset provided by
figshare
Authors
Benjamin W Okaty; Ken Sugino; Sacha Nelson
Time period covered
Jan 1, 2016
Description
Expression profiling of restricted neural populations using microarrays can facilitate neuronal classification and provide insight into the molecular bases of cellular phenotypes. Due to the formidable heterogeneity of intermixed cell types that make up the brain, isolating cell types prior to microarray processing poses steep technical challenges that have been met in various ways. These methodological differences have the potential to distort cell-type-specific gene expression profiles insofar as they may insufficiently filter out contaminating mRNAs or induce aberrant cellular responses not normally present in vivo. Thus we have compared the repeatability, susceptibility to contamination from off-target cell-types, and evidence for stress-responsive gene expression of five different purification methods - Laser Capture Microdissection (LCM), Translating Ribosome Affinity Purification (TRAP), Immunopanning (PAN), Fluorescence Activated Cell Sorting (FACS), and manual sorting of fluorescently labeled cells (Manual). We found that all methods obtained comparably high levels of repeatability, however, data from LCM and TRAP showed significantly higher levels of contamination than the other methods. While PAN samples showed higher activation of apoptosis-related, stress-related and immediate early genes, samples from FACS and Manual studies, which also require dissociated cells, did not. Given that TRAP targets actively translated mRNAs, whereas other methods target all transcribed mRNAs, observed differences may also reflect translational regulation.
Z
Dataset: Publication cultures and Dutch research output: a quantitative...
data.niaid.nih.gov
data-staging.niaid.nih.gov
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kramer, Bianca; Bosman, Jeroen (2020). Dataset: Publication cultures and Dutch research output: a quantitative assessment [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2643366
Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
Utrecht University Library
Authors
Kramer, Bianca; Bosman, Jeroen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset belonging to the report: Publication cultures and Dutch research output: a quantitative assessment

On the report:

Research into publication cultures commissioned by VSNU and carried out by Utrecht University Library has detailed university output beyond just journal articles, as well as the possibilities to assess open access levels of these other output types. For all four main fields reported on, the use of publication types other than journal articles is indeed substantial. For Social Sciences and Arts & Humanities in particular (with over 40% and over 60% of output respectively not being regular journal articles) looking at journal articles only ignores a significant share of their contribution to research and society. This is not only about books and book chapters, either: book reviews, conference papers, reports, case notes (in law) and all kinds of web publications are also significant parts of university output.

Analyzing all these publication forms and especially determining to what extent they are open access is currently not easy. Even combining some the largest citation databases (Web of Science, Scopus and Dimensions) leaves out a lot of non-article content and in some fields even journal articles are only partly covered. Lacking metadata like affiliations and DOIs (either in the original documents or in the scholarly search engines) makes it even harder to analyze open access levels by institution and field. Using repository-harvesting databases like BASE and NARCIS in addition to the main citation databases improves understanding of open access of non-article output, but these routes also have limitations. The report has recommendations for stakeholders, mostly to improve metadata and coverage and apply persistent identifiers.
o
Data from: CLARISSA Cash Plus Social Protection intervention: quantitative...
ordo.open.ac.uk
png
Updated Dec 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neil Howard; Keetie Roelen; Giel Ton; Mauricio Espinoza; Afrin Aktar; Saklain Al Mamum (2024). CLARISSA Cash Plus Social Protection intervention: quantitative and qualitative data [Dataset]. http://doi.org/10.21954/ou.rd.26061106.v3
Explore at:
pngAvailable download formats
Unique identifier
https://doi.org/10.21954/ou.rd.26061106.v3
Dataset updated
Dec 16, 2024
Dataset provided by
The Open University
Authors
Neil Howard; Keetie Roelen; Giel Ton; Mauricio Espinoza; Afrin Aktar; Saklain Al Mamum
License
Attribution-ShareAlike 2.0 (CC BY-SA 2.0)https://creativecommons.org/licenses/by-sa/2.0/
License information was derived automatically
Description
The CLARISSA Cash Plus intervention represented an innovative social protection scheme for tackling social ills, including the worst forms of child labour (WFCL). A universal and unconditional ‘cash plus’ programme, it combined community mobilisation, case work, and cash transfers (CTs). It was implemented in a high-density, low-income neighbourhood in Dhaka to build individual, family, and group capacities to meet needs. This, in turn, was expected to lead to a corresponding decrease in deprivation and community-identified social issues that negatively affect wellbeing, including WFCL. Four principles underpinned the intervention: Unconditionality, Universality, Needs-centred and people-led, and Emergent and open-ended.The intervention took place in Dhaka – North Gojmohol – over a 27-month period, between October 2021 and December 2023, to test and study the impact of providing unconditional and people‑led support to everyone in a community. Cash transfers were provided between January and June 2023 in monthly instalments, plus one investment transfer in September 2023. A total of 1,573 households received cash, through the Upay mobile financial service. Cash was complemented by a ‘plus’ component, implemented between October 2021 and December 2023. Referred to as relational needs-based community organising (NBCO), a team of 20 community mobilisers (CMs) delivered case work at the individual and family level and community mobilisation at the group level. The intervention was part of the wider CLARISSA programme, led by the Institute of Development Studies (IDS) and funded by UK’s Foreign, Commonwealth & Development Office (FCDO). The intervention was implemented by Terre des hommes (Tdh) in Bangladesh and evaluated in collaboration with the BRAC Institute of Governance and Development (BIGD) and researchers from the University of Bath and the Open University, UK.The evaluation of the CLARISSA Social Protection pilot was rooted in contribution analysis that combined multiple methods over more than three years in line with emerging best practice guidelines for mixed methods research on children, work, and wellbeing. Quantitative research included bi-monthly monitoring surveys administered by the project’s community mobilisers (CMs), including basic questions about wellbeing, perceived economic resilience, school attendance, etc. This was complimented by baseline, midline, and endline surveys, which collected information about key outcome indicators within the sphere of influence of the intervention, such as children’s engagement with different forms of work and working conditions, with schooling and other activities, household living conditions and sources of income, and respondents’ perceptions of change. Qualitative tools were used to probe topics and results of interest, as well as impact pathways. These included reflective diaries written by the community mobilisers; three rounds of focus group discussions (FGDs) with community members; three rounds of key informant interviews (KIIs) with members of case study households; and long-term ethnographic observation.Quantitative DataThe quantitative evaluation of the CLARISSA Cash Plus intervention involved several data collection methods to gather information about household living standards, children’s education and work, and social dynamics. The data collection included a pre-intervention census, four periodic surveys, and 13 rounds of bi-monthly monitoring surveys, all conducted between late 2020 and late 2023. Details of each instrument are as follows:Census: Conducted in October/November 2020 in the target neighbourhood of North Gojmohol (n=1,832) and the comparison neighbourhood of Balurmath (n=2,365)Periodic surveys: Baseline (February 2021, n=752 in North Gojmohol), Midline 1 (before cash) (October 2022, n=771 in North Gojmohol), Midline 2 (after 6 rounds of cash) (July 2023, n=769 in North Gojmohol), and Endline (December 2023, n=750 in North Gojmohol and n=773 in Balumath)Bi-monthly monitoring data (13 rounds): Conducted between December 2021 and December 2023 in North Gojmohol (average of 1,400 households per round)The present repository summarizes this information, organized as follows:1.1 Bimonthly survey (household): Panel dataset comprising 13 rounds of bi-monthly monitoring data at the household level (average of 1,400 households per round, total of 18,379 observations)1.2 Bimonthly survey (child): Panel dataset comprising 13 rounds of bi-monthly monitoring data at the child level (aged 5 to 16 at census) (average of 940 children per round, total of 12,213 observations)2.1 Periodic survey (household): Panel dataset comprising 5 periodic surveys (census, baseline, midline 1, midline 2, endline) at the household level (average of 750 households per period, total of 3,762 observations)2.2 Periodic survey (child): Panel dataset comprising 4 periodic surveys (baseline, midline 1, midline 2, endline) at the child level (average of 3,100 children per period, total of 12,417 observations)3.0 Balurmat - North Gojmohol panel: Balanced panel dataset comprising 558 households in North Gojmohol and 773 households in Balurmath, observed both at 2020 census and 2023 endline (total of 2,662 observations)4.0 Questionnaires: Original questionnaires for all datasetsAll datasets are provided in Stata format (.dta) and Excel format (.xlsx) and are accompanied by their respective dictionary in Excel format (.xlsx).Qualitative DataThe qualitative study was conducted in three rounds: the first round of IDIs and FGDs took place between December 2022 and January 2023; the second round took place from April to May 2023; and the third round took place from November to December 2023. KIIs were taken during the 2nd round of study in May 2023.The sample size by round and instrument type is shown below:RoundsIDIs with childrenIDIs with parentsIDIs with CMsFGDsKIIs1st Round (12/2022 – 01/2023)3026-06-2nd Round ( 04/2023 – 05/2023)3023-06053rd Round (11/2023 – 12/2023)26250307-The files in this archive contain the qualitative data and include six types of transcripts:· 1.1 Interviews with children in case study households (IDI): 30 families in round 1, 30 in round 2, and 26 in round 3· 1.2 Interviews with parents in case study households (IDI): 26 families in round 1, 23 in round 2, and 25 in round 3· 1.3 Interviews with community mobiliser (IDI): 3 CM in round 3· 2.0 Key informant interviews (KII): 5 in round 2· 3.0 Focus group discussions (FGD): 6 in round 1, 6 in round 2, and 7 in round 3· 4.0 Community mobiliser micro-narratives (556 cases)Additionally, this repository includes a comprehensive list of all qualitative data files ("List of all qualitative data+MC.xlsx").

Facebook

Twitter

Click to copy link

Link copied

Cite

Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid (2023). The primary data types collected. [Dataset]. http://doi.org/10.1371/journal.pone.0265257.t002

The primary data types collected.

Explore at:

7 scholarly articles cite this dataset (View in Google Scholar)

xlsAvailable download formats

Unique identifier

https://doi.org/10.1371/journal.pone.0265257.t002

Dataset updated

Jun 15, 2023

Dataset provided by

PLOShttp://plos.org/

Authors

Setiawan Priatmoko; Billal Hossain; Wuri Rahmawati; Susilo Budi Winarno; Lóránt Dénes Dávid

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The primary data types collected.

Clear search

Close search

Google apps

Main menu

The primary data types collected.

Global Electronic Data Capture Tool Market Research Report: By Application...

Which of the Five Types of Data Science Does Your Startup Need? - Data...

Individual Level Data - Quantitative Metadata - baselinesurvey_oh

Dataplex: FDA Medical Device Data | 24M+ Rows of Key Device Product Data for...

Intro to Data Types and Graphing Lab

Individual Level Data - Quantitative Metadata - screenersurvey_oh

Conceptualization of public data ecosystems

Data from: The species versus subspecies conundrum: quantitative...

Clustering of samples and variables with mixed-type data

Individual Level Data - Quantitative Metadata - weeklysurveys_oh - Dataset -...

Individual Level Data - Quantitative Metadata - consort_oh - Dataset - CHRR...

Replication data for: Measure schematicity through information content: A...

Intertidal species inventory and quantitative data collected on Oahu and...

Josh Richards' Tweets

Josh Richards' Tweets

A Quantitative Study

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

Acknowledgements

Data from: BiasCorrector: fast and accurate correction of all types of...

Data from: Qualitative and quantitative surveys of native freshwater mussels...

Data from: A Quantitative Comparison of Cell-Type-Specific Microarray Gene...

Dataset: Publication cultures and Dutch research output: a quantitative...

Data from: CLARISSA Cash Plus Social Protection intervention: quantitative...

The primary data types collected.See More Versions

The primary data types collected.