This data package contains datasets on clinical trials conducted in the United States. Diseases include cervical cancer, diabetes, acute respiratory infection as well as stress. This data package also includes clinical trials registry and results database.
The goal of the Clinical Trials track is to focus research on the clinical trials matching problem: given a free text summary of a patient health record, find suitable clinical trials for that patient.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Update — December 7, 2014. – Evidence-based medicine (EBM) is not working for many reasons, for example: 1. Incorrect in their foundations (paradox): hierarchical levels of evidence are supported by opinions (i.e., lowest strength of evidence according to EBM) instead of real data collected from different types of study designs (i.e., evidence). http://dx.doi.org/10.6084/m9.figshare.1122534 2. The effect of criminal practices by pharmaceutical companies is only possible because of the complicity of others: healthcare systems, professional associations, governmental and academic institutions. Pharmaceutical companies also corrupt at the personal level, politicians and political parties are on their payroll, medical professionals seduced by different types of gifts in exchange of prescriptions (i.e., bribery) which very likely results in patients not receiving the proper treatment for their disease, many times there is no such thing: healthy persons not needing pharmacological treatments of any kind are constantly misdiagnosed and treated with unnecessary drugs. Some medical professionals are converted in K.O.L. which is only a puppet appearing on stage to spread lies to their peers, a person supposedly trained to improve the well-being of others, now deceits on behalf of pharmaceutical companies. Probably the saddest thing is that many honest doctors are being misled by these lies created by the rules of pharmaceutical marketing instead of scientific, medical, and ethical principles. Interpretation of EBM in this context was not anticipated by their creators. “The main reason we take so many drugs is that drug companies don’t sell drugs, they sell lies about drugs.” ―Peter C. Gøtzsche “doctors and their organisations should recognise that it is unethical to receive money that has been earned in part through crimes that have harmed those people whose interests doctors are expected to take care of. Many crimes would be impossible to carry out if doctors weren’t willing to participate in them.” —Peter C Gøtzsche, The BMJ, 2012, Big pharma often commits corporate crime, and this must be stopped. Pending (Colombia): Health Promoter Entities (In Spanish: EPS ―Empresas Promotoras de Salud).
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Objectives: To develop and pilot a tool to measure and improve pharmaceutical companies’ clinical trial data sharing policies and practices. Design: Cross sectional descriptive analysis. Setting: Large pharmaceutical companies with novel drugs approved by the US Food and Drug Administration in 2015. Data sources: Data sharing measures were adapted from 10 prominent data sharing guidelines from expert bodies and refined through a multi-stakeholder deliberative process engaging patients, industry, academics, regulators, and others. Data sharing practices and policies were assessed using data from ClinicalTrials.gov, Drugs@FDA, corporate websites, data sharing platforms and registries (eg, the Yale Open Data Access (YODA) Project and Clinical Study Data Request (CSDR)), and personal communication with drug companies. Main outcome measures: Company level, multicomponent measure of accessibility of participant level clinical trial data (eg, analysis ready dataset and metadata); drug and trial level measures of registration, results reporting, and publication; company level overall transparency rankings; and feasibility of the measures and ranking tool to improve company data sharing policies and practices. Results: Only 25% of large pharmaceutical companies fully met the data sharing measure. The median company data sharing score was 63% (interquartile range 58-85%). Given feedback and a chance to improve their policies to meet this measure, three companies made amendments, raising the percentage of companies in full compliance to 33% and the median company data sharing score to 80% (73-100%). The most common reasons companies did not initially satisfy the data sharing measure were failure to share data by the specified deadline (75%) and failure to report the number and outcome of their data requests. Across new drug applications, a median of 100% (interquartile range 91-100%) of trials in patients were registered, 65% (36-96%) reported results, 45% (30-84%) were published, and 95% (69-100%) were publicly available in some form by six months after FDA drug approval. When examining results on the drug level, less than half (42%) of reviewed drugs had results for all their new drug applications trials in patients publicly available in some form by six months after FDA approval. Conclusions: It was feasible to develop a tool to measure data sharing policies and practices among large companies and have an impact in improving company practices. Among large companies, 25% made participant level trial data accessible to external investigators for new drug approvals in accordance with the current study’s measures; this proportion improved to 33% after applying the ranking tool. Other measures of trial transparency were higher. Some companies, however, have substantial room for improvement on transparency and data sharing of clinical trials.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The full anonymised dataset from our recent survey into the attitudes towards clinical trial data sharing. The invitation to participate was distributed to clinical trialists funded by the Wellcome Trust, Bill and Melinda Gates Foundation, Cancer Research UK, and UK Medical Research Council.
The National Database for Clinical Trials Related to Mental Illness (NDCT) is an extensible informatics platform for relevant data at all levels of biological and behavioral organization (molecules, genes, neural tissue, behavioral, social and environmental interactions) and for all data types (text, numeric, image, time series, etc.) related to clinical trials funded by the National Institute of Mental Health. Sharing data, associated tools, methodologies and results, rather than just summaries or interpretations, accelerates research progress. Community-wide sharing requires common data definitions and standards, as well as comprehensive and coherent informatics approaches for the sharing of de-identified human subject research data. Built on the National Database for Autism Research (NDAR) informatics platform, NDCT provides a comprehensive data sharing platform for NIMH grantees supporting clinical trials.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is about book series. It has 1 row and is filtered where the books is Statistical design and analysis of clinical trials : principles and methods. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global clinical trial data visualization market size is projected to grow from USD 0.75 billion in 2023 to USD 2.62 billion by 2032, reflecting a compound annual growth rate (CAGR) of 15.2% during the forecast period. This growth is driven by the increasing complexity of clinical trials, the need for enhanced data transparency, and the rising adoption of digital tools in the healthcare sector.
One of the key drivers for the growth of the clinical trial data visualization market is the escalating complexity and volume of data generated during clinical trials. The pharmaceutical and biotechnology sectors are witnessing a surge in clinical trials, which demand sophisticated data management and visualization tools to make sense of the vast amounts of data collected. These tools enable researchers to identify patterns, trends, and outliers more efficiently, thereby accelerating the decision-making process and improving clinical trial outcomes.
Another significant factor contributing to market growth is the increasing emphasis on data transparency and regulatory compliance. Regulatory bodies, such as the FDA and EMA, are mandating greater transparency in clinical trial data to ensure patient safety and data integrity. Data visualization tools facilitate the clear presentation of complex data, making it easier for regulatory bodies and stakeholders to review and approve clinical trial processes. This ensures that clinical trials are conducted in a more transparent and compliant manner, thus driving the adoption of these tools.
The advent of advanced technologies, such as artificial intelligence (AI) and machine learning (ML), is also playing a crucial role in the growth of the clinical trial data visualization market. These technologies are being increasingly integrated into data visualization tools to enhance their capabilities. AI and ML algorithms can analyze large datasets quickly and provide insights that were previously unattainable. This not only improves the efficiency of clinical trials but also enhances the accuracy and reliability of the data being presented.
As the clinical trial data visualization market continues to expand, the importance of Clinical Trial Data Security becomes increasingly paramount. With the vast amounts of data generated during trials, ensuring the confidentiality, integrity, and availability of this data is critical. Organizations must implement robust security measures to protect sensitive information from unauthorized access and breaches. This involves not only securing the data itself but also safeguarding the systems and networks that store and process this information. As regulatory bodies tighten their data protection requirements, companies are investing in advanced security technologies and practices to comply with these standards and maintain trust with stakeholders. The focus on Clinical Trial Data Security is not just about compliance; it is about ensuring the reliability and credibility of clinical trial outcomes, which ultimately impacts patient safety and the development of new therapies.
Regionally, North America is expected to dominate the clinical trial data visualization market due to the presence of a large number of pharmaceutical and biotechnology companies, a well-established healthcare infrastructure, and a strong focus on research and development. Europe is also expected to witness significant growth, driven by the increasing adoption of digital technologies in clinical trials and supportive regulatory frameworks. The Asia Pacific region is poised to grow at the fastest rate, fueled by the expanding pharmaceutical industry, growing investments in healthcare technology, and an increasing number of clinical trials being conducted in countries like China and India.
The clinical trial data visualization market is segmented into software and services based on components. The software segment is expected to hold the largest market share during the forecast period. This can be attributed to the increasing demand for advanced software solutions that offer real-time data analysis and visualization capabilities. These software tools are designed to handle large volumes of data and provide intuitive visual representations that facilitate better understanding and decision-making.
Furthermore, the integration of AI and ML technologies into data visualization software is enhancing their capabilities, makin
To improve reporting transparency and research integrity, some journals have begun publishing study protocols and statistical analysis plans alongside trial publications. To determine the overall availability and characteristics of protocols and statistical analysis plans this study reviewed all randomized clinical trials (RCT) published in 2016 in the following 5 general medicine journals: Annals of Internal Medicine, BMJ, JAMA, Lancet, and NEJM. Characteristics of RCTs were extracted from the publication and clinical trial registry. A detailed assessment of protocols and statistical analysis plans was conducted in a 20% random sample of trials. Dataset contains extraction sheets (as SAS data files), code to calculate the values in the tables in the manuscript, and a supplemental file with additional notes on methods used in the study.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The DIRECCT study is a multi-phase examination of clinical trial results dissemination during the COVID-19 pandemic.
Interim data for trials completed during the first six months of the pandemic (i.e., 1 January 2020 – 30 June 2020) was previously deposited at https://doi.org/10.5281/zenodo.4669936.
This data deposit comprises the results of searches for trials completed during the first 18-months of the pandemic (i.e., 1 January 2020 – 30 June 2021).
The data structure for the final phase of the project is not identical to the interim data as it was substantially more complex.
The data include datatables (CSVs) that can be treated as relational and joined on the id
or trn
columns. See datamodel.png for an overview of the data.
Details on data sources and methods for the creation and analysis of this dataset are available in a detailed protocol (Version 3.1, 19 July 2023) : https://osf.io/w8t7r
Note: This repository will be updated with additional information including a codebook and archives of raw data.
Additional information on the project is available at the project's OSF page: https://doi.org/10.17605/osf.io/5f8j2.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This fileset provides supporting data and corpora for the empirical study described in: Laura Miron, Rafael S. Goncalves and Mark A. Musen. Obstacles to the Reuse of Metadata in ClinicalTrials.govDescription of filesOriginal data files:- AllPublicXml.zip contains the set of all public XML records in ClinicalTrials.gov (protocols and summary results information), on which all remaining analyses are based. Set contains 302,091 records downloaded on April 3, 2019.- public.xsd is the XML schema downloaded from ClinicalTrials.gov on April 3, 2019, used to validate records in AllPublicXML.BioPortal API Query Results- condition_matches.csv contains the results of querying the BioPortal API for all ontology terms that are an 'exact match' to each condition string scraped from the ClinicalTrials.gov XML. Columns={filename, condition, url, bioportal term, cuis, tuis}. - intervention_matches.csv contains BioPortal API query results for all interventions scraped from the ClinicalTrials.gov XML. Columns={filename, intervention, url, bioportal term, cuis, tuis}.Data Element Definitions- supplementary_table_1.xlsx Mapping of element names, element types, and whether elements are required in ClinicalTrials.gov data dictionaries, the ClinicalTrials.gov XML schema declaration for records (public.XSD), the Protocol Registration System (PRS), FDAAA801, and the WHO required data elements for clinical trial registrations.Column and value definitions: - CT.gov Data Dictionary Section: Section heading for a group of data elements in the ClinicalTrials.gov data dictionary (https://prsinfo.clinicaltrials.gov/definitions.html) - CT.gov Data Dictionary Element Name: Name of an element/field according to the ClinicalTrials.gov data dictionaries (https://prsinfo.clinicaltrials.gov/definitions.html) and (https://prsinfo.clinicaltrials.gov/expanded_access_definitions.html) - CT.gov Data Dictionary Element Type: "Data" if the element is a field for which the user provides a value, "Group Heading" if the element is a group heading for several sub-fields, but is not in itself associated with a user-provided value. - Required for CT.gov for Interventional Records: "Required" if the element is required for interventional records according to the data dictionary, "CR" if the element is conditionally required, "Jan 2017" if the element is required for studies starting on or after January 18, 2017, the effective date of the FDAAA801 Final Rule, "-" indicates if this element is not applicable to interventional records (only observational or expanded access) - Required for CT.gov for Observational Records: "Required" if the element is required for interventional records according to the data dictionary, "CR" if the element is conditionally required, "Jan 2017" if the element is required for studies starting on or after January 18, 2017, the effective date of the FDAAA801 Final Rule, "-" indicates if this element is not applicable to observational records (only interventional or expanded access) - Required in CT.gov for Expanded Access Records?: "Required" if the element is required for interventional records according to the data dictionary, "CR" if the element is conditionally required, "Jan 2017" if the element is required for studies starting on or after January 18, 2017, the effective date of the FDAAA801 Final Rule, "-" indicates if this element is not applicable to expanded access records (only interventional or observational) - CT.gov XSD Element Definition: abbreviated xpath to the corresponding element in the ClinicalTrials.gov XSD (public.XSD). The full xpath includes 'clinical_study/' as a prefix to every element. (There is a single top-level element called "clinical_study" for all other elements.) - Required in XSD? : "Yes" if the element is required according to public.XSD, "No" if the element is optional, "-" if the element is not made public or included in the XSD - Type in XSD: "text" if the XSD type was "xs:string" or "textblock", name of enum given if type was enum, "integer" if type was "xs:integer" or "xs:integer" extended with the "type" attribute, "struct" if the type was a struct defined in the XSD - PRS Element Name: Name of the corresponding entry field in the PRS system - PRS Entry Type: Entry type in the PRS system. This column contains some free text explanations/observations - FDAAA801 Final Rule FIeld Name: Name of the corresponding required field in the FDAAA801 Final Rule (https://www.federalregister.gov/documents/2016/09/21/2016-22129/clinical-trials-registration-and-results-information-submission). This column contains many empty values where elements in ClinicalTrials.gov do not correspond to a field required by the FDA - WHO Field Name: Name of the corresponding field required by the WHO Trial Registration Data Set (v 1.3.1) (https://prsinfo.clinicaltrials.gov/trainTrainer/WHO-ICMJE-ClinTrialsgov-Cross-Ref.pdf)Analytical Results:- EC_human_review.csv contains the results of a manual review of random sample eligibility criteria from 400 CT.gov records. Table gives filename, criteria, and whether manual review determined the criteria to contain criteria for "multiple subgroups" of participants.- completeness.xlsx contains counts and percentages of interventional records missing fields required by FDAAA801 and its Final Rule.- industry_completeness.xlsx contains percentages of interventional records missing required fields, broken up by agency class of trial's lead sponsor ("NIH", "US Fed", "Industry", or "Other"), and before and after the effective date of the Final Rule- location_completeness.xlsx contains percentages of interventional records missing required fields, broken up by whether record listed at least one location in the United States and records with only international location (excluding trials with no listed location), and before and after the effective date of the Final RuleIntermediate Results:- cache.zip contains pickle and csv files of pandas dataframes with values scraped from the XML records in AllPublicXML. Downloading these files greatly speeds up running analysis steps from jupyter notebooks in our github repository.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset (Stata v15.1) containing responses from a survey of UK Clinical Research Collaboration registered clinical trial units (CTUs) and industry statisticians from both pharmaceuticals and clinical research organisations (http://dx.doi. org/10.1136/bmjopen-2020- 036875) Data is de-identified. The dataset contains descriptive variables describing participant's experience, as well as responses to questions on current adverse event analysis practices, awareness of specialist methods for adverse event analysis and priorities, concerns and barriers participants experience when analysing adverse event data.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In this project, we work on repairing three datasets:
country_protocol_code
, conduct the same clinical trials which is identified by eudract_number
. Each clinical trial has a title
that can help find informative details about the design of the trial.eudract_number
. The ground truth samples in the dataset were established by aligning information about the trial populations provided by external registries, specifically the CT.gov database and the German Trials database. Additionally, the dataset comprises other unstructured attributes that categorize the inclusion criteria for trial participants such as inclusion
.code
. Samples with the same code
represent the same product but are extracted from a differentb source
. The allergens are indicated by (‘2’) if present, or (‘1’) if there are traces of it, and (‘0’) if it is absent in a product. The dataset also includes information on ingredients
in the products. Overall, the dataset comprises categorical structured data describing the presence, trace, or absence of specific allergens, and unstructured text describing ingredients. N.B: Each '.zip' file contains a set of 5 '.csv' files which are part of the afro-mentioned datasets:
By Aero Data Lab [source]
This dataset contains information on clinical trials conducted by sponsors. Each row represents a clinical trial, and the columns represent various attributes of the trial, such as the National Clinical Trial Number, the sponsor of the trial, the title of the trial, and so on.
The purpose of this dataset is to provide a bird's-eye view of the clinical trial landscape. By understanding which sponsors are conducting which trials and for what conditions, we can get a better sense of where research is headed and what new treatments may be on the horizon
- NCT is a unique identifier for clinical trials. It stands for National Clinical Trial Number.
- Sponsor is the organization that is funding the clinical trial.
- Title is the name of the clinical trial.
- Summary is a brief summary of the clinical trial.
- Start Year is the year that the clinical trial started.
- Start Month is the month that the clinical trial started.
- Phase is the stage of development of the investigative drug or device (I), which can be one of four types: I, II, III, or IV.
- Enrollment is The number of participants in the clinical trial.
- Status is The status of enrollment in the study, which can be Recruiting, Not yet recruiting, Active, not recruiting, Completed, Suspended, or Terminated.
Condition indicates what medical condition(s) are being studied in this particular NCT record
- Identify patterns in clinical trials to improve the development process
- Understand how different sponsors fund clinical trials
By Aero Data Lab [source]
License
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: AERO-BirdsEye-Data.csv | Column name | Description | |:----------------|:-----------------------------------------------------------------| | NCT | National Clinical Trial number. (String) | | Sponsor | Name of the sponsor conducting the clinical trial. (String) | | Title | Title of the clinical trial. (String) | | Summary | Brief summary of the clinical trial. (String) | | Start_Year | Year the clinical trial started. (Integer) | | Start_Month | Month the clinical trial started. (String) | | Phase | Phase of the clinical trial. (String) | | Enrollment | Number of participants enrolled in the clinical trial. (Integer) | | Status | Status of the clinical trial. (String) | | Condition | Condition being tested in the clinical trial. (String) |
If you use this dataset in your research, please credit By Aero Data Lab [source]
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These files relate to data extracted from ClinicalTrials.gov. In the database file, data for individual clinical trials are included, and attributes include study identifier, study type, trial dates, interventions, sample size, countries in which the study was conducted, etc. The Edges file contains geographic data derived from the clinical trials data set that can be used to generate networks to illustrate geographic connectivity through clinical research, using open access software such as Gephi. The Gephi file includes networks for all countries worldwide, as well as regional networks for each major grographic region. The figures are network diagrams generated by Gephi showing geographic connectivity among individual countries through common participation in multinational clinical trials. The thickness of the connecting lines (edges) reflects the strength of a connection.
As per our latest research, the Big Data Analytics for Clinical Research market size reached USD 7.45 billion globally in 2024, reflecting a robust adoption pace driven by the increasing digitization of healthcare and clinical trial processes. The market is forecasted to grow at a CAGR of 17.2% from 2025 to 2033, reaching an estimated USD 25.54 billion by 2033. This significant growth is primarily attributed to the rising need for real-time data-driven decision-making, the proliferation of electronic health records (EHRs), and the growing emphasis on precision medicine and personalized healthcare solutions. The industry is experiencing rapid technological advancements, making big data analytics a cornerstone in transforming clinical research methodologies and outcomes.
Several key growth factors are propelling the expansion of the Big Data Analytics for Clinical Research market. One of the primary drivers is the exponential increase in clinical data volumes from diverse sources, including EHRs, wearable devices, genomics, and imaging. Healthcare providers and research organizations are leveraging big data analytics to extract actionable insights from these massive datasets, accelerating drug discovery, optimizing clinical trial design, and improving patient outcomes. The integration of artificial intelligence (AI) and machine learning (ML) algorithms with big data platforms has further enhanced the ability to identify patterns, predict patient responses, and streamline the entire research process. These technological advancements are reducing the time and cost associated with clinical research, making it more efficient and effective.
Another significant factor fueling market growth is the increasing collaboration between pharmaceutical & biotechnology companies and technology firms. These partnerships are fostering the development of advanced analytics solutions tailored specifically for clinical research applications. The demand for real-world evidence (RWE) and real-time patient monitoring is rising, particularly in the context of post-market surveillance and regulatory compliance. Big data analytics is enabling stakeholders to gain deeper insights into patient populations, treatment efficacy, and adverse event patterns, thereby supporting evidence-based decision-making. Furthermore, the shift towards decentralized and virtual clinical trials is creating new opportunities for leveraging big data to monitor patient engagement, adherence, and safety remotely.
The regulatory landscape is also evolving to accommodate the growing use of big data analytics in clinical research. Regulatory agencies such as the FDA and EMA are increasingly recognizing the value of data-driven approaches for enhancing the reliability and transparency of clinical trials. This has led to the establishment of guidelines and frameworks that encourage the adoption of big data technologies while ensuring data privacy and security. However, the implementation of stringent data protection regulations, such as GDPR and HIPAA, poses challenges related to data integration, interoperability, and compliance. Despite these challenges, the overall outlook for the Big Data Analytics for Clinical Research market remains highly positive, with sustained investments in digital health infrastructure and analytics capabilities.
From a regional perspective, North America currently dominates the Big Data Analytics for Clinical Research market, accounting for the largest share due to its advanced healthcare infrastructure, high adoption of digital technologies, and strong presence of leading pharmaceutical companies. Europe follows closely, driven by increasing government initiatives to promote health data interoperability and research collaborations. The Asia Pacific region is emerging as a high-growth market, supported by expanding healthcare IT investments, rising clinical trial activities, and growing awareness of data-driven healthcare solutions. Latin America and the Middle East & Africa are also witnessing gradual adoption, albeit at a slower pace, due to infrastructural and regulatory challenges. Overall, the global market is poised for substantial growth across all major regions over the forecast period.
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Health Canada's Clinical Trials Database is a listing of information about phase I, II and III clinical trials in patients. The database is managed by Health Canada and provides a source of information about Canadian clinical trials involving human pharmaceutical and biological drugs. Additional information on Health Canada’s CTD is available at: https://www.canada.ca/en/health-canada/services/drugs-health-products/drug-products/health-canada-clinical-trials-database/frequently-asked-questions.html
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The primary research question for which these data have been used, was: ‘How patient-relevant are outcomes measured in clinical trials for breast cancer drugs?’. Subquestions were: 1. Which treatment outcomes are relevant for breast cancer patients? 2. Which outcome measures are used in clinical trials for breast cancer drugs? 3. How much overlap is there between patient-relevant outcomes and outcomes measured in clinical trials?The dataset has been used to answer subquestion 2. Data have been obtained by searching Clinicaltrials.gov for trials conducted between January 2014 and March 2024 inclusive. Further inclusion criteria were that studies had to be phase III trials and had to focus on breast cancer, adults (18-64 years old) and drugs. Interventions focusing on lifestyle changes, Chinese medicine, anaesthesia, surgery and diagnostic methods were excluded. Ultimately, 264 trials were included and forty-five excluded. To determine the outcome measures used, the study plan of every included trial was reviewed and recorded on the data sheet.
Website which allows data from completed clinical trials to be distributed to investigators and public. Researchers can download de-identified data from completed NIDA clinical trial studies to conduct analyses that improve quality of drug abuse treatment. Incorporates data from Division of Therapeutics and Medical Consequences and Center for Clinical Trials Network.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data deposit includes large raw data used for the "IntoValue" dataset, which underlies several projects at the QUEST Center for Responsible Research in the Berlin Institute of Health (BIH) @ Charité. An initial version of the IntoValue dataset is available in Zenodo: https://doi.org/10.5281/zenodo.5141342. Based on this initial version, the dataset is actively developed and maintained in GitHub: https://github.com/maia-sh/intovalue-data. This Zenodo deposit serves to store large raw data files for individual trials and are used in that GitHub repository. These data are deposited for computational reproducibility and documentation; they are not intended to be used for additional projects and do not reflect the most current/accurate data available from each source.
This deposit contains raw data from the following sources:
PubMed (pubmed.zip
): PubMed XML files are provided courtesty of the U.S. National Library of Medicine and were accessed via the Entrez Programming Utilities (E-utilities) API. The files were downloaded on 2021-08-15 and do not reflect the most current/accurate data available from NLM. The following scripts were used to download and create these files: get-pubmed.R; download-pubmed.R.
German Clinical Trials Registry (DRKS) (drks.zip
): DRKS does not provide an API and was webscrapped on 2022-11-01. The following scripts were used to download and create these XML files: get-drks.R; drks-functions.R
ClinicalTrials.gov (ctgov.zip
): ClinicalTrials.gov was accessed via the Clinical Trials Transformation Initiative (CTTI) Aggregate Content of ClinicalTrials.gov (AACT) via its PostgreSQL database API.The API was queried and CSV files were generated on 2022-11-01. The following scripts were used to download and create these files: get-process-aact.R.
ClinicalTrials.gov 2018 (ctgov_2018.zip
): Additional trial data for 2018. ClinicalTrials.gov was accessed via the Clinical Trials Transformation Initiative (CTTI) Aggregate Content of ClinicalTrials.gov (AACT) via its PostgreSQL database API.The API was queried and CSV files were generated on 2022-11-01. The following scripts were used to download and create these files: get-process-aact.R.
This data package contains datasets on clinical trials conducted in the United States. Diseases include cervical cancer, diabetes, acute respiratory infection as well as stress. This data package also includes clinical trials registry and results database.