100+ datasets found

Indian Medicine Data
kaggle.com
zip
Updated Aug 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohneesh_Sreegirisetty (2023). Indian Medicine Data [Dataset]. https://www.kaggle.com/datasets/mohneesh7/indian-medicine-data
Explore at:
zip(18681848 bytes)Available download formats
Dataset updated
Aug 20, 2023
Authors
Mohneesh_Sreegirisetty
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
India Medicine Database

Dataset consists of 8 columns : - sub_category: This classification pertains to specific medical categories that define the domain in which the medicine finds its application. - product_name: This is the name of the product, as available in the indian market. - salt_composition: This is the chemical composition of the drug. - product_price:This represents the previous price of the product. Please consider this as a reference, as it tends to be highly volatile in relation to the health market. - product_manufactured:The pharmaceutical company responsible for producing the medicine/drug. - medicine_desc: Comprehensive overview and detailed description of the specific product. - side_effects:Potential adverse effects associated with the drug/medicine. - drug_interactions:Interactions and effects when combining this specific medicine with other drugs.

There are a few missing values in the dataset, but most information is available for the row, so I have left as is.
A-Z Medicine Dataset of India
kaggle.com
zip
Updated Nov 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shudhanshu Singh (2022). A-Z Medicine Dataset of India [Dataset]. https://www.kaggle.com/datasets/shudhanshusingh/az-medicine-dataset-of-india
Explore at:
zip(6918947 bytes)Available download formats
Dataset updated
Nov 17, 2022
Authors
Shudhanshu Singh
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Area covered
India
Description
The file has data which contains all possible medicines that we were able to find out during our research on finding out medicine's details such as compositions of medicine, type of medicines, there market availability, pricing and many other things.

The data consist of medicines from various pharmaceutical companies including:

Sun Pharmaceutical Industries

Torrent Pharma

Glenmark Pharma Limited

Emcure Pharmaceuticals

Cipla Limited

Zydus Lifesciences Limited (formerly Cadila Healthcare)

Abbott India Ltd.

Alkem Laboratories

Lupin Limited

Piramal Enterprises Limited and 7638 other pharmaceutical companies.

*Prices of medicines are reported / recorded as of November,2022.

*is_discontinued column defines Availability of medicines that is reported as of November,2022.

There is another dataset we have published which gives information about drugs side effects, substitutes and usage. *Visit here : https://www.kaggle.com/datasets/shudhanshusingh/250k-medicines-usage-side-effects-and-substitutes *

Dataset will be updated on yearly basis.

Announcement : I have released a new dataset on Real Estate Properties , if you are interested must checkout here: https://www.kaggle.com/datasets/shudhanshusingh/real-estate-properties-dataset ,If you liked it, do give an upvote :)
Medicine data: herbal medicines
data.europa.eu
excel xlsx, html
Updated Dec 14, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Medicines Agency (2015). Medicine data: herbal medicines [Dataset]. https://data.europa.eu/data/datasets/herbal-substances-that-are-designated-for-assessment-by-hmpc?locale=en
Explore at:
excel xlsx, htmlAvailable download formats
Dataset updated
Dec 14, 2015
Dataset authored and provided by
European Medicines Agencyhttp://ema.europa.eu/
License
http://data.europa.eu/eli/dec/2011/833/ojhttp://data.europa.eu/eli/dec/2011/833/oj
Description
This search allows you to find herbal substances that are designated for assessment by the European Medicines Agency's Committee on Herbal Medicinal Products (HMPC). Search results can be exported in Excel format.

Each substance is at a different stage of assessment and various documents are associated with the substance depending on where it is in the assessment process. The HMPC's conclusions on the herbal substance at the end of the assessment process can be found in the final European Union herbal monograph and may also be found in European Union list entry.
RxNorm Data
kaggle.com
bioregistry.io
zip
Updated Mar 20, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Library of Medicine (2019). RxNorm Data [Dataset]. https://www.kaggle.com/datasets/nlm-nih/nlm-rxnorm
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Mar 20, 2019
Dataset authored and provided by
National Library of Medicine
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

RxNorm is a name of a US-specific terminology in medicine that contains all medications available on US market. Source: https://en.wikipedia.org/wiki/RxNorm

RxNorm provides normalized names for clinical drugs and links its names to many of the drug vocabularies commonly used in pharmacy management and drug interaction software, including those of First Databank, Micromedex, Gold Standard Drug Database, and Multum. By providing links between these vocabularies, RxNorm can mediate messages between systems not using the same software and vocabulary. Source: https://www.nlm.nih.gov/research/umls/rxnorm/

Content

RxNorm was created by the U.S. National Library of Medicine (NLM) to provide a normalized naming system for clinical drugs, defined as the combination of {ingredient + strength + dose form}. In addition to the naming system, the RxNorm dataset also provides structured information such as brand names, ingredients, drug classes, and so on, for each clinical drug. Typical uses of RxNorm include navigating between names and codes among different drug vocabularies and using information in RxNorm to assist with health information exchange/medication reconciliation, e-prescribing, drug analytics, formulary development, and other functions.

This public dataset includes multiple data files originally released in RxNorm Rich Release Format (RXNRRF) that are loaded into Bigquery tables. The data is updated and archived on a monthly basis.

The following tables are included in the RxNorm dataset:

RXNCONSO contains concept and source information

RXNREL contains information regarding relationships between entities

RXNSAT contains attribute information

RXNSTY contains semantic information

RXNSAB contains source info

RXNCUI contains retired rxcui codes

RXNATOMARCHIVE contains archived data

RXNCUICHANGES contains concept changes

Update Frequency: Monthly

Fork this kernel to get started with this dataset.

Acknowledgements

https://www.nlm.nih.gov/research/umls/rxnorm/

https://bigquery.cloud.google.com/dataset/bigquery-public-data:nlm_rxnorm

https://cloud.google.com/bigquery/public-data/rxnorm

Dataset Source: Unified Medical Language System RxNorm. The dataset is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset. This dataset uses publicly available data from the U.S. National Library of Medicine (NLM), National Institutes of Health, Department of Health and Human Services; NLM is not responsible for the dataset, does not endorse or recommend this or any other dataset.

Banner Photo by @freestocks from Unsplash.

Inspiration

What are the RXCUI codes for the ingredients of a list of drugs?

Which ingredients have the most variety of dose forms?

In what dose forms is the drug phenylephrine found?

What are the ingredients of the drug labeled with the generic code number 072718?
DrugBank Database Data Package
johnsnowlabs.com
csv
Updated Jan 20, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John Snow Labs (2021). DrugBank Database Data Package [Dataset]. https://www.johnsnowlabs.com/marketplace/drugbank-database-data-package/
Explore at:
csvAvailable download formats
Dataset updated
Jan 20, 2021
Dataset authored and provided by
John Snow Labs
Description
DrugBank Vocabulary contains information on DrugBank identifiers, names, and synonyms to permit easy linking and integration into any type of project. DrugBank is a richly annotated resource that combines detailed drug data with comprehensive drug target and drug action information. DrugBank is widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education.
Drug Pharma New Dataset
kaggle.com
zip
Updated Feb 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shuvo Kumar Basak-4004.o (2025). Drug Pharma New Dataset [Dataset]. https://www.kaggle.com/datasets/shuvokumarbasak2030/drug-pharma-new-dataset
Explore at:
zip(10039849 bytes)Available download formats
Dataset updated
Feb 28, 2025
Authors
Shuvo Kumar Basak-4004.o
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The "Drug Pharma New Dataset" is a comprehensive and up-to-date collection of pharmaceutical drugs registered by the Drug Administration of Bangladesh (DGDA). This dataset spans five major drug categories: Allopathic, Unani, Ayurvedic, Homeopathic, and Herbal. It serves as a valuable resource for researchers, data analysts, and anyone interested in the pharmaceutical industry, offering a detailed overview of the variety of drugs registered for medical use. Source: DGDA http://dgdagov.info/index.php/registered-products/ayurvedic Dataset Breakdown 📊: Allopathic: 36,254 entries 💉 Unani: 8,460 entries 🌿 Ayurvedic: 5,262 entries 🌱 Homeopathic: 2,580 entries 💧 Herbal: 1,028 entries 🌸 https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15408835%2F4403d24ec95657ce6cf0f67a5091fe24%2FScreenshot%20(88).png?generation=1740769909204066&alt=media" alt=""> Columns in the Dataset 📝: The dataset contains the following columns to provide detailed drug information:

SL: Serial number for each product 📍 Name of the Manufacturer: The company or manufacturer producing the drug 🏭 Brand Name: The commercial brand name under which the drug is sold 🏷️ Generic Name: The official name of the drug's active ingredient 🏥 Strength: The concentration of the active ingredient(s) in the drug 💪 Dosages Description: The form in which the drug is administered (e.g., tablet, lotion, powder) 💊💧 Use For: The medical use or indication of the drug (e.g., pain relief, antibiotics, etc.) 🩺 DAR: The drug’s registration code, ensuring it’s officially approved for use 🆔 Type: The type of drug (e.g., Allopathic, Ayurvedic, etc.) 💡 Up-to-Date Drug Information 🕒: This dataset contains latest data on drugs registered and approved for use in Bangladesh. The information is continuously updated to reflect new drugs, changes in drug classifications, and updated manufacturer details.

High Data Integrity ✅: The data comes from a trusted and official source — the Drug Administration of Bangladesh (DGDA). This guarantees accuracy and consistency, making it highly reliable for analysis, research, and pharmaceutical studies.

Comprehensive Coverage 🗺️: By incorporating multiple types of drugs, this dataset covers both modern pharmaceutical drugs and traditional medicines, giving a well-rounded view of the pharmaceutical industry. It includes information for over-the-counter (OTC) drugs, prescription medicines, as well as herbal supplements.

Usage & Applications 🌍: The Drug Pharma New Dataset can be leveraged in several fields and for multiple applications:

Pharmaceutical Research 🔬:

New Drug Development: Researchers can use the dataset to identify trends, gaps in the market, and areas for innovation in the pharmaceutical industry. By analyzing drug classifications, strengths, dosages, and usage patterns, pharmaceutical companies can identify areas for new drug development and research. Pharmacovigilance: The dataset can be used in studying the safety and effectiveness of different drugs, monitoring adverse drug reactions (ADR), and identifying drugs that require more attention or changes in dosage recommendations. Market Analysis & Pharmaceutical Industry 📈:

Product Trends: Analyze the popularity of specific drug types (Allopathic, Ayurvedic, etc.) and understand market trends in pharmaceutical consumption. This helps manufacturers and marketers make data-driven decisions on drug production, marketing strategies, and customer targeting. Competitive Analysis: With drug manufacturer names included, this dataset allows for a competitive analysis by comparing the market share of different manufacturers and tracking new market entrants. Drug Classification & Insights ⚖️:

Drug Categorization: The dataset’s categorization of drugs by type (Allopathic, Unani, Ayurvedic, etc.) allows for detailed classification and comparison of the different therapeutic approaches in modern and traditional medicine. Therapeutic Use Analysis: Study the medicinal use of each drug type and identify the most common therapeutic applications (e.g., pain relief, treatment of infections). This is useful for healthcare professionals, policy makers, and regulatory bodies to better understand the most widely used treatments. Medical Database Creation 💻:

The dataset can be used to create comprehensive medical databases or drug repositories for hospitals, pharmacies, or pharmaceutical companies. It can help healthcare professionals quickly access important drug-related information such as dosages, brand names, and generic alternatives. Government & Regulatory Purposes 🏛️:

Regulatory Compliance: Regulatory agencies can use this dataset to monitor which drugs are officially registered and ensure that only approved drugs are sold in the market. The DAR (Drug Approval Registration) codes are especially useful for this purpose. Polic...
p
MIMIC-III Clinical Database
physionet.org
Updated Sep 4, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alistair Johnson; Tom Pollard; Roger Mark (2016). MIMIC-III Clinical Database [Dataset]. http://doi.org/10.13026/C2XW26
Explore at:
Unique identifier
https://doi.org/10.13026/C2XW26
Dataset updated
Sep 4, 2016
Authors
Alistair Johnson; Tom Pollard; Roger Mark
License
https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts
Description
MIMIC-III is a large, freely-available database comprising deidentified health-related data associated with over forty thousand patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. The database includes information such as demographics, vital sign measurements made at the bedside (~1 data point per hour), laboratory test results, procedures, medications, caregiver notes, imaging reports, and mortality (including post-hospital discharge).MIMIC supports a diverse range of analytic studies spanning epidemiology, clinical decision-rule improvement, and electronic tool development. It is notable for three factors: it is freely available to researchers worldwide; it encompasses a diverse and very large population of ICU patients; and it contains highly granular data, including vital signs, laboratory results, and medications.
Medi-Span
catalog.data.gov
Updated Jan 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Office of Personnel Management (2024). Medi-Span [Dataset]. https://catalog.data.gov/dataset/medi-span-2592d
Explore at:
Dataset updated
Jan 26, 2024
Dataset provided by
United States Office of Personnel Managementhttps://opm.gov/
Description
Medi-Span pharmacy reference database
d
Data from: RxNorm
catalog.data.gov
datadiscovery.nlm.nih.gov
+5more
Updated Jun 19, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Library of Medicine (2025). RxNorm [Dataset]. https://catalog.data.gov/dataset/rxnorm-3180d
Explore at:
Dataset updated
Jun 19, 2025
Dataset provided by
National Library of Medicine
Description
RxNorm provides normalized names for clinical drugs and links its names to many of the drug vocabularies commonly used in pharmacy management and drug interaction software, including those of First Databank, Micromedex, Gold Standard, and Multum. By providing links between these vocabularies, RxNorm can mediate messages between systems not using the same software and vocabulary. Technical documentation at http://www.nlm.nih.gov/research/umls/rxnorm/docs/index.html
Data from: DailyMed
healthdata.gov
datadiscovery.nlm.nih.gov
+6more
csv, xlsx, xml
Updated Mar 31, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
datadiscovery.nlm.nih.gov (2021). DailyMed [Dataset]. https://healthdata.gov/NIH/DailyMed/j3hv-i8vg
Explore at:
xlsx, csv, xmlAvailable download formats
Dataset updated
Mar 31, 2021
Dataset provided by
datadiscovery.nlm.nih.gov
Description
DailyMed provides health information providers and the public with a standard, comprehensive, up-to-date, look-up and download resource of medication content and labeling as found in medication package inserts, also known as Structured Product Labeling (SPL).
List of Registered Pharmaceutical Products | DATA.GOV.HK
data.gov.hk
Updated Feb 2, 2026
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.gov.hk (2026). List of Registered Pharmaceutical Products | DATA.GOV.HK [Dataset]. https://data.gov.hk/en-data/dataset/hk-dh-dh_do-hk-dh-do-pharmaceutical-product
Explore at:
Dataset updated
Feb 2, 2026
Dataset provided by
data.gov.hk
Description
List showing the name of product, name of registration certificate holder, Hong Kong registration number (Permit No) and active ingredient(s) of each registered pharmaceutical product.
Drug Dataset: Uses, Side Effects and User Reviews
kaggle.com
zip
Updated Oct 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aadya Singh (2024). Drug Dataset: Uses, Side Effects and User Reviews [Dataset]. https://www.kaggle.com/datasets/aadyasingh55/drug-dataset
Explore at:
zip(780159 bytes)Available download formats
Dataset updated
Oct 22, 2024
Authors
Aadya Singh
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Overview

This dataset provides comprehensive information on various medications, including their composition, uses, side effects, manufacturer details, and user reviews. It aims to assist healthcare professionals and patients in making informed decisions about medications.

Use Cases: The dataset is valuable for various applications, including:

Classification: Categorizing medicines based on their usage or effectiveness. Segmentation Analysis: Analyzing different groups of medications based on reviews and side effects. Recommendation Systems: Developing models to recommend medications based on user profiles and preferences.
National Drug Code Directory
catalog.data.gov
data.virginia.gov
+3more
Updated Jul 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Food and Drug Administration (2025). National Drug Code Directory [Dataset]. https://catalog.data.gov/dataset/national-drug-code-directory
Explore at:
Dataset updated
Jul 11, 2025
Dataset provided by
Food and Drug Administrationhttp://www.fda.gov/
Description
The Drug Listing Act of 1972 requires registered drug establishments to provide the Food and Drug Administration (FDA) with a current list of all drugs manufactured, prepared, propagated, compounded, or processed by it for commercial distribution. (See Section 510 of the Federal Food, Drug, and Cosmetic Act (Act) (21 U.S.C. � 360)). Drug products are identified and reported using a unique, three-segment number, called the National Drug Code (NDC), which serves as a universal product identifier for drugs. FDA publishes the listed NDC numbers and the information submitted as part of the listing information in the NDC Directory which is updated daily.
EHRSHOT
redivis.com
application/jsonl +7
Updated Feb 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shah Lab (2025). EHRSHOT [Dataset]. http://doi.org/10.57761/0gv9-nd83
Explore at:
csv, application/jsonl, sas, parquet, stata, spss, arrow, avroAvailable download formats
Unique identifier
https://doi.org/10.57761/0gv9-nd83
Dataset updated
Feb 13, 2025
Dataset provided by
Redivis Inc.
Authors
Shah Lab
Description
Abstract

👂💉 EHRSHOT is a dataset for benchmarking the few-shot performance of foundation models for clinical prediction tasks. EHRSHOT contains de-identified structured data (e.g., diagnosis and procedure codes, medications, lab values) from the electronic health records (EHRs) of 6,739 Stanford Medicine patients and includes 15 prediction tasks. Unlike MIMIC-III/IV and other popular EHR datasets, EHRSHOT is longitudinal and includes data beyond ICU and emergency department patients.

⚡️Quickstart 1. To recreate the original EHRSHOT paper, download the EHRSHOT_ASSETS.zip file from the "Files" tab 2. To work with OMOP CDM formatted data, download all the tables in the "Tables" tab

⚙️ Please see the "Methodology" section below for details on the dataset and downloadable files.

Methodology

1. 📖 Overview

EHRSHOT is a benchmark for evaluating models on few-shot learning for patient classification tasks. The dataset contains:

**6,739 **patients

41.6 million clinical events

921,499 visits

15 prediction tasks

%3C!-- --%3E

2. 💽 Dataset

EHRSHOT is sourced from Stanford’s STARR-OMOP database.

Data follows the OMOP CDM and is fully de-identified.

Unlike most other EHR research datasets, EHRSHOT is not restricted to ED/ICU visits and instead includes longitudinal patient data for all hospital encounter types.

EHRSHOT does not contain clinical notes or images.

%3C!-- --%3E

We provide two versions of the dataset:

EHRSHOT-Original is the same exact dataset used in the original EHRSHOT paper.

EHRSHOT-OMOP is a more complete version of the EHRSHOT dataset which includes all OMOP CDM tables and additional OMOP metadata.

%3C!-- --%3E

To access the raw data, please see the "Tables" and "Files"** **tabs above:

3. 💽 Data Files and Formats

We provide EHRSHOT in two file formats:

OMOP CDM v5.4

Medical Event Data Standard (MEDS)

%3C!-- --%3E

Within the "Tables" tab...

1. %3Cu%3EEHRSHOT-OMOP%3C/u%3E

* Dataset Version: EHRSHOT-OMOP

* Notes: Contains all OMOP CDM tables for the EHRSHOT patients. Note that this dataset is slightly different than the original EHRSHOT dataset, as these tables contain the full OMOP schema rather than a filtered subset.

Within the "Files" tab...

1. %3Cu%3EEHRSHOT_ASSETS.zip%3C/u%3E

* Dataset Version: EHRSHOT-Original

* Data Format: FEMR 0.1.16

* Notes: The original EHRSHOT dataset as detailed in the paper. Also includes model weights.

2. %3Cu%3EEHRSHOT_MEDS.zip%3C/u%3E

* Dataset Version: EHRSHOT-Original

* Data Format: MEDS 0.3.3

* Notes: The original EHRSHOT dataset as detailed in the paper. It does not include any models.

3. %3Cu%3EEHRSHOT_OMOP_MEDS.zip%3C/u%3E

* Dataset Version: EHRSHOT-OMOP

* Data Format: MEDS 0.3.3 + MEDS-ETL 0.3.8

* Notes: Converts the dataset from EHRSHOT-OMOP into MEDS format via the `meds_etl_omop`command from MEDS-ETL.

4. %3Cu%3EEHRSHOT_OMOP_MEDS_Reader.zip%3C/u%3E

* Dataset Version: EHRSHOT-OMOP

* Data Format: MEDS Reader 0.1.9 + MEDS 0.3.3 + MEDS-ETL 0.3.8

* Notes: Same data as EHRSHOT_OMOP_MEDS.zip, but converted into a MEDS-Reader database for faster reads.

4. 🤖 Model

We also release the full weights of **CLMBR-T-base, **a 141M parameter clinical foundation model pretrained on the structured EHR data of 2.57M patients. Please download from https://huggingface.co/StanfordShahLab/clmbr-t-base

**5. 🧑‍💻 Code **

Please see our Github repo to obtain code for loading the dataset and running a set of pretrained baseline models: https://github.com/som-shahlab/ehrshot-benchmark/

Usage

**NOTE: You must authenticate to Redivis using your formal affiliation's email address. If you use gmail or other personal email addresses, you will not be granted access. **

Access to the EHRSHOT dataset requires the following:

Verified Affiliation with an **Academic, Government, **o
GUDID Download
catalog.data.gov
data.virginia.gov
+5more
Updated Jul 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Food and Drug Administration (2025). GUDID Download [Dataset]. https://catalog.data.gov/dataset/gudid-download
Explore at:
Dataset updated
Jul 11, 2025
Dataset provided by
Food and Drug Administrationhttp://www.fda.gov/
Description
The Global Unique Device Identification Database (GUDID) contains key device identification information submitted to the FDA about medical devices that have Unique Device Identifiers (UDI). Unique device identification is a system being established by the
GlobalEssentialMedicinesDatabase.xlsx
figshare.com
xlsx
Updated Mar 7, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nav Persaud; Maggie Jiang; Roha Shaikh; Anjli Bali; Efosa Oronsaye; Hannah Woods; Gregory Drozdzal; Yathavan Rajakulasingam; Darshanand Maraj; Sapna Wadhawan; Norman Umali; Ri Wang; Marcy McCall; Jeffrey K Aronson; Annette Plüddemann; Lorenzo Moja; Nicola Magrini; Carl Heneghan (2019). GlobalEssentialMedicinesDatabase.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.7814246.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7814246.v1
Dataset updated
Mar 7, 2019
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Nav Persaud; Maggie Jiang; Roha Shaikh; Anjli Bali; Efosa Oronsaye; Hannah Woods; Gregory Drozdzal; Yathavan Rajakulasingam; Darshanand Maraj; Sapna Wadhawan; Norman Umali; Ri Wang; Marcy McCall; Jeffrey K Aronson; Annette Plüddemann; Lorenzo Moja; Nicola Magrini; Carl Heneghan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Global Essential Medicines Database

In June of 2017, we searched the WHO Essential Medicines and Health Products Information Portal, an online repository that contains hundreds of publication on medicines and health products related to WHO priorities, and a full-section dedicated to national essential medicines lists (EMLs). A WHO information specialist actively searched for updated versions of national EMLs, including national formularies, reimbursement lists, and lists based on standard treatment guidelines.

We included all national EMLs that were posted on the WHO’s NEMLs Repository irrespective of publication date and language. When we found more than one national EML from the same country, we used the most recent. We excluded documents that were not EMLs, such as prescribing guidelines. We also included the 20th edition of the WHO Model EML (2017) in this database.

From each EML we abstracted medicines using International Nonproprietary Names (INNs). For medicines whose names were not in English we used the Anatomical Therapeutic Chemical (ATC) classification system, if available, or translated the names with the help of Google Translate. We listed each medicine individually, whether it was part of a combination product or not. We treated as the same medicine bases and their salts (e.g. promethazine hydrochloride and promethazine) as well as different compounds of the same vitamin or mineral (e.g. ferrous fumarate and ferrous sulfate). We excluded diagnostic agents, antiseptics, disinfectants, and saline solutions.

In this database "1" and "0" indicate the presence or absence of the medicine respectively on an EML.
b
Repurposing related drug annotations
repo-hub.broadinstitute.org
repo-hub-qa.broadinstitute.org
txt
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Broad Institute of Harvard and MIT, Repurposing related drug annotations [Dataset]. https://repo-hub.broadinstitute.org/repurposing
Explore at:
txt(0.50 MB), txt(1.7 MB)Available download formats
Dataset authored and provided by
Broad Institute of Harvard and MIT
License
https://clue.io/termshttps://clue.io/terms
Variables measured
RNA abundance
Measurement technique
L1000
Description
Provided are annotations for 6,125 drug and tool compounds (2,369 FDA-approved drugs, 1,619 drugs that reached phases 1-3 of clinical development, 96 compounds that were previously approved but withdrawn from use, and 2,041 preclinical or tool compounds). Annotations include compound name, chemical structure, clinical trial status, mechanism of action, protein targets, disease areas, approved indications (where applicable), purity of the purchased sample, and vendor ID.
e
EU Veterinary Medicinal Product Database
data.europa.eu
html
Updated Nov 20, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Medicines Agency (2016). EU Veterinary Medicinal Product Database [Dataset]. https://data.europa.eu/data/datasets/eu-veterinary-medicinal-product-database?locale=en
Explore at:
htmlAvailable download formats
Dataset updated
Nov 20, 2016
Dataset authored and provided by
European Medicines Agency
License
http://data.europa.eu/eli/dec/2011/833/ojhttp://data.europa.eu/eli/dec/2011/833/oj
Area covered
European Union
Description
The EU Veterinary Medicinal Product Database is intended to be a source of information on all medicinal products for veterinary use that have been authorised in the European Union and the European Economic Area. The database is hosted by the European Medicines Agency.
d
MEDLINE/PubMed Citations
catalog.data.gov
datadiscovery.nlm.nih.gov
+3more
Updated Jun 19, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Library of Medicine (2025). MEDLINE/PubMed Citations [Dataset]. https://catalog.data.gov/dataset/medline-pubmed-citations-d2ed0
Explore at:
Dataset updated
Jun 19, 2025
Dataset provided by
National Library of Medicine
Description
PubMed is a free resource supporting the search and retrieval of biomedical and life sciences literature with the aim of improving health–both globally and personally. The PubMed database contains citations and abstracts of biomedical literature. It does not include full text journal articles; however, links to the full text are often present when available from other sources, such as the publisher's website or PubMed Central (PMC). See the PubMed User Guide for more information. https://pubmed.ncbi.nlm.nih.gov/help/
Drug-Drug Interactions
kaggle.com
zip
Updated Aug 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MGhobashy (2024). Drug-Drug Interactions [Dataset]. https://www.kaggle.com/datasets/mghobashy/drug-drug-interactions
Explore at:
zip(1923486 bytes)Available download formats
Dataset updated
Aug 31, 2024
Authors
MGhobashy
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This dataset provides a comprehensive collection of drug-drug interactions (DDIs) intended for research in predicting and understanding complex interaction relationships between drugs. It is sourced from the Drug Bank database and is designed to support multi-task learning approaches in the domain of bioinformatics and pharmacology.

Feature Details: Drug 1: Name of the first drug in the interaction. Drug 2: Name of the second drug in the interaction. Interaction Description: Detailed description of the interaction between the two drugs.

Source: The dataset is derived from the datasets provided by the team at TDCommons

Facebook

Twitter

Click to copy link

Link copied

Cite

Mohneesh_Sreegirisetty (2023). Indian Medicine Data [Dataset]. https://www.kaggle.com/datasets/mohneesh7/indian-medicine-data

Indian Medicine Data

Indian medicine database for all categories of medicines.

Explore at:

8 scholarly articles cite this dataset (View in Google Scholar)

zip(18681848 bytes)Available download formats

Dataset updated

Aug 20, 2023

Authors

Mohneesh_Sreegirisetty

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

India Medicine Database

Dataset consists of 8 columns : - sub_category: This classification pertains to specific medical categories that define the domain in which the medicine finds its application. - product_name: This is the name of the product, as available in the indian market. - salt_composition: This is the chemical composition of the drug. - product_price:This represents the previous price of the product. Please consider this as a reference, as it tends to be highly volatile in relation to the health market. - product_manufactured:The pharmaceutical company responsible for producing the medicine/drug. - medicine_desc: Comprehensive overview and detailed description of the specific product. - side_effects:Potential adverse effects associated with the drug/medicine. - drug_interactions:Interactions and effects when combining this specific medicine with other drugs.

There are a few missing values in the dataset, but most information is available for the row, so I have left as is.

Clear search

Close search

Google apps

Main menu

Indian Medicine Data

India Medicine Database

A-Z Medicine Dataset of India

Medicine data: herbal medicines

RxNorm Data

Context

Content

Acknowledgements

Inspiration

DrugBank Database Data Package

Drug Pharma New Dataset

MIMIC-III Clinical Database

Medi-Span

Data from: RxNorm

Data from: DailyMed

List of Registered Pharmaceutical Products | DATA.GOV.HK

Drug Dataset: Uses, Side Effects and User Reviews

Overview

Use Cases: The dataset is valuable for various applications, including:

National Drug Code Directory

EHRSHOT

Abstract

Methodology

Usage

GUDID Download

GlobalEssentialMedicinesDatabase.xlsx

Repurposing related drug annotations

EU Veterinary Medicinal Product Database

MEDLINE/PubMed Citations

Drug-Drug Interactions

Indian Medicine Data

Indian medicine database for all categories of medicines.

India Medicine Database