100+ datasets found

PHI-base: the Pathogen-Host Interactions Database, version 5.0
zenodo.org
zip
Updated Aug 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alayne Cuzick; Alayne Cuzick; James Seager; James Seager; Martin Urban; Martin Urban; Kim Hammond-Kosack; Kim Hammond-Kosack (2025). PHI-base: the Pathogen-Host Interactions Database, version 5.0 [Dataset]. http://doi.org/10.5281/zenodo.10722193
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10722193
Dataset updated
Aug 12, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Alayne Cuzick; Alayne Cuzick; James Seager; James Seager; Martin Urban; Martin Urban; Kim Hammond-Kosack; Kim Hammond-Kosack
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
⚠️ This release is outdated. The latest release is available at: doi:10.5281/zenodo.10722192

The Pathogen–Host Interactions Database (PHI-base) is an online database that catalogues experimentally-verified pathogenicity, virulence and effector genes from fungal, oomycete, and bacterial pathogens, which infect animal, plant, fungal, and insect hosts. PHI-base is a valuable resource in the discovery of genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention.

Information in PHI-base is manually curated by domain experts and is supported by strong experimental evidence (for example, gene disruption and gene complementation experiments), as well as references to the literature in which the original experiments are described. Annotations are made using terms from ontologies and controlled vocabularies, including the Gene Ontology (GO), Brenda Tissue Ontology (BTO), and the Pathogen--Host Interaction Phenotype Ontology (PHIPO).

PHI-base 5 includes data that was curated using a new curation process described in Cuzick et. al (2023). Data releases for PHI-base 5 do not use the same schema as data releases from PHI-base 4, but all data records from PHI-base 4 that can be made compatible with the new schema are included with this release. Data releases from PHI-base 4 and PHI-base 5 will occur in parallel until such time that all data from PHI-base 4 can be migrated to PHI-base 5. The PHI-base 4 data releases are available on Zenodo at https://zenodo.org/doi/10.5281/zenodo.5356870.

Data content

phi-base_v5.0.xlsx: the PHI-base dataset as an Excel spreadsheet. This format follows the layout of the PHI-base 5 website, with sheets corresponding to the sections of gene pages on the website. This format is designed for use by non-technical users.

phi-base_v5.0.json: the PHI-base dataset in JSON format. This format is closer to the data format that is exported by PHI-Canto, the curation tool used by PHI-base. This format is primarily intended for programmatic usage and has additional data (e.g. metadata for curation sessions) that is not included in the spreadsheet format.

phi-base.schema.json: a JSON Schema file for the JSON format of the dataset. This is included as documentation for the fields in the JSON file, but can also be used to validate the dataset.
Z
The Pathogen-Host Interactions Database, version 4.17
data.niaid.nih.gov
Updated Sep 16, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Urban, Martin; Cuzick, Alayne; Seager, James; Hammond-Kosack, Kim (2024). The Pathogen-Host Interactions Database, version 4.17 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5356870
Explore at:
Dataset updated
Sep 16, 2024
Dataset provided by
Rothamsted Research
Authors
Urban, Martin; Cuzick, Alayne; Seager, James; Hammond-Kosack, Kim
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
PHI-base is an online database (available at phi-base.org) that catalogues experimentally verified pathogenicity, virulence and effector genes from fungal, oomycete and bacterial pathogens, which infect animal, plant, fungal and insect hosts. PHI-base is a valuable resource in the discovery of genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention.

Each entry in PHI-base is curated by domain experts and is supported by strong experimental evidence (for example, gene disruption and gene complementation experiments), as well as literature references in which the original experiments are described. Each gene in PHI-base is presented with its nucleotide sequence and deduced amino acid sequence (available in a FASTA file), as well as a detailed description of the predicted protein's function during the host infection process. To facilitate data interoperability, we have annotated genes using ontologies, controlled vocabularies, and links to external sources (including UniProt, Gene Ontology, Enzyme Commission, NCBI Taxonomy, EMBL, PubMed and FRAC).

This PHI-base dataset is a Frictionless Data Package that contains an export of the PHI-base database in CSV format (comma-separated values), plus a FASTA file with sequences for each gene in the database. This version of the dataset, version 4.17, contains 5,521 publications, covering 22,408 pathogen–host interactions and 9,973 pathogen genes across 296 pathogen species and 249 host species.

Erratum

Please note that the funding information included in the readme file for this dataset (specifically README.md and README.html) is incorrect. The correct funding sources are Growing Health [BB/X010953/1; BBS/E/RH/230003A] and Delivering Sustainable Wheat [BB/X011003/1; BBS/E/RH/230001B], both ultimately funded by the Biotechnology and Biological Sciences Research Council (BBSRC). The metadata for this dataset has been amended to use the correct funding sources (updated 16 September 2024).
D
Data from: PHI-base in 2022: a multi-species phenotype database for...
ckan.grassroots.tools
pdf
Updated Sep 16, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rothamsted Research (2022). PHI-base in 2022: a multi-species phenotype database for Pathogen–Host Interactions [Dataset]. https://ckan.grassroots.tools/ar/dataset/c19809eb-f86c-4ae0-a0ad-36804b73dbd3
Explore at:
pdfAvailable download formats
Dataset updated
Sep 16, 2022
Dataset provided by
Rothamsted Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
jats:titleAbstract/jats:title jats:pSince 2005, the Pathogen–Host Interactions Database (PHI-base) has manually curated experimentally verified pathogenicity, virulence and effector genes from fungal, bacterial and protist pathogens, which infect animal, plant, fish, insect and/or fungal hosts. PHI-base (www.phi-base.org) is devoted to the identification and presentation of phenotype information on pathogenicity and effector genes and their host interactions. Specific gene alterations that did not alter the in host interaction phenotype are also presented. PHI-base is invaluable for comparative analyses and for the discovery of candidate targets in medically and agronomically important species for intervention. Version 4.12 (September 2021) contains 4387 references, and provides information on 8411 genes from 279 pathogens, tested on 228 hosts in 18, 190 interactions. This provides a 24% increase in gene content since Version 4.8 (September 2019). Bacterial and fungal pathogens represent the majority of the interaction data, with a 54:46 split of entries, whilst protists, protozoa, nematodes and insects represent 3.6% of entries. Host species consist of approximately 54% plants and 46% others of medical, veterinary and/or environmental importance. PHI-base data is disseminated to UniProtKB, FungiDB and Ensembl Genomes. PHI-base will migrate to a new gene-centric version (version 5.0) in early 2022. This major development is briefly described./jats:p

PHI-base: the Pathogen-Host Interactions Database, version 5.3

zenodo.org

zip

Updated Feb 1, 2026

Facebook

Twitter

Click to copy link

Link copied

Cite

Hsin Yu Chang; Hsin Yu Chang; James Seager; James Seager; Martin Urban; Martin Urban; Kim Hammond-Kosack; Kim Hammond-Kosack (2026). PHI-base: the Pathogen-Host Interactions Database, version 5.3 [Dataset]. http://doi.org/10.5281/zenodo.18449986

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.18449986

Dataset updated

Feb 1, 2026

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Hsin Yu Chang; Hsin Yu Chang; James Seager; James Seager; Martin Urban; Martin Urban; Kim Hammond-Kosack; Kim Hammond-Kosack

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Download the dataset here: phi-base_v5.3.zip

The Pathogen–Host Interactions Database (PHI-base) is an online database that catalogues experimentally-verified pathogenicity, virulence and effector genes from fungal, oomycete, and bacterial pathogens, which infect animal, plant, fungal, and insect hosts. PHI-base is a valuable resource in the discovery of genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention.

Information in PHI-base is manually curated by domain experts and is supported by strong experimental evidence (for example, gene disruption and gene complementation experiments), as well as references to the literature in which the original experiments are described. Annotations are made using terms from ontologies and controlled vocabularies, including the Gene Ontology (GO), Brenda Tissue Ontology (BTO), and the Pathogen–Host Interaction Phenotype Ontology (PHIPO).

PHI-base 5 includes data that was curated using a new curation process described in Cuzick et. al (2023). Data releases for PHI-base 5 do not use the same schema as data releases from PHI-base 4, but all data records from PHI-base 4 that can be made compatible with the new schema are included with this release. Data releases from PHI-base 4 and PHI-base 5 will occur in parallel until such time that all data from PHI-base 4 can be migrated to PHI-base 5. The PHI-base 4 data releases are available on Zenodo at https://zenodo.org/doi/10.5281/zenodo.5356870.

For more information about the planned transition from PHI-base 4 to PHI-base 5, see the Help and Announcements page on the PHI-base 5 website.

Release statistics

This version of the PHI-base 5 dataset contains the following types of information:

Data type	Count
Genes	10353
Interactions	33498
Pathogen species	303
Host species	237
Diseases	343
References (publications)	5222
Annotations
Pathogen-host interaction phenotype	19419
Gene-for-gene phenotype	569
Pathogen phenotype	12130
Host phenotype	15
GO biological process	1476
GO cellular component	109
GO molecular function	157
Post-translational modification	7
Physical interaction	73
WT RNA expression	48
WT protein expression	2

File contents

phi-base_v5.3.xlsx: the PHI-base dataset as an Excel spreadsheet. This format follows the layout of the PHI-base 5 website, with sheets corresponding to the sections of gene pages on the website. This format is designed for use by non-technical users.
phi-base_v5.3.json: the PHI-base dataset in JSON format. This is modelled on the export format used by PHI-Canto, the curation tool used by PHI-base. This format is primarily intended for programmatic usage and has additional information (e.g. metadata for curation sessions) that is not included in the spreadsheet format.
phi-base.schema.json: a JSON Schema file for the JSON format of the dataset. This is included as documentation for the fields in the JSON file, but can also be used to validate the dataset.

c
Data from: A DICOM dataset for evaluation of medical image de-identification...
cancerimagingarchive.net
csv, dicom, n/a
Updated Jan 31, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Cancer Imaging Archive (2021). A DICOM dataset for evaluation of medical image de-identification [Dataset]. http://doi.org/10.7937/s17z-r072
Explore at:
dicom, csv, n/aAvailable download formats
Unique identifier
https://doi.org/10.7937/s17z-r072
Dataset updated
Jan 31, 2021
Dataset authored and provided by
The Cancer Imaging Archive
License
https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/
Time period covered
Apr 7, 2021
Dataset funded by
National Cancer Institutehttp://www.cancer.gov/
Description
Open access or shared research data must comply with (HIPAA) patient privacy regulations. These regulations require the de-identification of datasets before they can be placed in the public domain. The process of image de-identification is time consuming, requires significant human resources, and is prone to human error. Automated image de-identification algorithms have been developed but the research community requires some method of evaluation before such tools can be widely accepted. This evaluation requires a robust dataset that can be used as part of an evaluation process for de-identification algorithms.
We developed a DICOM dataset that can be used to evaluate the performance of de-identification algorithms. DICOM image information objects were selected from datasets published in TCIA. Synthetic Protected Health Information (PHI) was generated and inserted into selected DICOM data elements to mimic typical clinical imaging exams. The evaluation dataset was de-identified by a TCIA curation team using standard TCIA tools and procedures. We are publishing the evaluation dataset (containing synthetic PHI) and de-identified evaluation dataset (result of TCIA curation) in advance of a potential competition, sponsored by the National Cancer Institute (NCI), for de-identification algorithm evaluation, and de-identification of medical image datasets. The evaluation dataset published here is a subset of a larger evaluation dataset that was created under contract for the National Cancer Institute. This subset is being published to allow researchers to test their de-identification algorithms and promote standardized procedures for validating automated de-identification.
n
PHI-base
neuinfo.org
Updated Jan 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). PHI-base [Dataset]. http://identifiers.org/RRID:SCR_003331
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_003331
Dataset updated
Jan 13, 2025
Description
Database that catalogs experimentally verified pathogenicity, virulence and effector genes from fungal, Oomycete and bacterial pathogens, which infect animal, plant, fungal and insect hosts. It is an invaluable resource in the discovery of genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention. In collaboration with the FRAC team, it also includes antifungal compounds and their target genes. Each entry is curated by domain experts and is supported by strong experimental evidence (gene disruption experiments, STM etc), as well as literature references in which the original experiments are described. Each gene is presented with its nucleotide and deduced amino acid sequence, as well as a detailed description of the predicted protein's function during the host infection process. To facilitate data interoperability, genes have been annotated using controlled vocabularies and links to external sources (Gene Ontology terms, EC Numbers, NCBI taxonomy, EMBL, PubMed and FRAC).
Phi Import Data & Buyers List in USA
seair.co.in
Updated Apr 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Seair Exim Solutions (2025). Phi Import Data & Buyers List in USA [Dataset]. https://www.seair.co.in/us-import/product-phi.aspx
Explore at:
.text/.csv/.xml/.xls/.binAvailable download formats
Dataset updated
Apr 14, 2025
Dataset authored and provided by
Seair Exim Solutions
Area covered
United States
Description
Get the latest USA Phi import data with importer names, shipment details, buyers list, product description, price, quantity, and major US ports.
f
Pi Beta Phi | Universities & Colleges | Education Data
datastore.forage.ai
Updated Sep 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Pi Beta Phi | Universities & Colleges | Education Data [Dataset]. https://datastore.forage.ai/searchresults/?resource_keyword=Universities%20&%20Colleges
Explore at:
Dataset updated
Sep 19, 2024
Description
Pi Beta Phi has been supporting and empowering women since 1867. The organization has grown into a global community with a lifelong membership experience rooted in timeless values, promoting friendship, leadership potential, and community service. Pi Phi aims to build confident women leaders who are equipped to make a difference in their communities and beyond. The sorority's literacy program, Read > Lead > Achieve, has been a cornerstone of its philanthropic efforts for over a century, inspiring a love of reading and learning among its members and in the communities it serves.

With its headquarters based in Town and Country, Missouri, Pi Beta Phi has a presence on college campuses and in communities across the United States and Canada. The organization is dedicated to providing its members with opportunities for personal growth, leadership development, and community engagement. Pi Phi's alumni network is a key part of the sorority's identity, with many sisters going on to become leaders in their communities and professions. Through its various programs and initiatives, Pi Beta Phi aims to create a lasting impact on the lives of its members and their communities.
Comprehensive analysis of Verticillium nonalfalfae in silico secretome...
plos.figshare.com
xlsx
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristina Marton; Marko Flajšman; Sebastjan Radišek; Katarina Košmelj; Jernej Jakše; Branka Javornik; Sabina Berne (2023). Comprehensive analysis of Verticillium nonalfalfae in silico secretome uncovers putative effector proteins expressed during hop invasion [Dataset]. http://doi.org/10.1371/journal.pone.0198971
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0198971
Dataset updated
Jun 2, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Kristina Marton; Marko Flajšman; Sebastjan Radišek; Katarina Košmelj; Jernej Jakše; Branka Javornik; Sabina Berne
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The vascular plant pathogen Verticillium nonalfalfae causes Verticillium wilt in several important crops. VnaSSP4.2 was recently discovered as a V. nonalfalfae virulence effector protein in the xylem sap of infected hop. Here, we expanded our search for candidate secreted effector proteins (CSEPs) in the V. nonalfalfae predicted secretome using a bioinformatic pipeline built on V. nonalfalfae genome data, RNA-Seq and proteomic studies of the interaction with hop. The secretome, rich in carbohydrate active enzymes, proteases, redox proteins and proteins involved in secondary metabolism, cellular processing and signaling, includes 263 CSEPs. Several homologs of known fungal effectors (LysM, NLPs, Hce2, Cerato-platanins, Cyanovirin-N lectins, hydrophobins and CFEM domain containing proteins) and avirulence determinants in the PHI database (Avr-Pita1 and MgSM1) were found. The majority of CSEPs were non-annotated and were narrowed down to 44 top priority candidates based on their likelihood of being effectors. These were examined by spatio-temporal gene expression profiling of infected hop. Among the highest in planta expressed CSEPs, five deletion mutants were tested in pathogenicity assays. A deletion mutant of VnaUn.279, a lethal pathotype specific gene with sequence similarity to SAM-dependent methyltransferase (LaeA), had lower infectivity and showed highly reduced virulence, but no changes in morphology, fungal growth or conidiation were observed. Several putative secreted effector proteins that probably contribute to V. nonalfalfae colonization of hop were identified in this study. Among them, LaeA gene homolog was found to act as a potential novel virulence effector of V. nonalfalfae. The combined results will serve for future characterization of V. nonalfalfae effectors, which will advance our understanding of Verticillium wilt disease.
m
Data from: Amino acid repeat signatures underlying human-pathogen...
data.mendeley.com
Updated Nov 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anjali Kumari Singh (2025). Amino acid repeat signatures underlying human-pathogen interactions [Dataset]. http://doi.org/10.17632/nzk4swk7xy.1
Explore at:
Unique identifier
https://doi.org/10.17632/nzk4swk7xy.1
Dataset updated
Nov 28, 2025
Authors
Anjali Kumari Singh
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Emerging evidence suggests that amino acid homorepeats (HRs) in proteins (HRPs) contribute to protein interactability. What is the role of HRs in human-pathogen protein interactions? We find that pathogens engage physiologically important human HRPs, thereby affecting diverse host physiological processes. From the pathogen standpoint, (i) eukaryotic pathogens engage more HRPs but with host-sparse HRs, leading to disparate and discriminate interactions, (ii) prokaryotic pathogens engage less HRPs but with host-abundant non-polar HRs via host protein proxies bringing about discriminate or promiscuous interactions and (iii) viral pathogens engage more HRPs with host-abundant polar uncharged HRs affecting promiscuous interactions using host-partner HR tract mimicry. To propel further research, we introduce a resource Hi-PHI (http://hiphi.iisertirupati.ac.in/) cataloging critical information about human and pathogen HRPs and HRs. We propose mechanisms to (i) repurpose drugs targeting human HRPs engaged by pathogens for treating different infections and (ii) exploit HRs and their flanks as targets for pathogen-targeted anti-infectives.

Here, we have uploaded the assembled and curated human-pathogen protein interactome (HPI), which has 19,535 interactions between human and pathogen proteins. We have also provided the source code to facilitate repetition of this work and address other fundamental systems- and molecular-level questions. The instructions regarding usage of the codes are provided in individual scripts. All the datasets assembled, curated, generated and used in this study is available as a resource, Hi-PHI database (http://hiphi.iisertirupati.ac.in/).
F. graminearum genes with known phenotypes from the PHI-base database...
plos.figshare.com
xlsx
Updated Jan 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Erika Kroll; Carlos Bayon; Jason Rudd; Victoria J. Armer; Anjana Magaji-Umashankar; Ryan Ames; Martin Urban; Neil A. Brown; Kim Hammond-Kosack (2025). F. graminearum genes with known phenotypes from the PHI-base database (www.PHI-base.org) in each fungal module. [Dataset]. http://doi.org/10.1371/journal.ppat.1012769.s017
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.ppat.1012769.s017
Dataset updated
Jan 9, 2025
Dataset provided by
PLOShttp://plos.org/
Authors
Erika Kroll; Carlos Bayon; Jason Rudd; Victoria J. Armer; Anjana Magaji-Umashankar; Ryan Ames; Martin Urban; Neil A. Brown; Kim Hammond-Kosack
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Table provides RRES v5 gene ID, PHI identifier ID from PHI-base, Uniprot protein ID, gene function, mutant phenotype, experimental technique, author reference, and year published. (XLSX)
h
Data from: $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $\mathrm{\phi(1020)}$...
hepdata.net
Updated Jan 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $\mathrm{\phi(1020)}$ production in p-Pb collisions at $\sqrt{s_{\rm NN}}$ = 8.16 TeV [Dataset]. http://doi.org/10.17182/hepdata.136309.v1
Explore at:
Unique identifier
https://doi.org/10.17182/hepdata.136309.v1
Dataset updated
Jan 24, 2023
Description
The production of $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $\mathrm{\phi(1020)}$ resonances has been measured in p-Pb collisions at $\sqrt{s_{\rm NN}}$ = 8.16 TeV using the ALICE detector. Resonances are reconstructed via their hadronic decay channels in the rapidity interval $-$0.5 $<$y$<$ 0 and the transverse momentum spectra are measured for various multiplicity classes up to $p_{\rm T}$ = 20 GeV/$c$ for $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $p_{\rm T}$ = 16 GeV/$c$ for $\mathrm{\phi(1020)}$. The $p_{\rm T}$ -integrated yields and mean transverse momenta are reported and compared with previous results in pp, p-Pb and Pb-Pb collisions. The $x_{\mathrm{T}}$ scaling for $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $\mathrm{\phi(1020)}$ resonance production is newly tested in p-Pb collisions and found to hold in the high-$p_{\rm T}$ region at LHC energies. The nuclear modification factors ($R_{\rm pPb}$) as a function of $p_{\rm T}$ for $\mathrm{K}^{*0}$ and $\mathrm{\phi}$ at $\sqrt{s_{NN}}$ = 8.16 TeV are presented along with the new $R_{\rm pPb}$ measurements of $\mathrm{K}^{*0}$, $\mathrm{\phi}$ , $\Xi$, and $\Omega$ at $\sqrt{s_{\rm NN}}$ = 5.02 TeV. At intermediate $p_{\rm T}$ (2-8 GeV/$c$), $R_{\rm pPb}$ of $\Xi$, $\Omega$ show a Cronin-like enhancement, while $\mathrm{K}^{*0}$ and $\mathrm{\phi}$ show no or little nuclear modification. At high $p_{\rm T}$ ($>$ 8 GeV/$c$), the $R_{\rm pPb}$ values of all hadrons are consistent with unity within uncertainties. The $R_{\rm pPb}$ of $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $\mathrm{\phi(1020)}$ at $\sqrt{s_{\rm NN}}$ = 8.16 and 5.02 TeV show no significant energy dependence.
mimic-iv-clinical-database-demo-2.2
kaggle.com
zip
Updated Apr 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Montassar bellah (2025). mimic-iv-clinical-database-demo-2.2 [Dataset]. https://www.kaggle.com/montassarba/mimic-iv-clinical-database-demo-2-2
Explore at:
zip(16441065 bytes)Available download formats
Dataset updated
Apr 1, 2025
Authors
Montassar bellah
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Abstract The Medical Information Mart for Intensive Care (MIMIC)-IV database is comprised of deidentified electronic health records for patients admitted to the Beth Israel Deaconess Medical Center. Access to MIMIC-IV is limited to credentialed users. Here, we have provided an openly-available demo of MIMIC-IV containing a subset of 100 patients. The dataset includes similar content to MIMIC-IV, but excludes free-text clinical notes. The demo may be useful for running workshops and for assessing whether the MIMIC-IV is appropriate for a study before making an access request.

Background The increasing adoption of digital electronic health records has led to the existence of large datasets that could be used to carry out important research across many areas of medicine. Research progress has been limited, however, due to limitations in the way that the datasets are curated and made available for research. The MIMIC datasets allow credentialed researchers around the world unprecedented access to real world clinical data, helping to reduce the barriers to conducting important medical research. The public availability of the data allows studies to be reproduced and collaboratively improved in ways that would not otherwise be possible.

Methods First, the set of individuals to include in the demo was chosen. Each person in MIMIC-IV is assigned a unique subject_id. As the subject_id is randomly generated, ordering by subject_id results in a random subset of individuals. We only considered individuals with an anchor_year_group value of 2011 - 2013 or 2014 - 2016 to ensure overlap with MIMIC-CXR v2.0.0. The first 100 subject_id who satisfied the anchor_year_group criteria were selected for the demo dataset.

All tables from MIMIC-IV were included in the demo dataset. Tables containing patient information, such as emar or labevents, were filtered using the list of selected subject_id. Tables which do not contain patient level information were included in their entirety (e.g. d_items or d_labitems). Note that all tables which do not contain patient level information are prefixed with the characters 'd_'.

Deidentification was performed following the same approach as the MIMIC-IV database. Protected health information (PHI) as listed in the HIPAA Safe Harbor provision was removed. Patient identifiers were replaced using a random cipher, resulting in deidentified integer identifiers for patients, hospitalizations, and ICU stays. Stringent rules were applied to structured columns based on the data type. Dates were shifted consistently using a random integer removing seasonality, day of the week, and year information. Text fields were filtered by manually curated allow and block lists, as well as context-specific regular expressions. For example, columns containing dose values were filtered to only contain numeric values. If necessary, a free-text deidentification algorithm was applied to remove PHI from free-text. Results of this algorithm were manually reviewed and verified to remove identified PHI.

Data Description MIMIC-IV is a relational database consisting of 26 tables. For a detailed description of the database structure, see the MIMIC-IV Clinical Database page [1] or the MIMIC-IV online documentation [2]. The demo shares an identical schema and structure to the equivalent version of MIMIC-IV.

Data files are distributed in comma separated value (CSV) format following the RFC 4180 standard [3]. The dataset is also made available on Google BigQuery. Instructions to accessing the dataset on BigQuery are provided on the online MIMIC-IV documentation, under the cloud page [2].

An additional file is included: demo_subject_id.csv. This is a list of the subject_id used to filter MIMIC-IV to the demo subset.

Usage Notes The MIMIC-IV demo provides researchers with the opportunity to better understand MIMIC-IV data.

CSV files can be opened natively using any text editor or spreadsheet program. However, as some tables are large it may be preferable to navigate the data via a relational database. We suggest either working with the data in Google BigQuery (see the "Files" section for access details) or creating an SQLite database using the CSV files. SQLite is a lightweight database format which stores all constituent tables in a single file, and SQLite databases interoperate well with a number software tools.

Code is made available for use with MIMIC-IV on the MIMIC-IV code repository [4]. Code provided includes derivation of clinical concepts, tutorials, and reproducible analyses.

Release Notes Release notes for the demo follow the release notes for the MIMIC-IV database.

Ethics This project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the pr...
CalHHS Data De-Identification Guidelines Reference Dataset
data.ca.gov
csv, zip
Updated Nov 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Health and Human Services Agency (2025). CalHHS Data De-Identification Guidelines Reference Dataset [Dataset]. https://data.ca.gov/dataset/calhhs-data-de-identification-guidelines-reference-dataset
Explore at:
csv, zipAvailable download formats
Dataset updated
Nov 6, 2025
Dataset authored and provided by
California Health and Human Services Agencyhttps://www.chhs.ca.gov/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
These datasets are part of the California Health and Human Services Agency (CalHHS) Data De-Identification Guidelines (DDG) in the "State and County Population Projections" Appendix. The DDG assists CalHHS departments in evaluating data for public release while ensuring the privacy of individuals represented in the data. California population estimates serve as a foundation for the population-based scoring assessments outlined in the DDG.
Genes encoding pathogenicity related factors derived from PHI database
figshare.com
xlsx
Updated Jul 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RAJ KUMAR JOSHI (2024). Genes encoding pathogenicity related factors derived from PHI database [Dataset]. http://doi.org/10.6084/m9.figshare.26213564.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.26213564.v1
Dataset updated
Jul 9, 2024
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
RAJ KUMAR JOSHI
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Genes encoding pathogenicity related factors derived from PHI database
sentiment for phi
kaggle.com
zip
Updated Aug 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mizanur Rahman (2024). sentiment for phi [Dataset]. https://www.kaggle.com/datasets/mizanurrahmanrafi/sentiment-for-phi/data
Explore at:
zip(233749 bytes)Available download formats
Dataset updated
Aug 12, 2024
Authors
Mizanur Rahman
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Mizanur Rahman

Released under Apache 2.0

Contents
p
Pi Beta Phi Locations Data for United States
poidata.io
csv, json
Updated Jan 14, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business Data Provider (2026). Pi Beta Phi Locations Data for United States [Dataset]. https://poidata.io/brand-report/pi-beta-phi/united-states
Explore at:
json, csvAvailable download formats
Dataset updated
Jan 14, 2026
Dataset authored and provided by
Business Data Provider
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2026
Area covered
United States
Variables measured
Website URL, Phone Number, Review Count, Business Name, Email Address, Business Hours, Customer Rating, Business Address, Brand Affiliation, Geographic Coordinates
Description
Comprehensive dataset containing 46 verified Pi Beta Phi locations in United States with complete contact information, ratings, reviews, and location data.
h
PHI-4-Hindi-Instruct-Data
huggingface.co
Updated Feb 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ram Kadiyala (2025). PHI-4-Hindi-Instruct-Data [Dataset]. https://huggingface.co/datasets/1024m/PHI-4-Hindi-Instruct-Data
Explore at:
Dataset updated
Feb 6, 2025
Authors
Ram Kadiyala
Description
1024m/PHI-4-Hindi-Instruct-Data dataset hosted on Hugging Face and contributed by the HF Datasets community
S
Data from: Table 4
hepdata.net
csv +3
Updated 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HEPData (2020). Table 4 [Dataset]. http://doi.org/10.17182/hepdata.95908.v1/t2
Explore at:
https://root.cern, https://yoda.hepforge.org, csv, https://yaml.orgAvailable download formats
Unique identifier
https://doi.org/10.17182/hepdata.95908.v1/t2
Dataset updated
2020
Dataset provided by
HEPData
Description
Observed and expected 95% CL upper limits on B(H $\rightarrow$ Z$\phi$), for different polarizations.
p
Alpha Phi Locations Data for United States
poidata.io
csv, json
Updated Feb 12, 2026
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business Data Provider (2026). Alpha Phi Locations Data for United States [Dataset]. https://poidata.io/brand-report/alpha-phi/united-states
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 12, 2026
Dataset authored and provided by
Business Data Provider
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2026
Area covered
United States
Variables measured
Website URL, Phone Number, Review Count, Business Name, Email Address, Business Hours, Customer Rating, Business Address, Brand Affiliation, Geographic Coordinates
Description
Comprehensive dataset containing 41 verified Alpha Phi locations in United States with complete contact information, ratings, reviews, and location data.

Facebook

Twitter

Click to copy link

Link copied

Cite

Alayne Cuzick; Alayne Cuzick; James Seager; James Seager; Martin Urban; Martin Urban; Kim Hammond-Kosack; Kim Hammond-Kosack (2025). PHI-base: the Pathogen-Host Interactions Database, version 5.0 [Dataset]. http://doi.org/10.5281/zenodo.10722193

PHI-base: the Pathogen-Host Interactions Database, version 5.0

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.10722193

Dataset updated

Aug 12, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Alayne Cuzick; Alayne Cuzick; James Seager; James Seager; Martin Urban; Martin Urban; Kim Hammond-Kosack; Kim Hammond-Kosack

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

⚠️ This release is outdated. The latest release is available at: doi:10.5281/zenodo.10722192

Data content

phi-base_v5.0.xlsx: the PHI-base dataset as an Excel spreadsheet. This format follows the layout of the PHI-base 5 website, with sheets corresponding to the sections of gene pages on the website. This format is designed for use by non-technical users.
phi-base_v5.0.json: the PHI-base dataset in JSON format. This format is closer to the data format that is exported by PHI-Canto, the curation tool used by PHI-base. This format is primarily intended for programmatic usage and has additional data (e.g. metadata for curation sessions) that is not included in the spreadsheet format.
phi-base.schema.json: a JSON Schema file for the JSON format of the dataset. This is included as documentation for the fields in the JSON file, but can also be used to validate the dataset.

Clear search

Close search

Google apps

Main menu

PHI-base: the Pathogen-Host Interactions Database, version 5.0

Data content

The Pathogen-Host Interactions Database, version 4.17

Data from: PHI-base in 2022: a multi-species phenotype database for...

PHI-base: the Pathogen-Host Interactions Database, version 5.3

Release statistics

File contents

Data from: A DICOM dataset for evaluation of medical image de-identification...

PHI-base

Phi Import Data & Buyers List in USA

Pi Beta Phi | Universities & Colleges | Education Data

Comprehensive analysis of Verticillium nonalfalfae in silico secretome...

Data from: Amino acid repeat signatures underlying human-pathogen...

F. graminearum genes with known phenotypes from the PHI-base database...

Data from: $\mathrm{K}^{*}(\mathrm{892})^{0}$ and $\mathrm{\phi(1020)}$...

mimic-iv-clinical-database-demo-2.2

CalHHS Data De-Identification Guidelines Reference Dataset

Genes encoding pathogenicity related factors derived from PHI database

sentiment for phi

Dataset

Contents

Pi Beta Phi Locations Data for United States

PHI-4-Hindi-Instruct-Data

Data from: Table 4

Alpha Phi Locations Data for United States

PHI-base: the Pathogen-Host Interactions Database, version 5.0See More Versions

Data content

PHI-base: the Pathogen-Host Interactions Database, version 5.0