100+ datasets found
  1. P

    PPI Dataset

    • paperswithcode.com
    Updated Apr 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    William L. Hamilton; Rex Ying; Jure Leskovec (2017). PPI Dataset [Dataset]. https://paperswithcode.com/dataset/ppi
    Explore at:
    Dataset updated
    Apr 27, 2021
    Authors
    William L. Hamilton; Rex Ying; Jure Leskovec
    Description

    protein roles—in terms of their cellular functions from gene ontology—in various protein-protein interaction (PPI) graphs, with each graph corresponding to a different human tissue [41]. positional gene sets are used, motif gene sets and immunological signatures as features and gene ontology sets as labels (121 in total), collected from the Molecular Signatures Database [34]. The average graph contains 2373 nodes, with an average degree of 28.8.

  2. ATOM3D: Protein-Protein Interactions (PPI) Dataset

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip, bin
    Updated Jun 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raphael J.L. Townshend; Raphael J.L. Townshend; Martin Vögele; Martin Vögele; Patricia Suriana; Patricia Suriana; Alexander Derry; Alexander Derry; Alexander Powers; Yianni Laloudakis; Sidhika Balachandar; Brandon Anderson; Stephan Eismann; Risi Kondor; Russ B. Altman; Ron O. Dror; Alexander Powers; Yianni Laloudakis; Sidhika Balachandar; Brandon Anderson; Stephan Eismann; Risi Kondor; Russ B. Altman; Ron O. Dror (2021). ATOM3D: Protein-Protein Interactions (PPI) Dataset [Dataset]. http://doi.org/10.5281/zenodo.4911102
    Explore at:
    application/gzip, binAvailable download formats
    Dataset updated
    Jun 16, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Raphael J.L. Townshend; Raphael J.L. Townshend; Martin Vögele; Martin Vögele; Patricia Suriana; Patricia Suriana; Alexander Derry; Alexander Derry; Alexander Powers; Yianni Laloudakis; Sidhika Balachandar; Brandon Anderson; Stephan Eismann; Risi Kondor; Russ B. Altman; Ron O. Dror; Alexander Powers; Yianni Laloudakis; Sidhika Balachandar; Brandon Anderson; Stephan Eismann; Risi Kondor; Russ B. Altman; Ron O. Dror
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Protein-Protein Interactions (PPI) dataset from the ATOM3D project. This upload includes three zipped data directories:

    1. Full, unsplit dataset in LMDB format
    2. Split datasets, with each in LMDB format
    3. Text files containing train, validation, and test indices used to split raw dataset
    4. README containing dataset details

  3. f

    Human PPI from IntAct database (IAH)

    • figshare.com
    txt
    Updated Apr 12, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elena Sugis; Henning Hermjakob (2019). Human PPI from IntAct database (IAH) [Dataset]. http://doi.org/10.6084/m9.figshare.5674858.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Apr 12, 2019
    Dataset provided by
    figshare
    Authors
    Elena Sugis; Henning Hermjakob
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The datasets contains information about protein-protein interactions (PPI) and protein-protein complex interactions (PCI) in human. It was received by querying the IntAct database based on the criteria that the organism is human and the confidence level of the interaction is based on MI score ≥ 0.45 The confidence level of each interaction is characterised by IntAct MI score. The result was downloaded from IntAct molecular interaction database version 4.2.6 https://www.ebi.ac.uk/intact/.

  4. i

    GO dataset

    • ieee-dataport.org
    Updated Jan 13, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wei Liu (2020). GO dataset [Dataset]. https://ieee-dataport.org/documents/ppi-datasetsgo-dataset-subcellular-localization-information-and-essential-protein-dataset
    Explore at:
    Dataset updated
    Jan 13, 2020
    Authors
    Wei Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SGD

  5. Producer price inflation time series

    • ons.gov.uk
    • cy.ons.gov.uk
    csdb, csv, xlsx
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2025). Producer price inflation time series [Dataset]. https://www.ons.gov.uk/economy/inflationandpriceindices/datasets/producerpriceindexstatisticalbulletindataset
    Explore at:
    csv, csdb, xlsxAvailable download formats
    Dataset updated
    Feb 19, 2025
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    A comprehensive selection of data on input and output indices. Contains producer price indices of materials and fuels purchased and output of manufacturing industry by broad sector.

  6. f

    PPI prediction from sequence, gold standard dataset

    • figshare.com
    txt
    Updated Feb 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Judith Bernett (2025). PPI prediction from sequence, gold standard dataset [Dataset]. http://doi.org/10.6084/m9.figshare.21591618.v4
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 10, 2025
    Dataset provided by
    figshare
    Authors
    Judith Bernett
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Gold Standard Dataset for sequence-based PPI prediction:Big dataset: 163,192 training points (Intra-1), 59,260 validation points (Intra-0), 52,048 test points (Intra-2)) + corresponding protein sequences from SwissprotNo direct data leakage: proteins from training are not contained in validation or test, proteins from validation are not in training or test, proteins from test are not in validation or trainingMinimized sequence similarity between training, validation, test because whole human proteome was split with KaHIP such that sequence similarities are minimized w.r.t. length-normalized bitscoresRedundancy-reduction with CD-HIT: inside of the datasets, no proteins with >40% pairwise sequence similarityNew version: added sequence of Q96PU5 to the human_swissprot_oneliner

  7. h

    PPI

    • huggingface.co
    Updated May 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vladimir Kovačević (2025). PPI [Dataset]. https://huggingface.co/datasets/vladak/PPI
    Explore at:
    Dataset updated
    May 13, 2025
    Authors
    Vladimir Kovačević
    Description

    Dataset Card for PPI

      Summary
    

    The PPI dataset is part of the LUCAONE downstream tasks collection for biomolecular interaction prediction. It is structured for binary classification and includes standard splits for training (train.csv), validation (dev.csv → val), and test (test.csv).

      Dataset Structure
    

    This dataset includes three splits:

    train val (converted from dev.csv) test

    Each split is in CSV format.

      Task
    

    Binary classification of interactions… See the full description on the dataset page: https://huggingface.co/datasets/vladak/PPI.

  8. p

    iPPI-DB

    • ippidb.pasteur.fr
    Updated Dec 26, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Institut Pasteur (2019). iPPI-DB [Dataset]. https://ippidb.pasteur.fr
    Explore at:
    Dataset updated
    Dec 26, 2019
    Dataset authored and provided by
    Institut Pasteur
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    a database of modulators of protein-protein interactions. It contains exclusively small molecules and therefore no peptides. The data are retrieved from the literature either peer reviewed scientific articles or world patents. A large variety of data is stored within IPPI-DB: structural, pharmacological, binding and activity profile, pharmacokinetic and cytotoxicity when available, as well as some data about the PPI targets themselves.

  9. T

    PRODUCER PRICE INDEX. by Country in EUROPE

    • tradingeconomics.com
    csv, excel, json, xml
    Updated Jan 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2024). PRODUCER PRICE INDEX. by Country in EUROPE [Dataset]. https://tradingeconomics.com/country-list/producer-price-index.?continent=europe
    Explore at:
    csv, xml, json, excelAvailable download formats
    Dataset updated
    Jan 20, 2024
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    Europe
    Description

    This dataset provides values for PRODUCER PRICE INDEX. reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.

  10. h

    PINDER

    • huggingface.co
    Updated Apr 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Synthyra (2025). PINDER [Dataset]. https://huggingface.co/datasets/Synthyra/PINDER
    Explore at:
    Dataset updated
    Apr 15, 2025
    Dataset authored and provided by
    Synthyra
    Description

    PINDER PPI dataset

    The PINDER: The Protein INteraction Dataset and Evaluation Resource is a high quality compilation of positive protein protein interactions. Of particular note, the train, valid, and test splits are deduplicated and heavily trimmed based on sequence and structure similarity. For more information on the original dataset compilation, please read their paper, GitHub, or docs.

      Differences between this version and the official version
    

    We further processed… See the full description on the dataset page: https://huggingface.co/datasets/Synthyra/PINDER.

  11. f

    Alzheimer's disease PPI from IntAct (ADIA)

    • figshare.com
    txt
    Updated Jan 30, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elena Sugis; Henning Hermjakob (2019). Alzheimer's disease PPI from IntAct (ADIA) [Dataset]. http://doi.org/10.6084/m9.figshare.5674990.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 30, 2019
    Dataset provided by
    figshare
    Authors
    Elena Sugis; Henning Hermjakob
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset is a subset of the expert curated PPI dataset based on the proteins with an association to Alzheimer’s disease available from IntAct molecular interaction database https://www.ebi.ac.uk/intact/. The confidence level of each interaction is characterised by IntAct MI score.Dataset was downloaded from IntAct database version 4.2.6.

  12. Transportation and Inflation - PPI

    • catalog.data.gov
    • data.virginia.gov
    Updated Aug 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bureau of Transportation Statistics (2024). Transportation and Inflation - PPI [Dataset]. https://catalog.data.gov/dataset/transportation-and-inflation-ppi
    Explore at:
    Dataset updated
    Aug 9, 2024
    Dataset provided by
    Bureau of Transportation Statisticshttp://www.rita.dot.gov/bts
    Description

    A look at the producer price index for transportation and its components as a measure of inflation faced by consumers.

  13. F

    Producer Price Index by Commodity: All Commodities

    • fred.stlouisfed.org
    json
    Updated Jun 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Producer Price Index by Commodity: All Commodities [Dataset]. https://fred.stlouisfed.org/series/PPIACO
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jun 12, 2025
    License

    https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain

    Description

    Graph and download economic data for Producer Price Index by Commodity: All Commodities (PPIACO) from Jan 1913 to May 2025 about commodities, PPI, inflation, price index, indexes, price, and USA.

  14. T

    United States Producer Prices

    • tradingeconomics.com
    • fa.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Aug 18, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2015). United States Producer Prices [Dataset]. https://tradingeconomics.com/united-states/producer-prices
    Explore at:
    excel, csv, xml, jsonAvailable download formats
    Dataset updated
    Aug 18, 2015
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Nov 30, 2009 - May 31, 2025
    Area covered
    United States
    Description

    Producer Prices in the United States increased to 148.07 points in May from 147.88 points in April of 2025. This dataset provides the latest reported value for - United States Producer Prices - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.

  15. Data for RAPPPID: Towards Generalisable Protein Interaction Prediction with...

    • zenodo.org
    zip
    Updated Jun 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joseph Szymborski; Joseph Szymborski; Amin Emad; Amin Emad (2022). Data for RAPPPID: Towards Generalisable Protein Interaction Prediction with AWD-LSTM Twin Networks [Dataset]. http://doi.org/10.5281/zenodo.6709790
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 24, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Joseph Szymborski; Joseph Szymborski; Amin Emad; Amin Emad
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data for RAPPPID, a method for the Regularised Automative Prediction of Protein-Protein Interactions using Deep Learning.

    These datasets are in a format that RAPPPID is ready to read.

    Comparatives Dataset
    These datasets were derived from the STRING v11 H. sapiens dataset, according to the C1, C2, and C3 procedures outlined by Park and Marcotte, 2012. Negative samples are sampled randomly from the space of proteins not known to interact. See Szymborski & Emad for details.

    Repeatability Datasets
    The following datasets are all derived from STRING in the manner as the comparatives dataset, but three different random seeds are used for drawing proteins.

    References
    Park,Y. and Marcotte,E.M. (2012) Flaws in evaluation schemes for pair-input computational predictions. Nat Methods, 9, 1134–1136.

    Szklarczyk, D., Gable, A. L., Lyon, D., Junge, A., Wyder, S., Huerta-Cepas, J., Simonovic, M., Doncheva, N. T., Morris, J. H., Bork, P., Jensen, L. J., and Mering, C. (2019). String v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research, 47(D1), D607–D613.

    Szymborski,J. and Emad,A. (2021) RAPPPID: Towards Generalisable Protein Interaction Prediction with AWD-LSTM Twin Networks. bioRxiv https://doi.org/10.1101/2021.08.13.456309

  16. PPI-Scanner Supplementary Dataset

    • zenodo.org
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous; Anonymous (2025). PPI-Scanner Supplementary Dataset [Dataset]. http://doi.org/10.5281/zenodo.15492750
    Explore at:
    Dataset updated
    May 28, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anonymous; Anonymous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset accompanies the NeurIPS 2025 submission titled "Face to Face with Proteins: Contrastive Surface Learning for Protein–Protein Interaction Prediction."

    It contains homology-aware and random train/val/test splits.

    All files were generated using the pipeline included in the supplementary code repository.

  17. h

    bacbench-ppi-stringdb-protein-sequences

    • huggingface.co
    Updated May 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maciej Wiatrak (2025). bacbench-ppi-stringdb-protein-sequences [Dataset]. https://huggingface.co/datasets/macwiatrak/bacbench-ppi-stringdb-protein-sequences
    Explore at:
    Dataset updated
    May 12, 2025
    Authors
    Maciej Wiatrak
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset for protein-protein interaction prediction across bacteria (Protein sequences)

    A dataset of 10,533 bacterial genomes across 6,956 species with protein-protein interaction (PPI) scores for each genome. The genome protein sequences and PPI scores have been extracted from STRING DB. Each row contains a set of protein sequences from a genome, ordered by their location on the chromosome and plasmids and a set of associated PPI scores. The PPI scores have been extracted using the… See the full description on the dataset page: https://huggingface.co/datasets/macwiatrak/bacbench-ppi-stringdb-protein-sequences.

  18. Datasets used in the INTREPPPID manuscript

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip
    Updated Feb 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joseph Szymborski; Joseph Szymborski (2024). Datasets used in the INTREPPPID manuscript [Dataset]. http://doi.org/10.5281/zenodo.10594150
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Feb 9, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Joseph Szymborski; Joseph Szymborski
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    INTREPPPID Manuscript Datasets

    The enclosed archive holds all the datasets used in the INTREPPPID manuscript. See the INTREPPPID documentation for details on the format of the HDF5 files.

    Files are organised as follows:

    [FORMAT]/seed_[SEED]/[TAXON]/[DATASET_NAME].h5

    Where:

    • FORMAT is whether the HDF5 is in the RAPPPID or INTREPPPID format.
    • SEED is the random seed used to generate the dataset. They are all phone numbers found in songs.
    • TAXON is the NCBI Taxon ID of the organism from which the dataset was generated
    • DATASET_NAME is the name of the dataset.

    "Why are there only Human (9606) datasets in the INTREPPPID format?"

    In the manuscript, we use the INTREPPPID format to train them model on Human data, and then test the model using datasets in the RAPPPID format. INTREPPPID can only be trained on datasets with orthology data, but can be tested on datasets without since the orthologous locality loss is only used during training.

  19. T

    United States Producer Price Inflation MoM

    • tradingeconomics.com
    • pt.tradingeconomics.com
    • +12more
    csv, excel, json, xml
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). United States Producer Price Inflation MoM [Dataset]. https://tradingeconomics.com/united-states/producer-price-inflation-mom
    Explore at:
    csv, xml, json, excelAvailable download formats
    Dataset updated
    Jun 12, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 31, 2009 - May 31, 2025
    Area covered
    United States
    Description

    Producer Price Inflation MoM in the United States increased to 0.10 percent in May from -0.20 percent in April of 2025. This dataset includes a chart with historical data for the United States Producer Price Inflation MoM.

  20. T

    United States Producer Prices Change

    • tradingeconomics.com
    • pl.tradingeconomics.com
    • +13more
    csv, excel, json, xml
    Updated Nov 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2024). United States Producer Prices Change [Dataset]. https://tradingeconomics.com/united-states/producer-prices-change
    Explore at:
    xml, csv, excel, jsonAvailable download formats
    Dataset updated
    Nov 29, 2024
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 31, 1950 - May 31, 2025
    Area covered
    United States
    Description

    Producer Prices in the United States increased 2.60 percent in May of 2025 over the same month in the previous year. This dataset provides - United States Producer Prices Change - actual values, historical data, forecast, chart, statistics, economic calendar and news.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
William L. Hamilton; Rex Ying; Jure Leskovec (2017). PPI Dataset [Dataset]. https://paperswithcode.com/dataset/ppi

PPI Dataset

Protein-Protein Interactions (PPI)

Explore at:
Dataset updated
Apr 27, 2021
Authors
William L. Hamilton; Rex Ying; Jure Leskovec
Description

protein roles—in terms of their cellular functions from gene ontology—in various protein-protein interaction (PPI) graphs, with each graph corresponding to a different human tissue [41]. positional gene sets are used, motif gene sets and immunological signatures as features and gene ontology sets as labels (121 in total), collected from the Molecular Signatures Database [34]. The average graph contains 2373 nodes, with an average degree of 28.8.

Search
Clear search
Close search
Google apps
Main menu