100+ datasets found
  1. d

    Max Planck Unified Proteome Database

    • dknet.org
    • neuinfo.org
    Updated Jul 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Max Planck Unified Proteome Database [Dataset]. http://identifiers.org/RRID:SCR_007771
    Explore at:
    Dataset updated
    Jul 6, 2025
    Description

    Database containing several body fluid proteomes, including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools.

  2. n

    Rice Proteome Database

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Jun 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Rice Proteome Database [Dataset]. http://identifiers.org/RRID:SCR_000743
    Explore at:
    Dataset updated
    Jun 23, 2025
    Description

    THIS RESOURCE IS NO LONGER IN SERVICE, documented July 22, 2016.A database on the proteome of rice that contains reference maps based on two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) of proteins from rice tissues and subcellular compartments.

  3. s

    Data from: NoPdb: Nucleolar Proteome Database

    • scicrunch.org
    • neuinfo.org
    • +1more
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). NoPdb: Nucleolar Proteome Database [Dataset]. http://identifiers.org/RRID:SCR_013459
    Explore at:
    Dataset updated
    Dec 4, 2023
    Description

    It archives data on more than 700 proteins that were identified by multiple mass spectrometry (MS) analyses from highly purified preparations of human nucleoli the most prominent nuclear organelle. Each protein entry is annotated with information about its corresponding gene its domain structures and relevant protein homologues across species as well as documenting its MS identification history including all the peptides sequenced by tandem MS/MS. Moreover, data showing the quantitative changes in the relative levels of 500 nucleolar proteins are compared at different timepoints upon transcriptional inhibition. Correlating changes in protein abundance at multiple timepoints highlighted by visualization means in the NOPdb provides clues regarding the potential interactions and relationships between nucleolar proteins and thereby suggests putative functions for factors within the 30% of the proteome which comprises novel/ uncharacterized proteins. The NOPdb is searchable by either gene names protein sequences Gene Ontology terms or motifs or by limiting the range for isoelectric points and/or molecular weights and links to other databases (e.g. LocusLink OMIM and PubMed).

  4. d

    Proteome database of 36 million proteins from 4,351 species, including...

    • search.dataone.org
    • datadryad.org
    • +1more
    Updated Nov 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariana Rius; Jackie Collier; Joshua Rest (2023). Proteome database of 36 million proteins from 4,351 species, including marine microbial sequences [Dataset]. http://doi.org/10.5061/dryad.4tmpg4ffn
    Explore at:
    Dataset updated
    Nov 29, 2023
    Dataset provided by
    Dryad Digital Repository
    Authors
    Mariana Rius; Jackie Collier; Joshua Rest
    Time period covered
    Jan 1, 2023
    Description

    A fasta-formatted database of 36,866,870 predicted proteins representing 4,351 unique species from 117 phyla., A database of 36,866,870 predicted proteins representing 4,351 unique species from 117 phyla (see table below) was constructed using the UniProt Reference Proteome (RP) at the 35% co-membership threshold including 4,295 Representative Proteome Groups (RPGs) (Chen et al. 2011) in addition to all taxonomically identifiable transcriptomes of the Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP) (Keeling et al. 2014) that were processed through WinstonCleaner (https://github.com/kolecko007/WinstonCleaner). The database also included proteins inferred from the annotated and assembled genomes of Aurantiochytrium limacinum ATCC MYA-1381, Schizochytrium aggregatum ATCC 28209, and Aplanochytrium kerguelensis PBS07 from the U.S. Department of Energy’s Joint Genome Institute (JGI), all PFAM PF00494 Aurantiochytrium sp. KH105 proteome hits from the Okinawa Institute of Science and Technology Marine Genomics Unit genome browser, all of UniProt's annotated Hondaea fermentalgiana pr...,

  5. f

    Probing the Missing Human Proteome: A Computational Perspective

    • figshare.com
    • acs.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dhirendra Kumar; Aradhya Jain; Debasis Dash (2023). Probing the Missing Human Proteome: A Computational Perspective [Dataset]. http://doi.org/10.1021/acs.jproteome.5b00728.s003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    ACS Publications
    Authors
    Dhirendra Kumar; Aradhya Jain; Debasis Dash
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The missing human proteome comprises predicted protein-coding genes with no credible protein level evidence detected so far and constitutes ∼18% of the human protein coding genes (neXtProt release 19/9/2014). The missing proteins may be of pharmacological interest as many of these are membrane receptors, thus requiring comprehensive characterization. In the present study, we explored various computational parameters, crucial during protein searches from tandem mass spectrometry (MS) data, for their impact on missing protein identification. Variables taken into consideration are differences in search database composition, shared peptides, semitryptic searches, post-translational modifications (PTMs), and transcriptome guided proteogenomic searches. We used a multialgorithmic approach for protein detection from publicly available mass spectra from recent studies covering diverse human tissues and cell types. Using the aforementioned approaches, we successfully detected 24 missing proteins (22-PE2, 1-PE4, and 1-PE5). Maximum of these identifications could be attributed to differences in reference proteome databases, exemplifying use of a single standard database for human protein detection from MS data. Our results suggest that search strategies with modified parameters can be rewarding alternatives for extensive profiling of missing proteins. We conclude that using complementary spectral data searches incorporating different parameters like PTMs, against a comprehensive and compact search database, might lead to discoveries of the proteins attributed so far as the missing human proteome.

  6. f

    Proteome database of Bacillus cereus Rock3-44

    • figshare.com
    xlsx
    Updated Jan 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenfa Ng (2020). Proteome database of Bacillus cereus Rock3-44 [Dataset]. http://doi.org/10.6084/m9.figshare.11493681.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jan 2, 2020
    Dataset provided by
    figshare
    Authors
    Wenfa Ng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A facultative anaerobic bacterium, Bacillus cereus is one member of a larger family of microorganisms that are closely-related and which are characterized as Bacillus cereus group. Rod shaped and Gram-positive, B. cereus forms colonies with waxy appearance on blood agar. This work sought to provide fundamental information of this bacterium by parsing the UniProt proteome file of B. cereus Rock3-44 with an in-house MATLAB proteome analysis software. The ensuing proteome database comprise protein name, amino acid sequence, number of residues, molecular weight, and nucleotide sequence of all proteins of the bacterium, and should help in a variety of fundamental and applied research involving B. cereus.

  7. Proteomics contaminant databases

    • zenodo.org
    application/gzip, bin
    Updated Mar 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laurent Gatto; Laurent Gatto (2025). Proteomics contaminant databases [Dataset]. http://doi.org/10.5281/zenodo.15115102
    Explore at:
    bin, application/gzipAvailable download formats
    Dataset updated
    Mar 31, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Laurent Gatto; Laurent Gatto
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These 3 fasta files are widely used proteomics contaminants. The files are

    1. crap_gpm.fasta: the common Repository of Adventitious Proteins (cRAP) from the Global Proteome Machine (GPM) organisation.
    2. crap_ccp.fasta: Cambridge Centre for Proteomics' own cRAP fasta database.
    3. crap_maxquant.fasta.gz: MaxQuant's contaminant database.

    These files are extracted from the {camprotR} package and described in the cRAP databases vignette (https://cambridgecentreforproteomics.github.io/camprotR/articles/crap.html).

  8. n

    Global Proteome Machine Database (GPM DB)

    • neuinfo.org
    Updated Apr 4, 2004
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2004). Global Proteome Machine Database (GPM DB) [Dataset]. http://identifiers.org/RRID:SCR_006617
    Explore at:
    Dataset updated
    Apr 4, 2004
    Description

    The Global Proteome Machine Organization was set up so that scientists involved in proteomics using tandem mass spectrometry could use that data to analyze proteomes. The projects supported by the GPMO have been selected to improve the quality of analysis, make the results portable and to provide a common platform for testing and validating proteomics results. The Global Proteome Machine Database was constructed to utilize the information obtained by GPM servers to aid in the difficult process of validating peptide MS/MS spectra as well as protein coverage patterns. This database has been integrated into GPM server pages, allowing users to quickly compare their experimental results with the best results that have been previously observed by other scientists.

  9. n

    PPDB: Plant Proteomics Database

    • neuinfo.org
    • scicrunch.org
    • +1more
    Updated Aug 23, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). PPDB: Plant Proteomics Database [Dataset]. http://identifiers.org/RRID:SCR_007872
    Explore at:
    Dataset updated
    Aug 23, 2023
    Description

    A Plant Proteome DataBase for Arabidopsis thaliana and maize (Zea mays). The PPDB stores experimental data from in-house proteome and mass spectrometry analysis, curated information about protein function, protein properties and subcellular localization. Importantly, proteins are particularly curated for possible (intra) plastid location and their plastid function. Protein accessions identified in published Arabidopsis (and other Brassicacea) proteomics papers are cross-referenced to rapidly determine previous experimental identification by mass spectrometry. All protein-encoding gene models in the Arabidopsis nuclear and organellar genomes, as assembled by TAIR, as well as all maize EST assemblies (ZmGI) as assembled by DFCI Maize Gene Index project. These are all uploaded in PPDB and are linked to each other via a BLAST alignment. Thus every predicted protein in both species can be searched for experimental and other information (even if not experimentally identified).

  10. mESC shotgun and positional proteomics based on deep proteome sequence...

    • data.niaid.nih.gov
    • ebi.ac.uk
    xml
    Updated Feb 25, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gerben Menschaert; Gerben Menschaert (2013). mESC shotgun and positional proteomics based on deep proteome sequence database (derived from RIBOseq data) [Dataset]. https://data.niaid.nih.gov/resources?id=pxd000124
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Feb 25, 2013
    Dataset provided by
    Faculty of Bioscience Engineering
    Authors
    Gerben Menschaert; Gerben Menschaert
    Variables measured
    Proteomics
    Description

    Shotgun and positional proteomics study of a mouse embryonic stem cell line. We devised a proteogenomic approach constructing a custom protein sequence search space, built from both SwissProt and RIBO-seq derived translation products, applicable for LC-MSMS spectrum identification. To record the impact of using the constructed deep proteome database we performed two alternative MS-based proteomic strategies: (I) a regular shotgun proteomic and (II) an N-terminal COFRADIC approach. The obtained fragmentation spectra were searched against the custom database (combination of UniProtKB-SwissProt and RIBO-seq derived translation sequences) using three different search engines: OMSSA (version 2.1.9), X!Tandem (TORNADO, version 2010.01.01.04) and Mascot (version 2.3). The first two were run from the SearchGUI graphical user interface (version 1.10.4). A combination of X!Tandem and Mascot was used for the N-terminal COFRADIC analysis, a combination of all three search engines for the shotgun proteome analysis. Note that OMMSA cannot cope with the protease setting semi-ArgC/P needed to analyze N-terminal COFRADIC data.For the shotgun proteome data, trypsin was set as cleavage enzyme allowing for one missed cleavage, and singly to triply charged precursors or singly to quadruple charged precursors were taken into account respectively for the Mascot or X!Tandem/OMSSA search engines, and the precursor and fragment mass tolerance were set to respectively 10 ppm and 0.5 Da. Methionine oxidation to methionine-sulfoxide, pyroglutamate formation of N-terminal glutamine and acetylation (protein N-terminus) were set as variable modifications. For the N-terminal COFRADIC analysis the protease setting semi-ArgC/P (Arg-C specificity with arginine-proline cleavage allowed) was used. No missed cleavages were allowed and the precursor and fragment mass tolerance were also set to respectively 10 ppm and 0.5 Da. Carbamidomethylation of cysteine and methionine oxidation to methionine-sulfoxide and 13C3D2-acetylation of lysines were set as fixed modifications. Peptide N-terminal acetylation or 13C3D2-acetylation and pyroglutamate formation of N-terminal glutamine were set as variable modifications and instrument setting was put on ESI-TRAP. Protein and peptide identification in addition to data interpretation was done using the PeptideShaker algorithm (http://code.google.com/p/peptide-shaker, version 0.18.3), setting the false discovery rate to 1% at all levels (protein, peptide, and peptide to spectrum matching). Aforementioned tools and algorithms (SearchGui, X!Tandem, OMSSA, and PeptideShaker) are freely available as open source.

  11. e

    Data from: Deep profiling and custom databases improve detection of...

    • ebi.ac.uk
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laura Agosto, Deep profiling and custom databases improve detection of proteoforms generated by alternative splicing [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD012556
    Explore at:
    Authors
    Laura Agosto
    Variables measured
    Proteomics
    Description

    Alternative pre-mRNA splicing has long been proposed to greatly contribute to proteome complexity. However, the extent to which mature mRNA isoforms are successfully translated into protein remains controversial. Here, we used high-throughput RNA sequencing and mass spectrometry (MS)-based proteomics to better evaluate the translation of alternatively spliced mRNAs. To increase proteome coverage and improve protein quantitation, we optimized cell fractionation and sample processing steps at both the protein and peptide level. Furthermore, we generated a custom spectral library trained on analysis of RNA-Seq data with MAJIQ, an algorithm optimized to detect differential and unannotated junction usage for a given splice site. Using this custom library to match against tandem mass spectra acquired by data independent acquisition (DIA), we improved identification of splicing-derived proteoforms by ~30% as compared to use of the SwissProt database alone. Moreover, our increased depth and detection of proteins allowed us to track changes in the transcriptome and proteome induced by T cell stimulation, as well as fluctuations in protein sub-cellular localization. In sum, our data here confirms that use of generic databases in proteomic studies under-estimates the number of spliced mRNA isoforms that are translated into protein and provides a workflow that improves isoform detection in large-scale proteomic experiments.

  12. H

    Proteome database of Staphylococcus lentus

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Nov 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenfa Ng (2020). Proteome database of Staphylococcus lentus [Dataset]. http://doi.org/10.7910/DVN/KJ83L8
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 22, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    Wenfa Ng
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This work presents the proteome database of Staphylococcus lentus by parsing its annotated proteome downloaded from UniProt. Comprising protein name, amino acid sequence, number of residues, molecular weight, and nucleotide sequence of each protein in the proteome, the resource should find use in functional genomics studies seeking to unravel the mysteries of this bacterium.

  13. b

    Global Proteome Machine Database

    • bioregistry.io
    Updated Apr 26, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Global Proteome Machine Database [Dataset]. https://bioregistry.io/gpmdb
    Explore at:
    Dataset updated
    Apr 26, 2021
    Description

    The Global Proteome Machine Database was constructed to utilize the information obtained by GPM servers to aid in the difficult process of validating peptide MS/MS spectra as well as protein coverage patterns.

  14. d

    LIPID MAPS Proteome Database

    • dknet.org
    • neuinfo.org
    • +1more
    Updated Jan 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). LIPID MAPS Proteome Database [Dataset]. http://identifiers.org/RRID:SCR_003062
    Explore at:
    Dataset updated
    Jan 3, 2024
    Description

    Database of lipid related proteins representing human and mouse proteins involved in lipid metabolism. Collection of lipid related genes and proteins contains data for genes and proteins from Homo sapiens, Mus musculus, Rattus norvegicus, Saccharomyces cerevisiae, Caenorhabditis elegans, Escherichia coli, Macaca mulata, Drosophila melanogaster, Arabidopsis thaliana and Danio rerio.

  15. f

    Data from: Nanoparticle–Protein Corona Boosted Cancer Diagnosis with...

    • figshare.com
    • acs.figshare.com
    xlsx
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haoxiang Guo; Baichuan Jin; Zhenjie Zhu; Xin Dai; Mengjie Wang; Yueli Xie; Chenlu Xu; Zongping Wang; Yuan Liu; Weihong Tan (2025). Nanoparticle–Protein Corona Boosted Cancer Diagnosis with Proteomic Transfer Learning [Dataset]. http://doi.org/10.1021/acsnano.5c01197.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 25, 2025
    Dataset provided by
    ACS Publications
    Authors
    Haoxiang Guo; Baichuan Jin; Zhenjie Zhu; Xin Dai; Mengjie Wang; Yueli Xie; Chenlu Xu; Zongping Wang; Yuan Liu; Weihong Tan
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Keeping pace with the rapid growth of proteomic data, the integration of multiproteomic data can improve biomarker identification and cancer diagnosis. However, the data integration needs to overcome substantial challenges owing to considerable variability among diverse data set sources and the extensive range of protein expression levels. In this study, with serum and urine from the same individuals, we established two in-depth paired proteome databases, including 956 serum proteins and 4730 urine proteins. To integrate multiproteomic data, we developed a proteomic-based transfer learning neural network (ProteoTransNet) to enhance the accuracy of bladder cancer diagnosis and progression monitoring. Using random forest analysis on the integrated database, we selected two panels comprising the top 10 key proteins, achieving a diagnostic AUC of 0.996 and a stage classification AUC of 0.914. ProteoTransNet integrates serum and urine proteome databases with proteomic transfer learning, significantly enhancing the diagnostic accuracy through minimizing biases and errors caused by variations in proteomic data. Our study provides insights that transfer learning of sophisticated biological information may solve complicated biological problems in disease diagnosis, prognosis, and treatment.

  16. e

    Pandoravirus proteome - Pandoraviruses: amoeba viruses with genomes up to 2

    • ebi.ac.uk
    • data.niaid.nih.gov
    Updated Jul 19, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yohann Couté (2013). Pandoravirus proteome - Pandoraviruses: amoeba viruses with genomes up to 2 [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD000213
    Explore at:
    Dataset updated
    Jul 19, 2013
    Authors
    Yohann Couté
    Variables measured
    Proteomics
    Description

    Proteome analysis of a novel type of virus. The virus was grown in A. castellanii amobea before being purified. Viral particules were enriched after centrifugation and proteins solubilised by SDS before being stacked in the top of a SDS-PGE gel. After in-gel digestion, resulting peptides were injected for a 120min nanoLC-MS/MS analysis using an Ultimate U3000 system and a LTQ-Orbitrap Velos pro hybrid mass spectrometer (Top 20).Data processing and bioinformatics: Data were processed automatically using Mascot Daemon software (version 2.3.2, Matrix Science). Concomitant searches against Pandoravirus and A. castellanii protein sequence databanks as well as classical contaminants database and the corresponding reversed databases were performed using Mascot (version 2.4). ESI-TRAP was chosen as the instrument, trypsin/P as the enzyme and 2 missed cleavage allowed. Precursor and fragment mass error tolerances were set respectively at 10 ppm and 0.6 Da. Peptide modifications allowed during the search were: carbamidomethyl (C, fixes) acetyl (N-ter, variable), oxidation (M, variable) and deamidation (NQ, variable). The IRMa software (Dupierris et al., Bioinformatics, 2009, 25:1980-1, version 1.30.4) was used to filter the results: selection of rank 1 peptides, peptide identification FDR < 1% (as calculated by employing the reverse database strategy), and minimum of 1 specific peptide per identified protein group.

  17. s

    Proteome 2D-PAGE Database

    • scicrunch.org
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Proteome 2D-PAGE Database [Dataset]. http://identifiers.org/RRID:SCR_001678
    Explore at:
    Dataset updated
    Dec 4, 2023
    Description

    The Proteome 2D-PAGE Database system for microbial research is a curated database for storing and investigating proteomics data. Software tools are available and for data submission, please contact the Database Curator. Established at the Max Plank Institution for Infection Biology, this system contains four interconnected databases: i.) 2D-PAGE Database: Two dimensional electrophoresis (2-DE) and mass spectrometry of diverse microorganisms and other organisms. This database currently contains 4971 identified spots and 1228 mass peaklists in 44 reference maps representing experiments from 24 different organisms and strains. The data were submitted by 84 Submitters from 24 Institutes and 12 nations. It also contains various software tools that are important in formatting and analyzing gels and mass peaks; software include: *TopSpot: Scanning the gel, editing the spots and saving the information *Fragmentation: Fragmentation of the gel image into sections *MS-Screener: Perl script to compare the similarity of MALDI-PMF peaklists *MS-Screener update: MS-Screener can be used to compare mass spectra (MALDI-MS(/MS) as well as ESI-MS/MS spectra) on the basis of their peak lists (.dta, .pkm, .pkt, or .txt files), to recalibrate mass spectra, to determine and eliminate exogenous contaminant peaks, and to create matrices for cluster analyses. *GelCali: Online calibration of the Mr- and pI-axis of 2-DE gels with mathematical regression methods ii.)Isotope Coded Affinity Tag (ICAT)-LC/MS database: Isotope Coded Affinity Tag (ICAT)-LC/MS data for Mycobacterium tuberculosis strain BCG versus H37Rv. iii.) FUNC_CLASS database: Functional classification of diverse microorganism. This database also integrates genomic, proteomic, and metabolic data. iv.) DIFF database: Presentation of differently regulated proteins obtained by comparative proteomic experiments using computerized gel image analysis.

  18. H

    Proteome database of Toxoplasma gondii strain ATCC 50861

    • dataverse.harvard.edu
    Updated Dec 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenfa Ng (2020). Proteome database of Toxoplasma gondii strain ATCC 50861 [Dataset]. http://doi.org/10.7910/DVN/5BR8AW
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 1, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    Wenfa Ng
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Toxoplasma gondii is a single-cell parasite that infects human host cells during pathogenesis. This work presents the proteome database of T. gondii strain ATCC 50861 by parsing its annotated proteome downloaded from UniProt. Comprising protein name, amino acid sequence, number of residues, molecular weight, and nucleotide sequence of each protein in the proteome, the resource should be useful for gaining a deeper understanding of the pathogenesis mechanism in the parasite.

  19. d

    Proteome database of Serratia sp. FGI94

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ng, Wenfa (2023). Proteome database of Serratia sp. FGI94 [Dataset]. http://doi.org/10.7910/DVN/C5OPUK
    Explore at:
    Dataset updated
    Nov 23, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Ng, Wenfa
    Description

    This work presents the proteome database of Serratia sp. FGI94 by parsing its annotated proteome downloaded from UniProt. Comprising protein name, amino acid sequence, number of residues, molecular weight, and nucleotide sequence of each protein in the proteome, the resource should be useful in gaining a deeper understanding of the metabolism of the organism.

  20. n

    MitoMiner

    • neuinfo.org
    • scicrunch.org
    • +2more
    Updated Jun 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). MitoMiner [Dataset]. http://identifiers.org/RRID:SCR_001368
    Explore at:
    Dataset updated
    Jun 21, 2025
    Description

    A database of mitochondrial proteomics data. It includes two sets of proteins: the MitoMiner Reference Set, which has 10477 proteins from 12 species; and MitoCarta, which has 2909 proteins from mouse and human mitochondrial proteins. MitoMiner provides annotation from the Gene Ontology (GO) and UniProt databases. This reference set contains all proteins that are annotated by either of these resources as mitochondrial in any of the species included in MitoMiner. MitoMiner data via is available via Application Programming Interface (API). The client libraries are provided in Perl, Python, Ruby and Java.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2025). Max Planck Unified Proteome Database [Dataset]. http://identifiers.org/RRID:SCR_007771

Max Planck Unified Proteome Database

RRID:SCR_007771, nif-0000-03102, Max Planck Unified Proteome Database (RRID:SCR_007771), MAPU

Explore at:
37 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jul 6, 2025
Description

Database containing several body fluid proteomes, including plasma, urine, and cerebrospinal fluid. Cell lines have been mapped to a depth of several thousand proteins and the red blood cell proteome has also been analyzed in depth. The liver proteome is represented with 3200 proteins. By employing high resolution MS and stringent validation criteria, false positive identification rates in MAPU are lower than 1:1000. Thus MAPU datasets can serve as reference proteomes in biomarker discovery. MAPU contains the peptides identifying each protein, measured masses, scores and intensities using a clickable interface of cell or body parts. Proteome data can be queried across proteomes by protein name, accession number, sequence similarity, peptide sequence and annotation information. More than 4500 mouse and 2500 human proteins have already been identified in at least one proteome. Basic annotation information and links to other public databases are provided in MAPU and we plan to add further analysis tools.

Search
Clear search
Close search
Google apps
Main menu