100+ datasets found

f
Assessment and Improvement of Statistical Tools for Comparative Proteomics...
acs.figshare.com
txt
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Veit Schwämmle; Ileana Rodríguez León; Ole Nørregaard Jensen (2023). Assessment and Improvement of Statistical Tools for Comparative Proteomics Analysis of Sparse Data Sets with Few Experimental Replicates [Dataset]. http://doi.org/10.1021/pr400045u.s003
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.1021/pr400045u.s003
Dataset updated
Jun 1, 2023
Dataset provided by
ACS Publications
Authors
Veit Schwämmle; Ileana Rodríguez León; Ole Nørregaard Jensen
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Large-scale quantitative analyses of biological systems are often performed with few replicate experiments, leading to multiple nonidentical data sets due to missing values. For example, mass spectrometry driven proteomics experiments are frequently performed with few biological or technical replicates due to sample-scarcity or due to duty-cycle or sensitivity constraints, or limited capacity of the available instrumentation, leading to incomplete results where detection of significant feature changes becomes a challenge. This problem is further exacerbated for the detection of significant changes on the peptide level, for example, in phospho-proteomics experiments. In order to assess the extent of this problem and the implications for large-scale proteome analysis, we investigated and optimized the performance of three statistical approaches by using simulated and experimental data sets with varying numbers of missing values. We applied three tools, including standard t test, moderated t test, also known as limma, and rank products for the detection of significantly changing features in simulated and experimental proteomics data sets with missing values. The rank product method was improved to work with data sets containing missing values. Extensive analysis of simulated and experimental data sets revealed that the performance of the statistical analysis tools depended on simple properties of the data sets. High-confidence results were obtained by using the limma and rank products methods for analyses of triplicate data sets that exhibited more than 1000 features and more than 50% missing values. The maximum number of differentially represented features was identified by using limma and rank products methods in a complementary manner. We therefore recommend combined usage of these methods as a novel and optimal way to detect significantly changing features in these data sets. This approach is suitable for large quantitative data sets from stable isotope labeling and mass spectrometry experiments and should be applicable to large data sets of any type. An R script that implements the improved rank products algorithm and the combined analysis is available.
e
Data from: High-throughput mass spectrometry and bioinformatics analysis of...
ebi.ac.uk
Updated Jul 15, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michel Batista (2019). High-throughput mass spectrometry and bioinformatics analysis of breast cancer proteomic data [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD012431
Explore at:
Dataset updated
Jul 15, 2019
Authors
Michel Batista
Variables measured
Proteomics
Description
The project contains raw and result files from a comparative proteomic analysis of malignant [primary breast tumor (PT) and axillary metastatic lymph nodes (LN)] and non-tumor [contralateral (NCT) and adjacent breast (ANT)] tissues of patients diagnosed with invasive ductal carcinoma. A label-free mass spectrometry was conducted using nano-liquid chromatography coupled to electrospray ionization–mass spectrometry (LC-ESI-MS/MS) followed by functional annotation to reveal differentially expressed proteins and their predicted impacts on pathways and cellular functions in breast cancer. A total of 462 proteins was observed as differentially expressed (DEPs) among the groups of samples analyzed. Ingenuity Pathway Analysis software version 2.3 (QIAGEN Inc.) was employed to identify the most relevant signaling and metabolic pathways, diseases, biological functions and interaction networks affected by the deregulated proteins. Upstream regulator and biomarker analyses were also performed by IPA’s tools. Altogether, our findings revealed differential proteomic profiles that affected the associated and interconnected cancer signaling processes.
e
Data from: MGVB: a new proteomics toolset for fast and efficient data...
ebi.ac.uk
Updated Nov 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Metodi Metodiev (2024). MGVB: a new proteomics toolset for fast and efficient data analysis [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD051331
Explore at:
Dataset updated
Nov 15, 2024
Authors
Metodi Metodiev
Variables measured
Proteomics
Description
MGVB is a collection of tools for proteomics data analysis. It covers data processing from in silico digestion of protein sequences to comprehensive identification of postranslational modifications and solving the protein inference problem. The toolset is developed with efficiency in mind. It enables analysis at a fraction of the resources cost typically required by existing commercial and free tools. MGVB, as it is a native application, is much faster than existing proteomics tools such as MaxQuant and MSFragger and, in the same time, finds very similar, in some cases even larger number of peptides at a chosen level of statistical significance. It implements a probabilistic scoring function to match spectra to sequences, and a novel combinatorial search strategy for finding post-translational modifications, and a Bayesian approach to locate modification sites. This report describes the algorithms behind the tools, presents benchmarking data sets analysis comparing MGVB performance to MaxQuant/Andromeda, and provides step by step instructions for using it in typical analytical scenarios. The toolset is provided free to download and use for academic research and in software projects, but is not open source at the present. It is the intention of the author that it will be made open source in the near future—following rigorous evaluations and feedback from the proteomics research community.
n
Data from: Innovative Approaches and Tool Development for Proteomics Data...
curate.nd.edu
Updated Apr 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ETD Depositor; Simon Dyck Weaver (2025). Innovative Approaches and Tool Development for Proteomics Data Analysis: Applications Across Diverse Biological Systems [Dataset]. http://doi.org/10.7274/28716458.v1
Explore at:
Unique identifier
https://doi.org/10.7274/28716458.v1
Dataset updated
Apr 3, 2025
Dataset provided by
University of Notre Dame
Authors
ETD Depositor; Simon Dyck Weaver
License
https://www.law.cornell.edu/uscode/text/17/106https://www.law.cornell.edu/uscode/text/17/106
Description
Bottum up proteomics (BUP) is a powerful analytical technique that involves digesting complex protein mixtures into peptides and analyzing them with liquid chromatography and tandem mass spectrometry to identify and quantify many proteins simultaneously. This produces massive multidimensional datasets which require informatics tools to analyze. The landscape of software tools for BUP analysis is vast and complex, and often custom programs and scripts are required to answer biological questions of interest in any given experiment.

This dissertation introduces novel methods and tools for analyzing BUP experiments and applies those methods to new samples. First, PrIntMap-R, a custom application for intraprotein intensity mapping, is developed and validated. This application is the first open-source tool to allow for statistical comparisons of peptides within a protein sequence along with quantitative sequence coverage visualization. Next, innovative sample preparation techniques and informatics methods are applied to characterize MUC16, a key ovarian cancer biomarker. This includes the proteomic validation of a novel model of MUC16 differing from the dominant isoform reported in literature. Shifting to bacterial studies, custom differential expression workflows are employed to investigate the role of virulence lipids in mycobacterial protein secretion by analyzing mutant strains of mycobacteria. This work links lipid presence and virulence factor secretion for the first time. Building on these efforts, OnePotN??TA, a labeling technique enabling quantification of N-terminal acetylation in mycobacterial samples, introduced. This method is the first technique to simultaneously quantify protein and N-terminal acetylation abundance using bottom-up proteomics, advancing the field of post-translational modification quantification. This project resulted in the identification of 37 new putative substrates for an N-acetyltransferase, three of which have since been validated biochemically. These tools and methodologies are further applied to various biological research areas, including breast cancer drug characterization and insect saliva analysis to perform the first proteomic studies of their kind with these respective treatments and samples. Additionally, a project focused on teaching programming skills relevant to analytical chemistry is presented. Collectively, this work enhances the analytical capabilities of bottom-up proteomics, providing novel tools and methodologies that advance protein characterization, post-translational modification analysis, and biological discovery across diverse research areas.
G
Genomics and Proteomic Tools Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jan 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Genomics and Proteomic Tools Report [Dataset]. https://www.datainsightsmarket.com/reports/genomics-and-proteomic-tools-553612
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Jan 29, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global genomics and proteomics tools market is projected to witness substantial growth over the coming years, driven by advancements in molecular biology, increasing adoption of personalized medicine, and rising demand for early disease diagnosis. The market size is estimated to reach a valuation of USD 85.50 billion by 2033, expanding at a CAGR of 10.7% from 2025 to 2033. North America and Europe are currently the dominant markets, with significant contributions from the United States, China, and Germany. Key trends shaping the genomics and proteomics tools market include the integration of artificial intelligence (AI) and machine learning (ML) in data analysis, the development of next-generation sequencing (NGS) technologies, and the growing focus on precision medicine. However, challenges related to data management, regulatory compliance, and privacy concerns may hinder market growth. Despite these restraints, the increasing healthcare expenditure, technological advancements, and rising awareness about the benefits of genomics and proteomics tools are expected to create lucrative opportunities in the coming years.
r
The Proteome Browser
researchdata.edu.au
Updated May 5, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ian Smith; Edouard Nice; Robert Goode; Ralf Schittenhelm (2022). The Proteome Browser [Dataset]. http://doi.org/10.26180/14676108.v1
Explore at:
Unique identifier
https://doi.org/10.26180/14676108.v1
Dataset updated
May 5, 2022
Dataset provided by
Monash University
Authors
Ian Smith; Edouard Nice; Robert Goode; Ralf Schittenhelm
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
For access to this conversation, please contact: Ralf Schittenhelm

This collection has been moved to the Monash University Research Data Archive.

This system delivers a comprehensive data integration and analysis software tool that provides a snapshot of our current proteomic knowledge in a gene-centric, chromosome format and will ultimately assist in analysing normal biological function, and the study of human disease. The Proteome Browser integrates various types of protein related data from a number of data sources into a report matrix of various hierarchical data types for each gene/protein within a gene set. Within the matrix, a traffic light system is used to indicate the quality of data available for a particular data type and protein combination. The underlying contributing information is available for further analyses using drill down/through capabilities. Filtration and summary tools are also provided through the web interface.
f
Name of proteins and their classification performance in all datasets.
plos.figshare.com
xlsx
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ehsan Saghapour; Saeed Kermani; Mohammadreza Sehhati (2023). Name of proteins and their classification performance in all datasets. [Dataset]. http://doi.org/10.1371/journal.pone.0184203.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0184203.s001
Dataset updated
Jun 3, 2023
Dataset provided by
PLOS ONE
Authors
Ehsan Saghapour; Saeed Kermani; Mohammadreza Sehhati
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The results obtained for 10 formative biomarkers selected by FRMT and reported in different sheets for each dataset. (XLSX)
f
Data from: Comparative Evaluation of Proteome Discoverer and FragPipe for...
acs.figshare.com
zip
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tianen He; Youqi Liu; Yan Zhou; Lu Li; He Wang; Shanjun Chen; Jinlong Gao; Wenhao Jiang; Yi Yu; Weigang Ge; Hui-Yin Chang; Ziquan Fan; Alexey I. Nesvizhskii; Tiannan Guo; Yaoting Sun (2023). Comparative Evaluation of Proteome Discoverer and FragPipe for the TMT-Based Proteome Quantification [Dataset]. http://doi.org/10.1021/acs.jproteome.2c00390.s002
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.jproteome.2c00390.s002
Dataset updated
Jun 3, 2023
Dataset provided by
ACS Publications
Authors
Tianen He; Youqi Liu; Yan Zhou; Lu Li; He Wang; Shanjun Chen; Jinlong Gao; Wenhao Jiang; Yi Yu; Weigang Ge; Hui-Yin Chang; Ziquan Fan; Alexey I. Nesvizhskii; Tiannan Guo; Yaoting Sun
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Isobaric labeling-based proteomics is widely applied in deep proteome quantification. Among the platforms for isobaric labeled proteomic data analysis, the commercial software Proteome Discoverer (PD) is widely used, incorporating the search engine CHIMERYS, while FragPipe (FP) is relatively new, free for noncommercial purposes, and integrates the engine MSFragger. Here, we compared PD and FP over three public proteomic data sets labeled using 6plex, 10plex, and 16plex tandem mass tags. Our results showed the protein abundances generated by the two software are highly correlated. PD quantified more proteins (10.02%, 15.44%, 8.19%) than FP with comparable NA ratios (0.00% vs. 0.00%, 0.85% vs. 0.38%, and 11.74% vs. 10.52%) in the three data sets. Using the 16plex data set, PD and FP outputs showed high consistency in quantifying technical replicates, batch effects, and functional enrichment in differentially expressed proteins. However, FP saved 93.93%, 96.65%, and 96.41% of processing time compared to PD for analyzing the three data sets, respectively. In conclusion, while PD is a well-maintained commercial software integrating various additional functions and can quantify more proteins, FP is freely available and achieves similar output with a shorter computational time. Our results will guide users in choosing the most suitable quantification software for their needs.
D
Proteomics Analytical Service Market Report | Global Forecast From 2025 To...
dataintelo.com
csv, pdf, pptx
Updated Oct 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataintelo (2024). Proteomics Analytical Service Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/proteomics-analytical-service-market
Explore at:
csv, pptx, pdfAvailable download formats
Dataset updated
Oct 16, 2024
Dataset authored and provided by
Dataintelo
License
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
Time period covered
2024 - 2032
Area covered
Global
Description
Proteomics Analytical Service Market Outlook

The global proteomics analytical service market size was valued at USD 5.2 billion in 2023 and is projected to reach USD 14.8 billion by 2032, exhibiting a CAGR of 12.4% during the forecast period. This substantial growth is driven by the increasing demand for precision medicine, advancements in mass spectrometry, and a growing focus on drug discovery and development.

One of the primary growth factors for the proteomics analytical service market is the rise in demand for personalized medicine. Personalized medicine aims to tailor medical treatment to the individual characteristics of each patient, and proteomics plays a crucial role by providing detailed insights into the protein expressions and modifications in different diseases. With increasing incidences of chronic diseases and the emphasis on early diagnosis and treatment, the need for advanced proteomics analytical services has surged, propelling market growth.

Another significant growth driver is technological advancements in proteomics. Advancements in mass spectrometry, chromatography, and bioinformatics tools have revolutionized the field of proteomics. These advanced technologies offer high-throughput, accurate, and comprehensive protein analysis, driving their adoption in research and clinical settings. Moreover, continuous innovations in proteomic technologies are expected to further accelerate market growth by improving the efficiency and scope of proteomics analytical services.

The growing investments in proteomics research and development also play a pivotal role in market expansion. Both government and private sectors are increasingly investing in proteomics projects due to the potential of proteomics in drug discovery, biomarker discovery, and clinical diagnostics. Funding for proteomics research has increased, leading to more comprehensive studies and a higher demand for analytical services that can provide detailed and accurate protein analysis.

Regionally, North America dominates the proteomics analytical service market, followed by Europe and Asia Pacific. North America's dominance is attributed to the presence of well-established pharmaceutical and biotechnology companies, leading academic institutions, and advanced healthcare infrastructure. Europe follows closely due to substantial government funding and focus on research and development. The Asia Pacific region, on the other hand, is expected to witness the highest growth rate due to increasing R&D activities, rising healthcare expenditure, and growing awareness about personalized medicine in emerging economies like China and India.

Service Type Analysis

The proteomics analytical service market is segmented by service type into protein identification, protein quantification, protein characterization, data analysis and interpretation, and others. Protein identification services hold a significant market share due to the critical role of identifying and cataloging proteins in understanding disease mechanisms and discovering potential therapeutic targets. The demand for protein identification services is further driven by advancements in mass spectrometry and electrophoresis technologies that enhance the accuracy and throughput of protein identification processes.

Protein quantification services are also experiencing substantial growth. Accurate quantification of proteins is essential for understanding protein function, interaction, and regulation within biological systems. The increasing need for quantitative proteomics in biomarker discovery, drug development, and clinical diagnostics is driving the demand for these services. Moreover, advancements in isotope labeling techniques and mass spectrometry have significantly improved the precision and sensitivity of protein quantification methods.

Protein characterization services are gaining traction due to their importance in understanding protein structure, function, and interactions. Characterizing the post-translational modifications and conformational changes of proteins is crucial for comprehending cellular processes and disease pathology. The integration of advanced techniques like X-ray crystallography, NMR spectroscopy, and cryo-electron microscopy in protein characterization is fueling the growth of this segment.

Data analysis and interpretation services are becoming increasingly vital due to the complexity and volume of proteomics data generated. Sophisticated bioinformatics tools and software are required to analyze and interpret proteomics datase
N
Clinical Proteomic Tumor Analysis Consortium Data
datacatalog.med.nyu.edu
datacatalog.mskcc.org
Updated Oct 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Clinical Proteomic Tumor Analysis Consortium Data [Dataset]. https://datacatalog.med.nyu.edu/dataset/10109
Explore at:
Dataset updated
Oct 16, 2023
Description
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) analyzes cancer biospecimens by mass spectrometry, characterizing and quantifying their constituent proteins, or proteome. Proteomic analysis for each CPTAC study is carried out independently by Proteomic Characterization Centers (PCCs) using a variety of protein fractionation techniques, instrumentation, and workflows. Mass spectrometry and related data files are organized into datasets by study, sub-proteome, and analysis site.
f
In-depth Proteomic Analysis of Mouse Cochlear Sensory Epithelium by Mass...
figshare.com
acs.figshare.com
application/cdfv2
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lancia N. F. Darville; Bernd H. A. Sokolowski (2023). In-depth Proteomic Analysis of Mouse Cochlear Sensory Epithelium by Mass Spectrometry [Dataset]. http://doi.org/10.1021/pr4001338.s002
Explore at:
application/cdfv2Available download formats
Unique identifier
https://doi.org/10.1021/pr4001338.s002
Dataset updated
Jun 2, 2023
Dataset provided by
ACS Publications
Authors
Lancia N. F. Darville; Bernd H. A. Sokolowski
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Proteomic analysis of sensory organs such as the cochlea is challenging due to its small size and difficulties with membrane protein isolation. Mass spectrometry in conjunction with separation methods can provide a more comprehensive proteome, because of the ability to enrich protein samples, detect hydrophobic proteins, and identify low abundant proteins by reducing the proteome dynamic range. GELFrEE as well as different separation and digestion techniques were combined with FASP and nanoLC–MS/MS to obtain an in-depth proteome analysis of cochlear sensory epithelium from 30-day-old mice. Digestion with LysC/trypsin followed by SCX fractionation and multiple nanoLC–MS/MS analyses identified 3773 proteins with a 1% FDR. Of these, 694 protein IDs were in the plasmalemma. Protein IDs obtained by combining outcomes from GELFrEE/LysC/trypsin with GELFrEE/trypsin/trypsin generated 2779 proteins, of which 606 additional proteins were identified using the GELFrEE/LysC/trypsin approach. Combining results from the different techniques resulted in a total of 4620 IDs, including a number of previously unreported proteins. GO analyses showed high expression of binding and catalytic proteins as well as proteins associated with metabolism. The results show that the application of multiple techniques is needed to provide an exhaustive proteome of the cochlear sensory epithelium that includes many membrane proteins. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD000231.
n
Human Proteomics Initiative
neuinfo.org
scicrunch.org
+2more
Updated Jan 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Human Proteomics Initiative [Dataset]. http://identifiers.org/RRID:SCR_002373/resolver?q=&i=rrid
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_002373 https://identifiers.org/RRID:SCR_002373/resolver?q=&i=rrid
Dataset updated
Jan 29, 2022
Description
THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 03, 2011. IT HAS BEEN REPLACED BY A NEW UniProtKB/Swiss-Prot ANNOTATION PROGRAM CALLED UniProt Chordata protein annotation program. The Human Proteome Initiative (HPI) aims to annotate all known human protein sequences, as well as their orthologous sequences in other mammals, according to the quality standards of UniProtKB/Swiss-Prot. In addition to accurate sequences, we strive to provide, for each protein, a wealth of information that includes the description of its function, domain structure, subcellular location, similarities to other proteins, etc. Although as complete as currently possible, the human protein set they provide is still imperfect, it will have to be reviewed and updated with future research results. They will also create entries for newly discovered human proteins, increase the number of splice variants, explore the full range of post-translational modifications (PTMs) and continue to build a comprehensive view of protein variation in the human population. The availability of the human genome sequence has enabled the exploration and exploitation of the human genome and proteome to begin. Research has now focused on the annotation of the genome and in particular of the proteome. With expert annotation extracted from the literature by biologists as the foundation, it has been possible to expand into the areas of data mining and automatic annotation. With further development and integration of pattern recognition methods and the application of alignments clustering, proteome analysis can now be provided in a meaningful way. These various approaches have been integrated to attach, extract and combine as much relevant information as possible to the proteome. This resource should be valuable to users from both research and industry. We maintain a file containing all human UniProtKB/Swiss-Prot entries. This file is updated at every biweekly release of UniProt and can be downloaded by FTP download, HTTP download or by using a mirroring program which automatically retrieves the file at regular intervals.
Data from: Protein Extraction methods suitable for muscle tissue proteomic...
data.niaid.nih.gov
ebi.ac.uk
xml
Updated May 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lorenza Vantaggiato; LUCA BINI (2025). Protein Extraction methods suitable for muscle tissue proteomic analysis [Dataset]. https://data.niaid.nih.gov/resources?id=pxd053117
Explore at:
xmlAvailable download formats
Dataset updated
May 7, 2025
Dataset provided by
Functional Proteomics Lab., Dept Life Sciences, University of Siena
University of Siena
Authors
Lorenza Vantaggiato; LUCA BINI
Variables measured
Proteomics
Description
• Evaluation of two methods for protein extraction from muscle tissues. • Two different extraction methods showed distinct patterns of protein abundance; • Two-dimensional electrophoresis revealed 131 spots with different volumes; • SDS-based buffer (method A) mostly extracted proteins of high MW range and peculiar in sarcomere and muscular fibres, such as Troponin, Myosin, Miozenin 2 and Ankyrin; • Urea/chaps/DTE/tris solution (method B) enriched numerous protein species in a low MW range, several protein fragments and organelle membrane proteins
Quantitative Proteomics Benchmark Dataset to evaluate label-free...
data.niaid.nih.gov
ebi.ac.uk
xml
Updated May 16, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
XIAOMENG SHEN; Jun Qu (2018). Quantitative Proteomics Benchmark Dataset to evaluate label-free quantitative methods- LC/Orbitrap Fusion MS analysis of E coli proteomes spiked-in Human proteins at 5 different levels (N=20) [Dataset]. https://data.niaid.nih.gov/resources?id=pxd003881
Explore at:
xmlAvailable download formats
Dataset updated
May 16, 2018
Dataset provided by
University at Buffalo
Authors
XIAOMENG SHEN; Jun Qu
Variables measured
Proteomics
Description
To unbiasedly evaluate the quantitative performance of different quantitative methods, and compare different popular proteomics data processing workflows, we prepared a benchmark dataset where the various levels of spikeed-in E. Coli proteome that true fold change (i.e. 1 fold, 1.5 fold, 2 fold, 2.5 fold and 3 fold) and true identities of positives/negatives (i.e. E.Coli proteins are true positives while Human proteins are true negatives) are known. To best mimic the proteomics application in comparison of multiple replicates, each fold change group contains 4 replicates, so there are 20 LC-MS/MS analysis in this benchmark dataset. To our knowledge, this spike-in benchmark dataset is largest-scale ever that encompasses 5 different spike level, >500 true positive proteins, and >3000 true negative proteins (2peptide criteria, 1% protein FDR), with a wide concentration dynamic range. The dataset is ideal to test quantitative accuracy, precision, false-positive biomarker discovery and missing data level.
s
Proteomic Data Commons
scicrunch.org
rrid.site
+2more
Updated Apr 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Proteomic Data Commons [Dataset]. http://identifiers.org/RRID:SCR_018273
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_018273
Dataset updated
Apr 15, 2024
Description
Portal to make cancer related proteomic datasets easily accessible to public. Facilitates multiomic integration in support of precision medicine through interoperability with other resources. Developed to advance our understanding of how proteins help to shape risk, diagnosis, development, progression, and treatment of cancer. One of several repositories within NCI Cancer Research Data Commons which enables researchers to link proteomic data with other data sets (e.g., genomic and imaging data) and to submit, collect, analyze, store, and share data throughout cancer data ecosystem. PDC provides access to highly curated and standardized biospecimen, clinical, and proteomic data, intuitive interface to filter, query, search, visualize and download data and metadata. Provides common data harmonization pipeline to uniformly analyze all PDC data and provides advanced visualization of quantitative information. Cloud based (Amazon Web Services) infrastructure facilitates interoperability with AWS based data analysis tools and platforms natively. Application programming interface (API) provides cloud-agnostic data access and allows third parties to extend functionality beyond PDC. Structured workspace that serves as private user data store and also data submission portal. Distributes controlled access data, such as patient-specific protein fasta sequence databases, with dbGaP authorization and eRA Commons authentication.
b
Results of proteomic analysis on diatom response to allelopathy from red...
bco-dmo.org
search.dataone.org
+1more
csv
Updated Oct 31, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Julia Kubanek; Facundo Fernandez; Brook L. Nunn (2014). Results of proteomic analysis on diatom response to allelopathy from red tide dinoflagellate (Karenia brevis) [Dataset]. https://www.bco-dmo.org/dataset/536893
Explore at:
csv(259 bytes)Available download formats
Dataset updated
Oct 31, 2014
Dataset provided by
Biological and Chemical Data Management Office
Authors
Julia Kubanek; Facundo Fernandez; Brook L. Nunn
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
file_link, file_size, project_id, repository, description, experiment_id
Measurement technique
Global Positioning System Receiver, Mass Spectrometer
Description
Data files in .raw format are available from the Chorus repository (https://chorusproject.org) under project ID number 650 (project name: "Diatom response to allelopathy"), experiment ID number 843.

Files can be downloaded from the following URL. Note the total file size is 13.4 GB: https://chorusproject.org/anonymous/download/experiment/6729965144923166962

The data files are in Thermo-Finnigan .raw format. Vendor software can be used to view these files, and there are free viewers that can also be used (one example is PVIEW). The files can also be converted to mzXML files using MSconvert.
d
Proteome of peripheral mononuclear cells (PBMCs) from asymptomatic malaria...
search.dataone.org
data.niaid.nih.gov
+2more
Updated Jan 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kelvin Kimenyi; Lynette Ochola-Oyier (2024). Proteome of peripheral mononuclear cells (PBMCs) from asymptomatic malaria and uninfected individuals and the ensuing malaria episodes [Dataset]. http://doi.org/10.5061/dryad.kwh70rz9s
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.kwh70rz9s
Dataset updated
Jan 26, 2024
Dataset provided by
Dryad Digital Repository
Authors
Kelvin Kimenyi; Lynette Ochola-Oyier
Time period covered
Jan 1, 2023
Description
Cumulative malaria parasite exposure in endemic regions often results in the acquisition of partial immunity and asymptomatic infections. There is limited information on how host-parasite interactions mediate maintenance of chronic symptomless infections that sustain malaria transmission. Here, we have determined the gene expression profiles of the parasite population and the corresponding host peripheral blood mononuclear cells (PBMCs) from 21 children (<15 years). We compared children who were defined as uninfected, asymptomatic and those with febrile malaria. Children with asymptomatic infections had a parasite transcriptional profile characterized by a bias toward trophozoite stage (~12 hours-post invasion) parasites and low parasite levels, while earlier ring stage parasites were characteristic of febrile malaria. The host response of asymptomatic children was characterized by downregulated transcription of genes associated with inflammatory responses, compared to children with ..., Proteins were extracted from PBMCs by resuspending the pellet with 5Âµl of 6M UREA (Thermo scientific). The protein samples were then adjusted with 50mM Triethylamonium bicarbonate (TEAB, Sigma-Aldrich) to 100Âµl and the protein concentration determined using the Bicinchoninic acid (BCA) protein assay (Thermo scientific). The protein samples were then reduced with 40mM dithiothretol, alkylated with 80mM iodoacetamide in the dark, and quenched with 80mM iodoacetamide at room temperature, followed by digestion with1Âµg/Âµl of trypsin (57). Nine pools, each containing 9 samples and 1 control for batch correction, were prepared by combining 1Âµl aliquots from each sample. The samples were pooled using a custom randomization R script. The pooled samples were then individually labelled using the Tandem Mass Tag (TMT) 10-plex kit (Thermo Scientific) according to the manufacturerâ€™s instructions. One isobaric tag was used solely for the pooled samples and combined with peptides samples labelled with ..., The files can be opened using MaxQuant software, specifically version 2.0.3.0 was used for analysis. Differential protein abundance analysis of MaxQuant output was done using PERSEUS version 2.05.0 software. Protein-protein interaction and Gene ontology analyses was perforened using STRING database version 11.5 (https://string-db.org/)., # Proteome of peripheral mononuclear cells (PBMCs) from asymptomatic malaria and uninfected individuals and the ensuing febrile malaria episodes

Proteins were extracted from peripheral mononuclear cells (PBMCs), pooled using Tandem Mass Tags (TMT) (10-plex) and injected into the LC-MS/MS for proteomics analysis. The output raw files were loaded into MaxQuant software v2.0.3.0 for protein quantification. The output from MaxQuant was then read using PERSEUS software v2.05.0 and differential protein abundance analysis performed. The Proteomics_metadata file contains the metadata that links each sample to the raw data files and the treatment group (condition).

Description of the data and file structure

The RAW data files provided contains the output data from the LC-MS/MS per each pool. The pools serve as the input data for MaxQuant software.

The Proteomics_metadata contains the metadata information that links each sample to the condition/treatment group (i.e. asymptomatic, uninfecte...
A
AI in Proteomics Report
marketreportanalytics.com
doc, pdf, ppt
Updated Apr 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Report Analytics (2025). AI in Proteomics Report [Dataset]. https://www.marketreportanalytics.com/reports/ai-in-proteomics-55357
Explore at:
pdf, doc, pptAvailable download formats
Dataset updated
Apr 3, 2025
Dataset authored and provided by
Market Report Analytics
License
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The AI in Proteomics market is experiencing significant growth, driven by the increasing need for high-throughput data analysis in drug discovery and scientific research. The market's expansion is fueled by advancements in artificial intelligence algorithms, particularly deep learning, which are capable of handling the complex and massive datasets generated by proteomic technologies. This allows for faster and more accurate identification and quantification of proteins, leading to improved insights into biological processes and disease mechanisms. The software segment currently holds a larger market share compared to services, reflecting the rising adoption of AI-powered software solutions for proteomic data analysis. However, the services segment is expected to witness substantial growth due to the increasing demand for specialized expertise in implementing and interpreting AI-driven proteomic analyses. Key players like Google DeepMind, Microsoft, and Thermo Fisher Scientific are driving innovation through strategic partnerships, acquisitions, and the development of cutting-edge AI-powered proteomics platforms. The North American region currently dominates the market due to substantial investments in research and development, and a robust presence of major players. However, regions like Asia Pacific are expected to witness significant growth in the coming years driven by increasing government funding and growing adoption of advanced technologies. Despite the rapid growth, the market faces certain challenges. High initial investment costs associated with AI-powered proteomics technologies, the requirement for specialized technical expertise, and the need for robust data infrastructure can limit widespread adoption. Furthermore, data privacy concerns and regulatory hurdles related to the use of AI in healthcare are also factors that need to be considered. Nevertheless, the long-term outlook for the AI in Proteomics market remains positive, with continued advancements in AI algorithms and decreasing costs expected to drive further market expansion. The market is poised to benefit from increased integration with other "omics" technologies and the development of AI-powered diagnostic tools, creating new opportunities for growth and innovation across various applications. The forecast period of 2025-2033 suggests a period of sustained and robust market growth, potentially outpacing even the more conservative estimates based on historical data.
Genomic And Proteomic Tool Market Report | Global Forecast From 2025 To 2033...
dataintelo.com
csv, pdf, pptx
Updated Oct 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataintelo (2024). Genomic And Proteomic Tool Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/genomic-and-proteomic-tool-market
Explore at:
pptx, pdf, csvAvailable download formats
Dataset updated
Oct 16, 2024
Dataset provided by
Authors
Dataintelo
License
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
Time period covered
2024 - 2032
Area covered
Global
Description
Genomic And Proteomic Tool Market Outlook

The global genomic and proteomic tool market size is projected to grow from $32.5 billion in 2023 to an estimated $75.6 billion by 2032, exhibiting a compound annual growth rate (CAGR) of 9.7% over the forecast period. This substantial growth is driven by increasing investments in genomics and proteomics research, advancements in sequencing technologies, and the rising prevalence of chronic diseases that demand precise diagnostic tools and personalized treatment options.

One of the primary growth factors for the genomic and proteomic tool market is the rapid advancement in high-throughput sequencing technologies. The advent of next-generation sequencing (NGS) has revolutionized both genomics and proteomics, enabling researchers to sequence entire genomes and transcriptomes quickly and cost-effectively. This technological leap has been pivotal in driving research in understanding genetic and protein functions, which in turn fuels the demand for sophisticated tools and devices in this sector.

Another significant driver is the increasing focus on personalized medicine. Personalized medicine aims to tailor medical treatment to the individual characteristics of each patient, which requires comprehensive genomic and proteomic profiling. The growing recognition of the importance of personalized approaches in improving patient outcomes is propelling the demand for advanced genomic and proteomic tools. Additionally, government initiatives and funding for genomics and proteomics research further bolster market growth.

Pharmaceutical and biotechnology companies are also playing a crucial role in the market's expansion. These companies invest heavily in research and development to discover novel drugs and therapies, relying extensively on genomic and proteomic tools for drug discovery and development. The rise in the prevalence of chronic diseases like cancer, diabetes, and cardiovascular diseases necessitates the development of new, effective therapies, thereby increasing the utilization of these advanced tools.

Regionally, North America is expected to dominate the market due to the presence of a robust research infrastructure, significant government and private sector funding, and a strong focus on precision medicine. Europe is also anticipated to hold a substantial market share, driven by technological advancements and increasing healthcare expenditure. The Asia Pacific region is projected to witness the fastest growth, attributed to rising investments in healthcare infrastructure, increasing prevalence of chronic diseases, and growing awareness of advanced diagnostic and therapeutic procedures.

Product Type Analysis

The genomic and proteomic tool market by product type is segmented into instruments, consumables, and software. Instruments constitute a significant portion of this market, as they encompass a variety of high-precision devices essential for genomic and proteomic research. These include sequencers, mass spectrometers, and electrophoresis apparatus, among others. The constant innovation in these instruments, aimed at enhancing their accuracy, speed, and cost-effectiveness, is a key factor driving their demand. Researchers and laboratories continuously upgrade their existing equipment to keep pace with technological advancements, thereby fueling the market growth for instruments.

Consumables form another major segment within the genomic and proteomic tool market. These include reagents, kits, and other laboratory supplies that are indispensable for conducting various genomic and proteomic analyses. The recurring need for these consumables in numerous experimental procedures ensures a steady demand. Additionally, the expansion of genomic and proteomic research across academic, clinical, and industrial settings contributes to the growing consumables market. The ongoing research and development activities, coupled with the widespread application of these tools in diagnostics and drug discovery, further propel the consumable segment.

The software segment is equally important, providing the computational power necessary to analyze large sets of genomic and proteomic data. Software tools for data analysis, visualization, and interpretation are integral to the workflow of genomics and proteomics research. These tools help in managing the vast amounts of data generated by high-throughput technologies and extracting meaningful insights. As the complexity and volume of data continue to increase, the demand for sophisticated software solutions is expected to rise. Moreover, continuous a
Data for Precursor intensity-based label-free quantification software tools...
zenodo.org
Updated Mar 31, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Subina Mehta; Subina Mehta (2020). Data for Precursor intensity-based label-free quantification software tools for Galaxy Platform. [Dataset]. http://doi.org/10.5281/zenodo.3733904
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.3733904
Dataset updated
Mar 31, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Subina Mehta; Subina Mehta
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Precursor intensity-based label-free quantification software tools for proteomic and multi-omic analysis within the Galaxy Platform.

ABRF: Data was generated through the collaborative work of the ABRF Proteomics Research Group (https://abrf.org/research-group/proteomics-research-group-prg). See Reference for details: Van Riper, S. et al. ‘An ABRF-PRG study: Identification of low abundance proteins in a highly complex protein sample’ at the 64th Annual Conference of American Society of Mass Spectrometry and Allied Topics" at San Antonio, TX."

UPS: MaxLFQ Cox J, Hein MY, Luber CA, Paron I, Nagaraj N, Mann M. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol Cell Proteomics. 2014 Sep;13(9):2513-26. doi: 10.1074/mcp.M113.031591. Epub 2014 Jun 17. PubMed PMID: 24942700; PubMed Central PMCID: PMC4159666;

PRIDE #5412; ProteomeXchange repository PXD000279: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2014/09/PXD000279

Facebook

Twitter

Click to copy link

Link copied

Cite

Veit Schwämmle; Ileana Rodríguez León; Ole Nørregaard Jensen (2023). Assessment and Improvement of Statistical Tools for Comparative Proteomics Analysis of Sparse Data Sets with Few Experimental Replicates [Dataset]. http://doi.org/10.1021/pr400045u.s003

Assessment and Improvement of Statistical Tools for Comparative Proteomics Analysis of Sparse Data Sets with Few Experimental Replicates

Explore at:

txtAvailable download formats

Unique identifier

https://doi.org/10.1021/pr400045u.s003

Dataset updated

Jun 1, 2023

Dataset provided by

ACS Publications

Authors

Veit Schwämmle; Ileana Rodríguez León; Ole Nørregaard Jensen

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Large-scale quantitative analyses of biological systems are often performed with few replicate experiments, leading to multiple nonidentical data sets due to missing values. For example, mass spectrometry driven proteomics experiments are frequently performed with few biological or technical replicates due to sample-scarcity or due to duty-cycle or sensitivity constraints, or limited capacity of the available instrumentation, leading to incomplete results where detection of significant feature changes becomes a challenge. This problem is further exacerbated for the detection of significant changes on the peptide level, for example, in phospho-proteomics experiments. In order to assess the extent of this problem and the implications for large-scale proteome analysis, we investigated and optimized the performance of three statistical approaches by using simulated and experimental data sets with varying numbers of missing values. We applied three tools, including standard t test, moderated t test, also known as limma, and rank products for the detection of significantly changing features in simulated and experimental proteomics data sets with missing values. The rank product method was improved to work with data sets containing missing values. Extensive analysis of simulated and experimental data sets revealed that the performance of the statistical analysis tools depended on simple properties of the data sets. High-confidence results were obtained by using the limma and rank products methods for analyses of triplicate data sets that exhibited more than 1000 features and more than 50% missing values. The maximum number of differentially represented features was identified by using limma and rank products methods in a complementary manner. We therefore recommend combined usage of these methods as a novel and optimal way to detect significantly changing features in these data sets. This approach is suitable for large quantitative data sets from stable isotope labeling and mass spectrometry experiments and should be applicable to large data sets of any type. An R script that implements the improved rank products algorithm and the combined analysis is available.

Clear search

Close search

Google apps

Main menu

Assessment and Improvement of Statistical Tools for Comparative Proteomics...

Data from: High-throughput mass spectrometry and bioinformatics analysis of...

Data from: MGVB: a new proteomics toolset for fast and efficient data...

Data from: Innovative Approaches and Tool Development for Proteomics Data...

Genomics and Proteomic Tools Report

The Proteome Browser

Name of proteins and their classification performance in all datasets.

Data from: Comparative Evaluation of Proteome Discoverer and FragPipe for...

Proteomics Analytical Service Market Report | Global Forecast From 2025 To...

Proteomics Analytical Service Market Outlook

Service Type Analysis

Clinical Proteomic Tumor Analysis Consortium Data

In-depth Proteomic Analysis of Mouse Cochlear Sensory Epithelium by Mass...

Human Proteomics Initiative

Data from: Protein Extraction methods suitable for muscle tissue proteomic...

Quantitative Proteomics Benchmark Dataset to evaluate label-free...

Proteomic Data Commons

Results of proteomic analysis on diatom response to allelopathy from red...

Proteome of peripheral mononuclear cells (PBMCs) from asymptomatic malaria...

Description of the data and file structure

AI in Proteomics Report

Genomic And Proteomic Tool Market Report | Global Forecast From 2025 To 2033...

Genomic And Proteomic Tool Market Outlook

Product Type Analysis

Data for Precursor intensity-based label-free quantification software tools...

Assessment and Improvement of Statistical Tools for Comparative Proteomics Analysis of Sparse Data Sets with Few Experimental Replicates