Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mass spectrometry (MS)-based metaproteomics is used to identify and quantify proteins in microbiome samples, with the frequently used methodology being Data-Dependent Acquisition mass spectrometry (DDA-MS). However, DDA-MS is limited in its ability to reproducibly identify and quantify lower abundant peptides and proteins. To address DDA-MS deficiencies, proteomics researchers have started using Data-Independent Acquisition Mass Spectrometry (DIA-MS) for reproducible detection and quantification of peptides and proteins. We sought to evaluate the reproducibility and accuracy of DIA-MS metaproteomic measurements relative to DDA-MS metaproteomic measurements using a mock community of known taxonomic composition. Artificial microbial communities of known composition were analyzed independently in three laboratories using DDA- and DIA-MS acquisition methods. DIA-MS yielded more protein and peptide identifications than DDA-MS in each laboratory. In addition, the protein and peptide identifications were more reproducible in all laboratories and provided an accurate quantification of proteins and taxonomic groups in the samples. We also identified some limitations of current DIA tools when applied to metaproteomic data highlighting specific needs to further improve DIA tools to enable analysis of metaproteomic datasets from complex microbiomes. Ultimately, DIA-MS represents a promising data collection strategy for MS-based metaproteomics due to its large number of detected proteins and peptides, reproducibility, deep sequencing capabilities, and accurate quantitation.
Facebook
TwitterData dependent acquisition (DDA) is the method of choice for mass spectrometry based proteomics discovery experiments, data-independent acquisition (DIA) is steadily becoming more important. One of the most important requirement to perform a DIA analysis is the availability of spectral libraries for the peptide identification and quantification. Several researches were already conducted regarding the creation of spectral libraries from DDA analyses and obtaining identifications with these in DIA measurements. But so far only few experiments were conducted, to estimate the effect of these libraries on the quantitative level. In this work we created a spike-in gold standard dataset with known contents and ratios of proteins in a complex sample matrix. With this dataset, we first created spectral libraries using different sample preparation approaches with and without sample prefractionation on peptide and protein level. Two different search engines were used for protein identification. In total, five different spike-in states were compared with DIA analyses, comparing eight different spectral libraries generated by varying approaches and one library free method, as well as one default DDA analysis. Not only the number of identifications on peptide and protein level in the spectral libraries and the corresponding analyses was inspected, but also the number of expected and identified significant quantifications and their ratios were thoroughly examined. We found, that while libraries of prefractionationed samples are generally larger, the actually yielded identifications are not increased compared to repetitive non-fractionated measurements. Furthermore, we show that the accuracy of the quantifications is also highly dependent on the applied spectra library and also whether the peptide or protein level is analysed. Overall, the reproducibility and accuracy of DIA is superior to DDA in all analysed approaches.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Analysis of DDA data searched with MaxQuant default values.
Facebook
TwitterGeneration of a new library of targeted mass spectrometry assays for accurate protein quantification in triple negative breast cancer (TNBC) tissues. Primary tumor tissue lysates from 105 TNBC patients treated at Masaryk Memorial Cancer Institute (MMCI) in Brno, Czech Republic, were used to generate the spectral library. This project covers raw files from data-dependent acquisition (DDA) – parallel accumulation-serial fragmentation (PASEF) measurements of 12 hydrophilic chromatography (HILIC) fractions of aliquot pool from complete set of 105 samples measured on timsTOF Pro; raw files of 16 individual samples measured in data-independent acquisition (DIA) – PASEF mode and used for hybrid library generation and for demonstrative quantitative DIA data extraction; Pulsar archive generated in Spectronaut 16.0 from 12 DDA-PASEF measurements of HILIC fractions and from 16 data-independent acquisition DIA-PASEF measurements of individual samples. The 16 DIA-PASEF runs of individual samples used for library generation were analyzed using newest versions of Spectronaut (version 18.5) and DIA-NN (version 1.8.1) software tools in library-based setting using the newly generated library as well as in library-free setting showing library-based method to outperform the use of predicted libraries in the terms of identification numbers.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
State-of-the-art strategies for proteomics are not able to rapidly interrogate complex peptide mixtures in an untargeted manner with sensitive peptide and protein identification rates. We describe a data-independent acquisition (DIA) approach, microDIA (μDIA), that applies a novel tandem mass spectrometry (MS/MS) mass spectral deconvolution method to increase the specificity of tandem mass spectra acquired during proteomics experiments. Using the μDIA approach with a 10 min liquid chromatography gradient allowed detection of 3.1-fold more HeLa proteins than the results obtained from data-dependent acquisition (DDA) of the same samples. Additionally, we found the μDIA MS/MS deconvolution procedure is critical for resolving modified peptides with relatively small precursor mass shifts that cause the same peptide sequence in modified and unmodified forms to theoretically cofragment in the same raw MS/MS spectra. The μDIA workflow is implemented in the PROTALIZER software tool which fully automates tandem mass spectral deconvolution, queries every peptide with a library-free search algorithm against a user-defined protein database, and confidently identifies multiple peptides in a single tandem mass spectrum. We also benchmarked μDIA against DDA using a 90 min gradient analysis of HeLa and Escherichia coli peptides that were mixed in predefined quantitative ratios, and our results showed μDIA provided 24% more true positives at the same false positive rate.
Facebook
TwitterMass spectrometry (MS)-based proteomics aims to characterize comprehensive proteomes in a fast and reproducible manner. Here, we present an ultra-fast scanning data-independent acquisition (DIA) strategy consisting on 2-Th precursor isolation windows, dissolving the differences between data-dependent and independent methods. This is achieved by pairing a Quadrupole Orbitrap mass spectrometer with the asymmetric track lossless (Astral) analyzer that provides >200 Hz MS/MS scanning speed, high resolving power and sensitivity, as well as low ppm-mass accuracy. Narrowwindow DIA enables profiling of up to 100 full yeast proteomes per day, or ~10,000 human proteins in half-an-hour. Moreover, multi-shot acquisition of fractionated samples allows comprehensive coverage of human proteomes in ~3h, showing comparable depth to next-generation RNA sequencing and with 10x higher throughput compared to current state-of-the-art MS. High quantitative precision and accuracy is demonstrated with high peptide coverage in a 3-species proteome mixture, quantifying 14,000+ proteins in a single run in half-an-hour.
Facebook
TwitterThese data cover all of the analyses described in the paper "Specter: linear deconvolution as a new paradigm for targeted analysis of data-independent acquisition mass spectrometry proteomics". Specifically, the data consist of - 20 DIA and DDA files from the HEK293T/synthetic phosphopeptides spike-in experiments - 10 DDA files, one for each of ten fractions of an E. coli lysate digest - 14 DIA files for the experiments involving mixtures of synthetic peptides - 11 DDA files for the isolated runs of these synthetic peptides - 84 DIA files for measurements of the phosphoproteome of perturbed PC3 cells - 10 DDA files for spectral library construction for the phosphoproteomics data - 3 DIA and 3 DDA files for analysis of an unfractionated E. coli lysate digest. See the spreadsheet "Specter Datasets Catalog.xlsx" for further descriptions and file metadata.
Facebook
TwitterData dependent acquisition (DDA) is the method of choice for mass spectrometry based proteomics discovery experiments, data-independent acquisition (DIA) is steadily becoming more important. One of the most important requirement to perform a DIA analysis is the availability of spectral libraries for the peptide identification and quantification. Several researches were already conducted regarding the creation of spectral libraries from DDA analyses and obtaining identifications with these in DIA measurements. But so far only few experiments were conducted, to estimate the effect of these libraries on the quantitative level. In this work we created a spike-in gold standard dataset with known contents and ratios of proteins in a complex sample matrix. With this dataset, we first created spectral libraries using different sample preparation approaches with and without sample prefractionation on peptide and protein level. Two different search engines were used for protein identification. In total, five different spike-in states were compared with DIA analyses, comparing eight different spectral libraries generated by varying approaches and one library free method, as well as one default DDA analysis. Not only the number of identifications on peptide and protein level in the spectral libraries and the corresponding analyses was inspected, but also the number of expected and identified significant quantifications and their ratios were thoroughly examined. We found, that while libraries of prefractionationed samples are generally larger, the actually yielded identifications are not increased compared to repetitive non-fractionated measurements. Furthermore, we show that the accuracy of the quantifications is also highly dependent on the applied spectra library and also whether the peptide or protein level is analysed. Overall, the reproducibility and accuracy of DIA is superior to DDA in all analysed approaches.
Facebook
TwitterCerebrospinal fluid (CSF) is in direct contact with the brain and serves as a valuable specimen to examine diseases of the central nervous system through analyzing its components. These include the analysis of metabolites, cells as well as proteins. For identifying new suitable diagnostic protein biomarkers bottom-up data-dependent acquisition (DDA) mass spectrometry-based approaches are most popular. Drawbacks of this method are stochastic and irreproducible precursor ion selection. Recently, data-independent acquisition (DIA) emerged as an alternative method. It overcomes several limitations of DDA, since it combines the benefits of DDA and targeted methods like selected reaction monitoring (SRM). We established a DIA method for in-depth proteome analysis of CSF. For this, four spectral libraries were generated with samples from native CSF (n=5) CSF fractionation (15 in total) and substantia nigra fractionation (54 in total) applying to CSF DIA of three replicates. The DDA and DIA methods for CSF were conducted with the same nanoLC parameters using a 180 minute gradient. Compared to a conventional DDA method, our DIA approach both increased the number of identified protein groups with 1574 compared to DDA with 648 over 50 % with a comprehensive spectral library (generated with DDA measurements from five native CSF and 54 substantia nigra fractions) and decreased the coefficient of variation to 6 %, compared to 11 % with a DDA method. We also could show that a sample specific spectral library generated only from native CSF increased the identification reproducibility from three DIA replicates to 90 % (77 % with a DDA method). Moreover, by utilizing a substantia nigra specific spectral library for CSF DIA over 60 brain-originated proteins could be identified compared to only eleven with DDA. In conclusion, the here presented optimized DIA method substantially outperforms DDA and could develop into a powerful tool for biomarker discovery in CSF.
Facebook
TwitterA novel approach for metaproteomics that utilizes data independent acquisition (DIA) mass spectrometry which enables accurate and consistent quantification of peptides from samples with a complex microbial composition.
Facebook
TwitterHair cells undergo postnatal development that leads to formation of their sensory organelles, synaptic machinery, and in the case of cochlear outer hair cells, their electromotile mechanism. To examine the proteome changes over development, we isolated pools of 5000 Pou4f3-Gfp positive or negative cells from the cochlea or utricles; these cell pools were analyzed by data-dependent and data-independent acquisition (DDA and DIA) mass spectrometry. DDA data were used to generate spectral libraries, which enabled identification and accurate quantitation of specific proteins using the DIA datasets. We also isolated and pooled individual inner and outer hair cells from adult cochlea and compared their proteomes to those of developing hair cells. The DDA and DIA datasets will be valuable for accurately quantifying proteins in hair cells and non-hair cells over this developmental window.
Facebook
TwitterThis dataset consists of 44 raw MS files, comprising 27 DIA (SWATH) and 15 DDA runs on a TripleTOF 5600 and of two raw mass spectrometry files acquired on a Q Exactive. The composition of the dataset is described in the manuscript by Tsou et al., titled: "DIA-Umpire: comprehensive computational framework for data independent acquisition proteomics", Nature Methods, in press Raw files are deposited here in ProteomeXchange and are associated with the DIA-Umpire processed data. All DIA-Umpire processed results for each sample together with DDA results are deposited in separated folders. Also see the "DataSampleID.xlsx" associated with this Readme file. Internal reference from the Gingras lab ProHits implementation: Project 94, Export version VS2 (Tsou_DIA-Umpire)
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Analysis of DIA data from 2000 most abundant DDA-identified proteins.
Facebook
TwitterThe Dynamic Organellar Maps (DOMs) approach combines cell fractionation and shotgun-proteomics for global profiling analysis of protein subcellular localization. Here, we have drastically enhanced the performance of DOMs through data-independent acquisition (DIA) mass spectrometry (MS). DIA-DOMs achieve twice the depth of our previous workflow in the same MS runtime, and substantially improve profiling precision and reproducibility. This repository contains all DDA-LFQ datasets used in our work: A reference dataset acquired in single shot on 100 min gradients and three fractionated datasets providing measured libraries for DIA searches on 100 min, 44 min and 21 min gradients.
Facebook
TwitterData-Dependent and Data-Independent Acquisition modes (DDA and DIA, respectively) are both widely used to acquire MS2 spectra in untargeted liquid chromatography tandem mass spectrometry (LC-MS/MS) metabolomics analyses. Despite their wide use, little work has been attempted to systematically compare their MS/MS spectral annotation performance in untargeted settings due to the lack of ground truth and the costs involved in running a large number of acquisitions. Here, we present a systematic in silico comparison of these two acquisition methods in untargeted metabolomics by extending our Virtual Metabolomics Mass Spectrometer (ViMMS) framework with a DIA module. Our results show that the performance of these methods varies with the average number of co-eluting ions as the most important factor. At low numbers, DIA outperforms DDA, but at higher numbers, DDA has an advantage as DIA can no longer deal with the large amount of overlapping ion chromatograms. Results from simulation were further validated on an actual mass spectrometer, demonstrating that using ViMMS we can draw conclusions from simulation that translate well into the real world. The versatility of the Virtual Metabolomics Mass Spectrometer (ViMMS) framework in simulating different parameters of both Data-Dependent and Data-Independent Acquisition (DDA and DIA) modes is a key advantage of this work. Researchers can easily explore and compare the performance of different acquisition methods within the ViMMS framework, without the need for expensive and time-consuming experiments with real experimental data. By identifying the strengths and limitations of each acquisition method, researchers can optimize their choice and obtain more accurate and robust results. Furthermore, the ability to simulate and validate results using the ViMMS framework can save significant time and resources, as it eliminates the need for numerous experiments. This work not only provides valuable insights into the performance of DDA and DIA, but it also opens the door for further advancements in LC-MS/MS data acquisition methods.
Facebook
TwitterRecently, we presented the DirectMS1 method of ultrafast proteome-wide analysis based on minute-long LC gradients and MS1-only mass spectra acquisition. Currently, the method provides the depth of human cell proteome coverage of 2500 proteins at a 1% false discovery rate (FDR) when using 5 min LC gradients and 7.3 min runtime in total. While the standard MS/MS approaches provide 4000–5000 protein identifications within a couple of hours of instrumentation time, we advocate here that the higher number of identified proteins does not always translate into better quantitation quality of the proteome analysis. To further elaborate on this issue, we performed a one-on-one comparison of quantitation results obtained using DirectMS1 with three popular MS/MS-based quantitation methods: label-free (LFQ) and tandem mass tag quantitation (TMT), both based on data-dependent acquisition (DDA) and data-independent acquisition (DIA). For comparison, we performed a series of proteome-wide analyses of well-characterized (ground truth) and biologically relevant samples, including a mix of UPS1 proteins spiked at different concentrations into an Echerichia coli digest used as a background and a set of glioblastoma cell lines. MS1-only data was analyzed using a novel quantitation workflow called DirectMS1Quant developed in this work. The results obtained in this study demonstrated comparable quantitation efficiency of 5 min DirectMS1 with both TMT and DIA methods, yet the latter two utilized a 10–20-fold longer instrumentation time.
Facebook
TwitterTo examine the different mass spectrometry approaches to monitoring kinases after enrichment with desthiobiotinylating probes for activity-based protein profiling (ABPP), two experiments were performed with H1993 lung cancer cells. First, cell lysates were pre-treated with DMSO vehicle, dasatinib, or erlotinib prior to addition of the ATP probe for ABPP to compare the differences in kinase labeling associated with examples of kinase inhibitors that vary in target selectivity. Then, to examine changes in cellular signaling, H1993 cells were treated with vehicle controls, BEX-235 (PI3K inhibitor), or Crizotinib. LC-MS/MS using data dependent acquisition, data-independent acquisition (pSMART), parallel reaction monitoring, and selected reaction monitoring (or multiple reaction monitoring) mass spectrometry were used to detect and relatively quantify the desthiobiotinylated kinase peptides.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Tandem mass spectrometry (MS/MS) is an invaluable experimental tool for providing analytical data supporting the identification of small molecules and peptides in mass-spectrometry-based “omics” experiments. Data-dependent MS/MS (DDA) is a real-time MS/MS-acquisition strategy that is responsive to the signals detected in a given sample. However, in analysis of even moderately complex samples with state-of-the-art instrumentation, the speed of MS/MS acquisition is insufficient to offer comprehensive MS/MS coverage of all detected molecules. Data-independent approaches (DIA) offer greater MS/MS coverage, typically at the expense of selectivity or sensitivity. This report describes data-set-dependent MS/MS (DsDA), a novel integration of MS1-data processing and target prioritization to enable comprehensive MS/MS sampling during the initial MS-level experiment. This approach is guided by the premise that in omics experiments, individual injections are typically made as part of a larger set of samples, and feedback between data processing and data acquisition can allow approximately real-time optimization of MS/MS-acquisition parameters and nearly complete MS/MS-sampling coverage. Using a combination of R, Proteowizard, XCMS, and WRENS software, this concept was implemented on a liquid-chromatograph-coupled quadrupole time-of-flight mass spectrometer. The results illustrate comprehensive MS/MS coverage for a set of complex small-molecule samples and demonstrate a strong improvement on traditional DDA.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Data independent acquisition-mass spectrometry (DIA-MS) coupled with liquid chromatography is a promising approach for rapid, automatic sampling of MS/MS data in untargeted metabolomics. However, wide isolation windows in DIA-MS generate MS/MS spectra containing a mixed population of fragment ions together with their precursor ions. This precursor-fragment ion map in a comprehensive MS/MS spectral library is crucial for relative quantification of fragment ions uniquely representative of each precursor ion. However, existing reference libraries are not sufficient for this purpose since the fragmentation patterns of small molecules can vary in different instrument setups. Here we developed a bioinformatics workflow called MetaboDIA to build customized MS/MS spectral libraries using a user’s own data dependent acquisition (DDA) data and to perform MS/MS-based quantification with DIA data, thus complementing conventional MS1-based quantification. MetaboDIA also allows users to build a spectral library directly from DIA data in studies of a large sample size. Using a marine algae data set, we show that quantification of fragment ions extracted with a customized MS/MS library can provide as reliable quantitative data as the direct quantification of precursor ions based on MS1 data. To test its applicability in complex samples, we applied MetaboDIA to a clinical serum metabolomics data set, where we built a DDA-based spectral library containing consensus spectra for 1829 compounds. We performed fragment ion quantification using DIA data using this library, yielding sensitive differential expression analysis.
Facebook
TwitterThe spike protein of SARS-CoV-2, the virus responsible for the global pandemic of COVID-19, is an abundant, heavily glycosylated surface protein that plays a key role in receptor binding and host cell fusion, and is the focus of all current vaccine development efforts. Variants of concern are now circulating worldwide that exhibit mutations in the spike protein. Protein sequence and glycosylation variations of the spike may affect viral fitness, antigenicity, and immune evasion. Global surveillance of the virus currently involves genome sequencing, but tracking emerging variants should include quantitative measurement of changes in site-specific glycosylation as well. In this work, we used data-dependent acquisition (DDA) and data-independent acquisition (DIA) mass spectrometry to quantitatively characterize the five N-linked glycosylation sites of the glycoprotein standard alpha-1-acid glycoprotein (AGP), as well as the 22 sites of SARS-CoV-2 spike protein. We found that DIA compared favorably to DDA in sensitivity, resulting in more assignments of low abundance glycopeptides. However, the reproducibility across replicates of DIA-identified glycopeptides was lower than that of DDA, possibly due to the difficulty of reliably assigning low abundance glycopeptides confidently. The differences in the data acquired between the two methods suggest that DIA out-performs DDA in terms of glycoprotein coverage but that overall performance is a balance of sensitivity, selectivity, and statistical confidence in glycoproteomics. We assert that these analytical and bioinformatics methods for assigning and quantifying glycoforms would benefit the process of tracking viral variants as well as for vaccine development.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mass spectrometry (MS)-based metaproteomics is used to identify and quantify proteins in microbiome samples, with the frequently used methodology being Data-Dependent Acquisition mass spectrometry (DDA-MS). However, DDA-MS is limited in its ability to reproducibly identify and quantify lower abundant peptides and proteins. To address DDA-MS deficiencies, proteomics researchers have started using Data-Independent Acquisition Mass Spectrometry (DIA-MS) for reproducible detection and quantification of peptides and proteins. We sought to evaluate the reproducibility and accuracy of DIA-MS metaproteomic measurements relative to DDA-MS metaproteomic measurements using a mock community of known taxonomic composition. Artificial microbial communities of known composition were analyzed independently in three laboratories using DDA- and DIA-MS acquisition methods. DIA-MS yielded more protein and peptide identifications than DDA-MS in each laboratory. In addition, the protein and peptide identifications were more reproducible in all laboratories and provided an accurate quantification of proteins and taxonomic groups in the samples. We also identified some limitations of current DIA tools when applied to metaproteomic data highlighting specific needs to further improve DIA tools to enable analysis of metaproteomic datasets from complex microbiomes. Ultimately, DIA-MS represents a promising data collection strategy for MS-based metaproteomics due to its large number of detected proteins and peptides, reproducibility, deep sequencing capabilities, and accurate quantitation.