100+ datasets found
  1. n

    Open Proteomics Database

    • neuinfo.org
    • scicrunch.org
    Updated Jan 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Open Proteomics Database [Dataset]. http://identifiers.org/RRID:SCR_007789
    Explore at:
    Dataset updated
    Jan 21, 2025
    Description

    OPD is a public database for storing and disseminating mass spectrometry based proteomics data. It covers Escherichia coli, Homo sapiens, Saccharomyces cerevisiae, Mycobacterium smegmatis, and Mus musculus. The database currently contains roughly 3,000,000 spectra representing experiments from these 5 different organisms. The mirror url is provided below as the OPD website is no longer functional (http://bioinformatics.icmb.utexas.edu/OPD/).

  2. Data from: Enhancing Open Modification Searches via a Combined Approach...

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Dec 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefan Schulze; Stefan Schulze; Aime Bienfait Igiraneza; Manuel Kösters; Johannes Leufken; Johannes Leufken; Sebastian A. Leidel; Sebastian A. Leidel; Benjamin A. Garcia; Benjamin A. Garcia; Christian Fufezan; Christian Fufezan; Mechthild Pohlschröder; Mechthild Pohlschröder; Aime Bienfait Igiraneza; Manuel Kösters (2020). Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal [Dataset]. http://doi.org/10.5281/zenodo.4299358
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 2, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Stefan Schulze; Stefan Schulze; Aime Bienfait Igiraneza; Manuel Kösters; Johannes Leufken; Johannes Leufken; Sebastian A. Leidel; Sebastian A. Leidel; Benjamin A. Garcia; Benjamin A. Garcia; Christian Fufezan; Christian Fufezan; Mechthild Pohlschröder; Mechthild Pohlschröder; Aime Bienfait Igiraneza; Manuel Kösters
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The identification of peptide sequences and their post-translational modifications (PTMs) is a crucial step in the analysis of bottom-up proteomics data. The recent development of open modification search (OMS) engines allows virtually all PTMs to be searched for. This not only increases the number of spectra that can be matched to peptides but also greatly advances the understanding of biological roles of PTMs through the identification, and thereby facilitated quantification, of peptidoforms (peptide sequences and their potential PTMs). While the benefits of combining results from multiple protein database search engines has been established previously, similar approaches for OMS results are missing so far. Here, we compare and combine results from three different OMS engines, demonstrating an increase in peptide spectrum matches of 8-18%. The unification of search results furthermore allows for the combined downstream processing of search results, including the mapping to potential PTMs. Finally, we test for the ability of OMS engines to identify glycosylated peptides. The implementation of these engines in the Python framework Ursgal facilitates the straightforward application of OMS with unified parameters and results files, thereby enabling yet unmatched high-throughput, large-scale data analysis.

    This dataset includes all relevant results files, databases, and scripts that correspond to the accompanying journal article. Specifically, the following files are deposited:

    • Homo_sapiens_PXD004452_results.zip: result files from OMS and CS for the dataset PXD004452
    • Homo_sapiens_PXD013715_results.zip: result files from OMS and CS for the dataset PXD013715
    • Haloferax_volcanii_PXD021874_results.zip: result files from OMS and CS for the dataset PXD021874
    • Escherichia_coli_PXD000498_results.zip: result files from OMS and CS for the dataset PXD000498
    • databases.zip: target-decoy databases for Homo sapiens, Escherichia coli and Haloferax volcanii as well as a glycan database for Homo sapiens
    • scripts.zip: example scripts for all relevant steps of the analysis
    • mzml_files.zip: mzML files for all included datasets
    • ursgal.zip: current version of Ursgal (0.6.7) that has been used to generate the results (for most recent versions see https://github.com/ursgal/ursgal)
  3. XMAn_Homo_Sapiens_Mutated_Peptide_Database_cancer_fasta

    • figshare.com
    txt
    Updated Jul 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Iulia M. Lazar; Xu Yang (2023). XMAn_Homo_Sapiens_Mutated_Peptide_Database_cancer_fasta [Dataset]. http://doi.org/10.6084/m9.figshare.2825557.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 4, 2023
    Dataset provided by
    figshare
    Authors
    Iulia M. Lazar; Xu Yang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To enable the identification of mutated peptide sequences in complex biological samples, in this work, a cancer protein database with mutation information collected from several public resources such as COSMIC, IARC P53, OMIM and UniProtKB, was developed. In-house developed Perl-scripts were used to search and process the data, and to translate each gene-level mutation into a mutated peptide sequence. The cancer mutation database comprises a total of 872,125 peptide entries from 25,642 protein IDs. A description line for each entry provides the parent protein ID and name, the cDNA- and protein-level mutation site and type, the originating database, and the cancer tissue type and corresponding hits. The database is FASTA formatted to enable data retrieval by commonly used tandem MS search engines.

  4. Picked Protein Group FDR

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip
    Updated Oct 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew The; Matthew The (2022). Picked Protein Group FDR [Dataset]. http://doi.org/10.5281/zenodo.6602949
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Oct 16, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Matthew The; Matthew The
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Accompanying MaxQuant, Percolator and Picked Protein Group FDR files to reproduce results in the publication "Re-analysis of ProteomicsDB using an accurate, sensitive and scalable false discovery rate estimation approach for protein groups". The code for reproducing Protein Group FDRs is available on GitHub at https://github.com/kusterlab/picked_group_fdr

  5. Bioinformatics Services Market By Type (Sequence, Gene Expression), By...

    • verifiedmarketresearch.com
    Updated Oct 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Bioinformatics Services Market By Type (Sequence, Gene Expression), By Application (Genomics, Proteomics, Transcriptomics), By End-User (Biopharmaceutical Companies, Academic & Research Institutes), & Region For 2024-2031 [Dataset]. https://www.verifiedmarketresearch.com/product/bioinformatics-services-market/
    Explore at:
    Dataset updated
    Oct 21, 2024
    Dataset provided by
    Verified Market Researchhttps://www.verifiedmarketresearch.com/
    Authors
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2031
    Area covered
    Global
    Description

    Bioinformatics Services Market size was valued at USD 11.1 Billion in 2023 and is projected to reach USD 3.58 Billion by 2031, growing at a CAGR of 15.06% from 2024-2031.

    Bioinformatics Services Market: Definition/ Overview

    Bioinformatics services cover a wide range of computational tools and methods for managing, analyzing, and interpreting biological data. These services enable the integration of data from domains such as genomics, proteomics, transcriptomics, and metabolomics to provide insights into biological systems. Drug discovery, customized medicine, gene sequencing, and biological data management are some of the most important applications of bioinformatics. Researchers and healthcare professionals use these services to analyze big datasets, detect disease markers, and develop tailored medicines, considerably improving the precision and efficiency of life science research.

  6. o

    Data from: A bioinformatics approach for integrated transcriptomic and...

    • explore.openaire.eu
    • ebi.ac.uk
    Updated Nov 2, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2012). A bioinformatics approach for integrated transcriptomic and proteomic comparative analyses of model and non-sequenced anopheline vectors of human malaria parasites [Dataset]. https://explore.openaire.eu/search/dataset?datasetId=_OmicsDI::2b29a3ba803c6e9119682b9270592f4b
    Explore at:
    Dataset updated
    Nov 2, 2012
    Description

    Malaria morbidity and mortality caused by both Plasmodium falciparum and Plasmodium vivax extend well beyond the African continent, and, although P. vivax causes 80-300 million severe cases each year, vivax transmission remains poorly understood. Plasmodium parasites are transmitted by Anopheles mosquitoes, and the critical site of interaction between parasite and host is at the mosquitos luminal midgut brush border. While the genome of the "model" African P. falciparum vector, Anopheles gambiae, has been sequenced, evolutionary divergence limits its utility as a reference across anophelines, especially non-sequenced P. vivax vectors such as Anopheles albimanus. Clearly, enabling technologies and platforms that bridge this substantial scientific gap are required in order to provide public health scientists key transcriptomic and proteomic information that could spur the development of novel interventions to combat this disease. To our knowledge, no approaches have been published which address this issue. To bolster our understanding of P. vivax-An. albimanus midgut interactions, we developed an integrated bioinformatic-hybrid RNA-Seq-LC-MS/MS approach involving An. albimanus transcriptome (15,764 contigs) and luminal midgut subproteome (9,445 proteins) assembly, which, when used with our custom Diptera protein database (685,078 sequences), facilitated a comparative proteomic analysis of the midgut brush borders of two important malaria vectors, An. gambiae and An. albimanus. Summary from: http://www.mcponline.org/content/early/2012/10/17/mcp.M112.019596.long The An. albimanus transcriptome dataset is available at http://funcgen.vectorbase.org/RNAseq/Anopheles_albimanus/INSP/v2

  7. f

    Functional Annotation of the Human Chromosome 7 “Missing” Proteins: A...

    • figshare.com
    • acs.figshare.com
    xls
    Updated Jun 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shoba Ranganathan; Javed M. Khan; Gagan Garg; Mark S. Baker (2023). Functional Annotation of the Human Chromosome 7 “Missing” Proteins: A Bioinformatics Approach [Dataset]. http://doi.org/10.1021/pr301082p.s004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 7, 2023
    Dataset provided by
    ACS Publications
    Authors
    Shoba Ranganathan; Javed M. Khan; Gagan Garg; Mark S. Baker
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The chromosome-centric human proteome project aims to systematically map all human proteins, chromosome by chromosome, in a gene-centric manner through dedicated efforts from national and international teams. This mapping will lead to a knowledge-based resource defining the full set of proteins encoded in each chromosome and laying the foundation for the development of a standardized approach to analyze the massive proteomic data sets currently being generated. The neXtProt database lists 946 proteins as the human proteome of chromosome 7. However, 170 (18%) proteins of human chromosome 7 have no evidence at the proteomic, antibody, or structural levels and are considered “missing” in this study as they lack experimental support. We have developed a protocol for the functional annotation of these “missing” proteins by integrating several bioinformatics analysis and annotation tools, sequential BLAST homology searches, protein domain/motif and gene ontology (GO) mapping, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Using the BLAST search strategy, homologues for reviewed non-human mammalian proteins with protein evidence were identified for 90 “missing” proteins while another 38 had reviewed non-human mammalian homologues. Putative functional annotations were assigned to 27 of the remaining 43 novel proteins. Proteotypic peptides have been computationally generated to facilitate rapid identification of these proteins. Four of the “missing” chromosome 7 proteins have been substantiated by the ENCODE proteogenomic peptide data.

  8. o

    Screening of serum biomarkers of preeclampsia by proteomics combination with...

    • omicsdi.org
    • ebi.ac.uk
    xml
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    xiaomei li, Screening of serum biomarkers of preeclampsia by proteomics combination with bioinformatics [Dataset]. https://www.omicsdi.org/dataset/pride/PXD013269
    Explore at:
    xmlAvailable download formats
    Authors
    xiaomei li
    Variables measured
    Proteomics
    Description

    Objective: To screen for novel predictive serum markers of preeclampsia (PE). Method: Blood samples were collected from 7 women with PE and 5 with healthy pregnancies. Serum proteins were identified using ITRAQ technology combined with liquid chromatography mass spectrometry analysis. The differential expressed proteins in the PE samples were identified using the SwissProt database, and functionally annotated by gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses Results: We identified 121 differential expressed proteins, of which 76 were up-regulated and 45 were down-regulated, and 14 were differential expressed by more than 2-folds. The top GO terms for Cellular Components (CC) were high-density lipoprotein particles and plasma lipoprotein particles, defense response for Biological Processes (BP), and glycosaminoglycan binding, heparin binding and sulfur compound for Molecular functions (MF). The pathway hsa04979 for Cholesterol metabolism was significantly enriched among the upregulated proteins, while structural domain was enriched in immunoglobulin subtype 2. Conclusion: PE pathogenesis is related to lipid metabolism and inflammation, and proteins related to these pathways are potential early diagnostic markers for PE.

  9. Global Bioinformatics Service Market Report 2025 Edition, Market Size,...

    • cognitivemarketresearch.com
    pdf,excel,csv,ppt
    Updated Dec 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cognitive Market Research (2024). Global Bioinformatics Service Market Report 2025 Edition, Market Size, Share, CAGR, Forecast, Revenue [Dataset]. https://www.cognitivemarketresearch.com/bioinformatics-service-market-report
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Dec 15, 2024
    Dataset provided by
    Decipher Market Research
    Authors
    Cognitive Market Research
    License

    https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy

    Time period covered
    2021 - 2033
    Area covered
    Global
    Description

    According to Cognitive Market Research, the Global Bioinformatics Services Market Size will be USD XX Billion in 2023 and is set to achieve a market size of USD XX Billion by the end of 2031 growing at a CAGR of XX% from 2024 to 2031.

    • The global Bioinformatics services Market will expand significantly by XX% CAGR between 2024 and 2031.

    • Based on technology, Because of the growing number of platform applications and the need for improved tools for drug development, the bioinformatics platforms segment dominated the market.

    • In terms of service type, The sequencing services segment held the largest share and is anticipated to grow over the coming years

    • Based on application, The genomic segment dominated the bioinformatics market

    • Based on End-user, academic institutes and research centers segment hold the largest share.

    • Based on speciality segment, The medical bioinformatics segment holds the large share and is anticipated to expand at a substantial CAGR during the forecast period.

    • The North America region accounted for the highest market share in the Global Bioinformatics Services Market. CURRENT SCENARIO OF THE BIOINFORMATICS SERVICES

    Driving Factors of the Bioinformatics Services Market

    Expansive uses of bioinformatics across multiple sectors is propelling the market's growth.
    

    Several industries, such as the food, bioremediation, agriculture, forensics, and consumer industries, are also using bioinformatics services to improve the quality of their products and supply chain processes. Companies in a variety of sectors are rapidly utilizing bioinformatics services such as data integration, manipulation, lead generation, data management, in silico analysis, and advanced knowledge discovery.

    • Bioinformatics Approaches in Food Sciences

    In order to meet the needs of food production, food processing, enhancing the quality and nutritional content of food sources, and many other areas, bioinformatics plays a significant role in forecasting and evaluating the intended and undesired impacts of microorganisms on food, genomes, and proteomics research. Furthermore, bioinformatics techniques can be applied to produce crops with high yields and resistance to disease, among other desirable qualities. Additionally, there are numerous databases with information about food, including its components, nutritional value, chemistry, and biology.

    Genome Canada is proud to partner with five Institutes where there are five funding pools within this opportunity and Genome Canada is partnering on the Bioinformatics, Computational Biology and Health Data Sciences pool. (Source:https://genomecanada.ca/genome-canada-partners-with-cihr-to-launch-health-research-training-platform-2024-25/)

    • Bioinformatics in agriculture

    Bioinformatics is becoming more and more crucial in the gathering, storing, and processing of genomic data in the field of agricultural genomics, or agri-genomics. Generally referred to as agri-informatics, some of the various applications of bioinformatics tools and methods in agriculture focus on improving plant resistance against biotic and abiotic stressors as well as enhancing the nutritional quality in depleted soils. Beyond these uses, computer software-assisted gene discovery has enabled researchers to create focused strategies for seed quality enhancement, incorporate extra micronutrients into plants for improved human health, and create plants with phytoremediation potential.

    India/UK-based Agri-Genomics startup, Piatrika Biosystems has raised $1.2 Million in a seed round led by Ankur Capital. The company is bringing sustainable seeds and agri chemicals to market faster and cheaper. The investment will be used to build a strong Product Development team, also for more profound research, and to accelerate the productionising and commercialization of MVP. (Source:https://pressroom.icrisat.org/agri-genomics-startup-piatrika-biosystems-raises-12-million-in-seed-funding-led-by-ankur-capital)

    This expansion in the application areas of bioinformatics services is likely to drive the overall market growth. Bioinformatics services such as data integration, manipulation, lead discovery, data management, in silico analysis, and advanced knowledge discovery are increasingly being adopted by companies across various industries.&...

  10. Complete proteomics (LC-MS/MS) dataset of Pseudoalteromonas tunicata...

    • figshare.com
    Updated Mar 17, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sura Ali; Benjamin Jenkins; Andrew Doxey (2020). Complete proteomics (LC-MS/MS) dataset of Pseudoalteromonas tunicata planktonic and biofilm cultures. [Dataset]. http://doi.org/10.6084/m9.figshare.11993505.v1
    Explore at:
    application/x-sqlite3Available download formats
    Dataset updated
    Mar 17, 2020
    Dataset provided by
    figshare
    Authors
    Sura Ali; Benjamin Jenkins; Andrew Doxey
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A complete proteomics dataset of Pseudoalteromonas tunicata D2 liquid cultures grown for 8 h (planktonic), 26 h (biofilm), 42 h (biofilm), and 68 h (biofilm). This Scaffold (.sf3) file contains all MS/MS based peptide and protein identifications for each of the four samples. PEAKS Studio v. 8.5 was used as a search engine with the NCBI P. tunicata D2 proteome (4503 entries, NCBI database, May 7, 2018) as a reference database. Additional details can be found within the .sf3 file.

  11. P

    Proteomics Market By Technology (Spectroscopy, Chromatography,...

    • prophecymarketinsights.com
    pdf
    Updated Apr 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prophecy Market Insights (2024). Proteomics Market By Technology (Spectroscopy, Chromatography, Electrophoresis, Protein Microarrays, X-Ray Crystallography, and Others), By Software & Services (Bioinformatics Software & Services, and Core Proteomics Services), By Application By End-Users, By Region, Trends, Analysis and Forecast Till 2034 [Dataset]. https://www.prophecymarketinsights.com/market_insight/Global-Proteomics-Market-By-Instrumentation-3907
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Apr 2024
    Dataset authored and provided by
    Prophecy Market Insights
    License

    https://www.prophecymarketinsights.com/privacy_policyhttps://www.prophecymarketinsights.com/privacy_policy

    Time period covered
    2024 - 2034
    Area covered
    Global
    Description

    Proteomics market size and share projected to reach USD 380.7 Billion by 2034 from USD 37.0 Billion in 2024 and is expected to grow at CAGR of 29.20% during the forecast period. The proteomics market is segmented based on instrumentation technology, services and software, application, and region.

  12. Additional file 1 of Aqueous humor proteomics analyzed by bioinformatics and...

    • springernature.figshare.com
    xlsx
    Updated Aug 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tan Wang; Huan Chen; Ningning Li; Bao Zhang; Hanyi Min (2024). Additional file 1 of Aqueous humor proteomics analyzed by bioinformatics and machine learning in PDR cases versus controls [Dataset]. http://doi.org/10.6084/m9.figshare.25857520.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Aug 18, 2024
    Dataset provided by
    figshare
    Authors
    Tan Wang; Huan Chen; Ningning Li; Bao Zhang; Hanyi Min
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary Material 1

  13. f

    Table_1_Proteomic and Bioinformatic Studies for the Characterization of...

    • frontiersin.figshare.com
    docx
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leda Severi; Lorena Losi; Sergio Fonda; Laura Taddia; Gaia Gozzi; Gaetano Marverti; Fulvio Magni; Clizia Chinello; Martina Stella; Jalid Sheouli; Elena I. Braicu; Filippo Genovese; Angela Lauriola; Chiara Marraccini; Alessandra Gualandi; Domenico D'Arca; Stefania Ferrari; Maria P. Costi (2023). Table_1_Proteomic and Bioinformatic Studies for the Characterization of Response to Pemetrexed in Platinum Drug Resistant Ovarian Cancer.DOCX [Dataset]. http://doi.org/10.3389/fphar.2018.00454.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Frontiers
    Authors
    Leda Severi; Lorena Losi; Sergio Fonda; Laura Taddia; Gaia Gozzi; Gaetano Marverti; Fulvio Magni; Clizia Chinello; Martina Stella; Jalid Sheouli; Elena I. Braicu; Filippo Genovese; Angela Lauriola; Chiara Marraccini; Alessandra Gualandi; Domenico D'Arca; Stefania Ferrari; Maria P. Costi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Proteomics and bioinformatics are a useful combined technology for the characterization of protein expression level and modulation associated with the response to a drug and with its mechanism of action. The folate pathway represents an important target in the anticancer drugs therapy. In the present study, a discovery proteomics approach was applied to tissue samples collected from ovarian cancer patients who relapsed after the first-line carboplatin-based chemotherapy and were treated with pemetrexed (PMX), a known folate pathway targeting drug. The aim of the work is to identify the proteomic profile that can be associated to the response to the PMX treatment in pre-treatement tissue. Statistical metrics of the experimental Mass Spectrometry (MS) data were combined with a knowledge-based approach that included bioinformatics and a literature review through ProteinQuest™ tool, to design a protein set of reference (PSR). The PSR provides feedback for the consistency of MS proteomic data because it includes known validated proteins. A panel of 24 proteins with levels that were significantly different in pre-treatment samples of patients who responded to the therapy vs. the non-responder ones, was identified. The differences of the identified proteins were explained for the patients with different outcomes and the known PMX targets were further validated. The protein panel herein identified is ready for further validation in retrospective clinical trials using a targeted proteomic approach. This study may have a general relevant impact on biomarker application for cancer patients therapy selection.

  14. Data on proteins identified by TMT-labeled proteomics in alveolar type II...

    • search.datacite.org
    • figshare.com
    Updated Mar 26, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xue Lu; Feng Xu (2019). Data on proteins identified by TMT-labeled proteomics in alveolar type II epithelial cells [Dataset]. http://doi.org/10.6084/m9.figshare.7892465
    Explore at:
    Dataset updated
    Mar 26, 2019
    Dataset provided by
    Figsharehttp://figshare.com/
    DataCitehttps://www.datacite.org/
    Authors
    Xue Lu; Feng Xu
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The objective of this study was to characterize differentially regulated proteins and biological processes in hydrogen-treated hyperoxic primary type II alveolar epithelial cells (AECIIs) to elucidate the protective mechanism of hydrogen using quantitative proteomics. AECIIs were divided into three groups that were cultured for 24 h in three different conditions: control (21% oxygen), hyperoxia (95% oxygen), and hyperoxia + hydrogen. The TMT labeling quantitative proteome technique was used to detect changes in the protein expression profile, and bioinformatics analysis was performed.

  15. Proteins for Metal Binding Identification

    • figshare.com
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Frazier N Baker; Nicholas Maltbie; Joseph Hirschfeld; Aleksey Porollo (2023). Proteins for Metal Binding Identification [Dataset]. http://doi.org/10.6084/m9.figshare.8170802.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Authors
    Frazier N Baker; Nicholas Maltbie; Joseph Hirschfeld; Aleksey Porollo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    FASTA files for training, validating, and testing a neural network for coevolution-based metal binding site identification.

  16. Unified Human Gastrointestinal Proteome clustering results by DPCfam

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip, txt +1
    Updated Feb 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federico Barone; Federico Barone; Elena Tea Russo; Elena Tea Russo; Edith Natalia Villegas Garcia; Edith Natalia Villegas Garcia; Marco Punta; Marco Punta; Stefano Cozzini; Stefano Cozzini; Alessio Ansuini; Alessio Ansuini; Alberto Cazzaniga; Alberto Cazzaniga (2024). Unified Human Gastrointestinal Proteome clustering results by DPCfam [Dataset]. http://doi.org/10.5281/zenodo.10611777
    Explore at:
    zip, application/gzip, txtAvailable download formats
    Dataset updated
    Feb 10, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Federico Barone; Federico Barone; Elena Tea Russo; Elena Tea Russo; Edith Natalia Villegas Garcia; Edith Natalia Villegas Garcia; Marco Punta; Marco Punta; Stefano Cozzini; Stefano Cozzini; Alessio Ansuini; Alessio Ansuini; Alberto Cazzaniga; Alberto Cazzaniga
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains the result of clustering the Unified Human Gastrointestinal Proteome (UHGP) using the DPCfam algorithm.

    More details on the DPCfam clustering algorithm can be found in the original publication:

    Russo, Elena Tea, et al. "DPCfam: Unsupervised protein family classification by Density Peak Clustering of large sequence datasets." PLOS Computational Biology 18.10 (2022): e1010610. https://doi.org/10.1371/journal.pcbi.1010610

    All of the putative protein families obtained through DPCfam (including previous results) can be browsed online at our dedicated webserver: https://dpcfam.areasciencepark.it/uhgp

    The original protein dataset is version 1.0 of the UHGP-50 dataset, available for download from MGnify at https://www.ebi.ac.uk/metagenomics/.

    FILES DESCRIPTION:

    Only MCs with seeds with 1) more than 50 elements and 2) average length larger than 50 aminoacids are reported.

    metaclusters_xml.tar.gz:

    • dpcfam_uhgp_metaclusters.xml: Metaclusters' seeds. Metaclusters entries include also some statistical information about each MC (such as size, average length, low complexity fraction, etc.) and Pfam comparison (Dominant Architecture).
    • dpcfam_metaclusters.xsd: XML schema file for the data.
    • MCxml_to_tables.awk: Awk script to convert from XML to tabular text files. Use through the parse.sh script.
    • parse.sh: XML parser.
    • README.md

    uhgp_xml.tar.gz:

    • uhgp_seed_match.xml: XML file containing all of UHGP-50 proteins and its corresponding sequences, annotated with Pfam and DPCfam metacluster data. Annotations comprise the membership of a protein as a seed or matches found though the profile-hmms of the DPCfam-UHGP and the DPCfam-Uniref clusterings.
    • uhgp_matches.xsd: XML schema file for the data.
    • xml_to_list.awk: Awk script to convert from XML to tabular text files. Use through the parse.sh script.
    • xml_to_list_mcfiles.awk: Awk script to convert from XML to tabular text files (including individual files for metaclusters' seeds). Use through the parse.sh script.
    • parse.sh: XML parser.
    • README.md

    Metacluster Files:

    • seeds.zip: Metaclusters' seed sequences. A fasta file for each metacluster before filtering.
    • filtered_seeds.zip: Metaclusters' seed sequences after clustering at 60 percent identity.
    • metaclusters_hmms.tar.gz: Metaclusters' profile-hmms. A ".hmm" file for each metacluser.
    • metaclusters_msas.tar.gz: Metaclusters' multiple sequence alignments, in fasta format.

    uhgp_protein_mapping.txt:

    • Contains a mapping between the identifiers of versions 1.0 and 2.0.2 of UHGP. The first column corresponds to the ID in UHGP-50 1.0 (representatives for the clustering at 50% protein identity), the second column to the ID in version 2.0.2 and the third column to the ID of the representative of the protein for clustering at 100% sequence identity, for which the protein sequence can be found in UHGP-100.
  17. Data from: The Archaeal Proteome Project advances knowledge about archaeal...

    • zenodo.org
    bin, csv, zip
    Updated Jun 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefan Schulze; Stefan Schulze (2021). The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics [Dataset]. http://doi.org/10.5281/zenodo.5044621
    Explore at:
    csv, zip, binAvailable download formats
    Dataset updated
    Jun 30, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Stefan Schulze; Stefan Schulze
    Description

    Modern proteomics approaches can explore whole proteomes within a single mass spectrometry (MS) run. However, the enormous amount of MS data generated often remains incompletely analyzed due to a lack of sophisticated bioinformatic tools and expertise needed from a diverse array of fields. In particular, in the field of microbiology, efforts to combine large-scale proteomic datasets have so far largely been missing. Thus, despite their relatively small genomes, the proteomes of most archaea remain incompletely characterized. This in turn undermines our ability to gain a greater understanding of archaeal cell biology.

    Therefore, we have initiated the Archaeal Proteome Project (ArcPP), a community effort that works towards a comprehensive analysis of archaeal proteomes. Starting with the model archaeon Haloferax volcanii, using state-of-the-art bioinformatic tools, we have:

    • reanalyzed more than 26 Mio. spectra
    • optimized the analysis using parameter sweeps, multiple search engines implemented in Ursgal, and the combination of results through the combined PEP approach
    • thoroughly controlled false discovery rates for high confidence protein identifications using the picked protein FDR approach and limiting FDR to 0.5%
    • identified more than 45k peptides, corresponding to 3069 proteins (>75% of the proteome) with a median sequence coverage of 55%.
    • analyzed N-terminal protein processing, including N-terminal acetylation and signal peptide cleavage
    • performed a detailed glycoproteomic analysis, identifying >230 glycopeptides corresponding to 45 glycoproteins

    Benefiting from the established bioinformatic infrastructure, we will follow up on this analysis focusing on H. volcanii proteogenomics as well as the characterization of additional post-translational modifications. Furthermore, ArcPP will integrate quantitative results obtained from the individual datasets in order to identify common regulatory mechanisms. These studies on the H. volcanii proteome can serve as a blueprint for comprehensive proteomic analyses performed on a diverse range of archaea and bacteria.

    For further details, please refer to the following publications. Please also cite this work if you use these results for further analyses:

    Schulze, S., Adams, Z., Cerletti, M. et al. The Archaeal Proteome Project advances knowledge about archaeal cell biology through comprehensive proteomics. Nat Commun 11, 3145 (2020). https://doi.org/10.1038/s41467-020-16784-7

    Schulze, S.; Pfeiffer, F.; Garcia, B.A.; Pohlschroder, M. (2021). Comprehensive glycoproteomics shines new light on the complexity and extent of glycosylation in archaea. PLOS Biol. https://doi.org/10.1371/journal.pbio.3001277

    An interactive website to explore the combined results can be found at https://archaealproteomeproject.org/

    Scripts and metadata used for the analysis can be found at https://github.com/arcpp/ArcPP

    Updates version 1.3.0:

    - includes dataset PXD021827

    Updates version 1.2.0:

    - Includes dataset PXD021874
    - Includes results from a comprehensive glycoproteomic analysis of ArcPP datasets

    Updates version 1.1.0:
    - Natrialba magadii results are included in PXD009116.zip

  18. D

    Bioinformatics Software Market Research Report 2032

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Bioinformatics Software Market Research Report 2032 [Dataset]. https://dataintelo.com/report/bioinformatics-software-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Jan 7, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Bioinformatics Software Market Outlook



    The global bioinformatics software market size was valued at approximately USD 10 billion in 2023, and it is projected to reach around USD 25 billion by 2032, growing at a robust CAGR of 11% during the forecast period. This remarkable growth is fueled by the increased application of bioinformatics in drug discovery and development, the rising demand for personalized medicine, and the ongoing advancements in sequencing technologies. The convergence of biology and information technology has led to the optimization of biological data management, propelling the market's expansion as it transforms the landscape of biotechnology and pharmaceutical research. The rapid integration of artificial intelligence and machine learning techniques to process complex biological data further accentuates the growth trajectory of this market.



    An essential growth factor for the bioinformatics software market is the burgeoning demand for sequencing technologies. The decreasing cost of sequencing has led to a massive increase in the volume of genomic data generated, necessitating advanced software solutions to manage and interpret this data efficiently. This demand is particularly evident in genomics and proteomics, where bioinformatics software plays a critical role in analyzing and visualizing large datasets. Additionally, the adoption of cloud computing in bioinformatics offers scalable resources and cost-effective solutions for data storage and processing, further fueling market growth. The increasing collaboration between research institutions and software companies to develop innovative bioinformatics tools is also contributing positively to market expansion.



    Another significant driver is the growth of personalized medicine, which relies heavily on bioinformatics for the analysis of individual genetic information to tailor therapeutic strategies. As healthcare systems worldwide move towards precision medicine, the demand for bioinformatics software that can integrate genetic, phenotypic, and environmental data becomes more pronounced. This trend is not only transforming patient care but also significantly impacting drug development processes, as pharmaceutical companies aim to create more effective and targeted therapies. The strategic partnerships and collaborations between biotech firms and bioinformatics software providers are critical in advancing personalized medicine and enhancing patient outcomes.



    The increasing prevalence of complex diseases such as cancer and neurological disorders necessitates comprehensive research efforts, driving the need for robust bioinformatics software. These diseases require multi-omics approaches for better understanding, diagnosis, and treatment, where bioinformatics tools are indispensable. The ongoing research and development activities in this area, supported by government funding and private investments, are fostering innovation in bioinformatics solutions. Furthermore, the development of user-friendly and intuitive software interfaces is expanding the market beyond specialized research labs to include clinical settings and hospitals, broadening the potential user base and enhancing market penetration.



    From a regional perspective, North America currently leads the bioinformatics software market, thanks to its advanced technological infrastructure, significant investment in healthcare R&D, and the presence of numerous key market players. The region accounted for the largest market share in 2023 and is expected to maintain its dominance throughout the forecast period. Meanwhile, the Asia Pacific region is anticipated to exhibit the highest CAGR, driven by increasing investments in biotechnology and pharmaceutical research, expanding healthcare infrastructure, and the rising adoption of bioinformatics in emerging economies like China and India. Europe's market growth is also significant, supported by substantial funding for genomic research and a strong focus on precision medicine initiatives.



    Lifesciences Data Mining and Visualization are becoming increasingly vital in the bioinformatics software market. As the volume of biological data continues to grow exponentially, the need for sophisticated tools to mine and visualize this data is paramount. These tools enable researchers to uncover hidden patterns and insights from complex datasets, facilitating breakthroughs in genomics, proteomics, and other life sciences fields. The integration of advanced data mining techniques with visualization capabilities allows for a more intuitive

  19. Data S4 - Common and unique gene–tissue associations to all the sets

    • search.datacite.org
    • figshare.com
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alberto Santos (2016). Data S4 - Common and unique gene–tissue associations to all the sets [Dataset]. http://doi.org/10.6084/m9.figshare.1405679.v1
    Explore at:
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    Figsharehttp://figshare.com/
    DataCitehttps://www.datacite.org/
    Authors
    Alberto Santos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file contains: - Overlap between all the sets (transcriptomic set, UniProtKB, Text-mining and proteomics set) - Overlap between the transcriptomic and the proteomic set - The list of gene–tissue associations unique to each set

  20. Bioinformatics Services Market Size & Share Analysis - Industry Research...

    • mordorintelligence.com
    pdf,excel,csv,ppt
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mordor Intelligence, Bioinformatics Services Market Size & Share Analysis - Industry Research Report - Growth Trends [Dataset]. https://www.mordorintelligence.com/industry-reports/bioinformatics-services-market
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset authored and provided by
    Mordor Intelligence
    License

    https://www.mordorintelligence.com/privacy-policyhttps://www.mordorintelligence.com/privacy-policy

    Time period covered
    2019 - 2030
    Area covered
    Global
    Description

    The Bioinformatics Services Market Report is Segmented by Service Type (Data Analysis, Database Management, Sequencing, and Others), by Application (Drug Design, Genomics & Proteomics, Metabolomics, Transcriptomics, and Others), by End-User (Pharmaceutical & Biotechnology Companies, Contract Research Organization, Academic Institutes & Research Centers, and Others), and Geography (North America, Europe, Asia-Pacific, Middle East and Africa, and South America). The Report Offers the Value (in USD) for the Above Segments.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2025). Open Proteomics Database [Dataset]. http://identifiers.org/RRID:SCR_007789

Open Proteomics Database

RRID:SCR_007789, nif-0000-03220, Open Proteomics Database (RRID:SCR_007789), OPD

Explore at:
95 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jan 21, 2025
Description

OPD is a public database for storing and disseminating mass spectrometry based proteomics data. It covers Escherichia coli, Homo sapiens, Saccharomyces cerevisiae, Mycobacterium smegmatis, and Mus musculus. The database currently contains roughly 3,000,000 spectra representing experiments from these 5 different organisms. The mirror url is provided below as the OPD website is no longer functional (http://bioinformatics.icmb.utexas.edu/OPD/).

Search
Clear search
Close search
Google apps
Main menu