55 datasets found
  1. e

    Proteome UP000001816 - (Caulobacter crescentus) SWISS-MODEL dataset

    • swissmodel.expasy.org
    gz
    Updated Jul 2, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Proteome UP000001816 - (Caulobacter crescentus) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
    Explore at:
    gzAvailable download formats
    Dataset updated
    Jul 2, 2016
    Description

    SWISS-MODEL homology models mapping to UniProtKB Proteome UP000001816 (Caulobacter crescentus)

  2. e

    PROSITE profiles

    • ebi.ac.uk
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). PROSITE profiles [Dataset]. https://www.ebi.ac.uk/interpro/
    Explore at:
    Dataset updated
    Feb 5, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PROSITE is a database of protein families and domains. It consists of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family a new sequence belongs. PROSITE is based at the Swiss Institute of Bioinformatics (SIB), Geneva, Switzerland.

  3. r

    SWISS-2DPAGE

    • rrid.site
    • scicrunch.org
    • +2more
    Updated Feb 1, 2001
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2001). SWISS-2DPAGE [Dataset]. http://identifiers.org/RRID:SCR_006946
    Explore at:
    Dataset updated
    Feb 1, 2001
    Description

    A database of proteins identified by various 2-D PAGE and SDS-PAGE reference maps. Each SWISS-2DPAGE entry contains textual data on one protein, including mapping procedures, physiological and pathological information, experimental data (isoelectric point, molecular weight, amino acid composition, peptide masses) and bibliographical references. In addition to this textual data, SWISS-2DPAGE provides several 2-D PAGE and SDS-PAGE images showing the experimentally determined location of the protein, as well as a theoretical region computed from the sequence protein, indicating where the protein might be found in the gel. Using the database, users can locate these proteins on the 2-D PAGE maps or display the region of a 2-D PAGE map where one might expect to find a protein from UniProtKB/Swiss-Prot.

  4. e

    Proteome UP000001584 - (Mycobacterium tuberculosis) SWISS-MODEL dataset

    • swissmodel.expasy.org
    gz
    Updated Jul 2, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Proteome UP000001584 - (Mycobacterium tuberculosis) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
    Explore at:
    gzAvailable download formats
    Dataset updated
    Jul 2, 2016
    Description

    SWISS-MODEL homology models mapping to UniProtKB Proteome UP000001584 (Mycobacterium tuberculosis)

  5. e

    Proteome UP000005640 - (Homo sapiens) SWISS-MODEL dataset

    • swissmodel.expasy.org
    gz
    Updated Jul 2, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Proteome UP000005640 - (Homo sapiens) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
    Explore at:
    gzAvailable download formats
    Dataset updated
    Jul 2, 2016
    Description

    SWISS-MODEL homology models mapping to UniProtKB Proteome UP000005640 (Homo sapiens)

  6. n

    RESID

    • neuinfo.org
    • rrid.site
    • +2more
    Updated Jan 5, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2026). RESID [Dataset]. http://identifiers.org/RRID:SCR_003505/resolver/mentions
    Explore at:
    Dataset updated
    Jan 5, 2026
    Description

    A comprehensive collection of annotations and structures for protein modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link post-translational modifications. It provides: systematic and alternate names, atomic formulas and masses, enzyme activities generating the modifications, keywords, literature citations, Gene Ontology cross-references, Protein Information Resource (PIR) and SWISS-PROT protein sequence database feature table annotations, structure diagrams and molecular models. Each RESID Database entry presents a chemically unique modification and shows how that modification is currently annotated in the protein sequence databases, Swiss-Prot and the Protein Information Resource (PIR). The RESID Database provides a table of corresponding equivalent feature annotations that is used in the UniProt project, an international effort to combine the resources of the Swiss-Prot, TrEMBL and PIR. As an annotation tool, the RESID Database is used in standardizing and enhancing modification descriptions in the feature tables of Swiss-Prot entries.

  7. Additional file 2: of BASILIScan: a tool for high-throughput analysis of...

    • springernature.figshare.com
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michal Barski (2023). Additional file 2: of BASILIScan: a tool for high-throughput analysis of intrinsic disorder patterns in homologous proteins [Dataset]. http://doi.org/10.6084/m9.figshare.7453838.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Michal Barski
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BASILIScan search with human CDC7 kinase (Uniprot ID: O00311) against all vertebrate sequences available from UniprotKB (both Swissprot and TrEMBL). (CSV 263 kb)

  8. e

    SFLD

    • ebi.ac.uk
    Updated Sep 7, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). SFLD [Dataset]. https://www.ebi.ac.uk/interpro/
    Explore at:
    Dataset updated
    Sep 7, 2018
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SFLD (Structure-Function Linkage Database) is a hierarchical classification of enzymes that relates specific sequence-structure features to specific chemical capabilities.

  9. e

    NCBIFAM

    • ebi.ac.uk
    Updated Aug 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). NCBIFAM [Dataset]. https://www.ebi.ac.uk/interpro/
    Explore at:
    Dataset updated
    Aug 6, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    NCBIfam is a collection of protein families, featuring curated multiple sequence alignments, hidden Markov models (HMMs) and annotation, which provides a tool for identifying functionally related proteins based on sequence homology. NCBIfam is maintained at the National Center for Biotechnology Information (Bethesda, MD). NCBIfam includes models from TIGRFAMs, another database of protein families developed at The Institute for Genomic Research, then at the J. Craig Venter Institute (Rockville, MD, US).

  10. f

    Protein structure quality information as given by the SWISS-MODEL structure...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Sep 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Das, Padmashree; Alexiou, Athanasios; Castrosanto, Melvin A.; Kamal, Mohammad Amjad; Aljarba, Nada H.; Maitra, Swastika; Manuben, John Julius P.; Ghosh, Arabinda; Hasan, Mohammad Mehedi; Dey, Abhijit; Mukerjee, Nobendu; Ramos, Ana Rose; Alkahtani, Saad; Malik, Sumira (2022). Protein structure quality information as given by the SWISS-MODEL structure assessment tool. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000399341
    Explore at:
    Dataset updated
    Sep 13, 2022
    Authors
    Das, Padmashree; Alexiou, Athanasios; Castrosanto, Melvin A.; Kamal, Mohammad Amjad; Aljarba, Nada H.; Maitra, Swastika; Manuben, John Julius P.; Ghosh, Arabinda; Hasan, Mohammad Mehedi; Dey, Abhijit; Mukerjee, Nobendu; Ramos, Ana Rose; Alkahtani, Saad; Malik, Sumira
    Description

    Protein structure quality information as given by the SWISS-MODEL structure assessment tool.

  11. d

    IPI

    • dknet.org
    • rrid.site
    • +2more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). IPI [Dataset]. http://identifiers.org/RRID:SCR_003012
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    IPI provides a top level guide to the main databases (UniProtKB/Swiss-Prot, UniProtKB/TrEMBL, RefSeq, Ensembl, TAIR, H-InvDB, Vega) that describe the proteomes of higher eukaryotic organisms. IPI: :1. effectively maintains a database of cross references between the primary data sources :2. provides minimally redundant yet maximally complete sets of proteins for featured species (one sequence per transcript) :3. maintains stable identifiers (with incremental versioning) to allow the tracking of sequences in IPI between IPI releases. IPI is updated monthly in accordance with the latest data released by the primary data sources. As previously announced, the closure of IPI has been proposed for some time. Replacement data sets are now available through UniProt for human and mouse; sets for the other species contained within IPI are expected to be included as part of the UniProt release 2011_07. To allow users time to transition to using the new UniProt data sets, IPI releases will continue to be produced throughout the summer. The final release will be made in September 2011. Thereafter, the IPI website will cease to be maintained, although previous releases of the dataset will continue to be available from the FTP site. We would like to thank our users for their support and interest in this service.

  12. e

    HAMAP

    • ebi.ac.uk
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). HAMAP [Dataset]. https://www.ebi.ac.uk/interpro/
    Explore at:
    Dataset updated
    Feb 5, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    HAMAP stands for High-quality Automated and Manual Annotation of Proteins. HAMAP profiles are manually created by expert curators. They identify proteins that are part of well-conserved protein families or subfamilies. HAMAP is based at the SIB Swiss Institute of Bioinformatics, Geneva, Switzerland.

  13. Similarity sets created for every amino acid pair to complement...

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg (2023). Similarity sets created for every amino acid pair to complement semi-conserved amino acid positions with other highly similar amino acids. [Dataset]. http://doi.org/10.1371/journal.pone.0182216.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Similarity sets created for every amino acid pair to complement semi-conserved amino acid positions with other highly similar amino acids.

  14. f

    DataSheet1_Prostruc: an open-source tool for 3D structure prediction using...

    • frontiersin.figshare.com
    pdf
    Updated Nov 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shivani V. Pawar; Wilson Sena Kwaku Banini; Musa Muhammad Shamsuddeen; Toheeb A. Jumah; Nigel N. O. Dolling; Abdulwasiu Tiamiyu; Olaitan I. Awe (2024). DataSheet1_Prostruc: an open-source tool for 3D structure prediction using homology modeling.PDF [Dataset]. http://doi.org/10.3389/fchem.2024.1509407.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Nov 29, 2024
    Dataset provided by
    Frontiers
    Authors
    Shivani V. Pawar; Wilson Sena Kwaku Banini; Musa Muhammad Shamsuddeen; Toheeb A. Jumah; Nigel N. O. Dolling; Abdulwasiu Tiamiyu; Olaitan I. Awe
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionHomology modeling is a widely used computational technique for predicting the three-dimensional (3D) structures of proteins based on known templates,evolutionary relationships to provide structural insights critical for understanding protein function, interactions, and potential therapeutic targets. However, existing tools often require significant expertise and computational resources, presenting a barrier for many researchers.MethodsProstruc is a Python-based homology modeling tool designed to simplify protein structure prediction through an intuitive, automated pipeline. Integrating Biopython for sequence alignment, BLAST for template identification, and ProMod3 for structure generation, Prostruc streamlines complex workflows into a user-friendly interface. The tool enables researchers to input protein sequences, identify homologous templates from databases such as the Protein Data Bank (PDB), and generate high-quality 3D structures with minimal computational expertise. Prostruc implements a two-stage vSquarealidation process: first, it uses TM-align for structural comparison, assessing Root Mean Deviations (RMSD) and TM scores against reference models. Second, it evaluates model quality via QMEANDisCo to ensure high accuracy.ResultsThe top five models are selected based on these metrics and provided to the user. Prostruc stands out by offering scalability, flexibility, and ease of use. It is accessible via a cloud-based web interface or as a Python package for local use, ensuring adaptability across research environments. Benchmarking against existing tools like SWISS-MODEL,I-TASSER and Phyre2 demonstrates Prostruc's competitive performance in terms of structural accuracy and job runtime, while its open-source nature encourages community-driven innovation.DiscussionProstruc is positioned as a significant advancement in homology modeling, making high-quality protein structure prediction more accessible to the scientific community.

  15. Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg (2023). Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release 2014_10. [Dataset]. http://doi.org/10.1371/journal.pone.0182216.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release 2014_10.

  16. r

    Classifying protein kinase conformations with machine learning: data

    • resodate.org
    • zenodo.org
    Updated Jul 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ivan REVEGUK (2023). Classifying protein kinase conformations with machine learning: data [Dataset]. https://resodate.org/resources/aHR0cHM6Ly96ZW5vZG8ub3JnL3JlY29yZHMvODE3NTM3MA==
    Explore at:
    Dataset updated
    Jul 23, 2023
    Dataset provided by
    Zenodo
    Authors
    Ivan REVEGUK
    Description

    This data collection accompanies the manuscript Classifying protein kinase conformations with machine learning.

    It is created using thekinactivev0.1tool written in pure Python v3.10. Note that the data areprovided for the reference and reproducibility purposes and will not be compatible with later versions ofkinactive built uponlXtractor 0.1.1. Refer to thekinactive documentationfor instructions on how to obtain an actualized version of the structural kinome collection.

    File descriptions:

    db_v3.tar.gz -- a structural kinome collection archive. One can unpack it and inspect the contents orload it into the Python interpreter using `kinactive` or `lXtractor` tools.
    db_af2.tar.gz -- an AlphaFold2 kinome collection for Swiss-Prot sequences.
    default_*_vs.tsv -- structure/sequence variables calculated with lXtractor and used in an interpretable ML pipeline.
    *_features.tsv -- lists of ranked features selected by the eBoruta tool for each classifier.
    Supplement_labels.tsv -- ML model predictions for each PK domain structure found in db_v3.
    predictions_af2.csv -- Active/Inactive and DFG labels predicted for domains in db_af2.
    
  17. e

    SMART

    • ebi.ac.uk
    Updated Feb 14, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). SMART [Dataset]. https://www.ebi.ac.uk/interpro/
    Explore at:
    Dataset updated
    Feb 14, 2020
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SMART (a Simple Modular Architecture Research Tool) allows the identification and annotation of genetically mobile domains and the analysis of domain architectures. SMART is based at EMBL, Heidelberg, Germany.

  18. e

    SUPERFAMILY

    • ebi.ac.uk
    Updated Nov 8, 2010
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2010). SUPERFAMILY [Dataset]. https://www.ebi.ac.uk/interpro/
    Explore at:
    Dataset updated
    Nov 8, 2010
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SUPERFAMILY is a library of profile hidden Markov models that represent all proteins of known structure. The library is based on the SCOP classification of proteins: each model corresponds to a SCOP domain and aims to represent the entire SCOP superfamily that the domain belongs to. SUPERFAMILY is based at the University of Bristol, UK.

  19. Additional file 3: of BASILIScan: a tool for high-throughput analysis of...

    • springernature.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michal Barski (2023). Additional file 3: of BASILIScan: a tool for high-throughput analysis of intrinsic disorder patterns in homologous proteins [Dataset]. http://doi.org/10.6084/m9.figshare.7453847.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Michal Barski
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data used to generate Fig. 4. Summary of the BASILIScan search with all human protein sequences available from UniprotKB/Swissprot against the UniprotKB/Swissprot repository. (CSV 1418 kb)

  20. d

    neXtProt

    • dknet.org
    • rrid.site
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    neXtProt [Dataset]. http://identifiers.org/RRID:SCR_008911/resolver
    Explore at:
    Description

    THIS RESOURCE IS NO LONGER IN SERVICE. Documented on April 15,2025. Human protein knowledge platform. Knowledge platform for human proteins selects and filters high throughput data pertinent to human proteins from UniProtKB. Extends UniProtKB/Swiss-Prot annotations for human proteins to include several new data types.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2016). Proteome UP000001816 - (Caulobacter crescentus) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository

Proteome UP000001816 - (Caulobacter crescentus) SWISS-MODEL dataset

Explore at:
gzAvailable download formats
Dataset updated
Jul 2, 2016
Description

SWISS-MODEL homology models mapping to UniProtKB Proteome UP000001816 (Caulobacter crescentus)

Search
Clear search
Close search
Google apps
Main menu