55 datasets found

e
Proteome UP000001816 - (Caulobacter crescentus) SWISS-MODEL dataset
swissmodel.expasy.org
gz
Updated Jul 2, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Proteome UP000001816 - (Caulobacter crescentus) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
Explore at:
gzAvailable download formats
Dataset updated
Jul 2, 2016
Description
SWISS-MODEL homology models mapping to UniProtKB Proteome UP000001816 (Caulobacter crescentus)
e
PROSITE profiles
ebi.ac.uk
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). PROSITE profiles [Dataset]. https://www.ebi.ac.uk/interpro/
Explore at:
Dataset updated
Feb 5, 2025
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
PROSITE is a database of protein families and domains. It consists of biologically significant sites, patterns and profiles that help to reliably identify to which known protein family a new sequence belongs. PROSITE is based at the Swiss Institute of Bioinformatics (SIB), Geneva, Switzerland.
r
SWISS-2DPAGE
rrid.site
scicrunch.org
+2more
Updated Feb 1, 2001
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2001). SWISS-2DPAGE [Dataset]. http://identifiers.org/RRID:SCR_006946
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_006946
Dataset updated
Feb 1, 2001
Description
A database of proteins identified by various 2-D PAGE and SDS-PAGE reference maps. Each SWISS-2DPAGE entry contains textual data on one protein, including mapping procedures, physiological and pathological information, experimental data (isoelectric point, molecular weight, amino acid composition, peptide masses) and bibliographical references. In addition to this textual data, SWISS-2DPAGE provides several 2-D PAGE and SDS-PAGE images showing the experimentally determined location of the protein, as well as a theoretical region computed from the sequence protein, indicating where the protein might be found in the gel. Using the database, users can locate these proteins on the 2-D PAGE maps or display the region of a 2-D PAGE map where one might expect to find a protein from UniProtKB/Swiss-Prot.
e
Proteome UP000001584 - (Mycobacterium tuberculosis) SWISS-MODEL dataset
swissmodel.expasy.org
gz
Updated Jul 2, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Proteome UP000001584 - (Mycobacterium tuberculosis) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
Explore at:
gzAvailable download formats
Dataset updated
Jul 2, 2016
Description
SWISS-MODEL homology models mapping to UniProtKB Proteome UP000001584 (Mycobacterium tuberculosis)
e
Proteome UP000005640 - (Homo sapiens) SWISS-MODEL dataset
swissmodel.expasy.org
gz
Updated Jul 2, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Proteome UP000005640 - (Homo sapiens) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
Explore at:
gzAvailable download formats
Dataset updated
Jul 2, 2016
Description
SWISS-MODEL homology models mapping to UniProtKB Proteome UP000005640 (Homo sapiens)
n
RESID
neuinfo.org
rrid.site
+2more
Updated Jan 5, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2026). RESID [Dataset]. http://identifiers.org/RRID:SCR_003505/resolver/mentions
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_003505 https://identifiers.org/RRID:SCR_003505/resolver/mentions
Dataset updated
Jan 5, 2026
Description
A comprehensive collection of annotations and structures for protein modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link post-translational modifications. It provides: systematic and alternate names, atomic formulas and masses, enzyme activities generating the modifications, keywords, literature citations, Gene Ontology cross-references, Protein Information Resource (PIR) and SWISS-PROT protein sequence database feature table annotations, structure diagrams and molecular models. Each RESID Database entry presents a chemically unique modification and shows how that modification is currently annotated in the protein sequence databases, Swiss-Prot and the Protein Information Resource (PIR). The RESID Database provides a table of corresponding equivalent feature annotations that is used in the UniProt project, an international effort to combine the resources of the Swiss-Prot, TrEMBL and PIR. As an annotation tool, the RESID Database is used in standardizing and enhancing modification descriptions in the feature tables of Swiss-Prot entries.
Additional file 2: of BASILIScan: a tool for high-throughput analysis of...
springernature.figshare.com
txt
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michal Barski (2023). Additional file 2: of BASILIScan: a tool for high-throughput analysis of intrinsic disorder patterns in homologous proteins [Dataset]. http://doi.org/10.6084/m9.figshare.7453838.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7453838.v1
Dataset updated
May 31, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Michal Barski
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BASILIScan search with human CDC7 kinase (Uniprot ID: O00311) against all vertebrate sequences available from UniprotKB (both Swissprot and TrEMBL). (CSV 263 kb)
e
SFLD
ebi.ac.uk
Updated Sep 7, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). SFLD [Dataset]. https://www.ebi.ac.uk/interpro/
Explore at:
Dataset updated
Sep 7, 2018
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
SFLD (Structure-Function Linkage Database) is a hierarchical classification of enzymes that relates specific sequence-structure features to specific chemical capabilities.
e
NCBIFAM
ebi.ac.uk
Updated Aug 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). NCBIFAM [Dataset]. https://www.ebi.ac.uk/interpro/
Explore at:
Dataset updated
Aug 6, 2025
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
NCBIfam is a collection of protein families, featuring curated multiple sequence alignments, hidden Markov models (HMMs) and annotation, which provides a tool for identifying functionally related proteins based on sequence homology. NCBIfam is maintained at the National Center for Biotechnology Information (Bethesda, MD). NCBIfam includes models from TIGRFAMs, another database of protein families developed at The Institute for Genomic Research, then at the J. Craig Venter Institute (Rockville, MD, US).
f
Protein structure quality information as given by the SWISS-MODEL structure...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Sep 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Das, Padmashree; Alexiou, Athanasios; Castrosanto, Melvin A.; Kamal, Mohammad Amjad; Aljarba, Nada H.; Maitra, Swastika; Manuben, John Julius P.; Ghosh, Arabinda; Hasan, Mohammad Mehedi; Dey, Abhijit; Mukerjee, Nobendu; Ramos, Ana Rose; Alkahtani, Saad; Malik, Sumira (2022). Protein structure quality information as given by the SWISS-MODEL structure assessment tool. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000399341
Explore at:
Dataset updated
Sep 13, 2022
Authors
Das, Padmashree; Alexiou, Athanasios; Castrosanto, Melvin A.; Kamal, Mohammad Amjad; Aljarba, Nada H.; Maitra, Swastika; Manuben, John Julius P.; Ghosh, Arabinda; Hasan, Mohammad Mehedi; Dey, Abhijit; Mukerjee, Nobendu; Ramos, Ana Rose; Alkahtani, Saad; Malik, Sumira
Description
Protein structure quality information as given by the SWISS-MODEL structure assessment tool.
d
IPI
dknet.org
rrid.site
+2more
Updated Jan 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). IPI [Dataset]. http://identifiers.org/RRID:SCR_003012
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_003012
Dataset updated
Jan 29, 2022
Description
IPI provides a top level guide to the main databases (UniProtKB/Swiss-Prot, UniProtKB/TrEMBL, RefSeq, Ensembl, TAIR, H-InvDB, Vega) that describe the proteomes of higher eukaryotic organisms. IPI: :1. effectively maintains a database of cross references between the primary data sources :2. provides minimally redundant yet maximally complete sets of proteins for featured species (one sequence per transcript) :3. maintains stable identifiers (with incremental versioning) to allow the tracking of sequences in IPI between IPI releases. IPI is updated monthly in accordance with the latest data released by the primary data sources. As previously announced, the closure of IPI has been proposed for some time. Replacement data sets are now available through UniProt for human and mouse; sets for the other species contained within IPI are expected to be included as part of the UniProt release 2011_07. To allow users time to transition to using the new UniProt data sets, IPI releases will continue to be produced throughout the summer. The final release will be made in September 2011. Thereafter, the IPI website will cease to be maintained, although previous releases of the dataset will continue to be available from the FTP site. We would like to thank our users for their support and interest in this service.
e
HAMAP
ebi.ac.uk
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). HAMAP [Dataset]. https://www.ebi.ac.uk/interpro/
Explore at:
Dataset updated
Feb 5, 2025
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
HAMAP stands for High-quality Automated and Manual Annotation of Proteins. HAMAP profiles are manually created by expert curators. They identify proteins that are part of well-conserved protein families or subfamilies. HAMAP is based at the SIB Swiss Institute of Bioinformatics, Geneva, Switzerland.
Similarity sets created for every amino acid pair to complement...
plos.figshare.com
xls
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg (2023). Similarity sets created for every amino acid pair to complement semi-conserved amino acid positions with other highly similar amino acids. [Dataset]. http://doi.org/10.1371/journal.pone.0182216.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0182216.t002
Dataset updated
Jun 3, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Similarity sets created for every amino acid pair to complement semi-conserved amino acid positions with other highly similar amino acids.
f
DataSheet1_Prostruc: an open-source tool for 3D structure prediction using...
frontiersin.figshare.com
pdf
Updated Nov 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shivani V. Pawar; Wilson Sena Kwaku Banini; Musa Muhammad Shamsuddeen; Toheeb A. Jumah; Nigel N. O. Dolling; Abdulwasiu Tiamiyu; Olaitan I. Awe (2024). DataSheet1_Prostruc: an open-source tool for 3D structure prediction using homology modeling.PDF [Dataset]. http://doi.org/10.3389/fchem.2024.1509407.s001
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fchem.2024.1509407.s001
Dataset updated
Nov 29, 2024
Dataset provided by
Frontiers
Authors
Shivani V. Pawar; Wilson Sena Kwaku Banini; Musa Muhammad Shamsuddeen; Toheeb A. Jumah; Nigel N. O. Dolling; Abdulwasiu Tiamiyu; Olaitan I. Awe
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IntroductionHomology modeling is a widely used computational technique for predicting the three-dimensional (3D) structures of proteins based on known templates,evolutionary relationships to provide structural insights critical for understanding protein function, interactions, and potential therapeutic targets. However, existing tools often require significant expertise and computational resources, presenting a barrier for many researchers.MethodsProstruc is a Python-based homology modeling tool designed to simplify protein structure prediction through an intuitive, automated pipeline. Integrating Biopython for sequence alignment, BLAST for template identification, and ProMod3 for structure generation, Prostruc streamlines complex workflows into a user-friendly interface. The tool enables researchers to input protein sequences, identify homologous templates from databases such as the Protein Data Bank (PDB), and generate high-quality 3D structures with minimal computational expertise. Prostruc implements a two-stage vSquarealidation process: first, it uses TM-align for structural comparison, assessing Root Mean Deviations (RMSD) and TM scores against reference models. Second, it evaluates model quality via QMEANDisCo to ensure high accuracy.ResultsThe top five models are selected based on these metrics and provided to the user. Prostruc stands out by offering scalability, flexibility, and ease of use. It is accessible via a cloud-based web interface or as a Python package for local use, ensuring adaptability across research environments. Benchmarking against existing tools like SWISS-MODEL,I-TASSER and Phyre2 demonstrates Prostruc's competitive performance in terms of structural accuracy and job runtime, while its open-source nature encourages community-driven innovation.DiscussionProstruc is positioned as a significant advancement in homology modeling, making high-quality protein structure prediction more accessible to the scientific community.
Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release...
plos.figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg (2023). Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release 2014_10. [Dataset]. http://doi.org/10.1371/journal.pone.0182216.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0182216.t004
Dataset updated
May 30, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Christian-Alexander Dudek; Henning Dannheim; Dietmar Schomburg
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release 2014_10.
r
Classifying protein kinase conformations with machine learning: data
resodate.org
zenodo.org
Updated Jul 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ivan REVEGUK (2023). Classifying protein kinase conformations with machine learning: data [Dataset]. https://resodate.org/resources/aHR0cHM6Ly96ZW5vZG8ub3JnL3JlY29yZHMvODE3NTM3MA==
Explore at:
Dataset updated
Jul 23, 2023
Dataset provided by
Zenodo
Authors
Ivan REVEGUK
Description
This data collection accompanies the manuscript Classifying protein kinase conformations with machine learning.

It is created using thekinactivev0.1tool written in pure Python v3.10. Note that the data areprovided for the reference and reproducibility purposes and will not be compatible with later versions ofkinactive built uponlXtractor 0.1.1. Refer to thekinactive documentationfor instructions on how to obtain an actualized version of the structural kinome collection.

File descriptions:

db_v3.tar.gz -- a structural kinome collection archive. One can unpack it and inspect the contents orload it into the Python interpreter using `kinactive` or `lXtractor` tools. db_af2.tar.gz -- an AlphaFold2 kinome collection for Swiss-Prot sequences. default_*_vs.tsv -- structure/sequence variables calculated with lXtractor and used in an interpretable ML pipeline. *_features.tsv -- lists of ranked features selected by the eBoruta tool for each classifier. Supplement_labels.tsv -- ML model predictions for each PK domain structure found in db_v3. predictions_af2.csv -- Active/Inactive and DFG labels predicted for domains in db_af2.
e
SMART
ebi.ac.uk
Updated Feb 14, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2020). SMART [Dataset]. https://www.ebi.ac.uk/interpro/
Explore at:
Dataset updated
Feb 14, 2020
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
SMART (a Simple Modular Architecture Research Tool) allows the identification and annotation of genetically mobile domains and the analysis of domain architectures. SMART is based at EMBL, Heidelberg, Germany.
e
SUPERFAMILY
ebi.ac.uk
Updated Nov 8, 2010
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2010). SUPERFAMILY [Dataset]. https://www.ebi.ac.uk/interpro/
Explore at:
Dataset updated
Nov 8, 2010
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
SUPERFAMILY is a library of profile hidden Markov models that represent all proteins of known structure. The library is based on the SCOP classification of proteins: each model corresponds to a SCOP domain and aims to represent the entire SCOP superfamily that the domain belongs to. SUPERFAMILY is based at the University of Bristol, UK.
Additional file 3: of BASILIScan: a tool for high-throughput analysis of...
springernature.figshare.com
txt
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michal Barski (2023). Additional file 3: of BASILIScan: a tool for high-throughput analysis of intrinsic disorder patterns in homologous proteins [Dataset]. http://doi.org/10.6084/m9.figshare.7453847.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7453847.v1
Dataset updated
Jun 1, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Michal Barski
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data used to generate Fig. 4. Summary of the BASILIScan search with all human protein sequences available from UniprotKB/Swissprot against the UniprotKB/Swissprot repository. (CSV 1418 kb)
d
neXtProt
dknet.org
rrid.site
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
neXtProt [Dataset]. http://identifiers.org/RRID:SCR_008911/resolver
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_008911 https://identifiers.org/RRID:SCR_008911/resolver
Description
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on April 15,2025. Human protein knowledge platform. Knowledge platform for human proteins selects and filters high throughput data pertinent to human proteins from UniProtKB. Extends UniProtKB/Swiss-Prot annotations for human proteins to include several new data types.