100+ datasets found
  1. d

    Structure - Molecular Modeling Database (MMDB)

    • catalog.data.gov
    • datadiscovery.nlm.nih.gov
    • +2more
    Updated Jun 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (2025). Structure - Molecular Modeling Database (MMDB) [Dataset]. https://catalog.data.gov/dataset/molecular-modeling-database-mmdb
    Explore at:
    Dataset updated
    Jun 19, 2025
    Dataset provided by
    National Library of Medicine
    Description

    Three dimensional structures provide a wealth of information on the biological function and the evolutionary history of macromolecules. They can be used to examine sequence-structure-function relationships, interactions, active sites, and more.

  2. b

    PSCDB

    • dbarchive.biosciencedbc.jp
    Updated Mar 15, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Graduate School of Information Science Nagoya University (2012). PSCDB [Dataset]. http://doi.org/10.18908/lsdba.nbdc01636-000
    Explore at:
    Dataset updated
    Mar 15, 2012
    Dataset provided by
    Graduate School of Information Science Nagoya University
    Description

    The purpose of this database is to represent the relationship between protein structural change and ligand binding. We classified protein structural changes into 7 classes, in terms of the ligand binding sites and the location where the dominant motion occurs.

  3. Protein Chemical Structure Comparison from Three Drug Databases

    • johnsnowlabs.com
    csv
    Updated Jan 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Snow Labs (2021). Protein Chemical Structure Comparison from Three Drug Databases [Dataset]. https://www.johnsnowlabs.com/marketplace/protein-chemical-structure-comparison-from-three-drug-databases/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 20, 2021
    Dataset authored and provided by
    John Snow Labs
    Area covered
    N/A
    Description

    This dataset Protein Chemical Structure Comparison from Three Drug Databases is a selection of a 3-way consensus list from the paper "Comparing the Chemical Structure and Protein Content of ChEMBL, DrugBank, Human Metabolome Database and the Therapeutic Target Database" (2013) [Abstract]. It includes 352 proteins-in-common between the three drug databases.

  4. b

    Database of structurally defined protein interfaces

    • bioregistry.io
    Updated Jan 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Database of structurally defined protein interfaces [Dataset]. https://bioregistry.io/pibase
    Explore at:
    Dataset updated
    Jan 29, 2023
    Description

    PIBASE is a collection of all protein structural interfaces extracted from the Protein Data Bank and PQS structure databases. Both chain-chain and domain-domain (SCOP and CATH definitions) interfaces are detected.

  5. d

    Protein-Small Molecule Database

    • dknet.org
    • rrid.site
    • +2more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Protein-Small Molecule Database [Dataset]. http://identifiers.org/RRID:SCR_002112
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    Database of non-redundant sets of protein - small-molecule complexes that are especially suitable for structure-based drug design and protein - small-molecule interaction research. PSMB supports: * Support frequent updates - The number of new structures in the PDB is growing rapidly. In order to utilize these structures, frequent updates are required. In contrast to manual procedures which require significant time and effort per update, generation of the PSMDB database is fully automatic thereby facilitating frequent database updates. * Consider both protein and ligand structural redundancy - In the database, two complexes are considered redundant if they share a similar protein and ligand (the protein - small-molecule non-redundant set). This allows the database to contain structural information for the same protein bound to several different ligands (and vice-versa). Additionally, for completeness, the database contains a set of non-redundant complexes when only protein structural redundancy is considered (our protein non-redundant set). The following images demonstrate the structural redundancy of the protein complexes in the PDB compared to the PSMDB. * Efficient handling of covalent bonds -Many protein complexes contain covalently bound ligands. Typically, protein-ligand databases discard these complexes; however, the PSMDB simply removes the covalently bound ligand from the complex, retaining any non-covalently bound ligands. This increases the number of usable complexes in the database. * Separate complexes into protein and ligand files -The PSMDB contains individual structure files for both the protein and all non-covalently bound ligands. The unbound proteins are in PDB format while the individual ligands are in SDF format (in their native coordinate frame).

  6. n

    Structural Antibody Database

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Feb 1, 2001
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2001). Structural Antibody Database [Dataset]. http://identifiers.org/RRID:SCR_022096
    Explore at:
    Dataset updated
    Feb 1, 2001
    Description

    Database containing all antibody structures available in the PDB, annotated and presented in consistent fashion.Each structure is annotated with number of properties including experimental details, antibody nomenclature (e.g. heavy-light pairings), curated affinity data and sequence annotations. You can use the database to inspect individual structures, create and download datasets for analysis, search the database for structures with similar sequences to your query, monitor the known structural repetoire of antibodies.

  7. r

    Enzyme Structures Database

    • rrid.site
    • neuinfo.org
    • +2more
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enzyme Structures Database [Dataset]. http://identifiers.org/RRID:SCR_007125
    Explore at:
    Description

    Database of known enzyme structures that have been deposited in the Protein Data Bank (PDB). The enzyme structures are classified by their E.C. number of the ENZYME Data Bank. Browse the classification hierarchy or enter an EC number or search-string. There are currently 45,638 PDB-enzyme entries in the PDB (as at 23 February, 2013) involving 38,109 separate PDB files - some files having more than one E.C. number associated with them.

  8. NAR Molecular Biology Database Collection Data

    • springernature.figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christine Zardecki; Monica Sekharan; Chenghua Shao (2023). NAR Molecular Biology Database Collection Data [Dataset]. http://doi.org/10.6084/m9.figshare.6122099.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Christine Zardecki; Monica Sekharan; Chenghua Shao
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    List of NAR Online Molecular Biology Database Collection resources that utilize PDB data (July 2018)

  9. NIST Inorganic Crystal Structure Database (ICSD)

    • data.nist.gov
    • s.cnmilf.com
    • +1more
    Updated Oct 19, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Igor Levin (2018). NIST Inorganic Crystal Structure Database (ICSD) [Dataset]. http://doi.org/10.18434/M32147
    Explore at:
    Dataset updated
    Oct 19, 2018
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Authors
    Igor Levin
    License

    https://www.nist.gov/open/licensehttps://www.nist.gov/open/license

    Description

    Materials discovery and development necessarily begins with the preparation and identification of product phase(s). Crystalline compounds can be identified by their characteristic diffraction patterns using X-rays, neutrons, and or electrons. An estimated 20,000 X-ray diffractometers and a comparable number of electron microscopes are used daily in materials research and development laboratories for this purpose. Access to crystal structure data is a critical step in solving research and applications problems in materials researches, and these data are of interest to analysts in areas such as materials design, property prediction and compound identification. NIST Crystallographic Data Center, in collaboration with partners all over the world, evaluates and disseminates chemical, physical and crystallographic information published on these materials. NIST Standard Reference Database 3: NIST Inorganic Crystal Structure Database (NIST ICSD) is a comprehensive collection of crystal structure data of nonorganic compounds (including inorganics, ceramics, minerals, pure elements, metals, and intermetallic systems) containing over 210,000 entries and covering the literature from 1913. NIST ICSD includes entries that fall into the following categories: full structure data from experimental refinement or derived from their iso-structural structure types, theoretically predicted structures from computer simulations, as well as partially characterized structures. The NIST ICSD web application provides materials researchers with a user-friendly interface to search the database based on bibliographic information, chemistry, unit cell, space group, experimental settings, mineral name/group and other derived data from expert evaluation. In addition, it also provides users with functions to easily create and examine results from various crystallographic computations, such as reduced cell, bond distance/angle, calculated powder diffraction data, and structure standardization.

  10. s

    Molecular Modeling DataBase

    • scicrunch.org
    • rrid.site
    • +2more
    Updated Dec 4, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Molecular Modeling DataBase [Dataset]. http://identifiers.org/RRID:SCR_010623
    Explore at:
    Dataset updated
    Dec 4, 2023
    Description

    The Molecular Modeling DataBase (MMDB), also known as Entrez Structure, is a database of experimentally determined structures obtained from the RCSB Protein Data Bank (PDB). MMDB is developed by the Structure Group of the NCBI Computational Biology Branch. The data processing procedure at NCBI results in the addition of a number of useful features that facilitate computation on the data and link them to many other data types in the Entrez system. The structure database is considerably smaller than Entrez''s Protein or Nucleotide databases, but a large fraction of all known protein sequences have homologs in this set, and one may often learn more about a protein by examining 3-D structures of its homologs. These are accessible as Related Structures in the Links menu of Entrez Protein sequence records (illustrated example). It is then possible to align the query protein to the structure-based sequence, as shown in the illustration on this page. Additional resources can be used along with MMDB to interactively view the structures, find similar 3D structures, learn about the types of interactions and bound chemicals that have been found to exist among the similar 3D structures, and more.

  11. c

    Protein Structural Domain Classification

    • cathdb.info
    • ec.i4cologne.com
    • +3more
    Updated Sep 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Protein Structural Domain Classification [Dataset]. http://identifiers.org/MIR:00100005
    Explore at:
    Dataset updated
    Sep 30, 2024
    Description

    CATH Domain Classification List (latest release) - protein structural domains classified into CATH hierarchy.

  12. Protein Structure Initiative - TargetTrack 2000-2017 - all data files

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Helen M. Berman, Margaret J. Gabanyi, Andrei Kouranov, David I. Micallef, John Westbrook; Protein Structure Initiative network of investigators; Helen M. Berman, Margaret J. Gabanyi, Andrei Kouranov, David I. Micallef, John Westbrook; Protein Structure Initiative network of investigators (2020). Protein Structure Initiative - TargetTrack 2000-2017 - all data files [Dataset]. http://doi.org/10.5281/zenodo.821654
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Helen M. Berman, Margaret J. Gabanyi, Andrei Kouranov, David I. Micallef, John Westbrook; Protein Structure Initiative network of investigators; Helen M. Berman, Margaret J. Gabanyi, Andrei Kouranov, David I. Micallef, John Westbrook; Protein Structure Initiative network of investigators
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Protein Structure Initiative - TargetTrack protein target registration database (795 MB, gzipped tarball)

    The Protein Structure Initiative was a high-throughput structural genomics effort from 2000-2015 focused on developing technologies to enable greater coverage of protein structure space. Over its 15-year tenure, over 100 investigators at 35 centers (see ContributingCenters.xls) declared over 350,000 protein sequences (targets) that they would study using state-of-the-art protein production and structure determination methods. Many of these targets were selected through bioinformatics-based methods to serve as representatives for sequence and structure clusters.

    From 2003-2010, these selected sequences and some basic identifying metadata were kept in a database called TargetDB, created at the Research Collaboratory for Structural Bioinformatics at Rutgers University. In 2008, a second database named PepcDB was created to track detailed experimental trial history and the standard protocols used by the PSI centers. These two databases became the principal structural genomics target databases, and were rolled into the PSI Structural Biology Knowledgebase in 2008.

    As part of the third phase of the PSI, TargetDB and PepcDB were merged into a single resource, TargetTrack, to facilitate one-stop access to the data as well as expanding the schema to include new required data items. Participating centers deposited the latest status on their active targets and the protocols that were used (along with any deviations) on a weekly or quarterly basis. TargetTrack provided a variety of pre-computed data downloads on a weekly basis as well.

    In July 2017, the Structural Biology Knowledgebase ceased operations. The files provided in this tarball represent the final datafiles generated by TargetTrack (timestamp June 30, 2017). Please read the README included in this dataset for descriptions of each file.

    The entire TargetTrack datafile in XML format can be found in /TargetTrack XML files/tt.xml.gz

    Key documentation can be found in the /Documentation folder.
    TargetTrack schema: targetTrack-v1.4.1.pdf
    Spreadsheet with TargetTrack enumerations for relevant fields: targetTrackEnumeratedDataItems-v1.4.1-1.xls
    Image depicted the XML data schema: targetTrack-v1.4.1.jpg

    These files are 868 MB in total size, uncompressed.
    To open the tarball, use the command 'tar -zxvf TargetTrack-1Jul2017.tar.gz'

    -- created by the PSI Structural Biology Knowledgebase, July 5, 2017

  13. d

    SCOP: Structural Classification of Proteins

    • dknet.org
    • scicrunch.org
    • +2more
    Updated Nov 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). SCOP: Structural Classification of Proteins [Dataset]. http://identifiers.org/RRID:SCR_007039/resolver
    Explore at:
    Dataset updated
    Nov 15, 2025
    Description

    The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships. Protein domains in SCOP are hierarchically classified into families, superfamilies, folds and classes. The continual accumulation of sequence and structural data allows more rigorous analysis and provides important information for understanding the protein world and its evolutionary repertoire. SCOP participates in a project that aims to rationalize and integrate the data on proteins held in several sequence and structure databases. As part of this project, starting with release 1.63, we have initiated a refinement of the SCOP classification, which introduces a number of changes mostly at the levels below superfamily. The pending SCOP reclassification will be carried out gradually through a number of future releases. In addition to the expanded set of static links to external resources, available at the level of domain entries, we have started modernization of the interface capabilities of SCOP allowing more dynamic links with other databases.

  14. d

    Apo and Holo structures DataBase

    • dknet.org
    • scicrunch.org
    • +1more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Apo and Holo structures DataBase [Dataset]. http://identifiers.org/RRID:SCR_004800
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    Database of apo and holo structure pairs of proteins before and after binding. Various protein functions have been shown directly associated with conformational transitions triggered by binding other molecules. Tertiary structures determined in the unbound and bound state are usually named apo and holo structures, respectively. AH-DB is the largest database of apo-holo structure pairs and provides a sophisticated interface to search and view the collected data. It contains 746314 apo-holo pairs of 3638 proteins from 702 organisms.

  15. T

    The Protein Ensemble Database (February 2022)

    • proteinensemble.org
    • bioregistry.io
    Updated Feb 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BioComputing UP, Department of Biomedical Sciences, University of Padua (2022). The Protein Ensemble Database (February 2022) [Dataset]. https://proteinensemble.org/
    Explore at:
    Dataset updated
    Feb 3, 2022
    Dataset authored and provided by
    BioComputing UP, Department of Biomedical Sciences, University of Padua
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Protein Ensemble Database is an open access database for the deposition of structural ensembles, including intrinsically disordered proteins.

  16. AlphaFold Protein Structure Database

    • console.cloud.google.com
    Updated Aug 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    https://console.cloud.google.com/marketplace/browse?filter=partner:BigQuery%20Public%20Data&hl=en-GB (2023). AlphaFold Protein Structure Database [Dataset]. https://console.cloud.google.com/marketplace/product/bigquery-public-data/deepmind-alphafold?hl=en-GB
    Explore at:
    Dataset updated
    Aug 9, 2023
    Dataset provided by
    BigQueryhttps://cloud.google.com/bigquery
    Googlehttp://google.com/
    License
    Description

    The AlphaFold Protein Structure Database is a collection of protein structure predictions made using the machine learning model AlphaFold. AlphaFold was developed by DeepMind , and this database was created in partnership with EMBL-EBI . For information on how to interpret, download and query the data, as well as on which proteins are included / excluded, and change log, please see our main dataset guide and FAQs . To interactively view individual entries or to download proteomes / Swiss-Prot please visit https://alphafold.ebi.ac.uk/ . The current release aims to cover most of the over 200M sequences in UniProt (a commonly used reference set of annotated proteins). The files provided for each entry include the structure plus two model confidence metrics (pLDDT and PAE). The files can be found in the Google Cloud Storage bucket gs://public-datasets-deepmind-alphafold-v4 with metadata in the BigQuery table bigquery-public-data.deepmind_alphafold.metadata . If you use this data, please cite: Jumper, J et al. Highly accurate protein structure prediction with AlphaFold. Nature (2021) Varadi, M et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Research (2021) This public dataset is hosted in Google Cloud Storage and is available free to use. Use this quick start guide to quickly learn how to access public datasets on Google Cloud Storage.

  17. b

    Database of homology-derived secondary structure of proteins

    • bioregistry.io
    Updated Jan 16, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Database of homology-derived secondary structure of proteins [Dataset]. https://bioregistry.io/hssp
    Explore at:
    Dataset updated
    Jan 16, 2022
    Description

    HSSP (homology-derived structures of proteins) is a derived database merging structural (2-D and 3-D) and sequence information (1-D). For each protein of known 3D structure from the Protein Data Bank, the database has a file with all sequence homologues, properly aligned to the PDB protein.

  18. b

    Database of protein-protein complexes

    • bioregistry.io
    Updated Nov 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Database of protein-protein complexes [Dataset]. https://bioregistry.io/registry/protcom
    Explore at:
    Dataset updated
    Nov 14, 2022
    Description

    This database is a collection of protein-protein homo- and hetero-complexes as well as domain-domain structures. This issue of the database contains 17.024 entries (as of October 2007) of which 1350 are two-chain protein hetero-complexes, 7773 homodimers and 1589 are one-chain proteins parsed into two domains (domain structures).

  19. d

    HOMSTRAD - Homologous Structure Alignment Database

    • dknet.org
    • scicrunch.org
    • +2more
    Updated Jan 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). HOMSTRAD - Homologous Structure Alignment Database [Dataset]. http://identifiers.org/RRID:SCR_006544
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    A curated database of structure-based alignments for homologous protein families. All known protein structure are clustered into homologous families (i.e., common ancestry), and the sequences of representative members of each family are aligned on the basis of their 3D structures using the programs MNYFIT, STAMP and COMPARER. These structure-based alignments are annotated with JOY and examined individually.

  20. Protein-protein interactions by database.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Margaret E. Johnson; Gerhard Hummer (2023). Protein-protein interactions by database. [Dataset]. http://doi.org/10.1371/journal.pcbi.1003065.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Margaret E. Johnson; Gerhard Hummer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BioGRID had 337 interactions between the set of 56 proteins. Of the 5 databases, BioGRID contained the most edges, with high coverage of interactions in the other 4 databases. The interactions missing from BioGRID did not arise due to missed references (except for 3 studies of functional rather than physical associations) but due to missed interactions in the same references. The other 4 databases contained a total of 69 interactions not present in BioGRID, and 52 not present in our original database that had been augmented by added edges and through curation of the SH3/PRD and kinase references. Of these 52 interactions, three were removed for erroneous citations, 20 were found only through functional association, and therefore removed, and the remainder were observed in only a single probe of physical interactions.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Library of Medicine (2025). Structure - Molecular Modeling Database (MMDB) [Dataset]. https://catalog.data.gov/dataset/molecular-modeling-database-mmdb

Structure - Molecular Modeling Database (MMDB)

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 19, 2025
Dataset provided by
National Library of Medicine
Description

Three dimensional structures provide a wealth of information on the biological function and the evolutionary history of macromolecules. They can be used to examine sequence-structure-function relationships, interactions, active sites, and more.

Search
Clear search
Close search
Google apps
Main menu