5 datasets found
  1. f

    Data from: Global Free Energy Scoring Functions Based on Distance-Dependent...

    • acs.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Kramer; Peter Gedeck (2023). Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors [Dataset]. http://doi.org/10.1021/ci100473d.s001
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    ACS Publications
    Authors
    Christian Kramer; Peter Gedeck
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Scoring functions for protein−ligand docking have received much attention in the past two decades. In many cases, remarkable success has been demonstrated in predicting the correct geometry of interaction. On independent test sets, however, the predicted binding energies or scores correlate only slightly with the observed free energies of binding.In this study, we analyze how well free energies of binding can be predicted on the basis of crystal structures using traditional QSAR techniques in a proteochemometric approach. We introduce a new set of protein−ligand interaction descriptors on the basis of distance-binned Crippen-like atom type pairs. A subset of the publicly available PDBbind09-CN refined set (MW < 900 g/mol, #P < 2, ndon + nacc < 20; N = 1387) is being used as data set. It is demonstrated how simple, yet surprisingly good, scoring functions can be generated for the whole diverse database (R2out-of-bag = 0.48, Rp = 0.69, RMSE = 1.44, MUE = 1.14) and individual protein family subsets. This performance is significantly better than the performance of almost all other scoring functions published that have been validated on a test set as large and diverse as the PDBbind refined set.We also find that on some protein families surprisingly good scoring functions can be obtained using simple ligand-only descriptors like logS, logP, and molecular weight. The ligand−descriptor based scoring function equals or even outperforms commonly used scoring functions, highlighting the need for better scoring functions. We demonstrate how the observed performance depends on the validation strategy, and we outline a general validation protocol for future free energy scoring functions.

  2. f

    Reusable core functions with general utility for generic TK models. (Note,...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Schacht, Celia M.; Sfeir, Mark A.; Breen, Miyuki; Sluka, James P.; Ring, Caroline L.; Wambaugh, John F.; Pearce, Robert G.; Chang, Xiaoqing; Meade, Annabel; Kenyon, Elaina; Devito, Michael J.; Davidson-Fritz, Sarah E.; Honda, Gregory S.; Linakis, Matthew W.; Evans, Marina V. (2025). Reusable core functions with general utility for generic TK models. (Note, this table only lists a subset of core functions available in the httk R package and is limited to functions explicitly mentioned in this paper. We refer readers to the S2 File and help files for further details on these and other available functions.). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002097319
    Explore at:
    Dataset updated
    Apr 16, 2025
    Authors
    Schacht, Celia M.; Sfeir, Mark A.; Breen, Miyuki; Sluka, James P.; Ring, Caroline L.; Wambaugh, John F.; Pearce, Robert G.; Chang, Xiaoqing; Meade, Annabel; Kenyon, Elaina; Devito, Michael J.; Davidson-Fritz, Sarah E.; Honda, Gregory S.; Linakis, Matthew W.; Evans, Marina V.
    Description

    Reusable core functions with general utility for generic TK models. (Note, this table only lists a subset of core functions available in the httk R package and is limited to functions explicitly mentioned in this paper. We refer readers to the S2 File and help files for further details on these and other available functions.).

  3. MOESM6 of Modelling the structure of a ceRNA-theoretical, bipartite...

    • springernature.figshare.com
    txt
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    J. Robinson; W. Henderson (2023). MOESM6 of Modelling the structure of a ceRNA-theoretical, bipartite microRNA–mRNA interaction network regulating intestinal epithelial cellular pathways using R programming [Dataset]. http://doi.org/10.6084/m9.figshare.5786070.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    J. Robinson; W. Henderson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 6. R-input, Fig3c subset adjacency list. A subset of the MiRWalk_Trimmed.csv adjacency list, used to derive the graph plot displayed in Fig. 3c.

  4. f

    R code.

    • figshare.com
    • plos.figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ju Ji; Chong Wang; Zhulin He; Karen E. Hay; Tamsin S. Barnes; Annette M. O’Connor (2023). R code. [Dataset]. http://doi.org/10.1371/journal.pone.0233960.s001
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ju Ji; Chong Wang; Zhulin He; Karen E. Hay; Tamsin S. Barnes; Annette M. O’Connor
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    R code for functions and Subset-BRD data analysis. (R)

  5. The data subset showing the loadings of the liner discriminant 1 for only...

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Darya Presnyakova; Will Archer; David R. Braun; Wesley Flear (2023). The data subset showing the loadings of the liner discriminant 1 for only those experimental flakes made on silcrete. [Dataset]. http://doi.org/10.1371/journal.pone.0130732.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Darya Presnyakova; Will Archer; David R. Braun; Wesley Flear
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data subset showing the loadings of the liner discriminant 1 for only those experimental flakes made on silcrete.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Christian Kramer; Peter Gedeck (2023). Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors [Dataset]. http://doi.org/10.1021/ci100473d.s001

Data from: Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors

Related Article
Explore at:
txtAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
ACS Publications
Authors
Christian Kramer; Peter Gedeck
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Scoring functions for protein−ligand docking have received much attention in the past two decades. In many cases, remarkable success has been demonstrated in predicting the correct geometry of interaction. On independent test sets, however, the predicted binding energies or scores correlate only slightly with the observed free energies of binding.In this study, we analyze how well free energies of binding can be predicted on the basis of crystal structures using traditional QSAR techniques in a proteochemometric approach. We introduce a new set of protein−ligand interaction descriptors on the basis of distance-binned Crippen-like atom type pairs. A subset of the publicly available PDBbind09-CN refined set (MW < 900 g/mol, #P < 2, ndon + nacc < 20; N = 1387) is being used as data set. It is demonstrated how simple, yet surprisingly good, scoring functions can be generated for the whole diverse database (R2out-of-bag = 0.48, Rp = 0.69, RMSE = 1.44, MUE = 1.14) and individual protein family subsets. This performance is significantly better than the performance of almost all other scoring functions published that have been validated on a test set as large and diverse as the PDBbind refined set.We also find that on some protein families surprisingly good scoring functions can be obtained using simple ligand-only descriptors like logS, logP, and molecular weight. The ligand−descriptor based scoring function equals or even outperforms commonly used scoring functions, highlighting the need for better scoring functions. We demonstrate how the observed performance depends on the validation strategy, and we outline a general validation protocol for future free energy scoring functions.

Search
Clear search
Close search
Google apps
Main menu