5 datasets found

f
Data from: Global Free Energy Scoring Functions Based on Distance-Dependent...
acs.figshare.com
txt
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Christian Kramer; Peter Gedeck (2023). Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors [Dataset]. http://doi.org/10.1021/ci100473d.s001
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.1021/ci100473d.s001
Dataset updated
Jun 1, 2023
Dataset provided by
ACS Publications
Authors
Christian Kramer; Peter Gedeck
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Scoring functions for protein−ligand docking have received much attention in the past two decades. In many cases, remarkable success has been demonstrated in predicting the correct geometry of interaction. On independent test sets, however, the predicted binding energies or scores correlate only slightly with the observed free energies of binding.In this study, we analyze how well free energies of binding can be predicted on the basis of crystal structures using traditional QSAR techniques in a proteochemometric approach. We introduce a new set of protein−ligand interaction descriptors on the basis of distance-binned Crippen-like atom type pairs. A subset of the publicly available PDBbind09-CN refined set (MW < 900 g/mol, #P < 2, ndon + nacc < 20; N = 1387) is being used as data set. It is demonstrated how simple, yet surprisingly good, scoring functions can be generated for the whole diverse database (R2out-of-bag = 0.48, Rp = 0.69, RMSE = 1.44, MUE = 1.14) and individual protein family subsets. This performance is significantly better than the performance of almost all other scoring functions published that have been validated on a test set as large and diverse as the PDBbind refined set.We also find that on some protein families surprisingly good scoring functions can be obtained using simple ligand-only descriptors like logS, logP, and molecular weight. The ligand−descriptor based scoring function equals or even outperforms commonly used scoring functions, highlighting the need for better scoring functions. We demonstrate how the observed performance depends on the validation strategy, and we outline a general validation protocol for future free energy scoring functions.
f
Reusable core functions with general utility for generic TK models. (Note,...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Apr 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Schacht, Celia M.; Sfeir, Mark A.; Breen, Miyuki; Sluka, James P.; Ring, Caroline L.; Wambaugh, John F.; Pearce, Robert G.; Chang, Xiaoqing; Meade, Annabel; Kenyon, Elaina; Devito, Michael J.; Davidson-Fritz, Sarah E.; Honda, Gregory S.; Linakis, Matthew W.; Evans, Marina V. (2025). Reusable core functions with general utility for generic TK models. (Note, this table only lists a subset of core functions available in the httk R package and is limited to functions explicitly mentioned in this paper. We refer readers to the S2 File and help files for further details on these and other available functions.). [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002097319
Explore at:
Dataset updated
Apr 16, 2025
Authors
Schacht, Celia M.; Sfeir, Mark A.; Breen, Miyuki; Sluka, James P.; Ring, Caroline L.; Wambaugh, John F.; Pearce, Robert G.; Chang, Xiaoqing; Meade, Annabel; Kenyon, Elaina; Devito, Michael J.; Davidson-Fritz, Sarah E.; Honda, Gregory S.; Linakis, Matthew W.; Evans, Marina V.
Description
Reusable core functions with general utility for generic TK models. (Note, this table only lists a subset of core functions available in the httk R package and is limited to functions explicitly mentioned in this paper. We refer readers to the S2 File and help files for further details on these and other available functions.).
MOESM6 of Modelling the structure of a ceRNA-theoretical, bipartite...
springernature.figshare.com
txt
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
J. Robinson; W. Henderson (2023). MOESM6 of Modelling the structure of a ceRNA-theoretical, bipartite microRNA–mRNA interaction network regulating intestinal epithelial cellular pathways using R programming [Dataset]. http://doi.org/10.6084/m9.figshare.5786070.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5786070.v1
Dataset updated
Jun 3, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
J. Robinson; W. Henderson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Additional file 6. R-input, Fig3c subset adjacency list. A subset of the MiRWalk_Trimmed.csv adjacency list, used to derive the graph plot displayed in Fig. 3c.
f
R code.
figshare.com
plos.figshare.com
txt
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ju Ji; Chong Wang; Zhulin He; Karen E. Hay; Tamsin S. Barnes; Annette M. O’Connor (2023). R code. [Dataset]. http://doi.org/10.1371/journal.pone.0233960.s001
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0233960.s001
Dataset updated
May 30, 2023
Dataset provided by
PLOS ONE
Authors
Ju Ji; Chong Wang; Zhulin He; Karen E. Hay; Tamsin S. Barnes; Annette M. O’Connor
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
R code for functions and Subset-BRD data analysis. (R)
The data subset showing the loadings of the liner discriminant 1 for only...
plos.figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Darya Presnyakova; Will Archer; David R. Braun; Wesley Flear (2023). The data subset showing the loadings of the liner discriminant 1 for only those experimental flakes made on silcrete. [Dataset]. http://doi.org/10.1371/journal.pone.0130732.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0130732.t004
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Darya Presnyakova; Will Archer; David R. Braun; Wesley Flear
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data subset showing the loadings of the liner discriminant 1 for only those experimental flakes made on silcrete.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Christian Kramer; Peter Gedeck (2023). Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors [Dataset]. http://doi.org/10.1021/ci100473d.s001

Data from: Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors

Explore at:

txtAvailable download formats

Unique identifier

https://doi.org/10.1021/ci100473d.s001

Dataset updated

Jun 1, 2023

Dataset provided by

ACS Publications

Authors

Christian Kramer; Peter Gedeck

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Scoring functions for protein−ligand docking have received much attention in the past two decades. In many cases, remarkable success has been demonstrated in predicting the correct geometry of interaction. On independent test sets, however, the predicted binding energies or scores correlate only slightly with the observed free energies of binding.In this study, we analyze how well free energies of binding can be predicted on the basis of crystal structures using traditional QSAR techniques in a proteochemometric approach. We introduce a new set of protein−ligand interaction descriptors on the basis of distance-binned Crippen-like atom type pairs. A subset of the publicly available PDBbind09-CN refined set (MW < 900 g/mol, #P < 2, ndon + nacc < 20; N = 1387) is being used as data set. It is demonstrated how simple, yet surprisingly good, scoring functions can be generated for the whole diverse database (R2out-of-bag = 0.48, Rp = 0.69, RMSE = 1.44, MUE = 1.14) and individual protein family subsets. This performance is significantly better than the performance of almost all other scoring functions published that have been validated on a test set as large and diverse as the PDBbind refined set.We also find that on some protein families surprisingly good scoring functions can be obtained using simple ligand-only descriptors like logS, logP, and molecular weight. The ligand−descriptor based scoring function equals or even outperforms commonly used scoring functions, highlighting the need for better scoring functions. We demonstrate how the observed performance depends on the validation strategy, and we outline a general validation protocol for future free energy scoring functions.

Clear search

Close search

Google apps

Main menu

Data from: Global Free Energy Scoring Functions Based on Distance-Dependent...

Reusable core functions with general utility for generic TK models. (Note,...

MOESM6 of Modelling the structure of a ceRNA-theoretical, bipartite...

R code.

The data subset showing the loadings of the liner discriminant 1 for only...

Data from: Global Free Energy Scoring Functions Based on Distance-Dependent Atom-Type Pair Descriptors