4 datasets found

f
Additional file 2 of Thresher: determining the number of clusters while...
springernature.figshare.com
zip
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Min Wang; Zachary B. Abrams; Steven M. Kornblau; Kevin R. Coombes (2023). Additional file 2 of Thresher: determining the number of clusters while removing outliers [Dataset]. http://doi.org/10.6084/m9.figshare.5768622.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5768622.v1
Dataset updated
Jun 3, 2023
Dataset provided by
figshare
Authors
Min Wang; Zachary B. Abrams; Steven M. Kornblau; Kevin R. Coombes
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
R Code for Analyses. This is a zip file containing all of the R code used to perform simulations and to analyze the breast cancer data. (ZIP 407 kb)
Machine learning pipeline to train toxicity prediction model of...
zenodo.org
data.niaid.nih.gov
zip
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jan Ewald; Jan Ewald (2020). Machine learning pipeline to train toxicity prediction model of FunTox-Networks [Dataset]. http://doi.org/10.5281/zenodo.3529162
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3529162
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Jan Ewald; Jan Ewald
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Machine Learning pipeline used to provide toxicity prediction in FunTox-Networks

01_DATA # preprocessing and filtering of raw activity data from ChEMBL
- Chembl_v25 # latest activity assay data set from ChEMBL (retrieved Nov 2019)
- filt_stats.R # Filtering and preparation of raw data
- Filtered # output data sets from filt_stats.R
- toxicity_direction.csv # table of toxicity measurements and their proportionality to toxicity

02_MolDesc # Calculation of molecular descriptors for all compounds within the filtered ChEMBL data set
- datastore # files with all compounds and their calculated molecular descriptors based on SMILES
- scripts
- calc_molDesc.py # calculates for all compounds based on their smiles the molecular descriptors
- chemopy-1.1 # used python package for descriptor calculation as decsribed in: https://doi.org/10.1093/bioinformatics/btt105

03_Averages # Calculation of moving averages for levels and organisms as required for calculation of Z-scores
- datastore # output files with statistics calculated by make_Z.R
- scripts
-make_Z.R # script to calculate statistics to calculate Z-scores as used by the regression models

04_ZScores # Calculation of Z-scores and preparation of table to fit regression models
- datastore # Z-normalized activity data and molecular descriptors in the form as used for fitting regression models
- scripts
-calc_Ztable.py # based on activity data, molecular descriptors and Z-statistics, the learning data is calculated

05_Regression # Performing regression. Preparation of data by removing of outliers based on a linear regression model. Learning of random forest regression models. Validation of learning process by cross validation and tuning of hyperparameters.

- datastore # storage of all random forest regression models and average level of Z output value per level and organism (zexp_*.tsv)
- scripts
- data_preperation.R # set up of regression data set, removal of outliers and optional removal of fields and descriptors
- Rforest_CV.R # analysis of machine learning by cross validation, importance of regression variables and tuning of hyperparameters (number of trees, split of variables)
- Rforest.R # based on analysis of Rforest_CV.R learning of final models

rregrs_output
# early analysis of regression model performance with the package RRegrs as described in: https://doi.org/10.1186/s13321-015-0094-2
Causal effect estimates using Radial MVMR with and without outlier removal...
plos.figshare.com
xls
Updated Dec 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wes Spiller; Jack Bowden; Eleanor Sanderson (2024). Causal effect estimates using Radial MVMR with and without outlier removal with varying levels of balanced pleiotropy. [Dataset]. http://doi.org/10.1371/journal.pgen.1011506.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pgen.1011506.t002
Dataset updated
Dec 30, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Wes Spiller; Jack Bowden; Eleanor Sanderson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Causal effect estimates using Radial MVMR with and without outlier removal with varying levels of balanced pleiotropy.
MLR models of age at onset of T1D after removing outliers (N = 354).
plos.figshare.com
xls
Updated Jun 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahood Alazwari; Mali Abdollahian; Laleh Tafakori; Alice Johnstone; Rahma A. Alshumrani; Manal T. Alhelal; Abdulhameed Y. Alsaheel; Eman S. Almoosa; Aseel R. Alkhaldi (2023). MLR models of age at onset of T1D after removing outliers (N = 354). [Dataset]. http://doi.org/10.1371/journal.pone.0264118.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0264118.t006
Dataset updated
Jun 16, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Ahood Alazwari; Mali Abdollahian; Laleh Tafakori; Alice Johnstone; Rahma A. Alshumrani; Manal T. Alhelal; Abdulhameed Y. Alsaheel; Eman S. Almoosa; Aseel R. Alkhaldi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
MLR models of age at onset of T1D after removing outliers (N = 354).
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Min Wang; Zachary B. Abrams; Steven M. Kornblau; Kevin R. Coombes (2023). Additional file 2 of Thresher: determining the number of clusters while removing outliers [Dataset]. http://doi.org/10.6084/m9.figshare.5768622.v1

Additional file 2 of Thresher: determining the number of clusters while removing outliers

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.5768622.v1

Dataset updated

Jun 3, 2023

Dataset provided by

figshare

Authors

Min Wang; Zachary B. Abrams; Steven M. Kornblau; Kevin R. Coombes

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

R Code for Analyses. This is a zip file containing all of the R code used to perform simulations and to analyze the breast cancer data. (ZIP 407 kb)

Clear search

Close search

Google apps

Main menu

Additional file 2 of Thresher: determining the number of clusters while...

Machine learning pipeline to train toxicity prediction model of...

Causal effect estimates using Radial MVMR with and without outlier removal...

MLR models of age at onset of T1D after removing outliers (N = 354).

Additional file 2 of Thresher: determining the number of clusters while removing outliers