5 datasets found

e
CoRRE Trait Database: A collection of 17 categorical and continuous traits...
portal.edirepository.org
dataone.org
csv
Updated Nov 17, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox (2023). CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide [Dataset]. http://doi.org/10.6073/pasta/31b1c78019b618106a0d13caab154e23
Explore at:
csv(3044968 byte), csv(72474339 byte), csv(3524292 byte)Available download formats
Unique identifier
https://doi.org/10.6073/pasta/31b1c78019b618106a0d13caab154e23
Dataset updated
Nov 17, 2023
Dataset provided by
EDI
Authors
Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox
Area covered
Variables measured
genus, trait, family, source, species, DatasetID, Reference, DatabaseID, trait_value, ObservationID, and 4 more
Description
In our changing world, it is critical to understand and predict plant community responses to global change drivers. Plant functional traits promise to be a key predictive tool for many ecosystems, including grasslands, however their use requires both complete plant community and functional trait data. Yet, representation of these data in global databases is incredibly sparse, particularly beyond a handful of most used traits and common species. Here we present the CoRRE Trait Database, spanning 17 traits (9 categorical, 8 continuous) anticipated to predict species’ responses to global change for 4,079 vascular plant species across 173 plant families present in 390 grassland experiments from around the world. The database contains complete categorical trait records for all 4,079 plant species, obtained from a comprehensive literature search. Additionally, the database contains nearly complete coverage (99.97%) of species mean values for continuous traits for a subset of 2,927 plant species, predicted from observed trait data drawn from TRY and a variety of other plant trait databases using Bayesian Probabilistic Matrix Factorization (BHPMF) and multivariate imputation using chained equations (MICE). These data will shed light on mechanisms underlying population, community, and ecosystem responses to global change in grasslands worldwide.
Food and Agriculture Biomass Input–Output (FABIO) database
zenodo.org
data.niaid.nih.gov
bin, csv
Updated Jun 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martin Bruckner; Martin Bruckner; Nikolas Kuschnig; Nikolas Kuschnig (2022). Food and Agriculture Biomass Input–Output (FABIO) database [Dataset]. http://doi.org/10.5281/zenodo.2577067
Explore at:
bin, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.2577067
Dataset updated
Jun 8, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Bruckner; Martin Bruckner; Nikolas Kuschnig; Nikolas Kuschnig
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
This data repository provides the Food and Agriculture Biomass Input Output (FABIO) database, a global set of multi-regional physical supply-use and input-output tables covering global agriculture and forestry.

The work is based on mostly freely available data from FAOSTAT, IEA, EIA, and UN Comtrade/BACI. FABIO currently covers 191 countries + RoW, 118 processes and 125 commodities (raw and processed agricultural and food products) for 1986-2013. All R codes and auxilliary data are available on GitHub. For more information please refer to https://fabio.fineprint.global.

The database consists of the following main components, in compressed .rds format:

Z: the inter-commodity input-output matrix, displaying the relationships of intermediate use of each commodity in the production of each commodity, in physical units (tons). The matrix has 24000 rows and columns (125 commodities x 192 regions), and is available in two versions, based on the method to allocate inputs to outputs in production processes: Z_mass (mass allocation) and Z_value (value allocation). Note that the row sums of the Z matrix (= total intermediate use by commodity) are identical in both versions.

Y: the final demand matrix, denoting the consumption of all 24000 commodities by destination country and final use category. There are six final use categories (yielding 192 x 6 = 1152 columns): 1) food use, 2) other use (non-food), 3) losses, 4) stock addition, 5) balancing, and 6) unspecified.

X: the total output vector of all 24000 commodities. Total output is equal to the sum of intermediate and final use by commodity.

L: the Leontief inverse, computed as (I – A)^-1, where A is the matrix of input coefficients derived from Z and x. Again, there are two versions, depending on the underlying version of Z (L_mass and L_value).

E: environmental extensions for each of the 24000 commodities, including four resource categories: 1) primary biomass extraction (in tons), 2) land use (in hectares), 3) blue water use (in m3)., and 4) green water use (in m3).

mr_sup_mass/mr_sup_value: For each allocation method (mass/value), the supply table gives the physical supply quantity of each commodity by producing process, with processes in the rows (118 processes x 192 regions = 22656 rows) and commodities in columns (24000 columns).

mr_use: the use table capture the quantities of each commodity (rows) used as an input in each process (columns).

A description of the included countries and commodities (i.e. the rows and columns of the Z matrix) can be found in the auxiliary file io_codes.csv. Separate lists of the country sample (including ISO3 codes and continental grouping) and commodities (including moisture content) are given in the files regions.csv and items.csv, respectively. For information on the individual processes, see auxiliary file su_codes.csv. RDS files can be opened in R. Information on how to read these files can be obtained here: https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/readRDS

Except of X.rds, which contains a matrix, all variables are organized as lists, where each element contains a sparse matrix. Please note that values are always given in physical units, i.e. tonnes or head, as specified in items.csv. The suffixes value and mass only indicate the form of allocation chosen for the construction of the symmetric IO tables (for more details see Bruckner et al. 2019). Product, process and country classifications can be found in the file fabio_classifications.xlsx.

Footprint results are not contained in the database but can be calculated, e.g. by using this script: https://github.com/martinbruckner/fabio_comparison/blob/master/R/fabio_footprints.R

How to cite:

To cite FABIO work please refer to this paper:

Bruckner, M., Wood, R., Moran, D., Kuschnig, N., Wieland, H., Maus, V., Börner, J. 2019. FABIO – The Construction of the Food and Agriculture Input–Output Model. Environmental Science & Technology 53(19), 11302–11312. DOI: 10.1021/acs.est.9b03554

License:

This data repository is distributed under the CC BY-NC-SA 4.0 License. You are free to share and adapt the material for non-commercial purposes using proper citation. If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original. In case you are interested in a collaboration, I am happy to receive enquiries at martin.bruckner@wu.ac.at.

Known issues:

The underlying FAO data have been manipulated to the minimum extent necessary. Data filling and supply-use balancing, yet, required some adaptations. These are documented in the code and are also reflected in the balancing item in the final demand matrices. For a proper use of the database, I recommend to distribute the balancing item over all other uses proportionally and to do analyses with and without balancing to illustrate uncertainties.
Data from: Prediction of Small Molecule–MicroRNA Associations by Sparse...
acs.figshare.com
figshare.com
xlsx
Updated Jun 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jun Yin; Xing Chen; Chun-Chun Wang; Yan Zhao; Ya-Zhou Sun (2023). Prediction of Small Molecule–MicroRNA Associations by Sparse Learning and Heterogeneous Graph Inference [Dataset]. http://doi.org/10.1021/acs.molpharmaceut.9b00384.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.molpharmaceut.9b00384.s001
Dataset updated
Jun 4, 2023
Dataset provided by
ACS Publications
Authors
Jun Yin; Xing Chen; Chun-Chun Wang; Yan Zhao; Ya-Zhou Sun
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
As microRNAs (miRNAs) have been reported to be a type of novel high-value small molecule (SM) drug targets for disease treatments, many researchers are engaged in the field of exploring new SM–miRNA associations. Nevertheless, because of the high cost, adopting traditional biological experiments constrains the efficiency of discovering new associations between SMs and miRNAs. Therefore, as an important auxiliary tool, reliable computational models will be of great help to reveal SM–miRNA associations. In this article, we developed a computational model of sparse learning and heterogeneous graph inference for small molecule–miRNA association prediction (SLHGISMMA). Initially, the sparse learning method (SLM) was implemented to decompose the SM–miRNA adjacency matrix. Then, we integrated the reacquired association information together with the similarity information of SMs and miRNAs into a heterogeneous graph to infer potential SM–miRNA associations. Here, the main innovation of SLHGISMMA lies in the introduction of SLM to eliminate noises of the original adjacency matrix to some extent, which plays an important role in performance improvement. In addition, to assess SLHGISMMA’ performance, four different kinds of cross-validations were performed based on two datasets. As a result, based on dataset 1 (dataset 2), SLHGISMMA achieved area under the curves of 0.9273 (0.7774), 0.9365 (0.7973), 0.7703 (0.6556), and 0.9241 ± 0.0052 (0.7724 ± 0.0032) in global leave-one-out cross-validation (LOOCV), miRNA-fixed local LOOCV, SM-fixed local LOOCV, and 5-fold cross-validation, respectively. Moreover, in the case study on three important SMs via removing their known associations, the results showed that most of the top 50 predicted miRNAs were confirmed by the database SM2miR v1.0 or the experimental literature.
f
MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for...
figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xing Chen; Jun Yin; Jia Qu; Li Huang (2023). MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction [Dataset]. http://doi.org/10.1371/journal.pcbi.1006418
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1006418
Dataset updated
May 31, 2023
Dataset provided by
PLOS Computational Biology
Authors
Xing Chen; Jun Yin; Jia Qu; Li Huang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Recently, a growing number of biological research and scientific experiments have demonstrated that microRNA (miRNA) affects the development of human complex diseases. Discovering miRNA-disease associations plays an increasingly vital role in devising diagnostic and therapeutic tools for diseases. However, since uncovering associations via experimental methods is expensive and time-consuming, novel and effective computational methods for association prediction are in demand. In this study, we developed a computational model of Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction (MDHGI) to discover new miRNA-disease associations by integrating the predicted association probability obtained from matrix decomposition through sparse learning method, the miRNA functional similarity, the disease semantic similarity, and the Gaussian interaction profile kernel similarity for diseases and miRNAs into a heterogeneous network. Compared with previous computational models based on heterogeneous networks, our model took full advantage of matrix decomposition before the construction of heterogeneous network, thereby improving the prediction accuracy. MDHGI obtained AUCs of 0.8945 and 0.8240 in the global and the local leave-one-out cross validation, respectively. Moreover, the AUC of 0.8794+/-0.0021 in 5-fold cross validation confirmed its stability of predictive performance. In addition, to further evaluate the model's accuracy, we applied MDHGI to four important human cancers in three different kinds of case studies. In the first type, 98% (Esophageal Neoplasms) and 98% (Lymphoma) of top 50 predicted miRNAs have been confirmed by at least one of the two databases (dbDEMC and miR2Disease) or at least one experimental literature in PubMed. In the second type of case study, what made a difference was that we removed all known associations between the miRNAs and Lung Neoplasms before implementing MDHGI on Lung Neoplasms. As a result, 100% (Lung Neoplasms) of top 50 related miRNAs have been indexed by at least one of the three databases (dbDEMC, miR2Disease and HMDD V2.0) or at least one experimental literature in PubMed. Furthermore, we also tested our prediction method on the HMDD V1.0 database to prove the applicability of MDHGI to different datasets. The results showed that 50 out of top 50 miRNAs related with the breast neoplasms were validated by at least one of the three databases (HMDD V2.0, dbDEMC, and miR2Disease) or at least one experimental literature.
p
Data from: Graph-theoretic algorithms for sparse matrix transformations /
dona.pwr.edu.pl
Updated 1978
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jadwiga Dzikiewicz; Maciej Sysło (1978). Graph-theoretic algorithms for sparse matrix transformations / [Dataset]. https://dona.pwr.edu.pl/szukaj/default.aspx?nrsys=33919
Explore at:
Dataset updated
1978
Authors
Jadwiga Dzikiewicz; Maciej Sysło
Description
Library of Wroclaw University of Science and Technology scientific output (DONA database)
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox (2023). CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide [Dataset]. http://doi.org/10.6073/pasta/31b1c78019b618106a0d13caab154e23

CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide

Explore at:

296 scholarly articles cite this dataset (View in Google Scholar)

csv(3044968 byte), csv(72474339 byte), csv(3524292 byte)Available download formats

Unique identifier

https://doi.org/10.6073/pasta/31b1c78019b618106a0d13caab154e23

Dataset updated

Nov 17, 2023

Dataset provided by

EDI

Authors

Area covered

Variables measured

genus, trait, family, source, species, DatasetID, Reference, DatabaseID, trait_value, ObservationID, and 4 more

Description

In our changing world, it is critical to understand and predict plant community responses to global change drivers. Plant functional traits promise to be a key predictive tool for many ecosystems, including grasslands, however their use requires both complete plant community and functional trait data. Yet, representation of these data in global databases is incredibly sparse, particularly beyond a handful of most used traits and common species. Here we present the CoRRE Trait Database, spanning 17 traits (9 categorical, 8 continuous) anticipated to predict species’ responses to global change for 4,079 vascular plant species across 173 plant families present in 390 grassland experiments from around the world. The database contains complete categorical trait records for all 4,079 plant species, obtained from a comprehensive literature search. Additionally, the database contains nearly complete coverage (99.97%) of species mean values for continuous traits for a subset of 2,927 plant species, predicted from observed trait data drawn from TRY and a variety of other plant trait databases using Bayesian Probabilistic Matrix Factorization (BHPMF) and multivariate imputation using chained equations (MICE). These data will shed light on mechanisms underlying population, community, and ecosystem responses to global change in grasslands worldwide.

Clear search

Close search

Google apps

Main menu

CoRRE Trait Database: A collection of 17 categorical and continuous traits...

Food and Agriculture Biomass Input–Output (FABIO) database

Data from: Prediction of Small Molecule–MicroRNA Associations by Sparse...

MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for...

Data from: Graph-theoretic algorithms for sparse matrix transformations /

CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwideSee More Versions

CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide