5 datasets found
  1. e

    CoRRE Trait Database: A collection of 17 categorical and continuous traits...

    • portal.edirepository.org
    • dataone.org
    csv
    Updated Nov 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox (2023). CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide [Dataset]. http://doi.org/10.6073/pasta/31b1c78019b618106a0d13caab154e23
    Explore at:
    csv(3044968 byte), csv(72474339 byte), csv(3524292 byte)Available download formats
    Dataset updated
    Nov 17, 2023
    Dataset provided by
    EDI
    Authors
    Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox
    Area covered
    Variables measured
    genus, trait, family, source, species, DatasetID, Reference, DatabaseID, trait_value, ObservationID, and 4 more
    Description

    In our changing world, it is critical to understand and predict plant community responses to global change drivers. Plant functional traits promise to be a key predictive tool for many ecosystems, including grasslands, however their use requires both complete plant community and functional trait data. Yet, representation of these data in global databases is incredibly sparse, particularly beyond a handful of most used traits and common species. Here we present the CoRRE Trait Database, spanning 17 traits (9 categorical, 8 continuous) anticipated to predict species’ responses to global change for 4,079 vascular plant species across 173 plant families present in 390 grassland experiments from around the world. The database contains complete categorical trait records for all 4,079 plant species, obtained from a comprehensive literature search. Additionally, the database contains nearly complete coverage (99.97%) of species mean values for continuous traits for a subset of 2,927 plant species, predicted from observed trait data drawn from TRY and a variety of other plant trait databases using Bayesian Probabilistic Matrix Factorization (BHPMF) and multivariate imputation using chained equations (MICE). These data will shed light on mechanisms underlying population, community, and ecosystem responses to global change in grasslands worldwide.

  2. Food and Agriculture Biomass Input–Output (FABIO) database

    • zenodo.org
    • data.niaid.nih.gov
    bin, csv
    Updated Jun 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Bruckner; Martin Bruckner; Nikolas Kuschnig; Nikolas Kuschnig (2022). Food and Agriculture Biomass Input–Output (FABIO) database [Dataset]. http://doi.org/10.5281/zenodo.2577067
    Explore at:
    bin, csvAvailable download formats
    Dataset updated
    Jun 8, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martin Bruckner; Martin Bruckner; Nikolas Kuschnig; Nikolas Kuschnig
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This data repository provides the Food and Agriculture Biomass Input Output (FABIO) database, a global set of multi-regional physical supply-use and input-output tables covering global agriculture and forestry.

    The work is based on mostly freely available data from FAOSTAT, IEA, EIA, and UN Comtrade/BACI. FABIO currently covers 191 countries + RoW, 118 processes and 125 commodities (raw and processed agricultural and food products) for 1986-2013. All R codes and auxilliary data are available on GitHub. For more information please refer to https://fabio.fineprint.global.

    The database consists of the following main components, in compressed .rds format:

    • Z: the inter-commodity input-output matrix, displaying the relationships of intermediate use of each commodity in the production of each commodity, in physical units (tons). The matrix has 24000 rows and columns (125 commodities x 192 regions), and is available in two versions, based on the method to allocate inputs to outputs in production processes: Z_mass (mass allocation) and Z_value (value allocation). Note that the row sums of the Z matrix (= total intermediate use by commodity) are identical in both versions.
    • Y: the final demand matrix, denoting the consumption of all 24000 commodities by destination country and final use category. There are six final use categories (yielding 192 x 6 = 1152 columns): 1) food use, 2) other use (non-food), 3) losses, 4) stock addition, 5) balancing, and 6) unspecified.
    • X: the total output vector of all 24000 commodities. Total output is equal to the sum of intermediate and final use by commodity.
    • L: the Leontief inverse, computed as (I – A)-1, where A is the matrix of input coefficients derived from Z and x. Again, there are two versions, depending on the underlying version of Z (L_mass and L_value).
    • E: environmental extensions for each of the 24000 commodities, including four resource categories: 1) primary biomass extraction (in tons), 2) land use (in hectares), 3) blue water use (in m3)., and 4) green water use (in m3).
    • mr_sup_mass/mr_sup_value: For each allocation method (mass/value), the supply table gives the physical supply quantity of each commodity by producing process, with processes in the rows (118 processes x 192 regions = 22656 rows) and commodities in columns (24000 columns).
    • mr_use: the use table capture the quantities of each commodity (rows) used as an input in each process (columns).

    A description of the included countries and commodities (i.e. the rows and columns of the Z matrix) can be found in the auxiliary file io_codes.csv. Separate lists of the country sample (including ISO3 codes and continental grouping) and commodities (including moisture content) are given in the files regions.csv and items.csv, respectively. For information on the individual processes, see auxiliary file su_codes.csv. RDS files can be opened in R. Information on how to read these files can be obtained here: https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/readRDS

    Except of X.rds, which contains a matrix, all variables are organized as lists, where each element contains a sparse matrix. Please note that values are always given in physical units, i.e. tonnes or head, as specified in items.csv. The suffixes value and mass only indicate the form of allocation chosen for the construction of the symmetric IO tables (for more details see Bruckner et al. 2019). Product, process and country classifications can be found in the file fabio_classifications.xlsx.

    Footprint results are not contained in the database but can be calculated, e.g. by using this script: https://github.com/martinbruckner/fabio_comparison/blob/master/R/fabio_footprints.R

    How to cite:

    To cite FABIO work please refer to this paper:

    Bruckner, M., Wood, R., Moran, D., Kuschnig, N., Wieland, H., Maus, V., Börner, J. 2019. FABIO – The Construction of the Food and Agriculture Input–Output Model. Environmental Science & Technology 53(19), 11302–11312. DOI: 10.1021/acs.est.9b03554

    License:

    This data repository is distributed under the CC BY-NC-SA 4.0 License. You are free to share and adapt the material for non-commercial purposes using proper citation. If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original. In case you are interested in a collaboration, I am happy to receive enquiries at martin.bruckner@wu.ac.at.

    Known issues:

    The underlying FAO data have been manipulated to the minimum extent necessary. Data filling and supply-use balancing, yet, required some adaptations. These are documented in the code and are also reflected in the balancing item in the final demand matrices. For a proper use of the database, I recommend to distribute the balancing item over all other uses proportionally and to do analyses with and without balancing to illustrate uncertainties.

  3. Data from: Prediction of Small Molecule–MicroRNA Associations by Sparse...

    • acs.figshare.com
    • figshare.com
    xlsx
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jun Yin; Xing Chen; Chun-Chun Wang; Yan Zhao; Ya-Zhou Sun (2023). Prediction of Small Molecule–MicroRNA Associations by Sparse Learning and Heterogeneous Graph Inference [Dataset]. http://doi.org/10.1021/acs.molpharmaceut.9b00384.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    ACS Publications
    Authors
    Jun Yin; Xing Chen; Chun-Chun Wang; Yan Zhao; Ya-Zhou Sun
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    As microRNAs (miRNAs) have been reported to be a type of novel high-value small molecule (SM) drug targets for disease treatments, many researchers are engaged in the field of exploring new SM–miRNA associations. Nevertheless, because of the high cost, adopting traditional biological experiments constrains the efficiency of discovering new associations between SMs and miRNAs. Therefore, as an important auxiliary tool, reliable computational models will be of great help to reveal SM–miRNA associations. In this article, we developed a computational model of sparse learning and heterogeneous graph inference for small molecule–miRNA association prediction (SLHGISMMA). Initially, the sparse learning method (SLM) was implemented to decompose the SM–miRNA adjacency matrix. Then, we integrated the reacquired association information together with the similarity information of SMs and miRNAs into a heterogeneous graph to infer potential SM–miRNA associations. Here, the main innovation of SLHGISMMA lies in the introduction of SLM to eliminate noises of the original adjacency matrix to some extent, which plays an important role in performance improvement. In addition, to assess SLHGISMMA’ performance, four different kinds of cross-validations were performed based on two datasets. As a result, based on dataset 1 (dataset 2), SLHGISMMA achieved area under the curves of 0.9273 (0.7774), 0.9365 (0.7973), 0.7703 (0.6556), and 0.9241 ± 0.0052 (0.7724 ± 0.0032) in global leave-one-out cross-validation (LOOCV), miRNA-fixed local LOOCV, SM-fixed local LOOCV, and 5-fold cross-validation, respectively. Moreover, in the case study on three important SMs via removing their known associations, the results showed that most of the top 50 predicted miRNAs were confirmed by the database SM2miR v1.0 or the experimental literature.

  4. f

    MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for...

    • figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xing Chen; Jun Yin; Jia Qu; Li Huang (2023). MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction [Dataset]. http://doi.org/10.1371/journal.pcbi.1006418
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS Computational Biology
    Authors
    Xing Chen; Jun Yin; Jia Qu; Li Huang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Recently, a growing number of biological research and scientific experiments have demonstrated that microRNA (miRNA) affects the development of human complex diseases. Discovering miRNA-disease associations plays an increasingly vital role in devising diagnostic and therapeutic tools for diseases. However, since uncovering associations via experimental methods is expensive and time-consuming, novel and effective computational methods for association prediction are in demand. In this study, we developed a computational model of Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction (MDHGI) to discover new miRNA-disease associations by integrating the predicted association probability obtained from matrix decomposition through sparse learning method, the miRNA functional similarity, the disease semantic similarity, and the Gaussian interaction profile kernel similarity for diseases and miRNAs into a heterogeneous network. Compared with previous computational models based on heterogeneous networks, our model took full advantage of matrix decomposition before the construction of heterogeneous network, thereby improving the prediction accuracy. MDHGI obtained AUCs of 0.8945 and 0.8240 in the global and the local leave-one-out cross validation, respectively. Moreover, the AUC of 0.8794+/-0.0021 in 5-fold cross validation confirmed its stability of predictive performance. In addition, to further evaluate the model's accuracy, we applied MDHGI to four important human cancers in three different kinds of case studies. In the first type, 98% (Esophageal Neoplasms) and 98% (Lymphoma) of top 50 predicted miRNAs have been confirmed by at least one of the two databases (dbDEMC and miR2Disease) or at least one experimental literature in PubMed. In the second type of case study, what made a difference was that we removed all known associations between the miRNAs and Lung Neoplasms before implementing MDHGI on Lung Neoplasms. As a result, 100% (Lung Neoplasms) of top 50 related miRNAs have been indexed by at least one of the three databases (dbDEMC, miR2Disease and HMDD V2.0) or at least one experimental literature in PubMed. Furthermore, we also tested our prediction method on the HMDD V1.0 database to prove the applicability of MDHGI to different datasets. The results showed that 50 out of top 50 miRNAs related with the breast neoplasms were validated by at least one of the three databases (HMDD V2.0, dbDEMC, and miR2Disease) or at least one experimental literature.

  5. p

    Data from: Graph-theoretic algorithms for sparse matrix transformations /

    • dona.pwr.edu.pl
    Updated 1978
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jadwiga Dzikiewicz; Maciej Sysło (1978). Graph-theoretic algorithms for sparse matrix transformations / [Dataset]. https://dona.pwr.edu.pl/szukaj/default.aspx?nrsys=33919
    Explore at:
    Dataset updated
    1978
    Authors
    Jadwiga Dzikiewicz; Maciej Sysło
    Description

    Library of Wroclaw University of Science and Technology scientific output (DONA database)

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox (2023). CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide [Dataset]. http://doi.org/10.6073/pasta/31b1c78019b618106a0d13caab154e23

CoRRE Trait Database: A collection of 17 categorical and continuous traits for more than 4000 grassland species worldwide

Explore at:
296 scholarly articles cite this dataset (View in Google Scholar)
csv(3044968 byte), csv(72474339 byte), csv(3524292 byte)Available download formats
Dataset updated
Nov 17, 2023
Dataset provided by
EDI
Authors
Kimberly Komatsu; Meghan Avolio; Josep Padulles Cubino; Franziska Schrodt; Harold Auge; Jeannine Cavender-Bares; Adam Clark; Habacuc Flores-Moreno; Emily Grman; W Stanley Harpole; Jens Kattge; Kaitlin Kimmel; Sally Koerner; Lotte Korell; J Adam Langley; Tamara Münkemüller; Timothy Ohlert; Renske Onstein; Christiane Roscher; Nadejda Soudzilovskaia; Benton Taylor; Leho Tedersoo; Rosalie Terry; Kevin Wilcox
Area covered
Variables measured
genus, trait, family, source, species, DatasetID, Reference, DatabaseID, trait_value, ObservationID, and 4 more
Description

In our changing world, it is critical to understand and predict plant community responses to global change drivers. Plant functional traits promise to be a key predictive tool for many ecosystems, including grasslands, however their use requires both complete plant community and functional trait data. Yet, representation of these data in global databases is incredibly sparse, particularly beyond a handful of most used traits and common species. Here we present the CoRRE Trait Database, spanning 17 traits (9 categorical, 8 continuous) anticipated to predict species’ responses to global change for 4,079 vascular plant species across 173 plant families present in 390 grassland experiments from around the world. The database contains complete categorical trait records for all 4,079 plant species, obtained from a comprehensive literature search. Additionally, the database contains nearly complete coverage (99.97%) of species mean values for continuous traits for a subset of 2,927 plant species, predicted from observed trait data drawn from TRY and a variety of other plant trait databases using Bayesian Probabilistic Matrix Factorization (BHPMF) and multivariate imputation using chained equations (MICE). These data will shed light on mechanisms underlying population, community, and ecosystem responses to global change in grasslands worldwide.

Search
Clear search
Close search
Google apps
Main menu