6 datasets found
  1. e

    Children's questionnaire (data frame) - Dataset - B2FIND

    • b2find.eudat.eu
    Updated Jun 23, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Children's questionnaire (data frame) - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/93fcb56f-065d-502e-8941-49dd99608fb5
    Explore at:
    Dataset updated
    Jun 23, 2024
    Description

    "WeAreHere!" Children's questionnaire. This dataset includes: (1) the WaH children's questionnaire (20 questions including 5-point Likert scale questions, dichotomous questions and an open space for comments). The Catalan version (original), and the Spanish and English versions of the questionnaire can be found in this dataset in pdf format. (2) The data frame in xlsx format, with the children's answers to the questionnaire (a total of 3664 answers) and a reduced version of it for doing the regression (with the 5-point likert scale variable "ask for help" transformed into a dichotomous variable). (3) The data frame in xlsx format, with the children's answers to the questionnaire and the categorization of their comments (sheet 1), the data frame with only the MCA variables selected (sheet 2), and the categories and subcategories table (sheet 3). (4) The data analysis procedure for the regression, the component and multiple component analysis (R script).

  2. H

    GenBank publication network yearly data frame files

    • dataverse.harvard.edu
    Updated Nov 15, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jian Qin; Jeff Hemsley; Sarah Bratt (2021). GenBank publication network yearly data frame files [Dataset]. http://doi.org/10.7910/DVN/YGWKLA
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 15, 2021
    Dataset provided by
    Harvard Dataverse
    Authors
    Jian Qin; Jeff Hemsley; Sarah Bratt
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    1992 - 2018
    Dataset funded by
    NSF
    NIH
    Description

    GenBank publication network R data frame files by year from 1992-2018.

  3. Data and code for the manuscript - The hidden biodiversity knowledge split...

    • zenodo.org
    Updated Apr 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous Anonymous; Anonymous Anonymous (2025). Data and code for the manuscript - The hidden biodiversity knowledge split in biological collections [Dataset]. http://doi.org/10.5281/zenodo.15248066
    Explore at:
    Dataset updated
    Apr 19, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anonymous Anonymous; Anonymous Anonymous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Apr 19, 2025
    Description

    # General overview

    This repository contains the data and code used in the analysis of the
    manuscript entitled **"The hidden biodiversity knowledge split in biological collections"**.

    # Context

    Ecological and evolutionary processes generate biodiversity, yet how biodiversity data are organized and shared globally can shape our understanding of these processes. We show that name-bearing type specimens—the primary reference for species identity—of all freshwater and brackish fish species are predominantly housed in Global North museums, disconnected from their countries of origin. This geographical divide creates a ‘knowledge split’ with consequences for biodiversity science, particularly in the Global South, where researchers face barriers in studying native species’ name bearers housed abroad. Meanwhile, Global North collections remain flooded with non-native name bearers. We relate this imbalance to historical and socioeconomic factors, which ultimately restricts access to critical taxonomic reference materials and hinders global species documentation. To address this disparity, we call for international initiatives to promote fairer access to biological knowledge, including specimen repatriation, improved accessibility protocols for researchers in countries where specimens originated, and inclusive research partnerships.

    # Repository structure

    ## data

    This folder stores raw and processed data used to perform all the
    analysis presented in this study

    ### raw

    - `flow_period_region_country.csv` a data frame in the long format
    containing the flowing of NBT per regions per per time (50-year time
    frame). Variables:

    - `period` numeric variable representing 50-year time intervals

    - `region_type` character representing the name of the World Bank region
    of the country where the NBT was sourced

    - `country_type` character. A three letter code (alpha-3 ISO3166) representing
    the country of the museum where the NBT was sourced

    - `region_museum` character. Name of the World Bank region of the country
    where the NBT is housed

    - `country_museum` character. A three letter code (alpha-3 ISO3166) representing
    the country of the museum where the NBT is housed

    - `n` numeric. The number of NBT flowing from one country to another

    - `spp_native_distribution.csv` data frame in the long format
    containing the native composition at the country level. Variables:

    - `valid_name` character. The name of a species in the format genus_epithet
    according to the Catalog of Fishes

    - `country_distribution` character. Three letter code (alpha-3 ISO3166)
    indicating the name of the country where a species is native to

    - `region_distribution` character. The name of the region acording with
    World Bank where a species is native to

    - `spp_type_distribution.csv` data frame in the long format containing
    the composition of NBT by country. Variables:


    - `valid_name` character. The name of a species in the format genus_epithet
    according to the Catalog of Fishes

    - `country_distribution` character. Three letter code (alpha-3 ISO3166)
    indicating the name of the country where a species is housed

    - `region_distribution` character. The name of the region acording with
    World Bank where a species is housed

    - `bio-dem_data.csv` data frame with data downloaded from
    [Bio-Dem](https://bio-dem.surge.sh/#awards) containing information
    on biological and social information at the country level. Variables:

    - `country` character. A three letter code (alpha-3 ISO3166) representing
    a country

    - `records` numeric. Total number of species occurrence records from Global
    Biodiverity Facility (GBIF)

    - `records_per_area` numeric. Records per area from gbif

    - `yearsSinceIndependence` numeric. Years since independence for each country

    - `e_migdppc` numeric. GDP per capta

    - `museum_data.csv` data frame with museums' acronyms and the world
    region of each. Variables:

    - `code_museum` character. The acronym (three letter code) of the museum

    - `country_museum` character. A three letter code (alpha-3 ISO3166) representing
    a country

    - `region_museum` character. The name of the region acording with
    World Bank

    ### processed

    - `flow_region.csv` a data frame containing flowing of name bearers among world
    regions and the total number of name bearers derived from the source region

    - `flow_period_region.csv` a data frame with the number of name bearers between
    the world regions per 50-year time frame and the total number of name bearers
    in each time frame for each world region

    - `flow_period_region_prop.csv` a data frame with the number of name bearers,
    the Domestic Contribution and Domestic Retention between the world
    regions in a 50-year time frame - this is not used anymore in downstream analyses

    - `flow_region_prop.csv` data with the total number of species flowing
    between world regions, Domestic Contribution and Domestic Retention - this is no longer used in downstream analyses

    - `flow_country.csv` data frame with flowing information of name bearers among
    countries

    - `df_country_native.csv` data frame with the number of native species
    at the country level

    - `df_country_type.csv` data frame with the number of name bearers at the
    country level

    - `df_all_beta.csv` data frame with values of endemic deficit and non-endemic
    representation at the country level

    ## R

    The letters `D`, `A` and `V` represents scripts for, respectively, data
    processing (D), data analysis (A) and results visualization (V). The
    script sequence to reproduce the workflow is indicated by the numbers at
    the beginning of the name of the script file

    - [`01_D_data_preparation.qmd`](R/01_D_data_preparation.qmd) initial data preparation

    - [`02_A_beta-endemics-countries.qmd`](R/02_A_beta-endemics-countries.qmd) analysis of endemic deficit and non endemic representation. This script is used to calculate `native/endemic deficit` and `non-native/non-endemic representation`

    - [`03_D_data_preparation_models.qmd`](R/03_D_data_preparation_models.qmd) script used to build data frames that will be used in statistical models ([`04_A_model_NBTs.qmd`](R/04_A_model_NBTs.qmd))

    - [`04_A_model_NBTs.qmd`](R/04_A_model_NBTs.qmd) statistical models for the total number of name bearers, endemic deficit and non-endemic representation

    - [`05_V_chord_diagram_Fig1.qmd`](R/05_V_chord_diagram_Fig1.qmd) code used to produce circular flow diagram. This is the Figure 1 of the study

    - [`06_V_world_map_Fig1.qmd`](R/06_V_world_map_Fig1.qmd) code used to produce the world map in the Figure 1 of the main text

    - [08_V_beta_endemics_Fig3.qmd](R/08_V_beta_endemics_Fig3.qmd) code used to build Figure 2 of the main text

    - [`09_V_model_Fig4.qmd`](R/09_V_model_Fig4.qmd) code used to build the Figure 3 of the main text. This is the representation of the results of the models present in the script [04_A_model_NBTs.qmd](R/04_A_model_NBTs.qmd)

    - [`0010_Supplementary_analysis.qmd`](R/0010_Supplementary_analysis.qmd) code to produce all the tables and figures presented in the Supplementary material of this study

    ## output

    ### Figures

    In this folder you will find all figures used in the main text and supplementary material of this study

    `Fig1_flow_circle_plot.png` Figure with circular plots showing the flux of name bearers among regions of the world in a 50-year time window

    `Fig3_turnover_metrics_endemics.png` Cartogram with 3 maps showing the level of endemic deficit
    non-endemic representation and the combination of both metrics in a combined map

    `Fig4_models.png` Figure showing the predictions of the number of name bearers,
    endemic deficit and non-endemic representation for different predictors.
    This is derived from the statistical models

    #### Supp-material

    This folder contains the figures in the Supplementary material

    - `FigS1_native_richness.png` World map with countries coloured according to the number of native species richness according to the Catalog of Fishes

    - `FigS3_turnover_metrics.png` Cartogram with 3 maps showing the level of
    native deficit, non-native representation and the combination of both metrics in a combined map

  4. f

    Summary statistics of water source using practice, quality and sanitation...

    • plos.figshare.com
    xls
    Updated Jun 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shewayiref Geremew Gebremichael; Emebet Yismaw; Belete Dejen Tsegaw; Adeladilew Dires Shibeshi (2023). Summary statistics of water source using practice, quality and sanitation perception among urban households in North-West Ethiopia, 2019. [Dataset]. http://doi.org/10.1371/journal.pone.0239502.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 11, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Shewayiref Geremew Gebremichael; Emebet Yismaw; Belete Dejen Tsegaw; Adeladilew Dires Shibeshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Ethiopia
    Description

    Summary statistics of water source using practice, quality and sanitation perception among urban households in North-West Ethiopia, 2019.

  5. f

    Summary of analyses of molecular variance (AMOVA) in R. crataegifolius and...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JiYoung Yang; Jae-Hong Pak; Masayuki Maki; Seung-Chul Kim (2023). Summary of analyses of molecular variance (AMOVA) in R. crataegifolius and R. takesimensis, showing degree of freedom (df), sum of squares (SS), variance components, and the total variance contributed by each component (%) and its associated significance (n = 1,023 permutations). [Dataset]. http://doi.org/10.1371/journal.pone.0222707.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    JiYoung Yang; Jae-Hong Pak; Masayuki Maki; Seung-Chul Kim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary of analyses of molecular variance (AMOVA) in R. crataegifolius and R. takesimensis, showing degree of freedom (df), sum of squares (SS), variance components, and the total variance contributed by each component (%) and its associated significance (n = 1,023 permutations).

  6. f

    Supplement 2. R code and associated source files used to generate parameters...

    • wiley.figshare.com
    html
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marti J. Anderson; Daniel C. I. Walsh (2023). Supplement 2. R code and associated source files used to generate parameters and conduct simulations based on real ecological data sets. [Dataset]. http://doi.org/10.6084/m9.figshare.3568332.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Wiley
    Authors
    Marti J. Anderson; Daniel C. I. Walsh
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    File List All Files at Once: Supplement_2.zip (MD5: b3572e87400306d7b0543dffda7edbde)
    Readme_Supplement2.txt (MD5: a7635149c51da79c6149ddb607823866) Ekofisk.csv (MD5: 7bdbdccc30a9799e7a3b12a0bdce5dcb) Ekofisk_dw.csv (MD5: dac36e965f5993986597dbfa95193d5b) Norway.csv (MD5: 4f1f2a7aa772e5835588c06b35072b7c) FitParameters_Ekofisk.R (MD5: 874ef4203154247252db241fd4aac852) FitParameters_Norway.R (MD5: faa855f8e6adb007ef4ea505c11e576e) Sim_Functions.R (MD5: 3033e9b636ce595f636baf0903dbea32) Par_Ekofisk_A-B_ln.ct.eu.R (MD5: 929fd83818e148538c5d2d035bb6a41e) Par_Ekofisk_A-B_ln.l1.eu.R (MD5: b02ff1a10455719ad6a9ab21804dc255) Par_Ekofisk_A-B_ln.qp.bc0.R (MD5: f004464a598bb89f605c5eaad1b2ce9d) Par_Ekofisk_A-B_nb.ct.eu.df.R (MD5: 9b10ad9e9d5fd2699c9d9c13ab3e0357) Par_Ekofisk_A-B_nb.ct.eu.dv.R (MD5: 9b10ad9e9d5fd2699c9d9c13ab3e0357) Par_Ekofisk_A-B_nb.l1.eu.df.R (MD5: b8bf67dc1ea90b8f37485ea7e3cfd1a2) Par_Ekofisk_A-B_nb.l1.eu.dv.R (MD5: e0db4b3d8fbb249e6a68f36488be56e8) Par_Ekofisk_A-B_nb.qp.bc0.df.R (MD5: 803b0d8a608db0948442ee7f90b40e7c) Par_Ekofisk_A-B_nb.qp.bc0.dv.R (MD5: 06771f1f8f371dbae913472962622385) Par_Ekofisk_B-C_ln.ct.eu.R (MD5: 41c2ae7182309da5b966e31bf643b053) Par_Ekofisk_B-C_ln.l1.eu.R (MD5: 787945612f927046a9697376b1584b33) Par_Ekofisk_B-C_ln.qp.bc0.R (MD5: f26014b0c0b44b6df4901d35ac149918) Par_Ekofisk_B-C_nb.ct.eu.df.R (MD5: 42704fc56ef7a1e46344f26b28bf8f7d) Par_Ekofisk_B-C_nb.ct.eu.dv.R (MD5: 346cbb6a5dd248d5ba4d0e0d7ba289ef) Par_Ekofisk_B-C_nb.l1.eu.df.R (MD5: 9205b4131dc22c7a00bd7230a03d2e8c) Par_Ekofisk_B-C_nb.l1.eu.dv.R (MD5: 1d4ea625561ae6348b448dd69e6515ee) Par_Ekofisk_B-C_nb.qp.bc0.df.R (MD5: 04cfde95529281883a20519781b6d017) Par_Ekofisk_B-C_nb.qp.bc0.dv.R (MD5: 04cfde95529281883a20519781b6d017) Par_Ekofisk_C-D_ln.ct.eu.R (MD5: 926dcdc7862e812045fa1f7635fb25b3) Par_Ekofisk_C-D_ln.l1.eu.R (MD5: 07cb5226bb8f540296a2e62f3f3a8463) Par_Ekofisk_C-D_ln.qp.bc0.R (MD5: b849134b5d32d8aa0953b1dc67e04d0f) Par_Ekofisk_C-D_nb.ct.eu.df.R (MD5: 79ad6bfe4b5ef65df3bed6e0129c0622) Par_Ekofisk_C-D_nb.ct.eu.dv.R (MD5: 6ff650b99c02d958e5b81e1283d1063f) Par_Ekofisk_C-D_nb.l1.eu.df.R (MD5: 91de46426b68a362c45f0df2e8949d7c) Par_Ekofisk_C-D_nb.l1.eu.dv.R (MD5: 212eba37ffc2f7aa884c57383cb9498a) Par_Ekofisk_C-D_nb.qp.bc0.df.R (MD5: 9254a0cd1a47dceeebb1ee35ab037691) Par_Ekofisk_C-D_nb.qp.bc0.dv.R (MD5: 4b7bb4b8847d925ea30fe3526965c2ca) Par_Norway_1-2.R (MD5: f482a356fbf64ecb6ee016752374f8d6) Par_Norway_2-3.R (MD5: cf75e4fd951c8f985cdcabc59287206b) Par_Norway_3-4.R (MD5: 6f1548331162aede52b0b2d5265f5ed9) Par_Norway_4-5.R (MD5: 08e788943b5a8cd807265e4230f60149) Description Readme_Supplement2.txt contains information on how to run the simulations in R using the parameter files generated from real data sets and using the simulation functions provided in the file Sim_Functions.R Ekofisk.csv ontains the Ekofisk data set. Ekofisk_dw.csv contains dispersion weights for each species in the Ekofisk data set. Norway.csv contains the Norwegian continental shelf data set. FitParameters_Ekofisk.R contains the R code for calculating fitted parameters on which simulations based on the Ekofisk data set can be run. FitParameters_Norway.R contains the R code for calculating fitted parameters on which simulations based on the Ekofisk data set can be run. Sim_Functions.R contains R code to calculate the test-statistics and run the simulations for a given set of parameters. Par_Ekofisk_A-B_ln.ct.eu.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the lognormal distribution and using chi-squared distances. Par_Ekofisk_A-B_ln.l1.eu.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the lognormal distribution and using Euclidean distances on log(x+1)-transformed data. Par_Ekofisk_A-B_ln.qp.bc0.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the lognormal distribution and using Bray-Curtis dissimilarities fourth-root-transformed data. Par_Ekofisk_A-B_nb.ct.eu.df.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the negative binomial/Poisson distributions with fixed overdispersion parameters and using chi-squared distances. Par_Ekofisk_A-B_nb.ct.eu.dv.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the negative binomial/Poisson distributions with variable overdispersion parameters and using chi-squared distances. Par_Ekofisk_A-B_nb.l1.eu.df.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the negative binomial/Poisson distributions with fixed overdispersion parameters and using Euclidean distances on log(x+1)-transformed data. Par_Ekofisk_A-B_nb.l1.eu.dv.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the negative binomial/Poisson distributions with variable overdispersion parameters and using Euclidean distances on log(x+1)-transformed data. Par_Ekofisk_A-B_nb.qp.bc0.df.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the negative binomial/Poisson distributions with fixed overdispersion parameters and using Bray-Curtis dissimilarities fourth-root-transformed data. Par_Ekofisk_A-B_nb.qp.bc0.dv.R contains parameters in R code for running simulations based on the Ekofisk data to compare groups A and B on the basis of the negative binomial/Poisson distributions with variable overdispersion parameters and using Bray-Curtis dissimilarities fourth-root-transformed data....

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2024). Children's questionnaire (data frame) - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/93fcb56f-065d-502e-8941-49dd99608fb5

Children's questionnaire (data frame) - Dataset - B2FIND

Explore at:
Dataset updated
Jun 23, 2024
Description

"WeAreHere!" Children's questionnaire. This dataset includes: (1) the WaH children's questionnaire (20 questions including 5-point Likert scale questions, dichotomous questions and an open space for comments). The Catalan version (original), and the Spanish and English versions of the questionnaire can be found in this dataset in pdf format. (2) The data frame in xlsx format, with the children's answers to the questionnaire (a total of 3664 answers) and a reduced version of it for doing the regression (with the 5-point likert scale variable "ask for help" transformed into a dichotomous variable). (3) The data frame in xlsx format, with the children's answers to the questionnaire and the categorization of their comments (sheet 1), the data frame with only the MCA variables selected (sheet 2), and the categories and subcategories table (sheet 3). (4) The data analysis procedure for the regression, the component and multiple component analysis (R script).

Search
Clear search
Close search
Google apps
Main menu