93 datasets found
  1. i

    Scatterplots

    • ieee-dataport.org
    Updated Apr 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hennes Rave (2024). Scatterplots [Dataset]. https://ieee-dataport.org/documents/scatterplots
    Explore at:
    Dataset updated
    Apr 2, 2024
    Authors
    Hennes Rave
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    trends

  2. f

    Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm

    • plos.figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS Biology
    Authors
    Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.

  3. f

    A Study of Factors Related to Readership of Scientific Articles

    • stemfellowship.figshare.com
    png
    Updated Jan 29, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tony Xu; Shayan Khalili; Cynthia Deng (2017). A Study of Factors Related to Readership of Scientific Articles [Dataset]. http://doi.org/10.6084/m9.figshare.4595443.v1
    Explore at:
    pngAvailable download formats
    Dataset updated
    Jan 29, 2017
    Dataset provided by
    STEM Fellowship Big Data Challenge
    Authors
    Tony Xu; Shayan Khalili; Cynthia Deng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This report analysed the relationship between the number of Twitter and Mendeley views of scientific articles and the following factors: the GDP per capita of the country where the views came from, the title length, the publisher, the journal, and the subject of the article, in order to determine which factors affected how many views scientific articles accumulate. The purpose of this report was to help future researchers gain the most views possible on their published articles. The data provided by Altmetric containing 550,000 json files, was extracted and displayed into several data frames, each one containing a different variable. The variables were then compared to views articles gained using a combination of bubble graphs, linear regression models, and scatter plots. From the results gathered, it could be seen that although the title length, the GDP per capita, and the subject of an article all affected the total views an article amassed, the journal and publisher had no notable effect on the views. In particular, title length within 41-100 characters had the most noticeable effect on the number of readers per article. Subjects, while showing that the more articles published about them brought in more total views, did not have a strong effect on the average number of viewers per article.

  4. m

    Ultimate_Analysis

    • data.mendeley.com
    Updated Jan 28, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akara Kijkarncharoensin (2022). Ultimate_Analysis [Dataset]. http://doi.org/10.17632/t8x96g88p3.2
    Explore at:
    Dataset updated
    Jan 28, 2022
    Authors
    Akara Kijkarncharoensin
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This database studies the performance inconsistency on the biomass HHV ultimate analysis. The research null hypothesis is the consistency in the rank of a biomass HHV model. Fifteen biomass models are trained and tested in four datasets. In each dataset, the rank invariability of these 15 models indicates the performance consistency.

    The database includes the datasets and source codes to analyze the performance consistency of the biomass HHV. These datasets are stored in tabular on an excel workbook. The source codes are the biomass HHV machine learning model through the MATLAB Objected Orient Program (OOP). These machine learning models consist of eight regressions, four supervised learnings, and three neural networks.

    An excel workbook, "BiomassDataSetUltimate.xlsx," collects the research datasets in six worksheets. The first worksheet, "Ultimate," contains 908 HHV data from 20 pieces of literature. The names of the worksheet column indicate the elements of the ultimate analysis on a % dry basis. The HHV column refers to the higher heating value in MJ/kg. The following worksheet, "Full Residuals," backups the model testing's residuals based on the 20-fold cross-validations. The article (Kijkarncharoensin & Innet, 2021) verifies the performance consistency through these residuals. The other worksheets present the literature datasets implemented to train and test the model performance in many pieces of literature.

    A file named "SourceCodeUltimate.rar" collects the MATLAB machine learning models implemented in the article. The list of the folders in this file is the class structure of the machine learning models. These classes extend the features of the original MATLAB's Statistics and Machine Learning Toolbox to support, e.g., the k-fold cross-validation. The MATLAB script, name "runStudyUltimate.m," is the article's main program to analyze the performance consistency of the biomass HHV model through the ultimate analysis. The script instantly loads the datasets from the excel workbook and automatically fits the biomass model through the OOP classes.

    The first section of the MATLAB script generates the most accurate model by optimizing the model's higher parameters. It takes a few hours for the first run to train the machine learning model via the trial and error process. The trained models can be saved in MATLAB .mat file and loaded back to the MATLAB workspace. The remaining script, separated by the script section break, performs the residual analysis to inspect the performance consistency. Furthermore, the figure of the biomass data in the 3D scatter plot, and the box plots of the prediction residuals are exhibited. Finally, the interpretations of these results are examined in the author's article.

    Reference : Kijkarncharoensin, A., & Innet, S. (2022). Performance inconsistency of the Biomass Higher Heating Value (HHV) Models derived from Ultimate Analysis [Manuscript in preparation]. University of the Thai Chamber of Commerce.

  5. f

    pone.0294455.t001 - Impact of debt, reserves, and political stability on Sri...

    • figshare.com
    xls
    Updated Nov 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Candauda Arachchige Saliya (2023). pone.0294455.t001 - Impact of debt, reserves, and political stability on Sri Lanka’s financial crisis [Dataset]. http://doi.org/10.1371/journal.pone.0294455.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Nov 17, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Candauda Arachchige Saliya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Sri Lanka
    Description

    pone.0294455.t001 - Impact of debt, reserves, and political stability on Sri Lanka’s financial crisis

  6. data-scatter-plots

    • figshare.com
    hdf
    Updated Jan 17, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Katharina Buelow (2019). data-scatter-plots [Dataset]. http://doi.org/10.6084/m9.figshare.7599809.v1
    Explore at:
    hdfAvailable download formats
    Dataset updated
    Jan 17, 2019
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Katharina Buelow
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Data used for the scatter-plots in the ems-2018 extended abstract

  7. Flow manipulation in a Hele-Shaw cell with an electrically-controlled...

    • zenodo.org
    • data.niaid.nih.gov
    tiff
    Updated Jul 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carl Brown; Carl Brown (2024). Flow manipulation in a Hele-Shaw cell with an electrically-controlled viscous obstruction [Dataset]. http://doi.org/10.5281/zenodo.11173025
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jul 5, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Carl Brown; Carl Brown
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Apr 5, 2024
    Description

    The dataset named “Dataset: Flow manipulation in a Hele-Shaw cell with an electrically-controlled viscous obstruction” consists of Raw time-averaged images, which are generated by sequence of 100 frames extracted from experimental videos captured at various voltages (5V, 10V, 15V, 20V, and 50V), and saved as .tif files. These images were analysed to produce the data used in figure 2 and 3 of the article. The dataset also includes two Excel files named as “Figure 2_Experimental data.xlsx” and “Figure 3_Experimental data.xlsx”. These excel files contain the data used to create the experimental plots shown in Figure 2C, and Figure 3 of the research article respectively.

    In the “Figure 2C_Experimental Data.xlsx” excel file, each sheet corresponds to a different voltage value shown in the figure, and contains three columns: A, B, and C. which represents the X-location, Y-location, and orientation angle (in degrees) of the experimental plot (red rods in the figure) respectively. This plot is overlaid on the model data (black rods in the figure) and displayed in Figure 2C given in the article.

    The “Figure 3_Experimental data.xlsx” file contains three sheets for each voltage (5V, 10V, 15V, 20V, and 50V) and each of these three sheets provide data at three different X-locations (X=579, X= 1079, and X= 1779) as a function of Y-location as shown in the Figure 3 of the article. Each sheet has five columns: A, B, C, D, and E. These columns represent the X-location, Y-location, Orientation angle (in degrees), Coherency, and Error in the orientation angle (in degrees), respectively. These data points are used to create the experimental scatter plot shown in Figure 3 of the article.

  8. Data from: Climate change threatens crop diversity at low latitudes

    • zenodo.org
    bin, tiff
    Updated Mar 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sara Heikonen; Sara Heikonen; Matias Heino; Matias Heino; Mika Jalava; Mika Jalava; Stefan Siebert; Stefan Siebert; Daniel Viviroli; Daniel Viviroli; Matti Kummu; Matti Kummu (2025). Data from: Climate change threatens crop diversity at low latitudes [Dataset]. http://doi.org/10.5281/zenodo.14801623
    Explore at:
    bin, tiffAvailable download formats
    Dataset updated
    Mar 4, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Sara Heikonen; Sara Heikonen; Matias Heino; Matias Heino; Mika Jalava; Mika Jalava; Stefan Siebert; Stefan Siebert; Daniel Viviroli; Daniel Viviroli; Matti Kummu; Matti Kummu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains data on the effect of global warming (1.5°C to 4°C) on the current production of 30 major food crops, as well as on the potential diversity of food crops on the current global cropland.

    The journal article that this dataset is supplement to: https://doi.org/10.1038/s43016-025-01135-w

    Contents

    • Source data files (.xlsx) for bar plots and scatter plots in the journal article “Climate change threatens crop diversity at low latitudes"
    • Source data files (.tiff) for map image figures in the journal article “Climate change threatens crop diversity at low latitudes". Note: these are parts of the other .tiff files listed below, and it is mentioned in the end of the file descriptions which layer produces each map figure image in the article and supplement.
    • Raster files (.tiff) of the lowest warming levels that would push 25%, 50%, and 75% of current food crop production in grid cell outside the SCS
    • Raster files (.tiff) of the total potential diversity for all food crops and for food crop groups, covering baseline climate conditions and the four global warming levels
    • Raster files (.tiff) of the change in potential diversity compared to baseline for all food crops and for food crop groups at the four global warming levels.
    • Raster files (.tiff) of the areas within and outside the crop specific Safe Climatic Spaces of 30 food crop types
    • Raster files (.tiff) of the difference in areas within the crop specific Safe Climatic Spaces when using seasonal versus annual method for defining the Safe Climatic Space

    See Data description.docx for detailed descriptions of the data files.

  9. d

    Data from: Trade-offs between growth rate, tree size and lifespan of...

    • datadryad.org
    • data.niaid.nih.gov
    • +2more
    zip
    Updated May 26, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christof Bigler (2016). Trade-offs between growth rate, tree size and lifespan of mountain pine (Pinus montana) in the Swiss National Park [Dataset]. http://doi.org/10.5061/dryad.d2680
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 26, 2016
    Dataset provided by
    Dryad
    Authors
    Christof Bigler
    Time period covered
    2016
    Area covered
    Switzerland, Swiss National Park, canton of Grisons
    Description

    A within-species trade-off between growth rates and lifespan has been observed across different taxa of trees, however, there is some uncertainty whether this trade-off also applies to shade-intolerant tree species. The main objective of this study was to investigate the relationships between radial growth, tree size and lifespan of shade-intolerant mountain pines. For 200 dead standing mountain pines (Pinus montana) located along gradients of aspect, slope steepness and elevation in the Swiss National Park, radial annual growth rates and lifespan were reconstructed. While early growth (i.e. mean tree-ring width over the first 50 years) correlated positively with diameter at the time of tree death, a negative correlation resulted with lifespan, i.e. rapidly growing mountain pines face a trade-off between reaching a large diameter at the cost of early tree death. Slowly growing mountain pines may reach a large diameter and a long lifespan, but risk to die young at a small size. Early gro...

  10. Z

    Silt content (silttotal) soil maps of the Upper Colorado River Basin

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Travis Nauman (2024). Silt content (silttotal) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2549860
    Explore at:
    Dataset updated
    Jul 25, 2024
    Dataset authored and provided by
    Travis Nauman
    Area covered
    Colorado River
    Description

    The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

    Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

    Associated publication:

    Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

    Repository includes maps of silt content (siltdtotal) as defined by United States soil survey program. Silt content is estimated by percent weight of the <2mm portion of the soil.

    These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

    This data should be used in combination with a soil depth or depth to restriction layer map (both layers that will be released soon as part of this project) to eliminate areas mapped at deeper depths than the soil actually goes. This is a limitation of this data which will hopefully be updated in future updates.

    The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

    Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

    File Name Details:

    ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

    Elements are separated by underscore (_) in the following sequence:

    property_r_depth_cm_geometry_model_additional_elements.extension

    Example: silttotal_r_0_cm_2D_QRF.tif

    Indicates silt content (silttotal) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

    The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

    _95PI_h: Indicates the layer is the upper 95% prediction interval value.

    _95PI_l: Indicates the layer is the lower 95% prediction interval value.

    _95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

    References

    Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma

  11. f

    Correlations.

    • figshare.com
    xls
    Updated Nov 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Candauda Arachchige Saliya (2023). Correlations. [Dataset]. http://doi.org/10.1371/journal.pone.0294455.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Nov 17, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Candauda Arachchige Saliya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This study attempts to explore the impact of external debt ($Debt), foreign reserves ($Reserves), and political stability & absence of violence/terrorism (PS&AVT) on the current financial crisis in Sri Lanka. Using data from 1996 to 2022 obtained from the World Bank (WB) and the Central Bank of Sri Lanka (CBSL), a regression analysis is conducted, with a composite variable named "CRISIS," which accounts for interest rate, inflation, currency devaluation adjusted to GDP growth, as the dependent variable. The findings indicate that, collectively, these predictors significantly contribute to explaining the variance in the financial crisis, although their impact is relatively minor. While the direct influence of PS&AVT on the financial crisis is not statistically significant, it indirectly affects the crisis through its considerable impact on debt and reserves. Granger causality tests showed predictive value for $Debt and $Reserve in relation to CRISIS, but the reverse relationship was not significant. Regression analysis using the error term and scatter plots supports the absence of endogeneity issues in the model. These findings suggest that while external debt and foreign reserves are more directly related to financial crises, political stability and the absence of violence/terrorism can influence the crisis indirectly through their effects on debt accumulation and reserve levels. This study represents a pioneering effort in investigating the impact of external debt, foreign reserves, and political stability on the financial crises in Sri Lanka. By utilizing a comprehensive dataset and applying a regression analysis, it sheds light on the complex interactions between these variables and their influence on the country’s financial stability.

  12. Z

    Organic matter content (om) soil maps of the Upper Colorado River Basin

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Travis Nauman (2024). Organic matter content (om) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2550935
    Explore at:
    Dataset updated
    Jul 22, 2024
    Dataset authored and provided by
    Travis Nauman
    Area covered
    Colorado River
    Description

    The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

    Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

    Associated publication:

    Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

    UPDATE: WE FOUND A RENDERING ERROR IN MANY AREAS OF THE 5 CM MAP. WE HAVE RECREATED THE MAP AND INCLUDED IN THIS VERSION OF THE REPOSITORY.

    Repository includes maps of organic matter content (% wt) as defined by United States soil survey program.

    These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

    This data should be used in combination with a soil depth or depth to restriction layer map (both layers that will be released soon as part of this project) to eliminate areas mapped at deeper depths than the soil actually goes. This is a limitation of this data which will hopefully be updated in future updates.

    The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

    Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

    File Name Details:

    ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (_CV_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000). Predictions are also evaluated with the U.S. soil survey laboratory database soil organic carbon (SOC) data. The SOC measurements were coverted to OM matter values using the common 1.724 conversion factor. The converted OM values are compared to predicted OM values using an accuracy plot (OM_SOC_plots.tif).

    Elements are separated by underscore (_) in the following sequence:

    property_r_depth_cm_geometry_model_additional_elements.extension

    Example: om_r_0_cm_2D_QRF_bt.tif

    Indicates soil organic matter content (om) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions. The _bt indicates that the map has been back transformed from ln or sqrt transformation used in modeling.

    The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

    _95PI_h: Indicates the layer is the upper 95% prediction interval value.

    _95PI_l: Indicates the layer is the lower 95% prediction interval value.

    _95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

    References

    Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma

  13. Z

    Gypsum content (gypsum) soil maps of the Upper Colorado River Basin

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Travis Nauman (2024). Gypsum content (gypsum) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2550911
    Explore at:
    Dataset updated
    Jul 25, 2024
    Dataset authored and provided by
    Travis Nauman
    Area covered
    Colorado River
    Description

    The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

    Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

    Associated publication:

    Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

    Repository includes maps of soil gypsum content (% wt of the <20 mm fraction) as defined by United States soil survey program.

    These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

    This data should be used in combination with a soil depth or depth to restriction layer map (both layers that will be released soon as part of this project) to eliminate areas mapped at deeper depths than the soil actually goes. This is a limitation of this data which will hopefully be updated in future updates.

    The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

    Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

    File Name Details:

    ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

    Elements are separated by underscore (_) in the following sequence:

    property_r_depth_cm_geometry_model_additional_elements.extension

    Example: gypsum_r_0_cm_2D_QRF_bt.tif

    Indicates soil gypsum content (gypsum) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions. The _bt indicates that the map has been back transformed from ln or sqrt transformation used in modeling.

    The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

    _95PI_h: Indicates the layer is the upper 95% prediction interval value.

    _95PI_l: Indicates the layer is the lower 95% prediction interval value.

    _95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

    References

    Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma

  14. d

    Selective Raster Graphics - Dataset - data.govt.nz - discover and use data

    • catalogue.data.govt.nz
    Updated May 24, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Selective Raster Graphics - Dataset - data.govt.nz - discover and use data [Dataset]. https://catalogue.data.govt.nz/dataset/oai-figshare-com-article-6349496
    Explore at:
    Dataset updated
    May 24, 2018
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This report explores ways to render specific components of an R plot in raster format, when the overall format of the plot is vector. For example, we demonstrate ways to draw raster data symbols within a PDF scatter plot. A general solution is provided by the grid.rasterize function from the R package ‘rasterize’.

  15. f

    Additional file 1 of Logistic regression has similar performance to...

    • springernature.figshare.com
    • commons.datacite.org
    zip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anita L. Lynam; John M. Dennis; Katharine R. Owen; Richard A. Oram; Angus G. Jones; Beverley M. Shields; Lauric A. Ferrat (2023). Additional file 1 of Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults [Dataset]. http://doi.org/10.6084/m9.figshare.12569144.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    Anita L. Lynam; John M. Dennis; Katharine R. Owen; Richard A. Oram; Angus G. Jones; Beverley M. Shields; Lauric A. Ferrat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 1: Figure S1. Flow diagram of participants through the model development stages. T1D: type 1 diabetes, T2D: type 2 diabetes. Figure S2. ROC AUC plots obtained using external validation dataset for seven prediction models. Legend: Solid lines: black = Support Vector Machine, dark grey = Logistic Regression, light grey = Random Forest. Dotted lines: black = Neural Network, dark grey = K-Nearest Neighbours, light grey = Gradient Boosting Machine. Figure S3. Correlation coefficient matrix and scatter plot of model predictions obtained from external test validation data.

  16. Fine sand content (sandfine; 0.10 to 0.25 mm) soil maps of the Upper...

    • zenodo.org
    Updated Jul 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Travis Nauman; Travis Nauman (2024). Fine sand content (sandfine; 0.10 to 0.25 mm) soil maps of the Upper Colorado River Basin [Dataset]. http://doi.org/10.5281/zenodo.2547610
    Explore at:
    Dataset updated
    Jul 25, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Travis Nauman; Travis Nauman
    Area covered
    Colorado River
    Description

    The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

    Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

    Associated publication:

    Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

    Repository includes maps of fine sand content (sandfine; 0.10 to 0.25 mm) as defined by United States soil survey program. Content is calculated on the fine earth fraction (<2mm).

    These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

    The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

    Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

    File Name Details:

    ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

    Elements are separated by underscore (_) in the following sequence:

    property_r_depth_cm_geometry_model_additional_elements.extension

    Example: sandfine_r_0_cm_2D_QRF.tif

    Indicates fine sand content (sandfine) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

    The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

    _95PI_h: Indicates the layer is the upper 95% prediction interval value.

    _95PI_l: Indicates the layer is the lower 95% prediction interval value.

    _95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

    References

    Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma.

  17. Question 2 for TMU PhD candidate

    • figshare.com
    xlsx
    Updated Jan 5, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seyed Mahmoud Ghasempouri (2022). Question 2 for TMU PhD candidate [Dataset]. http://doi.org/10.6084/m9.figshare.17883914.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jan 5, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Seyed Mahmoud Ghasempouri
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Dear TMU PhD candidate,

    Please run principal component analysis (PCA) for dataset as below:

    a) Calculate Eigenvalue for PC1 and PC2.

    b) Draw the scatter plot.

    c) Which loading plots are in the opposite direction to the others?

    Thank You

  18. Carbonate (caco3) soil maps of the Upper Colorado River Basin

    • zenodo.org
    • data.niaid.nih.gov
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Travis Nauman; Travis Nauman (2024). Carbonate (caco3) soil maps of the Upper Colorado River Basin [Dataset]. http://doi.org/10.5281/zenodo.2546935
    Explore at:
    Dataset updated
    Jul 25, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Travis Nauman; Travis Nauman
    Area covered
    Colorado River
    Description

    The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

    Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

    Associated publication:

    Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

    Repository includes maps of carbonate content (caco3) as defined by United States soil survey program. Content is calculated on the fine earth fraction (<2mm).

    These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

    The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

    Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

    File Name Details:

    ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

    Elements are separated by underscore (_) in the following sequence:

    property_r_depth_cm_geometry_model_additional_elements.extension

    Example: caco3_r_0_cm_2D_QRF_bt.tif

    Indicates carbonate (caco3) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest that is has gone through transfomation and backtransformation (_bt) in the modeling process. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

    The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

    _95PI_h: Indicates the layer is the upper 95% prediction interval value.

    _95PI_l: Indicates the layer is the lower 95% prediction interval value.

    _95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

    References

    Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma.

  19. Z

    Soil pH (ph1to1h20, 1:1 water method) soil maps of the Upper Colorado River...

    • data.niaid.nih.gov
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Travis Nauman (2024). Soil pH (ph1to1h20, 1:1 water method) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2548132
    Explore at:
    Dataset updated
    Jul 25, 2024
    Dataset authored and provided by
    Travis Nauman
    Area covered
    Colorado River
    Description

    The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

    Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

    Associated publication:

    Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

    Repository includes maps of 1:1 soil pH as defined by United States soil survey program.

    These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

    The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

    Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

    File Name Details:

    ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

    Elements are separated by underscore (_) in the following sequence:

    property_r_depth_cm_geometry_model_additional_elements.extension

    Example: ph1to1h2o_r_0_cm_2D_QRF.tif

    Indicates soil pH in 1:1 water mixture (ph1to1h2o) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

    The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

    _95PI_h: Indicates the layer is the upper 95% prediction interval value.

    _95PI_l: Indicates the layer is the lower 95% prediction interval value.

    _95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

    References

    Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma

  20. f

    Data from: Detecting desertification in different years and rainfall regimes...

    • scielo.figshare.com
    • figshare.com
    tiff
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thiago Costa dos Santos; Adunias dos Santos Teixeira; Fabrício da Silva Terra; Luis Clenio Jário Moreira; Raul Shiso Toma (2023). Detecting desertification in different years and rainfall regimes by 2D Scatter Plot [Dataset]. http://doi.org/10.6084/m9.figshare.19904126.v1
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    SciELO journals
    Authors
    Thiago Costa dos Santos; Adunias dos Santos Teixeira; Fabrício da Silva Terra; Luis Clenio Jário Moreira; Raul Shiso Toma
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT The desertification process causes soil degradation and a reduction in vegetation. The absence of visualisation techniques and the broad spatial and temporal dimension of the data hampers the identification of desertification and rapid decision-making by multidisciplinary teams. The 2D Scatter Plot is a two-dimensional visual analysis of reflectances in the red (630 - 690 nm) and near-infrared (760 - 900 nm) bands to visualise the spectral response of the vegetation. The hypothesis of this study is that visualising the reflectances of the vegetation by means of a 2D scatter plot will allow desertification to be inferred. The aim of this study was to identify desertified areas and characterise the spatial and temporal dynamics of the vegetation and soil during dry (DP) and rainy (RP) periods between 2000 and 2008, using a 2D scatter plot. The 2D scatter plot generated by the Envi® 4.8 software and the reflectances in bands 3 and 4 of the TM5 sensor were used within communities in the Irauçuba hub (Ceará, Brazil). The concentration densities of the near-infrared reflectances of the vegetation pixels were observed. Each community presented pixel concentrations with reflectances of less than 0.4 (40%) during each of the periods under evaluation, indicating little vegetation development, with further degradation caused by deforestation, the use of fire and overgrazing. The 2D scatter plot was able to show vegetation with low reflectance in the near infrared during both dry and rainy periods between 2000 and 2008, thereby inferring the occurrence of desertification.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hennes Rave (2024). Scatterplots [Dataset]. https://ieee-dataport.org/documents/scatterplots

Scatterplots

Explore at:
Dataset updated
Apr 2, 2024
Authors
Hennes Rave
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

trends

Search
Clear search
Close search
Google apps
Main menu