93 datasets found

i
Scatterplots
ieee-dataport.org
Updated Apr 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hennes Rave (2024). Scatterplots [Dataset]. https://ieee-dataport.org/documents/scatterplots
Explore at:
Dataset updated
Apr 2, 2024
Authors
Hennes Rave
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
trends
f
Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm
plos.figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pbio.1002128
Dataset updated
May 31, 2023
Dataset provided by
PLOS Biology
Authors
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.
f
A Study of Factors Related to Readership of Scientific Articles
stemfellowship.figshare.com
png
Updated Jan 29, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tony Xu; Shayan Khalili; Cynthia Deng (2017). A Study of Factors Related to Readership of Scientific Articles [Dataset]. http://doi.org/10.6084/m9.figshare.4595443.v1
Explore at:
pngAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.4595443.v1
Dataset updated
Jan 29, 2017
Dataset provided by
STEM Fellowship Big Data Challenge
Authors
Tony Xu; Shayan Khalili; Cynthia Deng
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This report analysed the relationship between the number of Twitter and Mendeley views of scientific articles and the following factors: the GDP per capita of the country where the views came from, the title length, the publisher, the journal, and the subject of the article, in order to determine which factors affected how many views scientific articles accumulate. The purpose of this report was to help future researchers gain the most views possible on their published articles. The data provided by Altmetric containing 550,000 json files, was extracted and displayed into several data frames, each one containing a different variable. The variables were then compared to views articles gained using a combination of bubble graphs, linear regression models, and scatter plots. From the results gathered, it could be seen that although the title length, the GDP per capita, and the subject of an article all affected the total views an article amassed, the journal and publisher had no notable effect on the views. In particular, title length within 41-100 characters had the most noticeable effect on the number of readers per article. Subjects, while showing that the more articles published about them brought in more total views, did not have a strong effect on the average number of viewers per article.
m
Ultimate_Analysis
data.mendeley.com
Updated Jan 28, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Akara Kijkarncharoensin (2022). Ultimate_Analysis [Dataset]. http://doi.org/10.17632/t8x96g88p3.2
Explore at:
Unique identifier
https://doi.org/10.17632/t8x96g88p3.2
Dataset updated
Jan 28, 2022
Authors
Akara Kijkarncharoensin
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This database studies the performance inconsistency on the biomass HHV ultimate analysis. The research null hypothesis is the consistency in the rank of a biomass HHV model. Fifteen biomass models are trained and tested in four datasets. In each dataset, the rank invariability of these 15 models indicates the performance consistency.

The database includes the datasets and source codes to analyze the performance consistency of the biomass HHV. These datasets are stored in tabular on an excel workbook. The source codes are the biomass HHV machine learning model through the MATLAB Objected Orient Program (OOP). These machine learning models consist of eight regressions, four supervised learnings, and three neural networks.

An excel workbook, "BiomassDataSetUltimate.xlsx," collects the research datasets in six worksheets. The first worksheet, "Ultimate," contains 908 HHV data from 20 pieces of literature. The names of the worksheet column indicate the elements of the ultimate analysis on a % dry basis. The HHV column refers to the higher heating value in MJ/kg. The following worksheet, "Full Residuals," backups the model testing's residuals based on the 20-fold cross-validations. The article (Kijkarncharoensin & Innet, 2021) verifies the performance consistency through these residuals. The other worksheets present the literature datasets implemented to train and test the model performance in many pieces of literature.

A file named "SourceCodeUltimate.rar" collects the MATLAB machine learning models implemented in the article. The list of the folders in this file is the class structure of the machine learning models. These classes extend the features of the original MATLAB's Statistics and Machine Learning Toolbox to support, e.g., the k-fold cross-validation. The MATLAB script, name "runStudyUltimate.m," is the article's main program to analyze the performance consistency of the biomass HHV model through the ultimate analysis. The script instantly loads the datasets from the excel workbook and automatically fits the biomass model through the OOP classes.

The first section of the MATLAB script generates the most accurate model by optimizing the model's higher parameters. It takes a few hours for the first run to train the machine learning model via the trial and error process. The trained models can be saved in MATLAB .mat file and loaded back to the MATLAB workspace. The remaining script, separated by the script section break, performs the residual analysis to inspect the performance consistency. Furthermore, the figure of the biomass data in the 3D scatter plot, and the box plots of the prediction residuals are exhibited. Finally, the interpretations of these results are examined in the author's article.

Reference : Kijkarncharoensin, A., & Innet, S. (2022). Performance inconsistency of the Biomass Higher Heating Value (HHV) Models derived from Ultimate Analysis [Manuscript in preparation]. University of the Thai Chamber of Commerce.
f
pone.0294455.t001 - Impact of debt, reserves, and political stability on Sri...
figshare.com
xls
Updated Nov 17, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Candauda Arachchige Saliya (2023). pone.0294455.t001 - Impact of debt, reserves, and political stability on Sri Lanka’s financial crisis [Dataset]. http://doi.org/10.1371/journal.pone.0294455.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0294455.t001
Dataset updated
Nov 17, 2023
Dataset provided by
PLOS ONE
Authors
Candauda Arachchige Saliya
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Sri Lanka
Description
pone.0294455.t001 - Impact of debt, reserves, and political stability on Sri Lanka’s financial crisis
data-scatter-plots
figshare.com
hdf
Updated Jan 17, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Katharina Buelow (2019). data-scatter-plots [Dataset]. http://doi.org/10.6084/m9.figshare.7599809.v1
Explore at:
hdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7599809.v1
Dataset updated
Jan 17, 2019
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Katharina Buelow
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Data used for the scatter-plots in the ems-2018 extended abstract
Flow manipulation in a Hele-Shaw cell with an electrically-controlled...
zenodo.org
data.niaid.nih.gov
tiff
Updated Jul 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carl Brown; Carl Brown (2024). Flow manipulation in a Hele-Shaw cell with an electrically-controlled viscous obstruction [Dataset]. http://doi.org/10.5281/zenodo.11173025
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.11173025
Dataset updated
Jul 5, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Carl Brown; Carl Brown
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Apr 5, 2024
Description
The dataset named “Dataset: Flow manipulation in a Hele-Shaw cell with an electrically-controlled viscous obstruction” consists of Raw time-averaged images, which are generated by sequence of 100 frames extracted from experimental videos captured at various voltages (5V, 10V, 15V, 20V, and 50V), and saved as .tif files. These images were analysed to produce the data used in figure 2 and 3 of the article. The dataset also includes two Excel files named as “Figure 2_Experimental data.xlsx” and “Figure 3_Experimental data.xlsx”. These excel files contain the data used to create the experimental plots shown in Figure 2C, and Figure 3 of the research article respectively.

In the “Figure 2C_Experimental Data.xlsx” excel file, each sheet corresponds to a different voltage value shown in the figure, and contains three columns: A, B, and C. which represents the X-location, Y-location, and orientation angle (in degrees) of the experimental plot (red rods in the figure) respectively. This plot is overlaid on the model data (black rods in the figure) and displayed in Figure 2C given in the article.

The “Figure 3_Experimental data.xlsx” file contains three sheets for each voltage (5V, 10V, 15V, 20V, and 50V) and each of these three sheets provide data at three different X-locations (X=579, X= 1079, and X= 1779) as a function of Y-location as shown in the Figure 3 of the article. Each sheet has five columns: A, B, C, D, and E. These columns represent the X-location, Y-location, Orientation angle (in degrees), Coherency, and Error in the orientation angle (in degrees), respectively. These data points are used to create the experimental scatter plot shown in Figure 3 of the article.
Data from: Climate change threatens crop diversity at low latitudes
zenodo.org
bin, tiff
Updated Mar 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sara Heikonen; Sara Heikonen; Matias Heino; Matias Heino; Mika Jalava; Mika Jalava; Stefan Siebert; Stefan Siebert; Daniel Viviroli; Daniel Viviroli; Matti Kummu; Matti Kummu (2025). Data from: Climate change threatens crop diversity at low latitudes [Dataset]. http://doi.org/10.5281/zenodo.14801623
Explore at:
bin, tiffAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14801623
Dataset updated
Mar 4, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Sara Heikonen; Sara Heikonen; Matias Heino; Matias Heino; Mika Jalava; Mika Jalava; Stefan Siebert; Stefan Siebert; Daniel Viviroli; Daniel Viviroli; Matti Kummu; Matti Kummu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains data on the effect of global warming (1.5°C to 4°C) on the current production of 30 major food crops, as well as on the potential diversity of food crops on the current global cropland.

The journal article that this dataset is supplement to: https://doi.org/10.1038/s43016-025-01135-w

Contents

Source data files (.xlsx) for bar plots and scatter plots in the journal article “Climate change threatens crop diversity at low latitudes"

Source data files (.tiff) for map image figures in the journal article “Climate change threatens crop diversity at low latitudes". Note: these are parts of the other .tiff files listed below, and it is mentioned in the end of the file descriptions which layer produces each map figure image in the article and supplement.

Raster files (.tiff) of the lowest warming levels that would push 25%, 50%, and 75% of current food crop production in grid cell outside the SCS

Raster files (.tiff) of the total potential diversity for all food crops and for food crop groups, covering baseline climate conditions and the four global warming levels

Raster files (.tiff) of the change in potential diversity compared to baseline for all food crops and for food crop groups at the four global warming levels.

Raster files (.tiff) of the areas within and outside the crop specific Safe Climatic Spaces of 30 food crop types

Raster files (.tiff) of the difference in areas within the crop specific Safe Climatic Spaces when using seasonal versus annual method for defining the Safe Climatic Space

See Data description.docx for detailed descriptions of the data files.
d
Data from: Trade-offs between growth rate, tree size and lifespan of...
datadryad.org
data.niaid.nih.gov
+2more
zip
Updated May 26, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Christof Bigler (2016). Trade-offs between growth rate, tree size and lifespan of mountain pine (Pinus montana) in the Swiss National Park [Dataset]. http://doi.org/10.5061/dryad.d2680
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.d2680
Dataset updated
May 26, 2016
Dataset provided by
Dryad
Authors
Christof Bigler
Time period covered
2016
Area covered
Switzerland, Swiss National Park, canton of Grisons
Description
A within-species trade-off between growth rates and lifespan has been observed across different taxa of trees, however, there is some uncertainty whether this trade-off also applies to shade-intolerant tree species. The main objective of this study was to investigate the relationships between radial growth, tree size and lifespan of shade-intolerant mountain pines. For 200 dead standing mountain pines (Pinus montana) located along gradients of aspect, slope steepness and elevation in the Swiss National Park, radial annual growth rates and lifespan were reconstructed. While early growth (i.e. mean tree-ring width over the first 50 years) correlated positively with diameter at the time of tree death, a negative correlation resulted with lifespan, i.e. rapidly growing mountain pines face a trade-off between reaching a large diameter at the cost of early tree death. Slowly growing mountain pines may reach a large diameter and a long lifespan, but risk to die young at a small size. Early gro...
Z
Silt content (silttotal) soil maps of the Upper Colorado River Basin
data.niaid.nih.gov
zenodo.org
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Travis Nauman (2024). Silt content (silttotal) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2549860
Explore at:
Dataset updated
Jul 25, 2024
Dataset authored and provided by
Travis Nauman
Area covered
Colorado River
Description
The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

Associated publication:

Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

Repository includes maps of silt content (siltdtotal) as defined by United States soil survey program. Silt content is estimated by percent weight of the <2mm portion of the soil.

These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

This data should be used in combination with a soil depth or depth to restriction layer map (both layers that will be released soon as part of this project) to eliminate areas mapped at deeper depths than the soil actually goes. This is a limitation of this data which will hopefully be updated in future updates.

The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

File Name Details:

ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

Elements are separated by underscore (_) in the following sequence:

property_r_depth_cm_geometry_model_additional_elements.extension

Example: silttotal_r_0_cm_2D_QRF.tif

Indicates silt content (silttotal) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

_95PI_h: Indicates the layer is the upper 95% prediction interval value.

_95PI_l: Indicates the layer is the lower 95% prediction interval value.

_95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

References

Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma
f
Correlations.
figshare.com
xls
Updated Nov 17, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Candauda Arachchige Saliya (2023). Correlations. [Dataset]. http://doi.org/10.1371/journal.pone.0294455.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0294455.t003
Dataset updated
Nov 17, 2023
Dataset provided by
PLOS ONE
Authors
Candauda Arachchige Saliya
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This study attempts to explore the impact of external debt ($Debt), foreign reserves ($Reserves), and political stability & absence of violence/terrorism (PS&AVT) on the current financial crisis in Sri Lanka. Using data from 1996 to 2022 obtained from the World Bank (WB) and the Central Bank of Sri Lanka (CBSL), a regression analysis is conducted, with a composite variable named "CRISIS," which accounts for interest rate, inflation, currency devaluation adjusted to GDP growth, as the dependent variable. The findings indicate that, collectively, these predictors significantly contribute to explaining the variance in the financial crisis, although their impact is relatively minor. While the direct influence of PS&AVT on the financial crisis is not statistically significant, it indirectly affects the crisis through its considerable impact on debt and reserves. Granger causality tests showed predictive value for $Debt and $Reserve in relation to CRISIS, but the reverse relationship was not significant. Regression analysis using the error term and scatter plots supports the absence of endogeneity issues in the model. These findings suggest that while external debt and foreign reserves are more directly related to financial crises, political stability and the absence of violence/terrorism can influence the crisis indirectly through their effects on debt accumulation and reserve levels. This study represents a pioneering effort in investigating the impact of external debt, foreign reserves, and political stability on the financial crises in Sri Lanka. By utilizing a comprehensive dataset and applying a regression analysis, it sheds light on the complex interactions between these variables and their influence on the country’s financial stability.
Z
Organic matter content (om) soil maps of the Upper Colorado River Basin
data.niaid.nih.gov
zenodo.org
Updated Jul 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Travis Nauman (2024). Organic matter content (om) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2550935
Explore at:
Dataset updated
Jul 22, 2024
Dataset authored and provided by
Travis Nauman
Area covered
Colorado River
Description
The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

Associated publication:

Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

UPDATE: WE FOUND A RENDERING ERROR IN MANY AREAS OF THE 5 CM MAP. WE HAVE RECREATED THE MAP AND INCLUDED IN THIS VERSION OF THE REPOSITORY.

Repository includes maps of organic matter content (% wt) as defined by United States soil survey program.

These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

This data should be used in combination with a soil depth or depth to restriction layer map (both layers that will be released soon as part of this project) to eliminate areas mapped at deeper depths than the soil actually goes. This is a limitation of this data which will hopefully be updated in future updates.

The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

File Name Details:

ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (_CV_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000). Predictions are also evaluated with the U.S. soil survey laboratory database soil organic carbon (SOC) data. The SOC measurements were coverted to OM matter values using the common 1.724 conversion factor. The converted OM values are compared to predicted OM values using an accuracy plot (OM_SOC_plots.tif).

Elements are separated by underscore (_) in the following sequence:

property_r_depth_cm_geometry_model_additional_elements.extension

Example: om_r_0_cm_2D_QRF_bt.tif

Indicates soil organic matter content (om) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions. The _bt indicates that the map has been back transformed from ln or sqrt transformation used in modeling.

The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

_95PI_h: Indicates the layer is the upper 95% prediction interval value.

_95PI_l: Indicates the layer is the lower 95% prediction interval value.

_95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

References

Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma
Z
Gypsum content (gypsum) soil maps of the Upper Colorado River Basin
data.niaid.nih.gov
zenodo.org
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Travis Nauman (2024). Gypsum content (gypsum) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2550911
Explore at:
Dataset updated
Jul 25, 2024
Dataset authored and provided by
Travis Nauman
Area covered
Colorado River
Description
The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

Associated publication:

Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

Repository includes maps of soil gypsum content (% wt of the <20 mm fraction) as defined by United States soil survey program.

These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

This data should be used in combination with a soil depth or depth to restriction layer map (both layers that will be released soon as part of this project) to eliminate areas mapped at deeper depths than the soil actually goes. This is a limitation of this data which will hopefully be updated in future updates.

The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

File Name Details:

ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

Elements are separated by underscore (_) in the following sequence:

property_r_depth_cm_geometry_model_additional_elements.extension

Example: gypsum_r_0_cm_2D_QRF_bt.tif

Indicates soil gypsum content (gypsum) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions. The _bt indicates that the map has been back transformed from ln or sqrt transformation used in modeling.

The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

_95PI_h: Indicates the layer is the upper 95% prediction interval value.

_95PI_l: Indicates the layer is the lower 95% prediction interval value.

_95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

References

Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma
d
Selective Raster Graphics - Dataset - data.govt.nz - discover and use data
catalogue.data.govt.nz
Updated May 24, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). Selective Raster Graphics - Dataset - data.govt.nz - discover and use data [Dataset]. https://catalogue.data.govt.nz/dataset/oai-figshare-com-article-6349496
Explore at:
Dataset updated
May 24, 2018
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This report explores ways to render specific components of an R plot in raster format, when the overall format of the plot is vector. For example, we demonstrate ways to draw raster data symbols within a PDF scatter plot. A general solution is provided by the grid.rasterize function from the R package ‘rasterize’.
f
Additional file 1 of Logistic regression has similar performance to...
springernature.figshare.com
commons.datacite.org
zip
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anita L. Lynam; John M. Dennis; Katharine R. Owen; Richard A. Oram; Angus G. Jones; Beverley M. Shields; Lauric A. Ferrat (2023). Additional file 1 of Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults [Dataset]. http://doi.org/10.6084/m9.figshare.12569144.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12569144.v1
Dataset updated
May 30, 2023
Dataset provided by
figshare
Authors
Anita L. Lynam; John M. Dennis; Katharine R. Owen; Richard A. Oram; Angus G. Jones; Beverley M. Shields; Lauric A. Ferrat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Additional file 1: Figure S1. Flow diagram of participants through the model development stages. T1D: type 1 diabetes, T2D: type 2 diabetes. Figure S2. ROC AUC plots obtained using external validation dataset for seven prediction models. Legend: Solid lines: black = Support Vector Machine, dark grey = Logistic Regression, light grey = Random Forest. Dotted lines: black = Neural Network, dark grey = K-Nearest Neighbours, light grey = Gradient Boosting Machine. Figure S3. Correlation coefficient matrix and scatter plot of model predictions obtained from external test validation data.
Fine sand content (sandfine; 0.10 to 0.25 mm) soil maps of the Upper...
zenodo.org
Updated Jul 25, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Travis Nauman; Travis Nauman (2024). Fine sand content (sandfine; 0.10 to 0.25 mm) soil maps of the Upper Colorado River Basin [Dataset]. http://doi.org/10.5281/zenodo.2547610
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.2547610
Dataset updated
Jul 25, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Travis Nauman; Travis Nauman
Area covered
Colorado River
Description
The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

Associated publication:

Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

Repository includes maps of fine sand content (sandfine; 0.10 to 0.25 mm) as defined by United States soil survey program. Content is calculated on the fine earth fraction (<2mm).

These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

File Name Details:

ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

Elements are separated by underscore (_) in the following sequence:

property_r_depth_cm_geometry_model_additional_elements.extension

Example: sandfine_r_0_cm_2D_QRF.tif

Indicates fine sand content (sandfine) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

_95PI_h: Indicates the layer is the upper 95% prediction interval value.

_95PI_l: Indicates the layer is the lower 95% prediction interval value.

_95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

References

Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma.
Question 2 for TMU PhD candidate
figshare.com
xlsx
Updated Jan 5, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Seyed Mahmoud Ghasempouri (2022). Question 2 for TMU PhD candidate [Dataset]. http://doi.org/10.6084/m9.figshare.17883914.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.17883914.v1
Dataset updated
Jan 5, 2022
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Seyed Mahmoud Ghasempouri
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Dear TMU PhD candidate,

Please run principal component analysis (PCA) for dataset as below:

a) Calculate Eigenvalue for PC1 and PC2.

b) Draw the scatter plot.

c) Which loading plots are in the opposite direction to the others?

Thank You
Carbonate (caco3) soil maps of the Upper Colorado River Basin
zenodo.org
data.niaid.nih.gov
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Travis Nauman; Travis Nauman (2024). Carbonate (caco3) soil maps of the Upper Colorado River Basin [Dataset]. http://doi.org/10.5281/zenodo.2546935
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.2546935
Dataset updated
Jul 25, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Travis Nauman; Travis Nauman
Area covered
Colorado River
Description
The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

Associated publication:

Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

Repository includes maps of carbonate content (caco3) as defined by United States soil survey program. Content is calculated on the fine earth fraction (<2mm).

These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

File Name Details:

ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

Elements are separated by underscore (_) in the following sequence:

property_r_depth_cm_geometry_model_additional_elements.extension

Example: caco3_r_0_cm_2D_QRF_bt.tif

Indicates carbonate (caco3) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest that is has gone through transfomation and backtransformation (_bt) in the modeling process. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

_95PI_h: Indicates the layer is the upper 95% prediction interval value.

_95PI_l: Indicates the layer is the lower 95% prediction interval value.

_95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

References

Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma.
Z
Soil pH (ph1to1h20, 1:1 water method) soil maps of the Upper Colorado River...
data.niaid.nih.gov
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Travis Nauman (2024). Soil pH (ph1to1h20, 1:1 water method) soil maps of the Upper Colorado River Basin [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2548132
Explore at:
Dataset updated
Jul 25, 2024
Dataset authored and provided by
Travis Nauman
Area covered
Colorado River
Description
The data here were originally posted to facilitate timely and transparent peer review. The final public data release with formal metadata is now available from at the following location:

Nauman, T.W., and Duniway, M.C., 2020, Predictive soil property maps with prediction uncertainty at 30 meter resolution for the Colorado River Basin above Lake Mead: U.S. Geological Survey data release, https://doi.org/10.5066/P9SK0DO2.

Associated publication:

Nauman, T. W., and Duniway, M. C., 2020, A hybrid approach for predictive soil property mapping using conventional soil survey data: Soil Science Society of America Journal, v. 84, no. 4, p. 1170-1194. https://doi.org/10.1002/saj2.20080.

Repository includes maps of 1:1 soil pH as defined by United States soil survey program.

These data are preliminary or provisional and are subject to revision. They are being provided to meet the need for timely best science. The data have not received final approval by the U.S. Geological Survey (USGS) and are provided on the condition that neither the USGS nor the U.S. Government shall be held liable for any damages resulting from the authorized or unauthorized use of the data.

The creation and interpretation of this data is documented in the following article. Please note this article has not been reviewed yet and this citation will be updated as the peer review process proceeds.

Nauman, T. W., Duniway, M. C., In Preparation. Predictive reconstruction of soil survey property maps for field scale adaptive land management. Soil Science Society of America Journal.

File Name Details:

ACCURACY!! Please see manuscript and Github repository (https://github.com/naumi421/SoilReconProps) for full details on accuracy. We do provide cross validation (CV) accuracy plots in this repository for both the overall sample (NRCS field pedons plus NRCS laboratory pedons; file ending _CV_plots.tif) and for just the CV results at laboratory pedons (file ending _CV_SCD_plots.tif). These plots compare CV predictions with observed values relative to a 1:1 line. Values plotted near the 1:1 line are more accurate. Note that values are plotted in hex-bin density scatter plots because of the large number of observations (most are >3000).

Elements are separated by underscore (_) in the following sequence:

property_r_depth_cm_geometry_model_additional_elements.extension

Example: ph1to1h2o_r_0_cm_2D_QRF.tif

Indicates soil pH in 1:1 water mixture (ph1to1h2o) at 0 cm depth using a 2D model (separate model for each depth) employing a quantile regression forest. This file is the raster prediction map for this model. There may be additional GIS files associated with this file (e.g. pyramids) that have the same file name, but different extensions.

The following elements may also exist on the end of filenames indicating other spatial files that characterize a given model's uncertainty (see below).

_95PI_h: Indicates the layer is the upper 95% prediction interval value.

_95PI_l: Indicates the layer is the lower 95% prediction interval value.

_95PI_relwidth: Indicates the layer is the 95% relative prediction interval (RPI). The RPI is a standardization of the prediction interval that indicates that model is constraining uncertainty relative to the original sample. RPI values less than one represent uncertainty is being improved by the model relative to the original sample, and values less than 0.5 indicate low uncertainty in predictions. See paper listed above and also Nauman and Duniway (In revision) for more details on RPI.

References

Nauman, T. W., and Duniway, M. C., In Revision, Relative prediction intervals reveal larger uncertainty in 3D approaches to predictive digital soil mapping of soil properties with legacy data: Geoderma
f
Data from: Detecting desertification in different years and rainfall regimes...
scielo.figshare.com
figshare.com
tiff
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Thiago Costa dos Santos; Adunias dos Santos Teixeira; Fabrício da Silva Terra; Luis Clenio Jário Moreira; Raul Shiso Toma (2023). Detecting desertification in different years and rainfall regimes by 2D Scatter Plot [Dataset]. http://doi.org/10.6084/m9.figshare.19904126.v1
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.19904126.v1
Dataset updated
Jun 1, 2023
Dataset provided by
SciELO journals
Authors
Thiago Costa dos Santos; Adunias dos Santos Teixeira; Fabrício da Silva Terra; Luis Clenio Jário Moreira; Raul Shiso Toma
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ABSTRACT The desertification process causes soil degradation and a reduction in vegetation. The absence of visualisation techniques and the broad spatial and temporal dimension of the data hampers the identification of desertification and rapid decision-making by multidisciplinary teams. The 2D Scatter Plot is a two-dimensional visual analysis of reflectances in the red (630 - 690 nm) and near-infrared (760 - 900 nm) bands to visualise the spectral response of the vegetation. The hypothesis of this study is that visualising the reflectances of the vegetation by means of a 2D scatter plot will allow desertification to be inferred. The aim of this study was to identify desertified areas and characterise the spatial and temporal dynamics of the vegetation and soil during dry (DP) and rainy (RP) periods between 2000 and 2008, using a 2D scatter plot. The 2D scatter plot generated by the Envi® 4.8 software and the reflectances in bands 3 and 4 of the TM5 sensor were used within communities in the Irauçuba hub (Ceará, Brazil). The concentration densities of the near-infrared reflectances of the vegetation pixels were observed. Each community presented pixel concentrations with reflectances of less than 0.4 (40%) during each of the periods under evaluation, indicating little vegetation development, with further degradation caused by deforestation, the use of fire and overgrazing. The 2D scatter plot was able to show vegetation with low reflectance in the near infrared during both dry and rainy periods between 2000 and 2008, thereby inferring the occurrence of desertification.