100+ datasets found

P
IQR Dataset
paperswithcode.com
Updated May 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng (2022). IQR Dataset [Dataset]. https://paperswithcode.com/dataset/image-query-retrieval-dataset
Explore at:
Dataset updated
May 7, 2022
Authors
Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng
Description
IQR is proposed for the image-text retrieval task. We use 200,000 queries and the corresponding images as the annotated image-query pairs.
f
Descriptive statistics, mean ± SD, range, median and interquartile range...
plos.figshare.com
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hélène Follet; Delphine Farlay; Yohann Bala; Stéphanie Viguet-Carrin; Evelyne Gineyts; Brigitte Burt-Pichat; Julien Wegrzyn; Pierre Delmas; Georges Boivin; Roland Chapurlat (2023). Descriptive statistics, mean ± SD, range, median and interquartile range (IQR). [Dataset]. http://doi.org/10.1371/journal.pone.0055232.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0055232.t001
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Hélène Follet; Delphine Farlay; Yohann Bala; Stéphanie Viguet-Carrin; Evelyne Gineyts; Brigitte Burt-Pichat; Julien Wegrzyn; Pierre Delmas; Georges Boivin; Roland Chapurlat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Descriptive statistics, mean ± SD, range, median and interquartile range (IQR).
a
North America Boundaries
home-pugonline.hub.arcgis.com
Updated Oct 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The PUG User Group (2023). North America Boundaries [Dataset]. https://home-pugonline.hub.arcgis.com/datasets/north-america-boundaries
Explore at:
Dataset updated
Oct 23, 2023
Dataset authored and provided by
The PUG User Group
Area covered
North America,
Description
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a new, retrospective satellite-based precipitation dataset, constructed as a climate data record for hydrological and climate studies. The PERSIANN-CDR is available from 1983-present making the dataset the longest satellite based precipitation data record available. The precipitation maps are available at daily temporal resolution for the latitude band 60°S–60°N at 0.25 degrees. The maps shown here represent 30-year annual and seasonal median and interquartile range (IQR) of the PERSIANN-CDR dataset from 1984 – 2014. In the median precipitation maps, the mid-point value (or 50th percentile) for each pixel in is computed and plotted for the study area. The range of the data about the median is represented by the interquartile range (IQR), and shows the variability of the dataset. For these maps, winter = December – February, spring = March – May, summer = June – August, fall = September – November
Outlier Detection and Removal Dataset
kaggle.com
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aamir Shahzad (2025). Outlier Detection and Removal Dataset [Dataset]. https://www.kaggle.com/datasets/aamir5659/outlier-detection-and-removal-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 9, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aamir Shahzad
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
📁 Files Included: Outlier_Loan_datase.csv – Raw dataset with outliers `.Final_Outliers_clean_dataset.csv (IQR + Z-score)

This dataset is designed for practicing outlier detection and data cleaning techniques.
It includes both the original (uncleaned) and cleaned versions of a financial dataset.
Simulation Data Set
catalog.data.gov
s.cnmilf.com
Updated Nov 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). Simulation Data Set [Dataset]. https://catalog.data.gov/dataset/simulation-data-set
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
f
Table 3. Mean, median and IQR of NCD risk factorsa.
plos.figshare.com
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Faisal S. Khan; Ismat Lotia-Farrukh; Aamir J. Khan; Saad Tariq Siddiqui; Sana Zehra Sajun; Amyn Abdul Malik; Aziza Burfat; Mohammad Hussham Arshad; Andrew J. Codlin; Belinda M. Reininger; Joseph B. McCormick; Nadeem Afridi; Susan P. Fisher-Hoch (2023). Table 3. Mean, median and IQR of NCD risk factorsa. [Dataset]. http://doi.org/10.1371/journal.pone.0056008.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0056008.t003
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Faisal S. Khan; Ismat Lotia-Farrukh; Aamir J. Khan; Saad Tariq Siddiqui; Sana Zehra Sajun; Amyn Abdul Malik; Aziza Burfat; Mohammad Hussham Arshad; Andrew J. Codlin; Belinda M. Reininger; Joseph B. McCormick; Nadeem Afridi; Susan P. Fisher-Hoch
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
a Age ≥15 yrs unless otherwise specified.
P
UDED Dataset
paperswithcode.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xavier Soria; Yachuan Li; Mohammad Rouhani; Angel D. Sappa, UDED Dataset [Dataset]. https://paperswithcode.com/dataset/uded
Explore at:
Authors
Xavier Soria; Yachuan Li; Mohammad Rouhani; Angel D. Sappa
Description
This dataset is a collection of 1, 2, or 3 images from: BIPED, BSDS500, BSDS300, DIV2K, WIRE-FRAME, CID, CITYSCAPES, ADE20K, MDBD, NYUD, THANGKA, PASCAL-Context, SET14, URBAN10, and the camera-man image. The image selection process consists on computing the Inter-Quartile Range (IQR) intensity value on all the images, images larger than 720×720 pixels were not considered. In dataset whose images are in HR, they were cut. We thank all the datasets owners to make them public. This dataset is just for Edge Detection not contour nor Boundary tasks.
Precipitation Interquartile Range Spring Estimation (PERSIANN) 1984-2014
noaa.hub.arcgis.com
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NOAA GeoPlatform (2024). Precipitation Interquartile Range Spring Estimation (PERSIANN) 1984-2014 [Dataset]. https://noaa.hub.arcgis.com/maps/c06721acf213414191847347fcbdff3b
Explore at:
Dataset updated
Dec 18, 2024
Dataset provided by
National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
Authors
NOAA GeoPlatform
Area covered

Description
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.
f
Characteristics of women, overall and according to BMI categories; data...
plos.figshare.com
xls
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Julie A. Pasco; Geoffrey C. Nicholson; Sharon L. Brennan; Mark A. Kotowicz (2023). Characteristics of women, overall and according to BMI categories; data presented as mean (±SD), median (IQR) or frequency (%). [Dataset]. http://doi.org/10.1371/journal.pone.0029580.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0029580.t003
Dataset updated
Jun 6, 2023
Dataset provided by
PLOS ONE
Authors
Julie A. Pasco; Geoffrey C. Nicholson; Sharon L. Brennan; Mark A. Kotowicz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
*n = 1041 (35 missing data).BMI = body mass index (kg/m2); SD = standard deviation; IQR = interquartile range; EI energy intake (MJ/d); BMR = basal metabolic rate (MJ/d).
d
Hunter AWRA Hydrological Response Variables (HRV)
data.gov.au
researchdata.edu.au
+1more
zip
Updated Jun 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bioregional Assessment Program (2022). Hunter AWRA Hydrological Response Variables (HRV) [Dataset]. https://data.gov.au/data/dataset/a84b2431-24e3-4537-ae50-84f4e955ebdc
Explore at:
zip(1245038746)Available download formats
Dataset updated
Jun 28, 2022
Dataset authored and provided by
Bioregional Assessment Program
License
Attribution 2.5 (CC BY 2.5)https://creativecommons.org/licenses/by/2.5/
License information was derived automatically
Description
Abstract

The dataset was derived by the Bioregional Assessment Programme from multiple datasets. The source dataset is identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

Hydrological Response Variables (HRVs) are the hydrological characteristics of the system that potentially change due to coal resource development. These data refer to the HRVs related to the AWRA L and AWRA R models for the Hunter subregion for the 65 simulation nodes (63 within Hunter basin and 2 within Macquarie-Tuggerah Lake basin). The nine hydrological response variables (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed by the AWRA L and AWRA R models under CRDP and baseline conditions, respectively and the ACRD is the difference between the Baseline and CRDP.

Abbreviation meaning

AF - the annual streamflow volume (GL/year)

P01 - the daily streamflow rate at the first percentile (ML/day)

P01 - the daily streamflow rate at the first percentile (ML/day)

IQR - the inter-quartile range in daily streamflow (ML/day). That is, the difference between the daily streamflow rate at the 75th percentile and at the 25th percentile.

LFD - the number of low streamflow days per year. The threshold for low streamflow days is the 10th percentile from the simulated 90-year period (2013 to 2102)

LFS - the number of low streamflow spells per year (perennial streams only). A spell is defined as a period of contiguous days of streamflow below the 10th percentile threshold

LLFS - the length (days) of the longest low streamflow spell each year

P99 - the daily streamflow rate at the 99th percentile (ML/day)

FD - flood days, the number of days with streamflow greater than the 90th percentile from the simulated 90-year period (2013 to 2102)

ZFD - Zero flow days

Purpose

This is the dataset used for the Hunter 2.6.1 product to evaluate additional coal mine and coal resource development impacts on hydrological response variables at 65 simulation nodes.

Dataset History

The HUN AWRA model outputs were used to determine the impacts on the HRVs to produce these data. The nine HRVs (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and baseline conditions, respectively. The difference between CRDP and baseline is used for predicting ACRD impacts on hydrological response variables at 65 simulation nodes.

Dataset Citation

Bioregional Assessment Programme (2017) Hunter AWRA Hydrological Response Variables (HRV). Bioregional Assessment Derived Dataset. Viewed 13 March 2019, http://data.bioregionalassessments.gov.au/dataset/a84b2431-24e3-4537-ae50-84f4e955ebdc.

Dataset Ancestors

Derived From River Styles Spatial Layer for New South Wales

Derived From HUN AWRA-L simulation nodes_v01

Derived From Hunter River Salinity Scheme Discharge NSW EPA 2006-2012

Derived From HUN AWRA-R simulation nodes v01

Derived From Bioregional Assessment areas v06

Derived From GEODATA 9 second DEM and D8: Digital Elevation Model Version 3 and Flow Direction Grid 2008

Derived From Bioregional Assessment areas v04

Derived From HUN AWRA-R Gauge Station Cross Sections v01

Derived From Gippsland Project boundary

Derived From Natural Resource Management (NRM) Regions 2010

Derived From BA All Regions BILO cells in subregions shapefile

Derived From Hunter Surface Water data v2 20140724

Derived From HUN AWRA-R River Reaches Simulation v01

Derived From HUN AWRA-L simulation nodes v02

Derived From GEODATA TOPO 250K Series 3, File Geodatabase format (.gdb)

Derived From HUN AWRA-R Irrigation Area Extents and Crop Types v01

Derived From GEODATA TOPO 250K Series 3

Derived From NSW Catchment Management Authority Boundaries 20130917

Derived From Geological Provinces - Full Extent

Derived From BA SYD selected GA TOPO 250K data plus added map features

Derived From Bioregional_Assessment_Programme_Catchment Scale Land Use of Australia - 2014

Derived From Bioregional Assessment areas v03

Derived From IQQM Model Simulation Regulated Rivers NSW DPI HUN 20150615

Derived From HUN AWRA-R calibration catchments v01

Derived From Bioregional Assessment areas v05

Derived From BILO Gridded Climate Data: Daily Climate Data for each year from 1900 to 2012

Derived From National Surface Water sites Hydstra

Derived From Selected streamflow gauges within and near the Hunter subregion

Derived From ASRIS Continental-scale soil property predictions 2001

Derived From Hunter Surface Water data extracted 20140718

Derived From Mean Annual Climate Data of Australia 1981 to 2012

Derived From HUN AWRA-R calibration nodes v01

Derived From HUN AWRA-R Observed storage volumes Glenbawn Dam and Glennies Creek Dam

Derived From HUN AWRA-LR Model v01

Derived From HUN AWRA-L ASRIS soil properties v01

Derived From HUN AWRAR restricted input 01

Derived From Bioregional Assessment areas v01

Derived From Bioregional Assessment areas v02

Derived From Victoria - Seamless Geology 2014

Derived From HUN AWRA-L Site Station Cross Sections v01

Derived From HUN AWRA-R simulation catchments v01

Derived From HUN AWRA-R Simulation Node Cross Sections v01

Derived From Climate model 0.05x0.05 cells and cell centroids
Precipitation Interquartile Range Winter Estimation (PERSIANN) 1984-2014
noaa.hub.arcgis.com
Updated Dec 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NOAA GeoPlatform (2024). Precipitation Interquartile Range Winter Estimation (PERSIANN) 1984-2014 [Dataset]. https://noaa.hub.arcgis.com/maps/c38031dd1db6491d837e3b5e58c628d5
Explore at:
Dataset updated
Dec 18, 2024
Dataset provided by
National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
Authors
NOAA GeoPlatform
Area covered

Description
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.
w
Data from: GEOMACS (Geological and Oceanographic Model of Australias...
data.wu.ac.at
researchdata.edu.au
+1more
zip
Updated Jun 24, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CSIRO Oceans and Atmosphere - Information and Data Centre (2017). GEOMACS (Geological and Oceanographic Model of Australias Continental Shelf) Interquartile range [Dataset]. https://data.wu.ac.at/schema/data_gov_au/ZGRmZGQyYjktMjEwNC00OWUxLTk4OTQtNTM3OWQyY2YyNmU0
Explore at:
zipAvailable download formats
Dataset updated
Jun 24, 2017
Dataset provided by
CSIRO Oceans and Atmosphere - Information and Data Centre
License
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Area covered
Australia, cd30346d96a4fac7d77a1e77d04c5511ba04b2f6
Description
Geoscience Australias GEOMACS model was utilised to produce hindcast hourly time series of continental shelf (~20 to 300 m depth) bed shear stress (unit of measure: Pascal, Pa) on a 0.1 degree grid covering the period March 1997 to February 2008 (inclusive). The hindcast data represents the combined contribution to the bed shear stress by waves, tides, wind and density-driven circulation. Included in the parameters that will be calculated to represent the magnitude of the bulk of the data are the quartiles of the distribution; Q25, Q50 and Q75 (i.e. the values for which 25, 50 and 75 percent of the observations fall below). The interquartile range, , of the GEOMACS output takes the observations from between Q25 and Q75 to provide an accurate representation of the spread of observations. The interquartile range was shown to provide a more robust representation of the observations than the standard deviation, which produced highly skewed observations (Hughes and Harris 2008). This dataset is a contribution to the CERF Marine Biodiversity Hub and is hosted temporarily by CMAR on behalf of Geoscience Australia.
Global Oscillation Network Group (GONG) Quick-Reduce Inputs (iQR)
catalog.data.gov
Updated Oct 18, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NOAA National Centers for Environmental Information (Point of Contact); National Centers for Environmental Information (NCEI) / Space Weather Prediction Center (STP) (Point of Contact, Custodian); National Weather Service (NWS) (Resource Provider) (2024). Global Oscillation Network Group (GONG) Quick-Reduce Inputs (iQR) [Dataset]. https://catalog.data.gov/dataset/global-oscillation-network-group-gong-quick-reduce-inputs-iqr1
Explore at:
Dataset updated
Oct 18, 2024
Dataset provided by
National Weather Servicehttp://www.weather.gov/
National Centers for Environmental Informationhttps://www.ncei.noaa.gov/
National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
Space Weather Prediction Centerhttp://www.swpc.noaa.gov/
Description
Originally constructed in 1995, the Global Oscillation Network Group (GONG) is a network of six identical ground-based solar telescopes distributed around the Earth in order to obtain continuous observations of the Sun. Those sites are located in Big Bear, California (BB); Mauna Loa, Hawaii (ML); Learmonth, Australia (LE); Udaipur, India (UD); El Teide, Spain (TD); and Cerro Tololo, Chile (CT). Additionally, there are three engineering/testbed sites in Boulder, Colorado (TC, TE, and TS). Owned by the National Science Foundation, GONG is operated and maintained by the National Solar Observatory (NSO) with significant funding from NOAAâ€™s Space Weather Prediction Center (SWPC). Each minute, weather permitting, the GONG network observes the Sun at two spectral wavelengths: 676.78nm (a Ni I absorption line) and 656.28nm (the H-alpha absorption line).
i
DATASET : Striped red mullet landing per unit of effort and environmental...
sextant.ifremer.fr
seanoe.org
+2more
rel-canonical +2
Updated May 12, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CNRS/Univ Pau & Pays Adour, Laboratoire de Mathématiques et de leurs Applications de Pau - Fédération MIRA, UMR5142, 64600 Anglet, France ARC Centre of Excellence for Mathematical and Statistical Frontiers at School of Mathematical Science, QueenslandUniversity of Technology, Brisbane, Australia (2021). DATASET : Striped red mullet landing per unit of effort and environmental variables in the Bay of Biscay [Dataset]. https://sextant.ifremer.fr/record/seanoe:77179/
Explore at:
www:link-1.0-http--metadata-url, www:download-1.0-link--download, rel-canonicalAvailable download formats
Dataset updated
May 12, 2021
Dataset authored and provided by
CNRS/Univ Pau & Pays Adour, Laboratoire de Mathématiques et de leurs Applications de Pau - Fédération MIRA, UMR5142, 64600 Anglet, France ARC Centre of Excellence for Mathematical and Statistical Frontiers at School of Mathematical Science, QueenslandUniversity of Technology, Brisbane, Australia
Area covered

Description
####### # Data description #

This dataset have been constructed and used for scientific purpose, available in the paper "Detecting the effects of inter-annual and seasonal changes of environmental factors on the the striped red mullet population in the Bay of Biscay" authored by Kermorvant C., Caill-Milly N., Sous D., Paradinas I., Lissardy M. and Liquet B. and published in Journal of Sea Research. This file is an extraction from the SACROIS fisheries database created by Ifremer (for more information see https://sextant.ifremer.fr/record/3e177f76-96b0-42e2-8007-62210767dc07/) and from the Copernicus database. Biochemestry comes from the product GLOBAL_ANALYSIS_FORECAST_BIO_001_028 (https://resources.marine.copernicus.eu/?option=com_csw&view=details&product_id=GLOBAL_ANALYSIS_FORECAST_BIO_001_028). Temperature and salinity comes from GLOBAL_ANALYSIS_FORECAST_PHY_001_024 product (https://resources.marine.copernicus.eu/?option=com_csw&view=details&product_id=GLOBAL_ANALYSIS_FORECAST_PHY_001_024). As fisheries landing per unit of effort is only available per ICES rectangle and by month, environmental data have been aggregated accordingly.

######### # Colomns description # ############### rectangle - The 6 ICES statistical rectangles used in the study. time_m - Time in months, from the beginning to the end of the study. annee = year mois = month (from 1 to 12) Poids = Weight of red mullet landed valeur = Temps_peche = fishing time Nb_sequence = number of fishing sequences Moy / Med / Var / StD Quartil_1 / Quartil_3 / min / max / CV / IQR = statistical descriptors of landing by rectangle and by month log_cpue = log of Med colomn mean_surface_s = mean of surface salinity by month and by rectangle median_surface_s = median of surface salinity by month and by rectangle mean_surface_t = mean of surface temperature by month and by rectangle median_surface_t = median of surface temperature by month and by rectangle si / zeu /po4 / pyc / o2/ nppv / no3 and nh4 mean and median concentration by rectangle and by month pc3 / pc2 / pc1 - projections of previous biochemestry variables on the three first axes of a PCA
Italy: Mobility COVID-19
kaggle.com
Updated Mar 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mr. Rahman (2021). Italy: Mobility COVID-19 [Dataset]. https://www.kaggle.com/motiurse/italy-mobility-covid19/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 26, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Mr. Rahman
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Italy
Description
A live version of the data record, which will be kept up-to-date with new estimates, can be downloaded from the Humanitarian Data Exchange: https://data.humdata.org/dataset/covid-19-mobility-italy.

If you find the data helpful or you use the data for your research, please cite our work:

Pepe, E., Bajardi, P., Gauvin, L., Privitera, F., Lake, B., Cattuto, C., & Tizzoni, M. (2020). COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Scientific Data 7, 230 (2020).

The data record is structured into 4 comma-separated value (CSV) files, as follows:

id_provinces_IT.csv. Table of the administrative codes of the 107 Italian provinces. The fields of the table are:

COD_PROV is an integer field that is used to identify a province in all other data records;

SIGLA is a two-letters code that identifies the province according to the ISO_3166-2 standard (https://en.wikipedia.org/wiki/ISO_3166-2:IT);

DEN_PCM is the full name of the province.

OD_Matrix_daily_flows_norm_full_2020_01_18_2020_04_17.csv. The file contains the daily fraction of users’ moving between Italian provinces. Each line corresponds to an entry of matrix (i, j). The fields of the table are:

p1: COD_PROV of origin,

p2: COD_PROV of destination,

day: in the format yyyy-mm-dd.

median_q1_q3_rog_2020_01_18_2020_04_17.csv. The file contains median and interquartile range (IQR) of users’ radius of gyration in a province by week. Each entry of the table fields of the table are:

COD_PROV of the province;

SIGLA of the province;

DEN_PCM of the province;

week: median value of the radius of gyration on week week, with week in the format dd/mm-DD/MM where dd/mm and DD/MM are the first and the last day of the week, respectively.

week Q1 first quartile (Q1) of the distribution of the radius of gyration on week week,

week Q3 third quartile (Q3) of the distribution of the radius of gyration on week week,

average_network_degree_2020_01_18_2020_04_17.csv. The file contains daily time-series of the average degree 〈k〉 of the proximity network. Each entry of the table is a value of 〈k〉 on a given day. The fields of the table are:

COD_PROV of the province;

SIGLA of the province;

DEN_PCM of the province;

day in the format yyyy-mm-dd.

ESRI shapefiles of the Italian provinces updated to the most recent definition are available from the website of the Italian National Office of Statistics (ISTAT): https://www.istat.it/it/archivio/222527.
d
Namoi standard Hydrological Response Variables (HRVs)
data.gov.au
researchdata.edu.au
+1more
Updated Nov 20, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bioregional Assessment Program (2019). Namoi standard Hydrological Response Variables (HRVs) [Dataset]. https://data.gov.au/data/dataset/groups/189f4c7a-29e1-41f9-868d-b7f5184d829f
Explore at:
Dataset updated
Nov 20, 2019
Dataset provided by
Bioregional Assessment Program
Area covered
Namoi River
Description
Abstract

The dataset was derived by the Bioregional Assessment Programme from multiple datasets. The source dataset is identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

Hydrological Response Variables (HRVs) are the hydrological characteristics of the system that potentially change due to coal resource development. These data refer to the HRVs related to the AWRA-R model for the Namoi subregion for the 54 simulation nodes. The nine hydrological response variables (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and Baseline conditions, respectively and the ACRD is the difference between the Baseline and CRDP.

Abbreviation meaning

AF - the annual streamflow volume (GL/year)

P01 - the daily streamflow rate at the first percentile (ML/day)

P01 - the daily streamflow rate at the first percentile (ML/day)

IQR - the inter-quartile range in daily streamflow (ML/day). That is, the difference between the daily streamflow rate at the 75th percentile and at the 25th percentile.

LFD - the number of low streamflow days per year. The threshold for low streamflow days is the 10th percentile from the simulated 90-year period (2013 to 2102)

LFS - the number of low streamflow spells per year (perennial streams only). A spell is defined as a period of contiguous days of streamflow below the 10th percentile threshold

LLFS - the length (days) of the longest low streamflow spell each year

P99 - the daily streamflow rate at the 99th percentile (ML/day)

FD - flood days, the number of days with streamflow greater than the 90th percentile from the simulated 90-year period (2013 to 2102)

ZFD - Zero flow days

Purpose

This is the dataset used for the Namoi 2.6.1 product to evaluate additional coal mine and coal resource development impacts on hydrological response variables at 54 simulation nodes.

Dataset History

The Namoi AWRA-R model outputs were used to determine the impacts on the HRVs to produce these data. Readme files within the folders in the dataset provide an explanation on how the resource was created. The nine HRVs (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and Baseline conditions, respectively. The difference between CRDP and Baseline is used for predicting ACRD impacts on hydrological response variables at 54 simulation nodes.

Dataset Citation

Bioregional Assessment Programme (2017) Namoi standard Hydrological Response Variables (HRVs). Bioregional Assessment Derived Dataset. Viewed 11 December 2018, http://data.bioregionalassessments.gov.au/dataset/189f4c7a-29e1-41f9-868d-b7f5184d829f.

Dataset Ancestors

Derived From Historical Mining Footprints DTIRIS NAM 20150914

Derived From Namoi AWRA-R (restricted input data implementation)

Derived From River Styles Spatial Layer for New South Wales

Derived From Namoi Surface Water Mine Footprints - digitised

Derived From Namoi AWRA-R model implementation (post groundwater input)

Derived From National Surface Water sites Hydstra

Derived From Namoi AWRA-L model

Derived From Namoi Hydstra surface water time series v1 extracted 140814

Derived From GEODATA 9 second DEM and D8: Digital Elevation Model Version 3 and Flow Direction Grid 2008

Derived From Namoi Environmental Impact Statements - Mine footprints

Derived From Namoi Existing Mine Development Surface Water Footprints
Z
Dataset related to article "Association between cardiac troponin I and...
data.niaid.nih.gov
Updated Apr 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Giuseppe Moriello (2021). Dataset related to article "Association between cardiac troponin I and mortality in patients with COVID-19 " [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4723490
Explore at:
Dataset updated
Apr 28, 2021
Dataset provided by
Giuseppe Moriello
Federica Maura
Sara Maria Giulia Cioffi
Francesco Paolo Leone
Emanuela Morenghi
Maria Teresa Sandri
Barbara Barbieri
Michela Salvatici
Description
Background: Severe pneumonia is pathological manifestation of Coronavirus Disease 2019 (COVID-19), however complications have been reported in COVID-19 patients with a worst prognosis. Aim of this study was to evaluate the role of high sensitivity cardiac troponin I (hs-TnI) in patients with SARS-CoV-2 infection.

Methods: we retrospectively analysed hs-TnI values measured in 523 patients (median age 64 years, 68% men) admitted to a university hospital in Milan, Italy, and diagnosed COVID-19.

Results: A significant difference in hs-TnI concentrations was found between deceased patients (98 patients) vs discharged (425 patients) [36.05 ng/L IQR 16.5-94.9 vs 6.3 ng/L IQR 2.6-13.9, p < 0.001 respectively]. Hs-TnI measurements were independent predictors of mortality at multivariate analysis adjusted for confounding parameters such as age (HR 1.004 for each 10 point of troponin, 95% CI 1.002-1.006, p < 0.001). The survival rate, after one week, in patients with hs-TnI values under 6 ng/L was 97.94%, between 6 ng/L and the normal value was 90.87%, between the normal value and 40 ng/L was 86.98, and 59.27% over 40 ng/L.

Conclusion: Increase of hs-TnI associated with elevated mortality in patients with COVID-19. Troponin shows to be a useful biomarker of disease progression and worse prognosis in COVID-19 patients.
Median and interquartile range (IQR) of fasting and postprandial...
plos.figshare.com
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kim M. Huffman; Leanne M. Redman; Lawrence R. Landerman; Carl F. Pieper; Robert D. Stevens; Michael J. Muehlbauer; Brett R. Wenner; James R. Bain; Virginia B. Kraus; Christopher B. Newgard; Eric Ravussin; William E. Kraus (2023). Median and interquartile range (IQR) of fasting and postprandial concentrations of metabolites at baseline (n = 46).* [Dataset]. http://doi.org/10.1371/journal.pone.0028190.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0028190.t002
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Kim M. Huffman; Leanne M. Redman; Lawrence R. Landerman; Carl F. Pieper; Robert D. Stevens; Michael J. Muehlbauer; Brett R. Wenner; James R. Bain; Virginia B. Kraus; Christopher B. Newgard; Eric Ravussin; William E. Kraus
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
*Those loading most heavily (component load ≥|0.5|) in principal component analyses are identified in bold.
Z
Data from: Diagnostic Value of Global Cardiac Strain in Patients With...
data.niaid.nih.gov
zenodo.org
Updated Jul 30, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paola Maria Cannaò (2021). Diagnostic Value of Global Cardiac Strain in Patients With Myocarditis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5147939
Explore at:
Dataset updated
Jul 30, 2021
Dataset provided by
Francesco Saverio Carbone
Francesco Sardanelli
Paola Maria Cannaò
Marco Alì
Francesco Secchi
Caterina Beatrice Monti
Description
Dataset from the article Secchi F, Monti CB, Alì M, Carbone FS, Cannaò PM, Sardanelli F. Diagnostic Value of Global Cardiac Strain in Patients With Myocarditis. J Comput Assist Tomogr. 2020 Jul/Aug;44(4):591-598. doi: 10.1097/RCT.0000000000001062. PMID: 32697530.

Abstract

Background: Cardiac strain represents an imaging biomarker of contractile dysfunction.

Purpose: The purpose of this study was to investigate the diagnostic value of cardiac strain obtained by feature-tracking cardiac magnetic resonance (MR) in acute myocarditis.

Materials and methods: Cardiac MR examinations of 46 patients with myocarditis and preserved ejection fraction at acute phase and follow-up were analyzed along with cardiac MR of 46 healthy age- and sex-matched controls. Global circumferential strain and global radial strain were calculated for each examination, along with myocardial edema and late gadolinium enhancement, and left ventricle functional parameters, through manual contouring of the myocardium. Correlations were assessed using Spearman ρ. Wilcoxon and Mann-Whitney U test were used to assess differences between data. Receiver operating characteristics curves and reproducibility were obtained to assess the diagnostic role of strain parameters.

Results: Global circumferential strain was significantly lower in controls (median, -20.4%; interquartile range [IQR], -23.4% to -18.7%) than patients in acute phase (-18.4%; IQR, -21.0% to -16.1%; P = 0.001) or at follow-up (-19.2%; IQR, -21.5% to -16.1%; P = 0.020). Global radial strain was significantly higher in controls (82.4%; IQR, 62.8%-104.9%) than in patients during the acute phase (65.8%; IQR, 52.9%-79.5%; P = 0.001). Correlations were found between global circumferential strain and global radial strain in all groups (acute, ρ = -0.580, P < 0.001; follow-up, ρ = -0.399, P = 0.006; controls, ρ = -0.609, P < 0.001), and between global circumferential strain and late gadolinium enhancement only in myocarditis patients (acute, ρ = 0.035, P = 0.024; follow-up, ρ = 0.307, P = 0.038).

Conclusions: Cardiac strain could potentially have a role in detecting acute myocarditis in low-risk acute myocarditis patients where cardiac MR is the main diagnosing technique.
Gender, Age, and Emotion Detection from Voice
kaggle.com
Updated May 29, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rohit Zaman (2021). Gender, Age, and Emotion Detection from Voice [Dataset]. https://www.kaggle.com/datasets/rohitzaman/gender-age-and-emotion-detection-from-voice/suggestions
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 29, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Rohit Zaman
Description
Context

Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.

Content

Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.

Acknowledgements

Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/

Facebook

Twitter

Click to copy link

Link copied

Cite

Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng (2022). IQR Dataset [Dataset]. https://paperswithcode.com/dataset/image-query-retrieval-dataset

IQR Dataset

Image-Query Retrieval Dataset

Explore at:

Dataset updated

May 7, 2022

Authors

Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng

Description

IQR is proposed for the image-text retrieval task. We use 200,000 queries and the corresponding images as the annotated image-query pairs.

Clear search

Close search

Google apps

Main menu

IQR Dataset

Descriptive statistics, mean ± SD, range, median and interquartile range...

North America Boundaries

Outlier Detection and Removal Dataset

Simulation Data Set

Table 3. Mean, median and IQR of NCD risk factorsa.

UDED Dataset

Precipitation Interquartile Range Spring Estimation (PERSIANN) 1984-2014

Characteristics of women, overall and according to BMI categories; data...

Hunter AWRA Hydrological Response Variables (HRV)

Abstract

Purpose

Dataset History

Dataset Citation

Dataset Ancestors

Precipitation Interquartile Range Winter Estimation (PERSIANN) 1984-2014

Data from: GEOMACS (Geological and Oceanographic Model of Australias...

Global Oscillation Network Group (GONG) Quick-Reduce Inputs (iQR)

DATASET : Striped red mullet landing per unit of effort and environmental...

####### # Data description #

Italy: Mobility COVID-19

Namoi standard Hydrological Response Variables (HRVs)

Abstract

Purpose

Dataset History

Dataset Citation

Dataset Ancestors

Dataset related to article "Association between cardiac troponin I and...

Median and interquartile range (IQR) of fasting and postprandial...

Data from: Diagnostic Value of Global Cardiac Strain in Patients With...

Gender, Age, and Emotion Detection from Voice

Context

Content

Acknowledgements

IQR Dataset

Image-Query Retrieval Dataset