100+ datasets found
  1. P

    IQR Dataset

    • paperswithcode.com
    Updated May 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng (2022). IQR Dataset [Dataset]. https://paperswithcode.com/dataset/image-query-retrieval-dataset
    Explore at:
    Dataset updated
    May 7, 2022
    Authors
    Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng
    Description

    IQR is proposed for the image-text retrieval task. We use 200,000 queries and the corresponding images as the annotated image-query pairs.

  2. f

    Descriptive statistics, mean ± SD, range, median and interquartile range...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hélène Follet; Delphine Farlay; Yohann Bala; Stéphanie Viguet-Carrin; Evelyne Gineyts; Brigitte Burt-Pichat; Julien Wegrzyn; Pierre Delmas; Georges Boivin; Roland Chapurlat (2023). Descriptive statistics, mean ± SD, range, median and interquartile range (IQR). [Dataset]. http://doi.org/10.1371/journal.pone.0055232.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Hélène Follet; Delphine Farlay; Yohann Bala; Stéphanie Viguet-Carrin; Evelyne Gineyts; Brigitte Burt-Pichat; Julien Wegrzyn; Pierre Delmas; Georges Boivin; Roland Chapurlat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Descriptive statistics, mean ± SD, range, median and interquartile range (IQR).

  3. a

    North America Boundaries

    • home-pugonline.hub.arcgis.com
    Updated Oct 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The PUG User Group (2023). North America Boundaries [Dataset]. https://home-pugonline.hub.arcgis.com/datasets/north-america-boundaries
    Explore at:
    Dataset updated
    Oct 23, 2023
    Dataset authored and provided by
    The PUG User Group
    Area covered
    North America,
    Description

    The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a new, retrospective satellite-based precipitation dataset, constructed as a climate data record for hydrological and climate studies. The PERSIANN-CDR is available from 1983-present making the dataset the longest satellite based precipitation data record available. The precipitation maps are available at daily temporal resolution for the latitude band 60°S–60°N at 0.25 degrees. The maps shown here represent 30-year annual and seasonal median and interquartile range (IQR) of the PERSIANN-CDR dataset from 1984 – 2014. In the median precipitation maps, the mid-point value (or 50th percentile) for each pixel in is computed and plotted for the study area. The range of the data about the median is represented by the interquartile range (IQR), and shows the variability of the dataset. For these maps, winter = December – February, spring = March – May, summer = June – August, fall = September – November

  4. Outlier Detection and Removal Dataset

    • kaggle.com
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aamir Shahzad (2025). Outlier Detection and Removal Dataset [Dataset]. https://www.kaggle.com/datasets/aamir5659/outlier-detection-and-removal-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 9, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aamir Shahzad
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    📁 Files Included: Outlier_Loan_datase.csv – Raw dataset with outliers `.Final_Outliers_clean_dataset.csv (IQR + Z-score)

    This dataset is designed for practicing outlier detection and data cleaning techniques.
    It includes both the original (uncleaned) and cleaned versions of a financial dataset.

  5. Simulation Data Set

    • catalog.data.gov
    • s.cnmilf.com
    Updated Nov 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2020). Simulation Data Set [Dataset]. https://catalog.data.gov/dataset/simulation-data-set
    Explore at:
    Dataset updated
    Nov 12, 2020
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).

  6. f

    Table 3. Mean, median and IQR of NCD risk factorsa.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Faisal S. Khan; Ismat Lotia-Farrukh; Aamir J. Khan; Saad Tariq Siddiqui; Sana Zehra Sajun; Amyn Abdul Malik; Aziza Burfat; Mohammad Hussham Arshad; Andrew J. Codlin; Belinda M. Reininger; Joseph B. McCormick; Nadeem Afridi; Susan P. Fisher-Hoch (2023). Table 3. Mean, median and IQR of NCD risk factorsa. [Dataset]. http://doi.org/10.1371/journal.pone.0056008.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Faisal S. Khan; Ismat Lotia-Farrukh; Aamir J. Khan; Saad Tariq Siddiqui; Sana Zehra Sajun; Amyn Abdul Malik; Aziza Burfat; Mohammad Hussham Arshad; Andrew J. Codlin; Belinda M. Reininger; Joseph B. McCormick; Nadeem Afridi; Susan P. Fisher-Hoch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    a Age ≥15 yrs unless otherwise specified.

  7. P

    UDED Dataset

    • paperswithcode.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xavier Soria; Yachuan Li; Mohammad Rouhani; Angel D. Sappa, UDED Dataset [Dataset]. https://paperswithcode.com/dataset/uded
    Explore at:
    Authors
    Xavier Soria; Yachuan Li; Mohammad Rouhani; Angel D. Sappa
    Description

    This dataset is a collection of 1, 2, or 3 images from: BIPED, BSDS500, BSDS300, DIV2K, WIRE-FRAME, CID, CITYSCAPES, ADE20K, MDBD, NYUD, THANGKA, PASCAL-Context, SET14, URBAN10, and the camera-man image. The image selection process consists on computing the Inter-Quartile Range (IQR) intensity value on all the images, images larger than 720×720 pixels were not considered. In dataset whose images are in HR, they were cut. We thank all the datasets owners to make them public. This dataset is just for Edge Detection not contour nor Boundary tasks.

  8. Precipitation Interquartile Range Spring Estimation (PERSIANN) 1984-2014

    • noaa.hub.arcgis.com
    Updated Dec 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOAA GeoPlatform (2024). Precipitation Interquartile Range Spring Estimation (PERSIANN) 1984-2014 [Dataset]. https://noaa.hub.arcgis.com/maps/c06721acf213414191847347fcbdff3b
    Explore at:
    Dataset updated
    Dec 18, 2024
    Dataset provided by
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    Authors
    NOAA GeoPlatform
    Area covered
    Description

    The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.

  9. f

    Characteristics of women, overall and according to BMI categories; data...

    • plos.figshare.com
    xls
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julie A. Pasco; Geoffrey C. Nicholson; Sharon L. Brennan; Mark A. Kotowicz (2023). Characteristics of women, overall and according to BMI categories; data presented as mean (±SD), median (IQR) or frequency (%). [Dataset]. http://doi.org/10.1371/journal.pone.0029580.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Julie A. Pasco; Geoffrey C. Nicholson; Sharon L. Brennan; Mark A. Kotowicz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    *n = 1041 (35 missing data).BMI = body mass index (kg/m2); SD = standard deviation; IQR = interquartile range; EI energy intake (MJ/d); BMR = basal metabolic rate (MJ/d).

  10. d

    Hunter AWRA Hydrological Response Variables (HRV)

    • data.gov.au
    • researchdata.edu.au
    • +1more
    zip
    Updated Jun 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bioregional Assessment Program (2022). Hunter AWRA Hydrological Response Variables (HRV) [Dataset]. https://data.gov.au/data/dataset/a84b2431-24e3-4537-ae50-84f4e955ebdc
    Explore at:
    zip(1245038746)Available download formats
    Dataset updated
    Jun 28, 2022
    Dataset authored and provided by
    Bioregional Assessment Program
    License

    Attribution 2.5 (CC BY 2.5)https://creativecommons.org/licenses/by/2.5/
    License information was derived automatically

    Description

    Abstract

    The dataset was derived by the Bioregional Assessment Programme from multiple datasets. The source dataset is identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

    Hydrological Response Variables (HRVs) are the hydrological characteristics of the system that potentially change due to coal resource development. These data refer to the HRVs related to the AWRA L and AWRA R models for the Hunter subregion for the 65 simulation nodes (63 within Hunter basin and 2 within Macquarie-Tuggerah Lake basin). The nine hydrological response variables (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed by the AWRA L and AWRA R models under CRDP and baseline conditions, respectively and the ACRD is the difference between the Baseline and CRDP.

    Abbreviation meaning

    AF - the annual streamflow volume (GL/year)

    P01 - the daily streamflow rate at the first percentile (ML/day)

    P01 - the daily streamflow rate at the first percentile (ML/day)

    IQR - the inter-quartile range in daily streamflow (ML/day). That is, the difference between the daily streamflow rate at the 75th percentile and at the 25th percentile.

    LFD - the number of low streamflow days per year. The threshold for low streamflow days is the 10th percentile from the simulated 90-year period (2013 to 2102)

    LFS - the number of low streamflow spells per year (perennial streams only). A spell is defined as a period of contiguous days of streamflow below the 10th percentile threshold

    LLFS - the length (days) of the longest low streamflow spell each year

    P99 - the daily streamflow rate at the 99th percentile (ML/day)

    FD - flood days, the number of days with streamflow greater than the 90th percentile from the simulated 90-year period (2013 to 2102)

    ZFD - Zero flow days

    Purpose

    This is the dataset used for the Hunter 2.6.1 product to evaluate additional coal mine and coal resource development impacts on hydrological response variables at 65 simulation nodes.

    Dataset History

    The HUN AWRA model outputs were used to determine the impacts on the HRVs to produce these data. The nine HRVs (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and baseline conditions, respectively. The difference between CRDP and baseline is used for predicting ACRD impacts on hydrological response variables at 65 simulation nodes.

    Dataset Citation

    Bioregional Assessment Programme (2017) Hunter AWRA Hydrological Response Variables (HRV). Bioregional Assessment Derived Dataset. Viewed 13 March 2019, http://data.bioregionalassessments.gov.au/dataset/a84b2431-24e3-4537-ae50-84f4e955ebdc.

    Dataset Ancestors

  11. Precipitation Interquartile Range Winter Estimation (PERSIANN) 1984-2014

    • noaa.hub.arcgis.com
    Updated Dec 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOAA GeoPlatform (2024). Precipitation Interquartile Range Winter Estimation (PERSIANN) 1984-2014 [Dataset]. https://noaa.hub.arcgis.com/maps/c38031dd1db6491d837e3b5e58c628d5
    Explore at:
    Dataset updated
    Dec 18, 2024
    Dataset provided by
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    Authors
    NOAA GeoPlatform
    Area covered
    Description

    The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.

  12. w

    Data from: GEOMACS (Geological and Oceanographic Model of Australias...

    • data.wu.ac.at
    • researchdata.edu.au
    • +1more
    zip
    Updated Jun 24, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CSIRO Oceans and Atmosphere - Information and Data Centre (2017). GEOMACS (Geological and Oceanographic Model of Australias Continental Shelf) Interquartile range [Dataset]. https://data.wu.ac.at/schema/data_gov_au/ZGRmZGQyYjktMjEwNC00OWUxLTk4OTQtNTM3OWQyY2YyNmU0
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 24, 2017
    Dataset provided by
    CSIRO Oceans and Atmosphere - Information and Data Centre
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Area covered
    Australia, cd30346d96a4fac7d77a1e77d04c5511ba04b2f6
    Description

    Geoscience Australias GEOMACS model was utilised to produce hindcast hourly time series of continental shelf (~20 to 300 m depth) bed shear stress (unit of measure: Pascal, Pa) on a 0.1 degree grid covering the period March 1997 to February 2008 (inclusive). The hindcast data represents the combined contribution to the bed shear stress by waves, tides, wind and density-driven circulation. Included in the parameters that will be calculated to represent the magnitude of the bulk of the data are the quartiles of the distribution; Q25, Q50 and Q75 (i.e. the values for which 25, 50 and 75 percent of the observations fall below). The interquartile range, , of the GEOMACS output takes the observations from between Q25 and Q75 to provide an accurate representation of the spread of observations. The interquartile range was shown to provide a more robust representation of the observations than the standard deviation, which produced highly skewed observations (Hughes and Harris 2008). This dataset is a contribution to the CERF Marine Biodiversity Hub and is hosted temporarily by CMAR on behalf of Geoscience Australia.

  13. Global Oscillation Network Group (GONG) Quick-Reduce Inputs (iQR)

    • catalog.data.gov
    Updated Oct 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOAA National Centers for Environmental Information (Point of Contact); National Centers for Environmental Information (NCEI) / Space Weather Prediction Center (STP) (Point of Contact, Custodian); National Weather Service (NWS) (Resource Provider) (2024). Global Oscillation Network Group (GONG) Quick-Reduce Inputs (iQR) [Dataset]. https://catalog.data.gov/dataset/global-oscillation-network-group-gong-quick-reduce-inputs-iqr1
    Explore at:
    Dataset updated
    Oct 18, 2024
    Dataset provided by
    National Weather Servicehttp://www.weather.gov/
    National Centers for Environmental Informationhttps://www.ncei.noaa.gov/
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    Space Weather Prediction Centerhttp://www.swpc.noaa.gov/
    Description

    Originally constructed in 1995, the Global Oscillation Network Group (GONG) is a network of six identical ground-based solar telescopes distributed around the Earth in order to obtain continuous observations of the Sun. Those sites are located in Big Bear, California (BB); Mauna Loa, Hawaii (ML); Learmonth, Australia (LE); Udaipur, India (UD); El Teide, Spain (TD); and Cerro Tololo, Chile (CT). Additionally, there are three engineering/testbed sites in Boulder, Colorado (TC, TE, and TS). Owned by the National Science Foundation, GONG is operated and maintained by the National Solar Observatory (NSO) with significant funding from NOAA’s Space Weather Prediction Center (SWPC). Each minute, weather permitting, the GONG network observes the Sun at two spectral wavelengths: 676.78nm (a Ni I absorption line) and 656.28nm (the H-alpha absorption line).

  14. i

    DATASET : Striped red mullet landing per unit of effort and environmental...

    • sextant.ifremer.fr
    • seanoe.org
    • +2more
    rel-canonical +2
    Updated May 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CNRS/Univ Pau & Pays Adour, Laboratoire de Mathématiques et de leurs Applications de Pau - Fédération MIRA, UMR5142, 64600 Anglet, France ARC Centre of Excellence for Mathematical and Statistical Frontiers at School of Mathematical Science, QueenslandUniversity of Technology, Brisbane, Australia (2021). DATASET : Striped red mullet landing per unit of effort and environmental variables in the Bay of Biscay [Dataset]. https://sextant.ifremer.fr/record/seanoe:77179/
    Explore at:
    www:link-1.0-http--metadata-url, www:download-1.0-link--download, rel-canonicalAvailable download formats
    Dataset updated
    May 12, 2021
    Dataset authored and provided by
    CNRS/Univ Pau & Pays Adour, Laboratoire de Mathématiques et de leurs Applications de Pau - Fédération MIRA, UMR5142, 64600 Anglet, France ARC Centre of Excellence for Mathematical and Statistical Frontiers at School of Mathematical Science, QueenslandUniversity of Technology, Brisbane, Australia
    Area covered
    Description
    ####### # Data description #

    This dataset have been constructed and used for scientific purpose, available in the paper "Detecting the effects of inter-annual and seasonal changes of environmental factors on the the striped red mullet population in the Bay of Biscay" authored by Kermorvant C., Caill-Milly N., Sous D., Paradinas I., Lissardy M. and Liquet B. and published in Journal of Sea Research. This file is an extraction from the SACROIS fisheries database created by Ifremer (for more information see https://sextant.ifremer.fr/record/3e177f76-96b0-42e2-8007-62210767dc07/) and from the Copernicus database. Biochemestry comes from the product GLOBAL_ANALYSIS_FORECAST_BIO_001_028 (https://resources.marine.copernicus.eu/?option=com_csw&view=details&product_id=GLOBAL_ANALYSIS_FORECAST_BIO_001_028). Temperature and salinity comes from GLOBAL_ANALYSIS_FORECAST_PHY_001_024 product (https://resources.marine.copernicus.eu/?option=com_csw&view=details&product_id=GLOBAL_ANALYSIS_FORECAST_PHY_001_024). As fisheries landing per unit of effort is only available per ICES rectangle and by month, environmental data have been aggregated accordingly.

    ######### # Colomns description # ############### rectangle - The 6 ICES statistical rectangles used in the study. time_m - Time in months, from the beginning to the end of the study. annee = year mois = month (from 1 to 12) Poids = Weight of red mullet landed valeur = Temps_peche = fishing time Nb_sequence = number of fishing sequences Moy / Med / Var / StD Quartil_1 / Quartil_3 / min / max / CV / IQR = statistical descriptors of landing by rectangle and by month log_cpue = log of Med colomn mean_surface_s = mean of surface salinity by month and by rectangle median_surface_s = median of surface salinity by month and by rectangle mean_surface_t = mean of surface temperature by month and by rectangle median_surface_t = median of surface temperature by month and by rectangle si / zeu /po4 / pyc / o2/ nppv / no3 and nh4 mean and median concentration by rectangle and by month pc3 / pc2 / pc1 - projections of previous biochemestry variables on the three first axes of a PCA
  15. Italy: Mobility COVID-19

    • kaggle.com
    Updated Mar 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mr. Rahman (2021). Italy: Mobility COVID-19 [Dataset]. https://www.kaggle.com/motiurse/italy-mobility-covid19/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 26, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mr. Rahman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Italy
    Description

    A live version of the data record, which will be kept up-to-date with new estimates, can be downloaded from the Humanitarian Data Exchange: https://data.humdata.org/dataset/covid-19-mobility-italy.

    If you find the data helpful or you use the data for your research, please cite our work:

    Pepe, E., Bajardi, P., Gauvin, L., Privitera, F., Lake, B., Cattuto, C., & Tizzoni, M. (2020). COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Scientific Data 7, 230 (2020).

    The data record is structured into 4 comma-separated value (CSV) files, as follows:

    id_provinces_IT.csv. Table of the administrative codes of the 107 Italian provinces. The fields of the table are:

    COD_PROV is an integer field that is used to identify a province in all other data records;

    SIGLA is a two-letters code that identifies the province according to the ISO_3166-2 standard (https://en.wikipedia.org/wiki/ISO_3166-2:IT);

    DEN_PCM is the full name of the province.

    OD_Matrix_daily_flows_norm_full_2020_01_18_2020_04_17.csv. The file contains the daily fraction of users’ moving between Italian provinces. Each line corresponds to an entry of matrix (i, j). The fields of the table are:

    p1: COD_PROV of origin,

    p2: COD_PROV of destination,

    day: in the format yyyy-mm-dd.

    median_q1_q3_rog_2020_01_18_2020_04_17.csv. The file contains median and interquartile range (IQR) of users’ radius of gyration in a province by week. Each entry of the table fields of the table are:

    COD_PROV of the province;

    SIGLA of the province;

    DEN_PCM of the province;

    week: median value of the radius of gyration on week week, with week in the format dd/mm-DD/MM where dd/mm and DD/MM are the first and the last day of the week, respectively.

    week Q1 first quartile (Q1) of the distribution of the radius of gyration on week week,

    week Q3 third quartile (Q3) of the distribution of the radius of gyration on week week,

    average_network_degree_2020_01_18_2020_04_17.csv. The file contains daily time-series of the average degree 〈k〉 of the proximity network. Each entry of the table is a value of 〈k〉 on a given day. The fields of the table are:

    COD_PROV of the province;

    SIGLA of the province;

    DEN_PCM of the province;

    day in the format yyyy-mm-dd.

    ESRI shapefiles of the Italian provinces updated to the most recent definition are available from the website of the Italian National Office of Statistics (ISTAT): https://www.istat.it/it/archivio/222527.

  16. d

    Namoi standard Hydrological Response Variables (HRVs)

    • data.gov.au
    • researchdata.edu.au
    • +1more
    Updated Nov 20, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bioregional Assessment Program (2019). Namoi standard Hydrological Response Variables (HRVs) [Dataset]. https://data.gov.au/data/dataset/groups/189f4c7a-29e1-41f9-868d-b7f5184d829f
    Explore at:
    Dataset updated
    Nov 20, 2019
    Dataset provided by
    Bioregional Assessment Program
    Area covered
    Namoi River
    Description

    Abstract

    The dataset was derived by the Bioregional Assessment Programme from multiple datasets. The source dataset is identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.

    Hydrological Response Variables (HRVs) are the hydrological characteristics of the system that potentially change due to coal resource development. These data refer to the HRVs related to the AWRA-R model for the Namoi subregion for the 54 simulation nodes. The nine hydrological response variables (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and Baseline conditions, respectively and the ACRD is the difference between the Baseline and CRDP.

    Abbreviation meaning

    AF - the annual streamflow volume (GL/year)

    P01 - the daily streamflow rate at the first percentile (ML/day)

    P01 - the daily streamflow rate at the first percentile (ML/day)

    IQR - the inter-quartile range in daily streamflow (ML/day). That is, the difference between the daily streamflow rate at the 75th percentile and at the 25th percentile.

    LFD - the number of low streamflow days per year. The threshold for low streamflow days is the 10th percentile from the simulated 90-year period (2013 to 2102)

    LFS - the number of low streamflow spells per year (perennial streams only). A spell is defined as a period of contiguous days of streamflow below the 10th percentile threshold

    LLFS - the length (days) of the longest low streamflow spell each year

    P99 - the daily streamflow rate at the 99th percentile (ML/day)

    FD - flood days, the number of days with streamflow greater than the 90th percentile from the simulated 90-year period (2013 to 2102)

    ZFD - Zero flow days

    Purpose

    This is the dataset used for the Namoi 2.6.1 product to evaluate additional coal mine and coal resource development impacts on hydrological response variables at 54 simulation nodes.

    Dataset History

    The Namoi AWRA-R model outputs were used to determine the impacts on the HRVs to produce these data. Readme files within the folders in the dataset provide an explanation on how the resource was created. The nine HRVs (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and Baseline conditions, respectively. The difference between CRDP and Baseline is used for predicting ACRD impacts on hydrological response variables at 54 simulation nodes.

    Dataset Citation

    Bioregional Assessment Programme (2017) Namoi standard Hydrological Response Variables (HRVs). Bioregional Assessment Derived Dataset. Viewed 11 December 2018, http://data.bioregionalassessments.gov.au/dataset/189f4c7a-29e1-41f9-868d-b7f5184d829f.

    Dataset Ancestors

  17. Z

    Dataset related to article "Association between cardiac troponin I and...

    • data.niaid.nih.gov
    Updated Apr 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Giuseppe Moriello (2021). Dataset related to article "Association between cardiac troponin I and mortality in patients with COVID-19 " [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4723490
    Explore at:
    Dataset updated
    Apr 28, 2021
    Dataset provided by
    Giuseppe Moriello
    Federica Maura
    Sara Maria Giulia Cioffi
    Francesco Paolo Leone
    Emanuela Morenghi
    Maria Teresa Sandri
    Barbara Barbieri
    Michela Salvatici
    Description

    Background: Severe pneumonia is pathological manifestation of Coronavirus Disease 2019 (COVID-19), however complications have been reported in COVID-19 patients with a worst prognosis. Aim of this study was to evaluate the role of high sensitivity cardiac troponin I (hs-TnI) in patients with SARS-CoV-2 infection.

    Methods: we retrospectively analysed hs-TnI values measured in 523 patients (median age 64 years, 68% men) admitted to a university hospital in Milan, Italy, and diagnosed COVID-19.

    Results: A significant difference in hs-TnI concentrations was found between deceased patients (98 patients) vs discharged (425 patients) [36.05 ng/L IQR 16.5-94.9 vs 6.3 ng/L IQR 2.6-13.9, p < 0.001 respectively]. Hs-TnI measurements were independent predictors of mortality at multivariate analysis adjusted for confounding parameters such as age (HR 1.004 for each 10 point of troponin, 95% CI 1.002-1.006, p < 0.001). The survival rate, after one week, in patients with hs-TnI values under 6 ng/L was 97.94%, between 6 ng/L and the normal value was 90.87%, between the normal value and 40 ng/L was 86.98, and 59.27% over 40 ng/L.

    Conclusion: Increase of hs-TnI associated with elevated mortality in patients with COVID-19. Troponin shows to be a useful biomarker of disease progression and worse prognosis in COVID-19 patients.

  18. Median and interquartile range (IQR) of fasting and postprandial...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kim M. Huffman; Leanne M. Redman; Lawrence R. Landerman; Carl F. Pieper; Robert D. Stevens; Michael J. Muehlbauer; Brett R. Wenner; James R. Bain; Virginia B. Kraus; Christopher B. Newgard; Eric Ravussin; William E. Kraus (2023). Median and interquartile range (IQR) of fasting and postprandial concentrations of metabolites at baseline (n = 46).* [Dataset]. http://doi.org/10.1371/journal.pone.0028190.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Kim M. Huffman; Leanne M. Redman; Lawrence R. Landerman; Carl F. Pieper; Robert D. Stevens; Michael J. Muehlbauer; Brett R. Wenner; James R. Bain; Virginia B. Kraus; Christopher B. Newgard; Eric Ravussin; William E. Kraus
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    *Those loading most heavily (component load ≥|0.5|) in principal component analyses are identified in bold.

  19. Z

    Data from: Diagnostic Value of Global Cardiac Strain in Patients With...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paola Maria Cannaò (2021). Diagnostic Value of Global Cardiac Strain in Patients With Myocarditis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5147939
    Explore at:
    Dataset updated
    Jul 30, 2021
    Dataset provided by
    Francesco Saverio Carbone
    Francesco Sardanelli
    Paola Maria Cannaò
    Marco Alì
    Francesco Secchi
    Caterina Beatrice Monti
    Description

    Dataset from the article Secchi F, Monti CB, Alì M, Carbone FS, Cannaò PM, Sardanelli F. Diagnostic Value of Global Cardiac Strain in Patients With Myocarditis. J Comput Assist Tomogr. 2020 Jul/Aug;44(4):591-598. doi: 10.1097/RCT.0000000000001062. PMID: 32697530.

    Abstract

    Background: Cardiac strain represents an imaging biomarker of contractile dysfunction.

    Purpose: The purpose of this study was to investigate the diagnostic value of cardiac strain obtained by feature-tracking cardiac magnetic resonance (MR) in acute myocarditis.

    Materials and methods: Cardiac MR examinations of 46 patients with myocarditis and preserved ejection fraction at acute phase and follow-up were analyzed along with cardiac MR of 46 healthy age- and sex-matched controls. Global circumferential strain and global radial strain were calculated for each examination, along with myocardial edema and late gadolinium enhancement, and left ventricle functional parameters, through manual contouring of the myocardium. Correlations were assessed using Spearman ρ. Wilcoxon and Mann-Whitney U test were used to assess differences between data. Receiver operating characteristics curves and reproducibility were obtained to assess the diagnostic role of strain parameters.

    Results: Global circumferential strain was significantly lower in controls (median, -20.4%; interquartile range [IQR], -23.4% to -18.7%) than patients in acute phase (-18.4%; IQR, -21.0% to -16.1%; P = 0.001) or at follow-up (-19.2%; IQR, -21.5% to -16.1%; P = 0.020). Global radial strain was significantly higher in controls (82.4%; IQR, 62.8%-104.9%) than in patients during the acute phase (65.8%; IQR, 52.9%-79.5%; P = 0.001). Correlations were found between global circumferential strain and global radial strain in all groups (acute, ρ = -0.580, P < 0.001; follow-up, ρ = -0.399, P = 0.006; controls, ρ = -0.609, P < 0.001), and between global circumferential strain and late gadolinium enhancement only in myocarditis patients (acute, ρ = 0.035, P = 0.024; follow-up, ρ = 0.307, P = 0.038).

    Conclusions: Cardiac strain could potentially have a role in detecting acute myocarditis in low-risk acute myocarditis patients where cardiac MR is the main diagnosing technique.

  20. Gender, Age, and Emotion Detection from Voice

    • kaggle.com
    Updated May 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit Zaman (2021). Gender, Age, and Emotion Detection from Voice [Dataset]. https://www.kaggle.com/datasets/rohitzaman/gender-age-and-emotion-detection-from-voice/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 29, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rohit Zaman
    Description

    Context

    Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.

    Content

    Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.

    Acknowledgements

    Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng (2022). IQR Dataset [Dataset]. https://paperswithcode.com/dataset/image-query-retrieval-dataset

IQR Dataset

Image-Query Retrieval Dataset

Explore at:
Dataset updated
May 7, 2022
Authors
Chunyu Xie; Heng Cai; Jincheng Li; Fanjing Kong; Xiaoyu Wu; Jianfei Song; Henrique Morimitsu; Lin Yao; Dexin Wang; Xiangzheng Zhang; Dawei Leng; Baochang Zhang; Xiangyang Ji; Yafeng Deng
Description

IQR is proposed for the image-text retrieval task. We use 200,000 queries and the corresponding images as the annotated image-query pairs.

Search
Clear search
Close search
Google apps
Main menu