IQR is proposed for the image-text retrieval task. We use 200,000 queries and the corresponding images as the annotated image-query pairs.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Descriptive statistics, mean ± SD, range, median and interquartile range (IQR).
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a new, retrospective satellite-based precipitation dataset, constructed as a climate data record for hydrological and climate studies. The PERSIANN-CDR is available from 1983-present making the dataset the longest satellite based precipitation data record available. The precipitation maps are available at daily temporal resolution for the latitude band 60°S–60°N at 0.25 degrees. The maps shown here represent 30-year annual and seasonal median and interquartile range (IQR) of the PERSIANN-CDR dataset from 1984 – 2014. In the median precipitation maps, the mid-point value (or 50th percentile) for each pixel in is computed and plotted for the study area. The range of the data about the median is represented by the interquartile range (IQR), and shows the variability of the dataset. For these maps, winter = December – February, spring = March – May, summer = June – August, fall = September – November
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
📁 Files Included:
Outlier_Loan_datase.csv
– Raw dataset with outliers
`.Final_Outliers_clean_dataset.csv (IQR + Z-score)
This dataset is designed for practicing outlier detection and data cleaning techniques.
It includes both the original (uncleaned) and cleaned versions of a financial dataset.
These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
a Age ≥15 yrs unless otherwise specified.
This dataset is a collection of 1, 2, or 3 images from: BIPED, BSDS500, BSDS300, DIV2K, WIRE-FRAME, CID, CITYSCAPES, ADE20K, MDBD, NYUD, THANGKA, PASCAL-Context, SET14, URBAN10, and the camera-man image. The image selection process consists on computing the Inter-Quartile Range (IQR) intensity value on all the images, images larger than 720×720 pixels were not considered. In dataset whose images are in HR, they were cut. We thank all the datasets owners to make them public. This dataset is just for Edge Detection not contour nor Boundary tasks.
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
*n = 1041 (35 missing data).BMI = body mass index (kg/m2); SD = standard deviation; IQR = interquartile range; EI energy intake (MJ/d); BMR = basal metabolic rate (MJ/d).
Attribution 2.5 (CC BY 2.5)https://creativecommons.org/licenses/by/2.5/
License information was derived automatically
The dataset was derived by the Bioregional Assessment Programme from multiple datasets. The source dataset is identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.
Hydrological Response Variables (HRVs) are the hydrological characteristics of the system that potentially change due to coal resource development. These data refer to the HRVs related to the AWRA L and AWRA R models for the Hunter subregion for the 65 simulation nodes (63 within Hunter basin and 2 within Macquarie-Tuggerah Lake basin). The nine hydrological response variables (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed by the AWRA L and AWRA R models under CRDP and baseline conditions, respectively and the ACRD is the difference between the Baseline and CRDP.
Abbreviation meaning
AF - the annual streamflow volume (GL/year)
P01 - the daily streamflow rate at the first percentile (ML/day)
P01 - the daily streamflow rate at the first percentile (ML/day)
IQR - the inter-quartile range in daily streamflow (ML/day). That is, the difference between the daily streamflow rate at the 75th percentile and at the 25th percentile.
LFD - the number of low streamflow days per year. The threshold for low streamflow days is the 10th percentile from the simulated 90-year period (2013 to 2102)
LFS - the number of low streamflow spells per year (perennial streams only). A spell is defined as a period of contiguous days of streamflow below the 10th percentile threshold
LLFS - the length (days) of the longest low streamflow spell each year
P99 - the daily streamflow rate at the 99th percentile (ML/day)
FD - flood days, the number of days with streamflow greater than the 90th percentile from the simulated 90-year period (2013 to 2102)
ZFD - Zero flow days
This is the dataset used for the Hunter 2.6.1 product to evaluate additional coal mine and coal resource development impacts on hydrological response variables at 65 simulation nodes.
The HUN AWRA model outputs were used to determine the impacts on the HRVs to produce these data. The nine HRVs (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and baseline conditions, respectively. The difference between CRDP and baseline is used for predicting ACRD impacts on hydrological response variables at 65 simulation nodes.
Bioregional Assessment Programme (2017) Hunter AWRA Hydrological Response Variables (HRV). Bioregional Assessment Derived Dataset. Viewed 13 March 2019, http://data.bioregionalassessments.gov.au/dataset/a84b2431-24e3-4537-ae50-84f4e955ebdc.
Derived From River Styles Spatial Layer for New South Wales
Derived From HUN AWRA-L simulation nodes_v01
Derived From Hunter River Salinity Scheme Discharge NSW EPA 2006-2012
Derived From HUN AWRA-R simulation nodes v01
Derived From Bioregional Assessment areas v06
Derived From GEODATA 9 second DEM and D8: Digital Elevation Model Version 3 and Flow Direction Grid 2008
Derived From Bioregional Assessment areas v04
Derived From HUN AWRA-R Gauge Station Cross Sections v01
Derived From Gippsland Project boundary
Derived From Natural Resource Management (NRM) Regions 2010
Derived From BA All Regions BILO cells in subregions shapefile
Derived From Hunter Surface Water data v2 20140724
Derived From HUN AWRA-R River Reaches Simulation v01
Derived From HUN AWRA-L simulation nodes v02
Derived From GEODATA TOPO 250K Series 3, File Geodatabase format (.gdb)
Derived From HUN AWRA-R Irrigation Area Extents and Crop Types v01
Derived From GEODATA TOPO 250K Series 3
Derived From NSW Catchment Management Authority Boundaries 20130917
Derived From Geological Provinces - Full Extent
Derived From BA SYD selected GA TOPO 250K data plus added map features
Derived From Bioregional_Assessment_Programme_Catchment Scale Land Use of Australia - 2014
Derived From Bioregional Assessment areas v03
Derived From IQQM Model Simulation Regulated Rivers NSW DPI HUN 20150615
Derived From HUN AWRA-R calibration catchments v01
Derived From Bioregional Assessment areas v05
Derived From BILO Gridded Climate Data: Daily Climate Data for each year from 1900 to 2012
Derived From National Surface Water sites Hydstra
Derived From Selected streamflow gauges within and near the Hunter subregion
Derived From ASRIS Continental-scale soil property predictions 2001
Derived From Hunter Surface Water data extracted 20140718
Derived From Mean Annual Climate Data of Australia 1981 to 2012
Derived From HUN AWRA-R calibration nodes v01
Derived From HUN AWRA-R Observed storage volumes Glenbawn Dam and Glennies Creek Dam
Derived From HUN AWRA-LR Model v01
Derived From HUN AWRA-L ASRIS soil properties v01
Derived From HUN AWRAR restricted input 01
Derived From Bioregional Assessment areas v01
Derived From Bioregional Assessment areas v02
Derived From Victoria - Seamless Geology 2014
Derived From HUN AWRA-L Site Station Cross Sections v01
Derived From HUN AWRA-R simulation catchments v01
Derived From HUN AWRA-R Simulation Node Cross Sections v01
Derived From Climate model 0.05x0.05 cells and cell centroids
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Geoscience Australias GEOMACS model was utilised to produce hindcast hourly time series of continental shelf (~20 to 300 m depth) bed shear stress (unit of measure: Pascal, Pa) on a 0.1 degree grid covering the period March 1997 to February 2008 (inclusive). The hindcast data represents the combined contribution to the bed shear stress by waves, tides, wind and density-driven circulation. Included in the parameters that will be calculated to represent the magnitude of the bulk of the data are the quartiles of the distribution; Q25, Q50 and Q75 (i.e. the values for which 25, 50 and 75 percent of the observations fall below). The interquartile range, , of the GEOMACS output takes the observations from between Q25 and Q75 to provide an accurate representation of the spread of observations. The interquartile range was shown to provide a more robust representation of the observations than the standard deviation, which produced highly skewed observations (Hughes and Harris 2008). This dataset is a contribution to the CERF Marine Biodiversity Hub and is hosted temporarily by CMAR on behalf of Geoscience Australia.
Originally constructed in 1995, the Global Oscillation Network Group (GONG) is a network of six identical ground-based solar telescopes distributed around the Earth in order to obtain continuous observations of the Sun. Those sites are located in Big Bear, California (BB); Mauna Loa, Hawaii (ML); Learmonth, Australia (LE); Udaipur, India (UD); El Teide, Spain (TD); and Cerro Tololo, Chile (CT). Additionally, there are three engineering/testbed sites in Boulder, Colorado (TC, TE, and TS). Owned by the National Science Foundation, GONG is operated and maintained by the National Solar Observatory (NSO) with significant funding from NOAA’s Space Weather Prediction Center (SWPC). Each minute, weather permitting, the GONG network observes the Sun at two spectral wavelengths: 676.78nm (a Ni I absorption line) and 656.28nm (the H-alpha absorption line).
This dataset have been constructed and used for scientific purpose, available in the paper "Detecting the effects of inter-annual and seasonal changes of environmental factors on the the striped red mullet population in the Bay of Biscay" authored by Kermorvant C., Caill-Milly N., Sous D., Paradinas I., Lissardy M. and Liquet B. and published in Journal of Sea Research. This file is an extraction from the SACROIS fisheries database created by Ifremer (for more information see https://sextant.ifremer.fr/record/3e177f76-96b0-42e2-8007-62210767dc07/) and from the Copernicus database. Biochemestry comes from the product GLOBAL_ANALYSIS_FORECAST_BIO_001_028 (https://resources.marine.copernicus.eu/?option=com_csw&view=details&product_id=GLOBAL_ANALYSIS_FORECAST_BIO_001_028). Temperature and salinity comes from GLOBAL_ANALYSIS_FORECAST_PHY_001_024 product (https://resources.marine.copernicus.eu/?option=com_csw&view=details&product_id=GLOBAL_ANALYSIS_FORECAST_PHY_001_024). As fisheries landing per unit of effort is only available per ICES rectangle and by month, environmental data have been aggregated accordingly.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A live version of the data record, which will be kept up-to-date with new estimates, can be downloaded from the Humanitarian Data Exchange: https://data.humdata.org/dataset/covid-19-mobility-italy.
If you find the data helpful or you use the data for your research, please cite our work:
Pepe, E., Bajardi, P., Gauvin, L., Privitera, F., Lake, B., Cattuto, C., & Tizzoni, M. (2020). COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Scientific Data 7, 230 (2020).
The data record is structured into 4 comma-separated value (CSV) files, as follows:
id_provinces_IT.csv. Table of the administrative codes of the 107 Italian provinces. The fields of the table are:
COD_PROV is an integer field that is used to identify a province in all other data records;
SIGLA is a two-letters code that identifies the province according to the ISO_3166-2 standard (https://en.wikipedia.org/wiki/ISO_3166-2:IT);
DEN_PCM is the full name of the province.
OD_Matrix_daily_flows_norm_full_2020_01_18_2020_04_17.csv. The file contains the daily fraction of users’ moving between Italian provinces. Each line corresponds to an entry of matrix (i, j). The fields of the table are:
p1: COD_PROV of origin,
p2: COD_PROV of destination,
day: in the format yyyy-mm-dd.
median_q1_q3_rog_2020_01_18_2020_04_17.csv. The file contains median and interquartile range (IQR) of users’ radius of gyration in a province by week. Each entry of the table fields of the table are:
COD_PROV of the province;
SIGLA of the province;
DEN_PCM of the province;
week: median value of the radius of gyration on week week, with week in the format dd/mm-DD/MM where dd/mm and DD/MM are the first and the last day of the week, respectively.
week Q1 first quartile (Q1) of the distribution of the radius of gyration on week week,
week Q3 third quartile (Q3) of the distribution of the radius of gyration on week week,
average_network_degree_2020_01_18_2020_04_17.csv. The file contains daily time-series of the average degree 〈k〉 of the proximity network. Each entry of the table is a value of 〈k〉 on a given day. The fields of the table are:
COD_PROV of the province;
SIGLA of the province;
DEN_PCM of the province;
day in the format yyyy-mm-dd.
ESRI shapefiles of the Italian provinces updated to the most recent definition are available from the website of the Italian National Office of Statistics (ISTAT): https://www.istat.it/it/archivio/222527.
The dataset was derived by the Bioregional Assessment Programme from multiple datasets. The source dataset is identified in the Lineage field in this metadata statement. The processes undertaken to produce this derived dataset are described in the History field in this metadata statement.
Hydrological Response Variables (HRVs) are the hydrological characteristics of the system that potentially change due to coal resource development. These data refer to the HRVs related to the AWRA-R model for the Namoi subregion for the 54 simulation nodes. The nine hydrological response variables (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and Baseline conditions, respectively and the ACRD is the difference between the Baseline and CRDP.
Abbreviation meaning
AF - the annual streamflow volume (GL/year)
P01 - the daily streamflow rate at the first percentile (ML/day)
P01 - the daily streamflow rate at the first percentile (ML/day)
IQR - the inter-quartile range in daily streamflow (ML/day). That is, the difference between the daily streamflow rate at the 75th percentile and at the 25th percentile.
LFD - the number of low streamflow days per year. The threshold for low streamflow days is the 10th percentile from the simulated 90-year period (2013 to 2102)
LFS - the number of low streamflow spells per year (perennial streams only). A spell is defined as a period of contiguous days of streamflow below the 10th percentile threshold
LLFS - the length (days) of the longest low streamflow spell each year
P99 - the daily streamflow rate at the 99th percentile (ML/day)
FD - flood days, the number of days with streamflow greater than the 90th percentile from the simulated 90-year period (2013 to 2102)
ZFD - Zero flow days
This is the dataset used for the Namoi 2.6.1 product to evaluate additional coal mine and coal resource development impacts on hydrological response variables at 54 simulation nodes.
The Namoi AWRA-R model outputs were used to determine the impacts on the HRVs to produce these data. Readme files within the folders in the dataset provide an explanation on how the resource was created. The nine HRVs (AF, P99, FD, IQR, ZFD, P01, LFD, LFS, LLFS) were computed under CRDP and Baseline conditions, respectively. The difference between CRDP and Baseline is used for predicting ACRD impacts on hydrological response variables at 54 simulation nodes.
Bioregional Assessment Programme (2017) Namoi standard Hydrological Response Variables (HRVs). Bioregional Assessment Derived Dataset. Viewed 11 December 2018, http://data.bioregionalassessments.gov.au/dataset/189f4c7a-29e1-41f9-868d-b7f5184d829f.
Derived From Historical Mining Footprints DTIRIS NAM 20150914
Derived From Namoi AWRA-R (restricted input data implementation)
Derived From River Styles Spatial Layer for New South Wales
Derived From Namoi Surface Water Mine Footprints - digitised
Derived From Namoi AWRA-R model implementation (post groundwater input)
Derived From National Surface Water sites Hydstra
Derived From Namoi AWRA-L model
Derived From Namoi Hydstra surface water time series v1 extracted 140814
Derived From GEODATA 9 second DEM and D8: Digital Elevation Model Version 3 and Flow Direction Grid 2008
Derived From Namoi Environmental Impact Statements - Mine footprints
Derived From Namoi Existing Mine Development Surface Water Footprints
Background: Severe pneumonia is pathological manifestation of Coronavirus Disease 2019 (COVID-19), however complications have been reported in COVID-19 patients with a worst prognosis. Aim of this study was to evaluate the role of high sensitivity cardiac troponin I (hs-TnI) in patients with SARS-CoV-2 infection.
Methods: we retrospectively analysed hs-TnI values measured in 523 patients (median age 64 years, 68% men) admitted to a university hospital in Milan, Italy, and diagnosed COVID-19.
Results: A significant difference in hs-TnI concentrations was found between deceased patients (98 patients) vs discharged (425 patients) [36.05 ng/L IQR 16.5-94.9 vs 6.3 ng/L IQR 2.6-13.9, p < 0.001 respectively]. Hs-TnI measurements were independent predictors of mortality at multivariate analysis adjusted for confounding parameters such as age (HR 1.004 for each 10 point of troponin, 95% CI 1.002-1.006, p < 0.001). The survival rate, after one week, in patients with hs-TnI values under 6 ng/L was 97.94%, between 6 ng/L and the normal value was 90.87%, between the normal value and 40 ng/L was 86.98, and 59.27% over 40 ng/L.
Conclusion: Increase of hs-TnI associated with elevated mortality in patients with COVID-19. Troponin shows to be a useful biomarker of disease progression and worse prognosis in COVID-19 patients.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
*Those loading most heavily (component load ≥|0.5|) in principal component analyses are identified in bold.
Dataset from the article Secchi F, Monti CB, Alì M, Carbone FS, Cannaò PM, Sardanelli F. Diagnostic Value of Global Cardiac Strain in Patients With Myocarditis. J Comput Assist Tomogr. 2020 Jul/Aug;44(4):591-598. doi: 10.1097/RCT.0000000000001062. PMID: 32697530.
Abstract
Background: Cardiac strain represents an imaging biomarker of contractile dysfunction.
Purpose: The purpose of this study was to investigate the diagnostic value of cardiac strain obtained by feature-tracking cardiac magnetic resonance (MR) in acute myocarditis.
Materials and methods: Cardiac MR examinations of 46 patients with myocarditis and preserved ejection fraction at acute phase and follow-up were analyzed along with cardiac MR of 46 healthy age- and sex-matched controls. Global circumferential strain and global radial strain were calculated for each examination, along with myocardial edema and late gadolinium enhancement, and left ventricle functional parameters, through manual contouring of the myocardium. Correlations were assessed using Spearman ρ. Wilcoxon and Mann-Whitney U test were used to assess differences between data. Receiver operating characteristics curves and reproducibility were obtained to assess the diagnostic role of strain parameters.
Results: Global circumferential strain was significantly lower in controls (median, -20.4%; interquartile range [IQR], -23.4% to -18.7%) than patients in acute phase (-18.4%; IQR, -21.0% to -16.1%; P = 0.001) or at follow-up (-19.2%; IQR, -21.5% to -16.1%; P = 0.020). Global radial strain was significantly higher in controls (82.4%; IQR, 62.8%-104.9%) than in patients during the acute phase (65.8%; IQR, 52.9%-79.5%; P = 0.001). Correlations were found between global circumferential strain and global radial strain in all groups (acute, ρ = -0.580, P < 0.001; follow-up, ρ = -0.399, P = 0.006; controls, ρ = -0.609, P < 0.001), and between global circumferential strain and late gadolinium enhancement only in myocarditis patients (acute, ρ = 0.035, P = 0.024; follow-up, ρ = 0.307, P = 0.038).
Conclusions: Cardiac strain could potentially have a role in detecting acute myocarditis in low-risk acute myocarditis patients where cardiac MR is the main diagnosing technique.
Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.
Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.
Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/
IQR is proposed for the image-text retrieval task. We use 200,000 queries and the corresponding images as the annotated image-query pairs.