11 datasets found
  1. Gender, Age, and Emotion Detection from Voice

    • kaggle.com
    zip
    Updated May 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit Zaman (2021). Gender, Age, and Emotion Detection from Voice [Dataset]. https://www.kaggle.com/rohitzaman/gender-age-and-emotion-detection-from-voice
    Explore at:
    zip(967820 bytes)Available download formats
    Dataset updated
    May 29, 2021
    Authors
    Rohit Zaman
    Description

    Context

    Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.

    Content

    Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.

    Acknowledgements

    Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/

  2. Tonal languages from mozilla common voice 10

    • kaggle.com
    zip
    Updated Jan 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enrique Díaz-Ocampo (2023). Tonal languages from mozilla common voice 10 [Dataset]. https://www.kaggle.com/datasets/enriquedazocampo/tonal-languages-mozilla-common-voice
    Explore at:
    zip(23999125 bytes)Available download formats
    Dataset updated
    Jan 4, 2023
    Authors
    Enrique Díaz-Ocampo
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    The following dataset is intended to be used for gender recognition using audio files in uncontrolled environments from the Mozilla Common Voice Dataset 10.0. It consists of a table of descriptive statistical characteristics of the fundamental frequency of six tonal languages Chinese (China), Chinese (Hong Kong), Chinese (Taiwan), Thai, Vietnamese, and Punjabi. In addition, the estimation of the vocal tract of each of the speakers.

    This dataset contains 18 columns: 'client_id': id speaker from Mozilla Common Voice 'path': Name of the mp3 file 'sentence': The sentence spoken by the speaker 'age': Age in decades (teens, twenties, etc.) 'gender': Binary gender (male or female) 'duration': Duration of mp3 in seconds 'vocal_tract_length': Vocal tract length in cm. 'mean_F4': Mean of the fourth formant in Hz. 'min_pitch': Minimal pitch of the whole pitch contour in Hz. 'mean_pitch': Mean pitch of the whole pitch contour in Hz. 'q1_pitch': : First quartile of the whole pitch contour in Hz. 'median_pitch': : Median pitch of the whole pitch contour Hz. 'q3_pitch': : Third quartile of the whole pitch contour in Hz. 'max_pitch': : Max pitch of the whole pitch contour in Hz. 'stddev_pitch' : Standard deviation of the whole pitch contour in Hz. 'estimated_age': Nominal value (adult or teen) 'estimated_age_gender: Nominal value (adult-male, adult-female, teen-male and teen-female). 'language': Nominal value (Chinese (China), Chinese (Hong Kong), Chinese (Taiwan), Thai, Vietnamese, and Punjabi).

    The methodology for the extraction of these characteristics was the following:

    Only the audios from the valid.tsv file of the respective language were analyzed (this file is contained in the Mozilla Common Voice Dataset https://commonvoice.mozilla.org/en/datasets ) the voiced-speech was extracted using Praat's algorithm Vocal ToolKit (https://www.praatvocaltoolkit.com/extract-voiced-and-unvoiced.html)

    2) The vocal tract length was calculated with the Vocal Tool Kit algorithm ( https://www.praatvocaltoolkit.com/calculate-vocal-tract-length.html ) as follows: If the audio came from a teen, then the maximum formant was established at 8000, otherwise it was adjusted to 5000 Hz for men and 5500 for women. Finally, the mean of the fourth formant was calculated for the windows with voiced speech only.

    3) The fundamental frequency was calculated using the PRAAT Software in the To Pitch (ac) option and a) Time step (s) 0.0 (=auto) b) Pitch floor (Hz) 75.0 c) Max. number of candidates 15 d) Vey accurate=True e) Silence Threshold= 0.03 f) Voicing threshold= 0.45 g) Octave Cost= 0.01 h) Octave jump cost = 0.35 i) Voiced/ Unvoiced cost= 0.14 j) Pitch ceiling (Hz) = 350

    4) The statistical characteristics of the fundamental frequency were calculated only in the windows that were detected as voiced speech.

  3. Italy: Mobility COVID-19

    • kaggle.com
    Updated Mar 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mr. Rahman (2021). Italy: Mobility COVID-19 [Dataset]. https://www.kaggle.com/motiurse/italy-mobility-covid19/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 26, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mr. Rahman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Italy
    Description

    A live version of the data record, which will be kept up-to-date with new estimates, can be downloaded from the Humanitarian Data Exchange: https://data.humdata.org/dataset/covid-19-mobility-italy.

    If you find the data helpful or you use the data for your research, please cite our work:

    Pepe, E., Bajardi, P., Gauvin, L., Privitera, F., Lake, B., Cattuto, C., & Tizzoni, M. (2020). COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Scientific Data 7, 230 (2020).

    The data record is structured into 4 comma-separated value (CSV) files, as follows:

    id_provinces_IT.csv. Table of the administrative codes of the 107 Italian provinces. The fields of the table are:

    COD_PROV is an integer field that is used to identify a province in all other data records;

    SIGLA is a two-letters code that identifies the province according to the ISO_3166-2 standard (https://en.wikipedia.org/wiki/ISO_3166-2:IT);

    DEN_PCM is the full name of the province.

    OD_Matrix_daily_flows_norm_full_2020_01_18_2020_04_17.csv. The file contains the daily fraction of users’ moving between Italian provinces. Each line corresponds to an entry of matrix (i, j). The fields of the table are:

    p1: COD_PROV of origin,

    p2: COD_PROV of destination,

    day: in the format yyyy-mm-dd.

    median_q1_q3_rog_2020_01_18_2020_04_17.csv. The file contains median and interquartile range (IQR) of users’ radius of gyration in a province by week. Each entry of the table fields of the table are:

    COD_PROV of the province;

    SIGLA of the province;

    DEN_PCM of the province;

    week: median value of the radius of gyration on week week, with week in the format dd/mm-DD/MM where dd/mm and DD/MM are the first and the last day of the week, respectively.

    week Q1 first quartile (Q1) of the distribution of the radius of gyration on week week,

    week Q3 third quartile (Q3) of the distribution of the radius of gyration on week week,

    average_network_degree_2020_01_18_2020_04_17.csv. The file contains daily time-series of the average degree 〈k〉 of the proximity network. Each entry of the table is a value of 〈k〉 on a given day. The fields of the table are:

    COD_PROV of the province;

    SIGLA of the province;

    DEN_PCM of the province;

    day in the format yyyy-mm-dd.

    ESRI shapefiles of the Italian provinces updated to the most recent definition are available from the website of the Italian National Office of Statistics (ISTAT): https://www.istat.it/it/archivio/222527.

  4. Voice Gender recognition in Spanish language

    • kaggle.com
    zip
    Updated Jan 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Enrique Díaz-Ocampo (2023). Voice Gender recognition in Spanish language [Dataset]. https://www.kaggle.com/datasets/enriquedazocampo/spanish-gender-recognition-mozilla
    Explore at:
    zip(16862585 bytes)Available download formats
    Dataset updated
    Jan 16, 2023
    Authors
    Enrique Díaz-Ocampo
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    The following dataset is intended to be used for gender recognition using audio files in uncontrolled environments from the Mozilla Common Voice Dataset 10.0. It consists of a table of descriptive statistical characteristics of the fundamental frequency of Spanish language. In addition, the estimation of the vocal tract of each of the speakers.

    This dataset contains 18 columns: 'client_id': id speaker from Mozilla Common Voice 'path': Name of the mp3 file 'age': Age in decades (teens, twenties, etc.) 'gender': Binary gender (male or female) 'duration': Duration of mp3 in seconds 'vocal_tract_length': Vocal tract length in cm. 'mean_F4': Mean of the fourth formant in Hz. 'min_pitch': Minimal pitch of the whole pitch contour in Hz. 'mean_pitch': Mean pitch of the whole pitch contour in Hz. 'q1_pitch': : First quartile of the whole pitch contour in Hz. 'median_pitch': : Median pitch of the whole pitch contour Hz. 'q3_pitch': : Third quartile of the whole pitch contour in Hz. 'max_pitch': : Max pitch of the whole pitch contour in Hz. 'stddev_pitch' : Standard deviation of the whole pitch contour in Hz. 'estimated_age': Nominal value (adult or teen) 'estimated_age_gender: Nominal value (adult-male, adult-female, teen-male and teen-female). 'language': Nominal value (Chinese (China), Chinese (Hong Kong), Chinese (Taiwan), Thai, Vietnamese, and Punjabi).

    The methodology for the extraction of these characteristics was the following:

    Only the audios from the valid.tsv file of the respective language were analyzed (this file is contained in the Mozilla Common Voice Dataset https://commonvoice.mozilla.org/en/datasets ) the voiced-speech was extracted using Praat's algorithm Vocal ToolKit (https://www.praatvocaltoolkit.com/extract-voiced-and-unvoiced.html)

    2) The vocal tract length was calculated with the Vocal Tool Kit algorithm ( https://www.praatvocaltoolkit.com/calculate-vocal-tract-length.html ) as follows: If the audio came from a teen, then the maximum formant was established at 8000, otherwise it was adjusted to 5000 Hz for men and 5500 for women. Finally, the mean of the fourth formant was calculated for the windows with voiced speech only.

    3) The fundamental frequency was calculated using the PRAAT Software in the To Pitch (ac) option and a) Time step (s) 0.0 (=auto) b) Pitch floor (Hz) 75.0 c) Max. number of candidates 15 d) Vey accurate=True e) Silence Threshold= 0.03 f) Voicing threshold= 0.45 g) Octave Cost= 0.01 h) Octave jump cost = 0.35 i) Voiced/ Unvoiced cost= 0.14 j) Pitch ceiling (Hz) = 350

    4) The statistical characteristics of the fundamental frequency were calculated only in the windows that were detected as voiced speech.

  5. U.S. National Ice Center Arctic Sea Ice Charts and Climatologies in Gridded...

    • data.nasa.gov
    Updated Apr 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). U.S. National Ice Center Arctic Sea Ice Charts and Climatologies in Gridded Format, 1972 - 2007, Version 1 - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/u-s-national-ice-center-arctic-sea-ice-charts-and-climatologies-in-gridded-format-1972-200-3531a
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Area covered
    United States, Arctic
    Description

    Notice: Due to funding limitations, this data set was recently changed to a “Basic” Level of Service. Learn more about what this means for users and how you can share your story here: Level of Service Update for Data Products.NOTE: The data product titled U.S. National Ice Center Arctic and Antarctic Sea Ice Concentration and Climatologies in Gridded Format supersedes this product. It begins with charts from January 2003 and is updated weekly. The U.S. National Ice Center (NIC) is an inter-agency sea ice analysis and forecasting center comprised of the Department of Commerce/NOAA, the Department of Defense/U.S. Navy, and the Department of Homeland Security/U.S. Coast Guard components. Since 1972, NIC has produced Arctic and Antarctic sea ice charts. This data set is comprised of Arctic sea ice concentration climatology derived from the NIC weekly or biweekly operational ice-chart time series. The charts used in the climatology are from 1972 through 2007; and the monthly climatology products are median, maximum, minimum, first quartile, and third quartile concentrations, as well as frequency of occurrence of ice at any concentration for the entire period of record as well as for 10-year and 5-year periods. These climatologies and the charts from which they are derived are provided in the 25-km Equal-Area Scalable Earth Grid (EASE-Grid) binary (.bin) format. The climatologies are also available in ArcGIS geodatabases (.mdb), and GIF format browse files (.gif) are also provided.To view the browse files and compare climatological periods visually, choose Search Database under the Download Data tab.

  6. U.S. National Ice Center Arctic Sea Ice Charts and Climatologies in Gridded...

    • catalog.data.gov
    • search.dataone.org
    • +4more
    Updated Nov 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NSIDC;NOAA (2025). U.S. National Ice Center Arctic Sea Ice Charts and Climatologies in Gridded Format, 1972 - 2007, Version 1 [Dataset]. https://catalog.data.gov/dataset/u-s-national-ice-center-arctic-sea-ice-charts-and-climatologies-in-gridded-format-1972-200
    Explore at:
    Dataset updated
    Nov 14, 2025
    Dataset provided by
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    National Snow and Ice Data Center
    Area covered
    Arctic
    Description

    Notice: Due to funding limitations, this data set was recently changed to a “Basic” Level of Service. Learn more about what this means for users and how you can share your story here: Level of Service Update for Data Products.NOTE: The data product titled U.S. National Ice Center Arctic and Antarctic Sea Ice Concentration and Climatologies in Gridded Format supersedes this product. It begins with charts from January 2003 and is updated weekly. The U.S. National Ice Center (NIC) is an inter-agency sea ice analysis and forecasting center comprised of the Department of Commerce/NOAA, the Department of Defense/U.S. Navy, and the Department of Homeland Security/U.S. Coast Guard components. Since 1972, NIC has produced Arctic and Antarctic sea ice charts. This data set is comprised of Arctic sea ice concentration climatology derived from the NIC weekly or biweekly operational ice-chart time series. The charts used in the climatology are from 1972 through 2007; and the monthly climatology products are median, maximum, minimum, first quartile, and third quartile concentrations, as well as frequency of occurrence of ice at any concentration for the entire period of record as well as for 10-year and 5-year periods. These climatologies and the charts from which they are derived are provided in the 25-km Equal-Area Scalable Earth Grid (EASE-Grid) binary (.bin) format. The climatologies are also available in ArcGIS geodatabases (.mdb), and GIF format browse files (.gif) are also provided.To view the browse files and compare climatological periods visually, choose Search Database under the Download Data tab.

  7. d

    Data from: Assessing bundles of ecosystem services from regional to...

    • datadryad.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Jul 1, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emilie Crouzat; Maud Mouchet; Francis Turkelboom; Coline Byczek; Jeroen Meersmans; Frederic Berger; Pieter Johannes Verkerk; Sandra Lavorel (2016). Assessing bundles of ecosystem services from regional to landscape scale: insights from the French Alps [Dataset]. http://doi.org/10.5061/dryad.3qk15
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 1, 2016
    Dataset provided by
    Dryad
    Authors
    Emilie Crouzat; Maud Mouchet; Francis Turkelboom; Coline Byczek; Jeroen Meersmans; Frederic Berger; Pieter Johannes Verkerk; Sandra Lavorel
    Time period covered
    Jun 30, 2015
    Area covered
    French Alps, Alps
    Description

    Main land use classesThis file contains GIS information on the main land use classes over the study area (French Alps) at a resolution of 1*1km. Classes come from aggregated Corine Land Cover 2006 classes.land_use.zipRichness in ecosystem servicesThis file contains GIS information on the aggregated distribution of ecosystem services (ES) over the study area (French Alps). Sixteen ES were included as binary datasets (presence/absence - threshold at third quartile) to calculate the number of present ES at a resolution of 1*1km.ES_richness.zipSelf-Organising Map on ecosystem service distributionsThis file contains GIS information on the self-organising map obtained from ecosystem service (ES) distributions over the study area (French Alps) at a resolution of 1*1km. Sixteen ES were included to cluster the area into five groups presenting similar ecological profiles. The Self-Organising Map was obtained using Kohonen's algorithms.SOM.zip

  8. R

    Experimental results for the paper "A fractal-based decomposition framework...

    • entrepot.recherche.data.gouv.fr
    text/markdown, tsv +1
    Updated Mar 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas Firmin; Thomas Firmin (2025). Experimental results for the paper "A fractal-based decomposition framework for continuous optimization" [Dataset]. http://doi.org/10.57745/0JEUEK
    Explore at:
    text/markdown(7338), zip(1312228931), tsv(1566), tsv(107482), tsv(100730)Available download formats
    Dataset updated
    Mar 3, 2025
    Dataset provided by
    Recherche Data Gouv
    Authors
    Thomas Firmin; Thomas Firmin
    License

    https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html

    Description

    This dataset contains three summaries. These were generated through continuous optimization of the CEC2020 and SOCO2011 benchmarks, as well as for a real-world application involving portfolio optimization with the SP500 dataset. The objective of these experiments is to analyze the performances and behaviors of a family of optimization algorithms, we named: 'fractal-based decomposition algorithms'. They hierarchically decompose a continuous search space using a self-similar and self-recurrent geometrical object. These algorithms can be described by five elementary building blocks: fractal, tree search, scoring, exploration and exploitation search components. To obtain these experimental results, we used 11 different algorithms instantiated with our Python package named 'Zellij'. Each of these algorithms has a unique combination of building blocks. Thus, one can analyze the sensitivity to the five components, dimensionality, and problem definition. The summaries are provided as CSV files. Each file contains 10 features (8 for summary_sp500.tab): the dimensionality of the benchmark function, the optimization algorithm, the benchmark function and basic statistics regarding the errors computed using the raw data (minimum, maximum, mean, standard error, median, first, and third quartile). Summaries are named according to the three experiments: summary_cec2020.tab, summary_socco2011.tab, summary_sp500.tab. Raw data is provided as a compressed ZIP file, named raw_data.zip, it contains three folders for the three experiments. Each experiment folder contains subfolders for each algorithm, dimension, and benchmark function. These are organized as follows: /experiment_{name}/{algorithm}/D{dimension}_{function}_save/outputs/all_evaluations.csv. A README.md file is provided for further technical details.

  9. n

    CARDS Monthly Statistics (MONADS)

    • access.uat.earthdata.nasa.gov
    • cmr.earthdata.nasa.gov
    not provided
    Updated Dec 31, 2001
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2001). CARDS Monthly Statistics (MONADS) [Dataset]. https://access.uat.earthdata.nasa.gov/collections/C1245063963-NOAA_NCEI
    Explore at:
    not provided(1 KB)Available download formats
    Dataset updated
    Dec 31, 2001
    Time period covered
    Jan 1, 1948 - Dec 31, 2001
    Area covered
    Earth
    Description

    CARDS Monthly Statistics is digital data set DSI-6307, archived at the National Climatic Data Center (NCDC). This data set uses data from Comprehensive Aerological Data Set (CARDS) (DSI-6305), also archived at NCDC. DSI-6307 is similar in concept and format to Monthly Aerological Data Set (MONADS) (DSI-6220), another digital data set archived at NCDC. DSI-6305 and DSI-6220 are monthly upper air statistics. DSI-6307 data are for surface, tropopause, and mandatory pressure levels. At each level, monthly statistical parameters are provided for geopotential height or pressure, temperature, relative humidity, specific humidity, dew point temperature, wind speed, zonal wind speed, and meridional wind speed. Those statistical parameters are: mean value; standard deviation; minimum value; maximum value; first, second, and third quartile values, and number of non-missing observations used in the calculations. Data are global, from 1948 through 2001. NCDC maintains this data set in archive but no longer updates nor actively distributes it. It has been superseded by the Integrated Global Radiosonde Archive (IGRA) (C00616).

  10. d

    Data from: Taxonomic and numerical sufficiency in depth- and...

    • datadryad.org
    • search.dataone.org
    zip
    Updated Nov 1, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Zuschin; Rafal Nawrot; Mathias Harzhauser; Oleg Mandic; Adam Tomašových (2016). Taxonomic and numerical sufficiency in depth- and salinity-controlled marine paleocommunities [Dataset]. http://doi.org/10.5061/dryad.r7s92
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 1, 2016
    Dataset provided by
    Dryad
    Authors
    Martin Zuschin; Rafal Nawrot; Mathias Harzhauser; Oleg Mandic; Adam Tomašových
    Time period covered
    Oct 31, 2016
    Description

    Supplementary figure 1Rank abundance distributions for habitats at three taxonomic levelsSuppl_fig_1.pdfSupplementary figure 2Evenness and species richness of the four habitats at three taxonomic levels.Suppl_fig_2.pdfSupplementary figure 3Distribution of p-values from Mantel test for Spearman correlation between dissimilarity matrices representing different taxonomic and numerical levels. A-C, Correlation between taxonomic levels at different numerical resolutions. D-F, Correlation between proportional abundance data and higher levels of numerical transformation. Filled points represent median p-values across 1000 subsampling iterations, empty points are outliers that lie beyond 1.5 times the interquartile range from the upper quartile.Suppl_fig_3.pdfSupplementary figure 4NMDS ordination of a double-standardized subsample of the total dataset comparing individual habitats along the depth- and salinity gradient for species and families using proportional abundances and presence/absence ...

  11. n

    Arctic Ocean and Climate Atlas (1950-1989)

    • access.earthdata.nasa.gov
    • cmr.earthdata.nasa.gov
    Updated Jun 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Arctic Ocean and Climate Atlas (1950-1989) [Dataset]. https://access.earthdata.nasa.gov/collections/C1214587227-SCIOPS
    Explore at:
    Dataset updated
    Jun 24, 2019
    Time period covered
    Jan 1, 1950 - Dec 31, 1989
    Area covered
    Arctic Ocean
    Description

    This CD-ROM atlas contains statistics of 45 years of summer (July through September) temperature and salinity data. All temperatures in this atlas are shown as potential temperatures. Salinities are reported in standard salinity units. Densities are reported as potential density. Minimum, maximum, mean, standard deviation, skewness, kurtosis, 1st quartile, median, and 3rd quartile of the temperature and salinity data were calculated for each decade in the data set and over each 200 km by 200 km grid cell within the central Arctic region. In the Nordic and Siberian Seas, the statistics were calculated over 50 km by 50 km cells, as well as 100 km by 100 km cells, due to the smaller spatial scales in this region. These statistics were compiled for depths (in meters below the ocean surface) of 5, 10, 25, 50, 75, 100, 150, 200, 250, 300, 400, 500, 750, 1000, 1500, 2000, 2500, 3000, 3500, and 4000. The ocean bottom bathymetry was defined using the National Geophysical Data Center ETOPO5 digital data set. The decadal periods are: 1950-1959, 1960-1969, 1970-1979, 1980-1989. The number of original data points used to calculate the statistics are recorded for each location and a planar fit to the data, at each depth, was also generated to indicate linear trends within each cell. The three coefficients of the planar fit and the root-mean-square error of the fit are also recorded in the atlas. It was agreed to publish these statistics because most of the Russian original data could not be released. The CD-ROM atlas also contains a complete set of statistics for the periods, June, October and November. Statistics for December through May are available in the atlas for the winter period.

  12. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rohit Zaman (2021). Gender, Age, and Emotion Detection from Voice [Dataset]. https://www.kaggle.com/rohitzaman/gender-age-and-emotion-detection-from-voice
Organization logo

Gender, Age, and Emotion Detection from Voice

Extracted statistical features from audios and added labels to form the datasets

Explore at:
36 scholarly articles cite this dataset (View in Google Scholar)
zip(967820 bytes)Available download formats
Dataset updated
May 29, 2021
Authors
Rohit Zaman
Description

Context

Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.

Content

Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.

Acknowledgements

Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/

Search
Clear search
Close search
Google apps
Main menu