Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.
Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.
Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/
Both the number of dust explosion accidents and the resulting number of casualties have increased dramatically in recent years. To reduce this risk of dust explosions, we use the functional resonance analysis method (FRAM) to analyze the cause of the dust explosion accident at the Kunshan factory and propose barrier measures to prevent such accidents. The functional units that changed in the production system during the accident and how these functional units coupled to eventually cause the dust explosion were examined and explained. In addition, barrier measures were developed for functional units that changed during production and emergency systems defined to block the propagation of changes between functions and prevent resonance. Through case study, the identification of key functional parameters in both triggering the initial explosion and in then allowing its spread are key to define barriers to prevent a recurrence of such an event. FRAM uses system function coupling instead of traditional linear causality to explain the accident process, and develops barrier measures for changing function units, providing a novel thinking strategy and method for the analysis of accidents and their prevention.
The research project 'Information strategies - analysis and evaluation of trials to spread information on developing countries through multimedia' (Informationsstrategier - Analys och utvärdering av försök med spridning av u-landsinformation genom multimedia) was requested to evaluate an information campaign on developing countries generally, and Tanzania especially. The campaign was namned Karibu-project and data was gathered in several surveys. The aim of this investigation was to define and measure variables assumed to be important conditions for the exposure of information about developing countries, concern for developing countries, etc. The survey includes information on the respondent´s interest in different subject fields in mass media, opinion on the future prospects for different groups in society, and threaths against the future. Furthermore the respondent was asked about the reason of why some countries are poor, her/his attitude to different kinds of help to developing countries, and reason for being concerned with/not concerned with developing countries. Other questions dealt with the respondent´s knowledge about developing countries and judgement of the policy held by the different political parties. Most of these questions can also be found in the study World picture of the opinion leaders (SSD 0126), and a smaller number in Daily press journalists and the foreign information (SSD 0170). The research project ´Information strategies - analysis and evaluation of trials to spread information on developing countries through multimedia´ (Informationsstrategier - Analys och utvärdering av försök med spridning av u-landsinformation genom multimedia) was requested to evaluate an information campaign on developing countries generally, and Tanzania especially. The campaign was namned Karibu-project and data was gathered in several surveys. The aim of this investigation was to define and measure variables assumed to be important conditions for the exposure of information about developing countries, concern for developing countries, etc. The survey includes information on the respondent´s interest in different subject fields in mass media, opinion on the future prospects for different groups in society, and threaths against the future. Furthermore the respondent was asked about the reason of why some countries are poor, her/his attitude to different kinds of help to developing countries, and reason for being concerned with/not concerned with developing countries. Other questions dealt with the respondent´s knowledge about developing countries and judgement of the policy held by the different political parties. Most of these questions can also be found in the study World picture of the opinion leaders (SSD 0126), and a smaller number in Daily press journalists and the foreign information (SSD 0170). Self-administered questionnaire: paperSelf-administered questionnaire: paper Självadministrerat frågeformulär: papperSjälvadministrerat frågeformulär: papper
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Our target was to predict gender, age and emotion from audio. We found audio labeled datasets on Mozilla and RAVDESS. So by using R programming language 20 statistical features were extracted and then after adding the labels these datasets were formed. Audio files were collected from "Mozilla Common Voice" and “Ryerson AudioVisual Database of Emotional Speech and Song (RAVDESS)”.
Datasets contains 20 feature columns and 1 column for denoting the label. The 20 statistical features were extracted through the Frequency Spectrum Analysis using R programming Language. They are: 1) meanfreq - The mean frequency (in kHz) is a pitch measure, that assesses the center of the distribution of power across frequencies. 2) sd - The standard deviation of frequency is a statistical measure that describes a dataset’s dispersion relative to its mean and is calculated as the variance’s square root. 3) median - The median frequency (in kHz) is the middle number in the sorted, ascending, or descending list of numbers. 4) Q25 - The first quartile (in kHz), referred to as Q1, is the median of the lower half of the data set. This means that about 25 percent of the data set numbers are below Q1, and about 75 percent are above Q1. 5) Q75 - The third quartile (in kHz), referred to as Q3, is the central point between the median and the highest distributions. 6) IQR - The interquartile range (in kHz) is a measure of statistical dispersion, equal to the difference between 75th and 25th percentiles or between upper and lower quartiles. 7) skew - The skewness is the degree of distortion from the normal distribution. It measures the lack of symmetry in the data distribution. 8) kurt - The kurtosis is a statistical measure that determines how much the tails of distribution vary from the tails of a normal distribution. It is actually the measure of outliers present in the data distribution. 9) sp.ent - The spectral entropy is a measure of signal irregularity that sums up the normalized signal’s spectral power. 10) sfm - The spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used for digital signal processing to characterize an audio spectrum. Spectral flatness is usually measured in decibels, which, instead of being noise-like, offers a way to calculate how tone-like a sound is. 11) mode - The mode frequency is the most frequently observed value in a data set. 12) centroid - The spectral centroid is a metric used to describe a spectrum in digital signal processing. It means where the spectrum’s center of mass is centered. 13) meanfun - The meanfun is the average of the fundamental frequency measured across the acoustic signal. 14) minfun - The minfun is the minimum fundamental frequency measured across the acoustic signal 15) maxfun - The maxfun is the maximum fundamental frequency measured across the acoustic signal. 16) meandom - The meandom is the average of dominant frequency measured across the acoustic signal. 17) mindom - The mindom is the minimum of dominant frequency measured across the acoustic signal. 18) maxdom - The maxdom is the maximum of dominant frequency measured across the acoustic signal 19) dfrange - The dfrange is the range of dominant frequency measured across the acoustic signal. 20) modindx - the modindx is the modulation index, which calculates the degree of frequency modulation expressed numerically as the ratio of the frequency deviation to the frequency of the modulating signal for a pure tone modulation.
Gender and Age Audio Data Souce: Link: https://commonvoice.mozilla.org/en Emotion Audio Data Souce: Link : https://smartlaboratory.org/ravdess/