16 datasets found
  1. h

    Emilia-Dataset

    • huggingface.co
    Updated Aug 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amphion (2024). Emilia-Dataset [Dataset]. https://huggingface.co/datasets/amphion/Emilia-Dataset
    Explore at:
    Dataset updated
    Aug 27, 2024
    Dataset authored and provided by
    Amphion
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

    This is the official repository 👑 for the Emilia dataset and the source code for the Emilia-Pipe speech data preprocessing pipeline.

      News 🔥
    

    2025/02/26: The Emilia-Large dataset, featuring over 200,000 hours of data, is now available!!! Emilia-Large combines the original 101k-hour Emilia dataset (licensed under CC BY-NC 4.0) with the brand-new 114k-hour Emilia-YODAS… See the full description on the dataset page: https://huggingface.co/datasets/amphion/Emilia-Dataset.

  2. h

    Emo-Emilia

    • huggingface.co
    Updated Feb 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ASLP-lab (2025). Emo-Emilia [Dataset]. https://huggingface.co/datasets/ASLP-lab/Emo-Emilia
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 25, 2025
    Authors
    ASLP-lab
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    C2SER: Paper | Code | HuggingFace

      Emo-Emilia Dataset
    

    To better simulate real-world context, we introduce a new SER test set, Emo-Emilia. Specifically, we apply the automated labeling approach to annotate Emilia, a large-scale multilingual and diverse speech generation resource with over 100,000 hours of speech data that captures a wide range of emotional contexts. We then manually verify the accuracy of the emotion labels. Each utterance is checked by at least two experts to ensure… See the full description on the dataset page: https://huggingface.co/datasets/ASLP-lab/Emo-Emilia.

  3. e

    Emilia Bubenikova - g-index of ^ based on highly cited papers

    • exaly.com
    csv, json
    Updated Nov 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Emilia Bubenikova - g-index of ^ based on highly cited papers [Dataset]. https://exaly.com/author/8642579/emilia-bubenikova/g-index
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Nov 1, 2025
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    The graph shows the changes in ^'s g-index and the corresponding percentile for the sake of comparison with all authors. The g-index is a scientometric index similar to the h-index but puts more weight on the sum of citations. The g-index of an author is g if the author has published at least g papers with total citations of g2.

  4. C

    Geological map of the Emilia-Romagna Apennines 1:10.000 - Paper edition

    • ckan.mobidatalab.eu
    Updated Jun 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoDatiGovIt RNDT (2023). Geological map of the Emilia-Romagna Apennines 1:10.000 - Paper edition [Dataset]. https://ckan.mobidatalab.eu/dataset/carta-geologica-dellappennino-emiliano-romagnolo-1-10-000-edizione-cartacea
    Explore at:
    Dataset updated
    Jun 27, 2023
    Dataset provided by
    GeoDatiGovIt RNDT
    Area covered
    Apennine Mountains, Emilia-Romagna
    Description

    With the creation of the geological map on a scale of 1:10,000, the Region intended to equip itself with a detailed cognitive tool, such as to represent the reference base for targeted analyzes and insights into specific areas and themes. It constitutes the indispensable premise of any planning and intervention design, both public and private; it is the basis for the preparation of urban plans, for an effective soil protection policy, for the planning of extractive activities, for the planning of the use and of surface and deep water resources, for the protection of groundwater from pollution, for civil protection, etc. Each sheet is accompanied by: a detailed legend, a list of conventional signs, one or more geological sections and often by diagrams of various kinds (tectonic, stratigraphic, etc.). The methodological and scientific coordination of the entire project was carried out by the Regional Geological Office, by professors from 7 university institutes (Departments or Institutes of Geology of Bologna, Modena, Parma, Pavia, Pisa, Florence and Padua) and by researchers from the CNR of Pisa. The field survey and the cartographic drafting were mainly carried out by young professional geologists.

  5. p

    Classified ads newspaper publishers Business Data for Province of Reggio...

    • poidata.io
    csv, json
    Updated Nov 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Business Data Provider (2025). Classified ads newspaper publishers Business Data for Province of Reggio Emilia, Italy [Dataset]. https://www.poidata.io/report/classified-ads-newspaper-publisher/italy/province-of-reggio-emilia
    Explore at:
    json, csvAvailable download formats
    Dataset updated
    Nov 27, 2025
    Dataset authored and provided by
    Business Data Provider
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2025
    Area covered
    Province of Reggio Emilia
    Variables measured
    Website URL, Phone Number, Review Count, Business Name, Email Address, Business Hours, Customer Rating, Business Address, Business Categories, Geographic Coordinates
    Description

    Comprehensive dataset containing 1 verified Classified ads newspaper publisher businesses in Province of Reggio Emilia, Italy with complete contact information, ratings, reviews, and location data.

  6. g

    Map of the natural background at 1:250,000 scale of the Emilia-Romagna plain...

    • gimi9.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Map of the natural background at 1:250,000 scale of the Emilia-Romagna plain - Nickel [Ni] | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_r_emiro-2016-07-11t180126
    Explore at:
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Area covered
    Emilia-Romagna
    Description

    It represents the area distribution in the subsoil (90-140 cm deep) of the nickel content in soils for agricultural use. This depth is considered representative of the natural background content ('pedo-geochemical content' according to ISO/DIS 19258, 2005). The cartographic units are represented by groups of polygons belonging to concentration classes. Factors that regulate the natural content of metals in soils are: origin of the sediment in which the soil was formed, texture, and evolutionary degree. For nickel as well as for chromium the dominant factor is the origin of the sediments that originate the soil. The paper has an original setting as it uses genetic-environmental interpretation for the evaluation and geographical extension of geochemical data, instead of the more traditional geostatistical analysis. Concentration values are obtained by the XRF (X-ray Fluorescence Spectrometry) analytical method in order to determine the total content.

  7. e

    Application papers — Organic carbon stored in the soils of the Apennines...

    • data.europa.eu
    arcgis map service +4
    Updated Nov 18, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Regione Emilia Romagna (2013). Application papers — Organic carbon stored in the soils of the Apennines between 0-100 cm [Dataset]. https://data.europa.eu/data/datasets/r_emiro_2013-11-18t130557?locale=en
    Explore at:
    html, xml, pdf, word doc, arcgis map serviceAvailable download formats
    Dataset updated
    Nov 18, 2013
    Dataset authored and provided by
    Regione Emilia Romagna
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The paper of organic carbon stored in the soils of the Emilia-Romagna Apennines, is understood as the first processing through polygons that describe the average content of organic carbon expressed in Mg*ha-1 in the first 100 cm of soil plus the content of the organic surface horizons in the case of forest soils. It provides a spatial data. The CO content in the first 100 cm of soil and organic horizons in forest soils considers the distribution of different types of soil and the incidence of non-soil areas, understood as areas occupied by surface water, urban and infrastructure. The representation of the territory takes place through a knitted structure consisting of cells with 1Km side. The value attributed takes into account the zero contribution due to non-soil areas. The attribution of the value to the cell takes into account the distribution of soils according to the Land Charter of the Emilia-Romagna Region (scale 1:250,000, ed. 1994 and updated.); the distinction between land-occupied and non-soil-occupied areas (i.e. urban, infrastructure or surface water) derives from the Land Use Charter 2003 on a scale of 1:25,000 drawn up by the Geographic Information Systems Service of the Emilia-Romagna Region.

  8. C

    Regional Technical Paper - 1:10.000 (WMS)

    • ckan.mobidatalab.eu
    wms
    Updated Apr 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoDatiGovIt RNDT (2023). Regional Technical Paper - 1:10.000 (WMS) [Dataset]. https://ckan.mobidatalab.eu/tl/dataset/regional-technical-map-1-10-000-wms
    Explore at:
    wmsAvailable download formats
    Dataset updated
    Apr 27, 2023
    Dataset provided by
    GeoDatiGovIt RNDT
    Description

    WMS of the Regional Technical Paper 1:10.000 - Emilia-Romagna Region

  9. ROST (ROmanian Stories and other Texts)

    • kaggle.com
    zip
    Updated Feb 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sanda-Maria Avram (2024). ROST (ROmanian Stories and other Texts) [Dataset]. https://www.kaggle.com/datasets/sandamariaavram/rost-romanian-stories-and-other-texts
    Explore at:
    zip(788838 bytes)Available download formats
    Dataset updated
    Feb 8, 2024
    Authors
    Sanda-Maria Avram
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    ROST: A dataset containing 400 Romanian texts written by 10 authors

    Context:

    This dataset was introduced for the work presented in the paper called "A comparison of several AI techniques for authorship attribution on Romanian texts". Here, several AI techniques were compared for classifying literary texts written by multiple authors by taking into account a limited number of speech parts (prepositions, adverbs, and conjunctions). The compared methods are Artificial Neural Networks, Multi Expression Programming, k-Nearest Neighbour, Support Vector Machines, and Decision Trees with C5.0.

    The source code is available at https://github.com/sanda-avram/ROST-source-code.

    Content:

    The dataset contains stories, short stories, fairy tales, novels, articles, and sketches written by Ion Creangă, Barbu Ştefănescu Delavrancea, Mihai Eminescu, Nicolae Filimon, Emil Gârleanu, Petre Ispirescu, Mihai Oltean, Emilia Plugaru, Liviu Rebreanu, Ioan Slavici.

    • ROST-csv/ - dataset representations as vectors of occurrence frequencies of Inflexible Parts of Speech (IPoS)

    • IPoS/ - lists of Inflexible Parts of Speech (IPoS); the initial ones and the ones that were used because they appeared in the texts

    • ROST-details.csv - contains detailed information about the year of publishing, type of writing, used file name, author and title of the text, and (website) source of the text

    • ROST-stats.csv - contains statistics for all 400 considered texts, pertaining to the number of occurring:

      • unique prepositions from usedPrepositions.txt
      • unique prepositions and adverbs from usedPrepositionsAndAdverbs.txt
      • unique prepositions, adverbs, and conjunctions from usedPrepositionsAdverbsAndConjunctions.txt
      • new lines
      • words
      • chars
      • unique words
      • unique chars

    Research papers:

    Avram, Sanda Maria, and Mihai Oltean. "A comparison of several AI techniques for authorship attribution on Romanian texts." arXiv preprint arXiv:2211.05180 (2022). The paper introduces the dataset and compares multiple AI techniques trained to recognize the authors of the texts, based on a number of speech parts (prepositions, adverbs, and conjunctions). The compared methods are Artificial Neural Networks, Support Vector Machines, Multi Expression Programming, Decision Trees with C5.0, and k-Nearest Neighbour

    License

    MIT License

    Copyright (c) 2022 Sanda Avram

    Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

    The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

    THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES, OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

  10. Issues raised about methodology.

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jul 7, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marijn Muurling; Anna M. G. Pasmooij; Ivan Koychev; Dora Roik; Lutz Froelich; Emilia Schwertner; Dorota Religa; Carla Abdelnour; Mercè Boada; Monica Almici; Samantha Galluzzi; Sandra Cardoso; Alexandre de Mendonça; Andrew P. Owens; Sajini Kuruppu; Martha Therese Gjestsen; Ioulietta Lazarou; Mara Gkioka; Magda Tsolaki; Ana Diaz; Dianne Gove; Pieter Jelle Visser; Dag Aarsland; Federica Lucivero; Casper de Boer (2023). Issues raised about methodology. [Dataset]. http://doi.org/10.1371/journal.pone.0285807.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jul 7, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Marijn Muurling; Anna M. G. Pasmooij; Ivan Koychev; Dora Roik; Lutz Froelich; Emilia Schwertner; Dorota Religa; Carla Abdelnour; Mercè Boada; Monica Almici; Samantha Galluzzi; Sandra Cardoso; Alexandre de Mendonça; Andrew P. Owens; Sajini Kuruppu; Martha Therese Gjestsen; Ioulietta Lazarou; Mara Gkioka; Magda Tsolaki; Ana Diaz; Dianne Gove; Pieter Jelle Visser; Dag Aarsland; Federica Lucivero; Casper de Boer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionClinical research with remote monitoring technologies (RMTs) has multiple advantages over standard paper-pencil tests, but also raises several ethical concerns. While several studies have addressed the issue of governance of big data in clinical research from the legal or ethical perspectives, the viewpoint of local research ethics committee (REC) members is underrepresented in the current literature. The aim of this study is therefore to find which specific ethical challenges are raised by RECs in the context of a large European study on remote monitoring in all syndromic stages of Alzheimer’s disease, and what gaps remain.MethodsDocuments describing the REC review process at 10 sites in 9 European countries from the project Remote Assessment of Disease and Relapse–Alzheimer’s Disease (RADAR-AD) were collected and translated. Main themes emerging in the documents were identified using a qualitative analysis approach.ResultsFour main themes emerged after analysis: data management, participant’s wellbeing, methodological issues, and the issue of defining the regulatory category of RMTs. Review processes differed across sites: process duration varied from 71 to 423 days, some RECs did not raise any issues, whereas others raised up to 35 concerns, and the approval of a data protection officer was needed in half of the sites.DiscussionThe differences in the ethics review process of the same study protocol across different local settings suggest that a multi-site study would benefit from a harmonization in research ethics governance processes. More specifically, some best practices could be included in ethical reviews across institutional and national contexts, such as the opinion of an institutional data protection officer, patient advisory board reviews of the protocol and plans for how ethical reflection is embedded within the study.

  11. C

    Regional Technical Paper - 1:5.000 - MonoFull - DBTR2008 (WMS)

    • ckan.mobidatalab.eu
    wms
    Updated May 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoDatiGovIt RNDT (2023). Regional Technical Paper - 1:5.000 - MonoFull - DBTR2008 (WMS) [Dataset]. https://ckan.mobidatalab.eu/lt/dataset/regional-technical-paper-1-5-000-monofull-dbtr2008-wms
    Explore at:
    wmsAvailable download formats
    Dataset updated
    May 3, 2023
    Dataset provided by
    GeoDatiGovIt RNDT
    Description

    WMS of the Regional Technical Paper 1:5.000 taken from DBTR2008 (Full Version) - Emilia-Romagna Region.

  12. C

    Papers of chemical-physical properties - pH values in the soils of the...

    • ckan.mobidatalab.eu
    wms, zip
    Updated May 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoDatiGovIt RNDT (2023). Papers of chemical-physical properties - pH values ​​in the soils of the Emilia-Romagna plain between 0-30 cm [Dataset]. https://ckan.mobidatalab.eu/dataset/maps-of-physical-chemical-properties-values-of-ph-in-the-soils-of-the-plain-emilia-romagnola-t
    Explore at:
    wms, zipAvailable download formats
    Dataset updated
    May 3, 2023
    Dataset provided by
    GeoDatiGovIt RNDT
    Area covered
    Emilia-Romagna
    Description

    Soil pH is a fundamental property capable of influencing many physical, chemical and biological processes. It regulates the availability of many nutrients for plants, it influences the activity of the microorganisms responsible for the decomposition of organic matter and most of the chemical transformations that take place in the soil, it has a determining role in influencing the mobility and therefore the bioavailability of heavy metals . Furthermore, some physical characteristics of the soil are influenced by the pH, such as permeability, the stability of the aggregates, the degree of compaction and the dispersion of the clayey fraction. The map represents the areal distribution in lowland soils of the pH value in the surface layer (0- 30cm). The map was drawn up starting from data extrapolated from the Soil Database of the Emilia-Romagna Region for the period 1974-2017.

  13. Data and Appendix for Original Research Paper "Emotions and Prior Outcomes...

    • figshare.com
    pdf
    Updated Jul 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emilia Czerniawska; Maciej Pastwa; Kamil Imbir (2025). Data and Appendix for Original Research Paper "Emotions and Prior Outcomes Shape Risk Propensity and the Dynamics of Decision Making in BART" [Dataset]. http://doi.org/10.6084/m9.figshare.29590559.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 17, 2025
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Emilia Czerniawska; Maciej Pastwa; Kamil Imbir
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Decisions under uncertainty often emerge from the interaction of affective and cognitive processes. Using the Balloon Analogue Risk Task (BART), this study investigated how incidental emotions (happiness, sadness, anger, fear) and prior outcomes shape risk-taking. Sixty-six participants performed the BART while exposed to images evoking emotions or neutral affect. Results revealed that exposure to anger- and fear-evoking stimuli significantly reduced risk-taking, suggesting these highly arousing negative emotions may disrupt engagement and promote avoidance behaviors. Additionally, participants demonstrated heightened risk propensity and prolonged decision times following a successful trial, indicating a cognitive reframing of subsequent decisions after gains. These findings highlight how emotional and contextual cues jointly shape risky behavior in uncertain environments, advancing our understanding of affect-cognition interplay in decision processes.

  14. C

    Regional Technical Paper - 1:5.000 - MonoLight - DBTR2008 (WMS)

    • ckan.mobidatalab.eu
    wms
    Updated May 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoDatiGovIt RNDT (2023). Regional Technical Paper - 1:5.000 - MonoLight - DBTR2008 (WMS) [Dataset]. https://ckan.mobidatalab.eu/dataset/regional-technical-paper-1-5-000-monolight-dbtr2008-wms
    Explore at:
    wmsAvailable download formats
    Dataset updated
    May 2, 2023
    Dataset provided by
    GeoDatiGovIt RNDT
    Description

    WMS of the Regional Technical Map 1:5.000 taken from DBTR2008 (Light version) - Emilia-Romagna Region

  15. Inclusion and exclusion criteria for assessing the retrieved papers.

    • plos.figshare.com
    xls
    Updated Jun 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ana Luiza Dallora; Peter Anderberg; Ola Kvist; Emilia Mendes; Sandra Diaz Ruiz; Johan Sanmartin Berglund (2023). Inclusion and exclusion criteria for assessing the retrieved papers. [Dataset]. http://doi.org/10.1371/journal.pone.0220242.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 17, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Ana Luiza Dallora; Peter Anderberg; Ola Kvist; Emilia Mendes; Sandra Diaz Ruiz; Johan Sanmartin Berglund
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Inclusion and exclusion criteria for assessing the retrieved papers.

  16. NumDB-dataset

    • figshare.com
    zip
    Updated Nov 29, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alessandro Piscopo; Emilia Kacprzak (2018). NumDB-dataset [Dataset]. http://doi.org/10.6084/m9.figshare.6205814.v4
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 29, 2018
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Alessandro Piscopo; Emilia Kacprzak
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    NumDB benchmark: set of tables originally extracted from DBpedia, from which different value samples have been selected and various degrees of errors have been added in order to simulate actual tables on the Web.The dataset has been created for Kacprzak, E., Giménez-García, J. M., Piscopo, A., Koesten, L., Ibáñez, L. D., Tennison, J., & Simperl, E. (2018, November). Making Sense of Numerical Data-Semantic Labelling of Web Tables. In European Knowledge Acquisition Workshop (pp. 163-178). Springer, Cham.A description of the data generation process is in the paper.

  17. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Amphion (2024). Emilia-Dataset [Dataset]. https://huggingface.co/datasets/amphion/Emilia-Dataset

Emilia-Dataset

Emilia

amphion/Emilia-Dataset

Explore at:
45 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Aug 27, 2024
Dataset authored and provided by
Amphion
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

This is the official repository 👑 for the Emilia dataset and the source code for the Emilia-Pipe speech data preprocessing pipeline.

  News 🔥

2025/02/26: The Emilia-Large dataset, featuring over 200,000 hours of data, is now available!!! Emilia-Large combines the original 101k-hour Emilia dataset (licensed under CC BY-NC 4.0) with the brand-new 114k-hour Emilia-YODAS… See the full description on the dataset page: https://huggingface.co/datasets/amphion/Emilia-Dataset.

Search
Clear search
Close search
Google apps
Main menu