5 datasets found
  1. d

    Recording Studios

    • datasets.ai
    • data.cityofnewyork.us
    • +3more
    55
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of New York (2024). Recording Studios [Dataset]. https://datasets.ai/datasets/recording-studios
    Explore at:
    55Available download formats
    Dataset updated
    Aug 6, 2024
    Dataset authored and provided by
    City of New York
    Description

    List of local recording studios

  2. The Best Sarcasm Annotated Dataset in Spanish

    • kaggle.com
    Updated Jun 21, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mika Hämäläinen (2020). The Best Sarcasm Annotated Dataset in Spanish [Dataset]. http://doi.org/10.34740/kaggle/dsv/1268848
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 21, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mika Hämäläinen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Content

    This dataset contains all utterances of two episodes of South Park (Latin American voices) and two episodes of Archer (Spanish voices). The order of the utterances is shuffled. Each utterance has been annotated based on whether it is sarcastic or not. Sarcastic expressions also contain further annotation based on different theories on sarcasm.

    This corpus is unique because it is annotated from primarily audiovisual media. It also contains a lot of negative examples of sentences that are meant to be humorous or outrageous, but not sarcastic. This data provides thus a closer to real life benchmark for any sarcasm detection system.

    Cite

    I annotated this data for my MA thesis, so please cite it if you use this data.

    Hämäläinen, Mika (2016). Reconocimiento automático del sarcasmo: ¡Esto va a funcionar bien!. Helsinki: University of Helsinki, Department of Modern Languages.

    Inspiration

    • Sarcasm detection
    • Prediction of the theoretical categories of sarcasm
  3. h

    elevenlabs

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    dd (2025). elevenlabs [Dataset]. https://huggingface.co/datasets/Sh1man/elevenlabs
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    dd
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Введение

    Набор данных Elevenlabs - это высококачественный (48 кГц) набор данных синтетической речи в формате mono wav общим обьемом по времени на 15ч 56м 47с 133мс

      Статистика
    

    Dataset Voice total (hrs)

    den4ikai Arcades 2.30

    den4ikai Alexandr Vlasov - Professional Voiceover 3.08

    den4ikai Ana 2.05

    den4ikai Saira 2.22

    fairy_tales_children Saira 1.21

    fairy_tales_children Soft Female Russian voice 4.27

    Total

    15.56h

  4. D

    Code for EchoTables (IKILeUS)

    • darus.uni-stuttgart.de
    Updated Feb 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nadeen Fathallah; Steffen Staab (2025). Code for EchoTables (IKILeUS) [Dataset]. http://doi.org/10.18419/DARUS-4774
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 28, 2025
    Dataset provided by
    DaRUS
    Authors
    Nadeen Fathallah; Steffen Staab
    License

    https://spdx.org/licenses/MIT.htmlhttps://spdx.org/licenses/MIT.html

    Time period covered
    Aug 1, 2022 - Nov 30, 2024
    Dataset funded by
    German Federal Ministry of Education and Research (BMBF)
    Description

    EchoTables is an innovative accessibility tool developed as part of the IKILeUS project at the University of Stuttgart. It is designed to improve the usability of tabular data for visually impaired users by converting structured tables into concise, auditory-friendly textual summaries. Traditional screen readers navigate tables linearly, which imposes a high cognitive load on users. EchoTables alleviates this issue by summarizing tables, facilitating quicker comprehension and more efficient information retrieval. Initially utilizing RUCAIBox (LLM), EchoTables transitioned to Mistral-7B, a more powerful open-source model, to enhance processing efficiency and scalability. The tool has been tested with widely used screen readers such as VoiceOver to ensure accessibility. EchoTables has been adapted to process diverse data sources, including lecture materials, assignments, and WikiTables, making it a valuable resource for students navigating complex datasets.

  5. r

    Data from: Ebook accessibility testing: accessibility ranking by ebook...

    • researchdata.edu.au
    docx
    Updated Jan 1, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ms Pam Schindler; Ms Pam Schindler; Ms Julie Oates; Ms Julie Oates (2018). Ebook accessibility testing: accessibility ranking by ebook platform [Dataset]. http://doi.org/10.14264/UQL.2018.666
    Explore at:
    docx(29637)Available download formats
    Dataset updated
    Jan 1, 2018
    Dataset provided by
    The University of Queensland
    Authors
    Ms Pam Schindler; Ms Pam Schindler; Ms Julie Oates; Ms Julie Oates
    License

    https://creativecommons.org/licenses/by_nc_nd/3.0/deed.enhttps://creativecommons.org/licenses/by_nc_nd/3.0/deed.en

    Time period covered
    Mar 1, 2017 - Dec 22, 2017
    Description

    The topic of the dataset is the accessibility of academic ebook platforms for users with a print disability. The dataset is a table in Word format, ranking 14 ebook platforms in bands from "more accessible platforms" to "less accessible platforms", based on testing done as part of the University of Queensland Library's Ebook Accessibility Project, conducted in 2017. The table includes scoring from the Jisc Ebook Accessibility Audit and observations regarding how each platform performed with the screenreaders NVDA and VoiceOver.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
City of New York (2024). Recording Studios [Dataset]. https://datasets.ai/datasets/recording-studios

Recording Studios

Explore at:
55Available download formats
Dataset updated
Aug 6, 2024
Dataset authored and provided by
City of New York
Description

List of local recording studios

Search
Clear search
Close search
Google apps
Main menu