100+ datasets found
  1. h

    poetry

    • huggingface.co
    Updated Nov 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    merve (2021). poetry [Dataset]. https://huggingface.co/datasets/merve/poetry
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 3, 2021
    Authors
    merve
    Description

    Dataset Card for poetry

      Dataset Summary
    

    It contains poems from subjects: Love, Nature and Mythology & Folklore that belong to two periods namely Renaissance and Modern

      Supported Tasks and Leaderboards
    

    [Needs More Information]

      Languages
    

    [Needs More Information]

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    [Needs More Information]

      Data Fields
    

    Has 5 columns:

    Content Author Poem name Age Type

      Data Splits
    

    Only training… See the full description on the dataset page: https://huggingface.co/datasets/merve/poetry.

  2. 💬 Poem Dataset

    • kaggle.com
    zip
    Updated Apr 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mexwell (2024). 💬 Poem Dataset [Dataset]. https://www.kaggle.com/datasets/mexwell/poem-dataset
    Explore at:
    zip(173379 bytes)Available download formats
    Dataset updated
    Apr 29, 2024
    Authors
    mexwell
    Description

    About

    This dataset comprises a collection of 450 poems, curated to facilitate the analysis of emotional content in textual form. Each poem is labeled with one of six emotional classes: Anger, Disgust, Fear, Joy, Neutral, and Sadness. This classification enables the development and testing of models for sentiment analysis, emotional understanding, and literary studies. The dataset is designed to provide a diverse range of poetic expressions, making it a valuable resource for machine learning researchers and computational linguists interested in emotion detection and the nuances of poetic language.

    Applications:

    • Sentiment analysis
    • Emotional analysis in text
    • Literary studies
    • Training machine learning models for natural language processing (NLP)

    Classes:

    • Anger
    • Disgust
    • Fear
    • Joy
    • Neutral
    • Sadness

    Licence

    Academic Free License v3.0

    Acknowlegement

    Foto von Thought Catalog auf Unsplash

  3. Gutenberg Poetry Corpus

    • kaggle.com
    zip
    Updated Apr 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DS-Buff (2024). Gutenberg Poetry Corpus [Dataset]. https://www.kaggle.com/datasets/thehung83/gutenberg-poetry-corpus
    Explore at:
    zip(55496556 bytes)Available download formats
    Dataset updated
    Apr 11, 2024
    Authors
    DS-Buff
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Allison Parrish's Gutenberg Poetry Corpus This corpus was originally published under the CC0 license by Allison Parrish. Please visit Allison's fantastic accompanying GitHub repository for usage inspiration as well as more information on how the data was mined, how to create your own version of the corpus, and examples of projects using it.

    This dataset contains 3,085,117 lines of poetry from hundreds of Project Gutenberg books. Each line has a corresponding gutenberg_id (1191 unique values) from project Gutenberg.

    A row of data looks like this:

    {'s': 'And retreated, baffled, beaten,', 'gutenberg_id': 19}

  4. American,British,Indian Emotion poetry dataset

    • kaggle.com
    zip
    Updated Apr 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pkkazipeta143 (2024). American,British,Indian Emotion poetry dataset [Dataset]. https://www.kaggle.com/datasets/pkkazipeta143/americanbritishindian-emotion-poetry-dataset
    Explore at:
    zip(4324938 bytes)Available download formats
    Dataset updated
    Apr 6, 2024
    Authors
    pkkazipeta143
    Area covered
    United States, United Kingdom
    Description

    Capturing emotion from reviews and tweets is a well studied task. reviews and tweets are not abundant with emotions, where poetry is a text which is abundant with emotions, so capturing emotions from poetry is an interesting task. In this regard we have collected poems from Poemhunter.com(we thank the website owners) and created a dataset and manually annotated the poems with 5 emotions namely Fear, Sad, Surprise, Happy and Angry. This dataset comprise of 3 files 1. ABIEMO: American, British and Indian poets poems 2. CAPEMO: Augmented Poems to resolve class imbalance problem using NLPAUG library(we thank the library developers) 3. BAPEMO: Extended Augmented poems to resolve class imbalance problem

    along with emotion country of poem is also assigned. We can use this dataset to perform poet style analysis, emotion analysis country wise differences in poetry etc.

  5. Arabic Poem Comprehensive Dataset (APCD)

    • kaggle.com
    zip
    Updated Nov 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamed Khaled Elsafty (2022). Arabic Poem Comprehensive Dataset (APCD) [Dataset]. https://www.kaggle.com/datasets/mohamedkhaledelsafty/best-arabic-poem-comprehensive-dataset
    Explore at:
    zip(183473042 bytes)Available download formats
    Dataset updated
    Nov 14, 2022
    Authors
    Mohamed Khaled Elsafty
    Description

    Poem Comprehensive Dataset (PCD)

    Arabic PCD (APCD)

    This data I get from Here

    Description

    The Arabic dataset is scraped mainly from الموسوعة الشعرية and الديوان. After merging both, the total number of verses is 1,831,770 poetic verses. Each verse is labeled by its meter, the poet who wrote it, and the age which it was written in. There are 22 meters, 3701 poets and 11 ages: Pre-Islamic, Islamic, Umayyad, Mamluk, Abbasid, Ayyubid, Ottoman, Andalusian, era between Umayyad and Abbasid, Fatimid, and finally the modern age. We are only interested in the 16 classic meters which are attributed to Al-Farahidi, and they comprise the majority of the dataset with a total number around 1.7M verses. It is important to note that the verses diacritic states are not consistent. This means that a verse can carry full, semi diacritics, or it can carry nothing.

  6. h

    Poetry-Foundation-Poems

    • huggingface.co
    Updated Feb 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Şuayp Talha Kocabay (2025). Poetry-Foundation-Poems [Dataset]. https://huggingface.co/datasets/suayptalha/Poetry-Foundation-Poems
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 21, 2025
    Authors
    Şuayp Talha Kocabay
    License

    https://choosealicense.com/licenses/agpl-3.0/https://choosealicense.com/licenses/agpl-3.0/

    Description

    From: https://www.kaggle.com/datasets/tgdivy/poetry-foundation-poems Poetry Foundation Poems Dataset Overview This dataset contains a collection of 13.9k poems sourced from the Poetry Foundation website. Each poem entry includes its title, author, and associated tags (if available). The dataset provides a robust resource for exploring poetry, analyzing thematic trends, or creating applications such as poem generators. Dataset Structure The dataset consists of the following columns: 1. Title:… See the full description on the dataset page: https://huggingface.co/datasets/suayptalha/Poetry-Foundation-Poems.

  7. English Poem Dataset

    • kaggle.com
    zip
    Updated Mar 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdelrahmane Khaldi (2024). English Poem Dataset [Dataset]. https://www.kaggle.com/datasets/abdelrahmanekhaldi/english-poem-dataset
    Explore at:
    zip(6427313 bytes)Available download formats
    Dataset updated
    Mar 30, 2024
    Authors
    Abdelrahmane Khaldi
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The Poetry Foundation Emotion-Annotated Dataset is a collection of poems scraped from the Poetry Foundation website. It comprises four main columns: Title, Poem, Poet, and Genre. This dataset has been enriched by incorporating emotion annotations derived from a fine-tuned BERT model trained to classify emotions in text.

    Columns:

    Title: This column contains the titles of the poems included in the dataset. Poem: The Poem column stores the text of the poems scraped from the Poetry Foundation website. Poet: This column lists the poets who authored the poems. Genre: The Genre column represents the emotional classification assigned to each poem based on the text content.

    Emotion Annotation:

    The emotion annotation process employed a state-of-the-art BERT-based model specifically trained to recognize emotions in text. By leveraging this model, each poem was analyzed to identify the prevalent emotions conveyed within its text. These emotions were then mapped to corresponding emotional genres, providing insights into the overarching emotional themes of each poem.

    Dataset Application:

    The Poetry Foundation Emotion-Annotated Dataset offers a valuable resource for researchers, poets, literary enthusiasts, and AI practitioners interested in exploring the intersection of poetry and emotional expression. By associating emotional genres with individual poems, this dataset enables nuanced analyses of emotional themes and provides inspiration for further exploration in the realms of literature, psychology, and computational linguistics.

  8. Poem classification dataset

    • kaggle.com
    zip
    Updated May 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DjDonPablo (2024). Poem classification dataset [Dataset]. https://www.kaggle.com/datasets/djdonpablo/poem-classification-dataset
    Explore at:
    zip(7156278 bytes)Available download formats
    Dataset updated
    May 16, 2024
    Authors
    DjDonPablo
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset was made by scraping the Poetry Foundation website, for classification.

    It contains five different topics : nature, art & sciences, love, relationships and religion, which are fairly well distributed.

  9. h

    poems_dataset

    • huggingface.co
    Updated Apr 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saad (2023). poems_dataset [Dataset]. https://huggingface.co/datasets/Ozziey/poems_dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 14, 2023
    Authors
    Saad
    License

    https://choosealicense.com/licenses/afl-3.0/https://choosealicense.com/licenses/afl-3.0/

    Description

    Ozziey/poems_dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. English Poem Comprehensive Dataset (EPCD)

    • kaggle.com
    zip
    Updated Nov 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamed Khaled Elsafty (2022). English Poem Comprehensive Dataset (EPCD) [Dataset]. https://www.kaggle.com/datasets/mohamedkhaledelsafty/english-poem-comprehensive-dataset-apcd
    Explore at:
    zip(4329206 bytes)Available download formats
    Dataset updated
    Nov 14, 2022
    Authors
    Mohamed Khaled Elsafty
    Description

    Poem Comprehensive Dataset (PCD)

    English PCD (EPCD)

    This data I get from Here

    Description

    The English dataset is scraped from many different web resources. It consists of 199,002 verses, each of them is labeled with one of these four meters: Iambic, Trochee, Dactyl and Anapaestic. The Iambic class dominates the dataset; they are 186,809 Iambic verses, 5418 Trochee verses, 5378 Anapaestic verses, 1397 Dactyl verses.

  11. t

    Automatic Analysis of Rhythmic Poetry - Dataset - LDM

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Automatic Analysis of Rhythmic Poetry - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/automatic-analysis-of-rhythmic-poetry
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    Automatic analysis of rhythmic poetry with applications to generation and translation.

  12. Dataset: What the Eyes Reveal about (Reading) Poetry

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    txt
    Updated Dec 16, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastian Wallot; Winfried Menninghaus (2020). Dataset: What the Eyes Reveal about (Reading) Poetry [Dataset]. http://doi.org/10.6084/m9.figshare.13387475.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Dec 16, 2020
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Sebastian Wallot; Winfried Menninghaus
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    dataPOEM.csv

    The dataPOEM.csv data set contains data on the level of each poem.

    scoresAes = factor scores of moving, beauty, and melodious ratings.

    participant = participant number

    poemVersion = Version of poem presented: (A = original poem with rhyme and meter, B = poem variant with only rhyme, C = poem variant with only meter, D = poem variant without rhyme and meter)

    poemIdentity = poem number

    avgWFreq = average word frequency of poem

    totalGazeSlopeLineLength

    totalGazeWordMeanNAByWordLen

    totalGazeWordMeanNADiff

    order = order of presentation (1 = from A to D, 2 = from D to A; between participant factor)

    firstFixDurMS_MINFIX_AVG = first fixation duration

    totalGazeMS_MINFIX_AVG = total gaze durations

    fixDurMS_MINFIX_NUM = number of fixations

    sacLenMS_MINFIX_AVG = average saccade length

    percRegMS_MINFIX_AVG = percentage of regressive eye movements

    pupilDial_AVG = average pupil dilation

    blink_NUM_TotalRT = number of blinks relative to total reading time

    totalReadingTime = total reading time of the poem

    areaTT = total score of the Aesthetic Responsiveness Assessment questionnaire

    dataIntegrity = percentage of valid position measurements by eye tracker during reading of a poem

    moving = rating of how moving the poem was

    beauty = rating of how beautiful the poem was

    melodious = rating of how melodious the poem was

    dataROI.csv

    The dataROI.csv data set contains data on the level of each line within a poem.

    order = order of presentation (1 = from A to D, 2 = from D to A; between participant factor)

    participant = participant number

    poemIdentity = poem number

    lineNr = line number within poem

    poemVersion = Version of poem presented: (A = original poem with rhyme and meter, B = poem variant with only rhyme, C = poem variant with only meter, D = poem variant without rhyme and meter)

    verseEnd = wheter a particular word/line was the last line of a stanza (0 = word/line within a stanza, 1 = last word/line of a stanza)

    BeginCloseRhyme = whether a particular line’s final word marked the opening or closing of a rhyme pair (1 = opening of rhyme, 2 = closing of rhyme)

    lastFix = whether a particular line or word was the last one of the poem (0 = word/line within a poem, 1 = last word/line of poem)

    totalGazeByWordNA = total gaze duration of final word of a line relative to word length

    gazeByLineLengthNA = total gaze duration of a line relative to line length

    dataIntegrity = percentage of valid position measurements by eye tracker during reading of a poem

  13. i

    Shah Abdul Latif Bhittai Poetry Dataset

    • ieee-dataport.org
    Updated Aug 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AMBILE official (2025). Shah Abdul Latif Bhittai Poetry Dataset [Dataset]. https://ieee-dataport.org/documents/shah-abdul-latif-bhittai-poetry-dataset
    Explore at:
    Dataset updated
    Aug 29, 2025
    Authors
    AMBILE official
    Description

    Tourism

  14. Open Poetry Vision Dataset

    • universe.roboflow.com
    zip
    Updated Apr 7, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roboflow (2022). Open Poetry Vision Dataset [Dataset]. https://universe.roboflow.com/roboflow-gw7yv/open-poetry-vision/model/2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 7, 2022
    Dataset authored and provided by
    Roboflowhttps://roboflow.com/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Text Bounding Boxes
    Description

    Overview

    The Open Poetry Vision dataset is a synthetic dataset created by Roboflow for OCR tasks.

    It combines a random image from the Open Images Dataset with text primarily sampled from Gwern's GPT-2 Poetry project. Each image in the dataset contains between 1 and 5 strings in a variety of fonts and colors randomly positioned in the 512x512 canvas. The classes correspond to the font of the text.

    Example Image: https://i.imgur.com/sZT516a.png" alt="Example Image">

    Use Cases

    A common OCR workflow is to use a neural network to isolate text for input into traditional optical character recognition software. This dataset could make a good starting point for an OCR project like business card parsing or automated paper form-processing.

    Alternatively, you could try your hand using this as a neural font identification dataset. Nvidia, amongst others, have had success with this task.

    Using this Dataset

    Use the fork button to copy this dataset to your own Roboflow account and export it with new preprocessing settings (perhaps resized for your model's desired format or converted to grayscale), or additional augmentations to make your model generalize better. This particular dataset would be very well suited for Roboflow's new advanced Bounding Box Only Augmentations.

    Version 5 of this dataset (classes_all_text-raw-images) has all classes remapped to be labeled as "text." This was accomplished by using Modify Classes as a preprocessing step.

    Version 6 of this dataset (classes_all_text-augmented-FAST) has all classes remapped to be labeled as "text." and was trained with Roboflow's Fast Model.

    Version 7 of this dataset (classes_all_text-augmented-ACCURATE) has all classes remapped to be labeled as "text." and was trained with Roboflow's Accurate Model.

    About Roboflow

    Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

    Developers reduce 50% of their code when using Roboflow's workflow, automate annotation quality assurance, save training time, and increase model reproducibility.

    Roboflow Workmark

  15. w

    Dataset of books called The poetry life : ten stories

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called The poetry life : ten stories [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=The+poetry+life+%3A+ten+stories
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 1 row and is filtered where the book is The poetry life : ten stories. It features 7 columns including author, publication date, language, and book publisher.

  16. w

    Dataset of books called The poetry of praise

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called The poetry of praise [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=The+poetry+of+praise
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 2 rows and is filtered where the book is The poetry of praise. It features 7 columns including author, publication date, language, and book publisher.

  17. t

    Chinese Poetry - Dataset - LDM

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Chinese Poetry - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/chinese-poetry
    Explore at:
    Dataset updated
    Dec 2, 2024
    Area covered
    China
    Description

    The Chinese Poetry dataset is a dataset of Chinese poems used for language modeling.

  18. w

    Dataset of book subjects that contain Poetry and the meaning of life :...

    • workwithdata.com
    Updated Nov 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Dataset of book subjects that contain Poetry and the meaning of life : reading and writing poetry in language arts classrooms [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=j0-book&fop0=%3D&fval0=Poetry+and+the+meaning+of+life+:+reading+and+writing+poetry+in+language+arts+classrooms&j=1&j0=books
    Explore at:
    Dataset updated
    Nov 7, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects. It has 3 rows and is filtered where the books is Poetry and the meaning of life : reading and writing poetry in language arts classrooms. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.

  19. Poetry Assessment EEG Dataset 1

    • openneuro.org
    Updated Sep 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Soma Chaudhuri; Joydeep Bhattacharya (2025). Poetry Assessment EEG Dataset 1 [Dataset]. http://doi.org/10.18112/openneuro.ds006648.v1.0.0
    Explore at:
    Dataset updated
    Sep 11, 2025
    Dataset provided by
    OpenNeurohttps://openneuro.org/
    Authors
    Soma Chaudhuri; Joydeep Bhattacharya
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Understanding how the brain engages with poetic language is key to advancing empirical research on aesthetic and creative cognition. This experiment involved 64-channel EEG recordings and behavioural ratings from 51 participants who read and evaluated 210 short English-language texts — 70 Haiku (nature-themed), 70 Senryu (emotion-themed), and 70 non-poetic Control texts. Each poem/text was rated on five subjective dimensions: Aesthetic Appeal, Vivid Imagery, Being Moved, Originality, and Creativity — using a 7-point scale.

    The full study involved 51 participants, and the data were divided into two BIDS-compliant datasets to ensure technical validation and facilitate upload to OpenNeuro.

    Poetry Assessment EEG Dataset 1 (this dataset) contains data from 47 participants whose continuous EEG recordings passed technical validation and were used in the primary analyses. In this dataset, the participants.tsv file maps anonymized BIDS IDs (sub-001 to sub-047) to the original participant codes used during data collection (P101–P151)

    Poetry Assessment EEG Dataset 2 includes the remaining 4 participants (P105, P141, P142, P146), whose EEG recordings were acquired in segments due to session interruptions and later concatenated during preprocessing. These participants were excluded from the PSD analysis to avoid potential artifacts but are included here for completeness and transparency.

    Dataset Structure and Navigation: Each subject folder contains four core EEG files:

    channels.tsv – EEG channel metadata eeg.json – EEG recording metadata eeg.set – Raw EEG data (EEGLAB format) events.tsv – Event markers aligned with poem presentation

    The /code/ directory includes:

    Preprocessing.m – MATLAB preprocessing script BioSemi64.loc – 64-channel coordinate file

    The /derivatives/ directory contains:

    Behavioural_Ratings/ – One .csv file per participant (e.g., P101.csv), including trial-by-trial ratings across five dimensions: Aesthetic Appeal, Vivid Imagery, Emotional Impact (labeled as 'being moved'), Originality, and Creativity.

    Psychometric_Responses/ – A single .csv file with demographic and trait-level questionnaire responses per participant, including: PANAS (mood), Openness, Curiosity, VVIQ (visual imagery), AVIQ (auditory imagery), MAAS (mindfulness), and AReA (aesthetic responsiveness).

    Also includes questionnaires.pdf with full questionnaire texts and scoring keys

    The /stimuli/ directory includes:

    All 210 texts used in the experiment: 70 Haiku (nature-themed poetry), 70 Senryu (emotion-themed poetry), 70 Control (non-poetic matched prose).

    Block-wise trial assignments for all seven blocks

    Resting-state EEG was recorded at the beginning and end of each session. These segments are embedded within the raw EEG files and can be identified using the following trigger codes in events.tsv:

    65285, 65286 → Resting state (before experiment); 65287, 65288 → Resting state (after experiment)

    Interested users may also consult Poetry Assessment EEG Dataset 2 to access recordings from the remaining 4 participants excluded from the main analyses. All preprocessing steps, event markers, and metadata structures were applied identically across both datasets (Poetry Assessment EEG Dataset 1 and Poetry Assessment EEG Dataset 2), ensuring consistency. This enables users to apply their own quality control pipelines and include these data if desired.

    Of note, the anonymized participant IDs (e.g., PXXX) are used consistently across all data modalities, enabling reliable cross-referencing between EEG data, behavioural ratings, and psychometric responses. Data collection took place at the Department of Psychology at Goldsmiths, University of London, UK. The project was approved by the Local Ethics Committee at the Department of Psychology, Goldsmiths University of London. The experiment was conducted in accordance with the Declaration of Helsinki.

    All EEG, behavioural, and psychometric data were anonymized. Participant identifiers were coded (P101–P151), and no names, dates of birth, or other direct identifiers are included.

  20. m

    Malayalam Poem Syllable Duration Dataset

    • data.mendeley.com
    Updated Aug 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jasir MP (2021). Malayalam Poem Syllable Duration Dataset [Dataset]. http://doi.org/10.17632/wh6fwmgccf.1
    Explore at:
    Dataset updated
    Aug 23, 2021
    Authors
    Jasir MP
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset estimates the duration of Malayalam Poem syllables written in three Vruthas, Kakali, Manjari, and Keka.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
merve (2021). poetry [Dataset]. https://huggingface.co/datasets/merve/poetry

poetry

merve/poetry

Explore at:
8 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 3, 2021
Authors
merve
Description

Dataset Card for poetry

  Dataset Summary

It contains poems from subjects: Love, Nature and Mythology & Folklore that belong to two periods namely Renaissance and Modern

  Supported Tasks and Leaderboards

[Needs More Information]

  Languages

[Needs More Information]

  Dataset Structure





  Data Instances

[Needs More Information]

  Data Fields

Has 5 columns:

Content Author Poem name Age Type

  Data Splits

Only training… See the full description on the dataset page: https://huggingface.co/datasets/merve/poetry.

Search
Clear search
Close search
Google apps
Main menu