100+ datasets found
  1. h

    poetry

    • huggingface.co
    Updated Oct 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    merve (2024). poetry [Dataset]. https://huggingface.co/datasets/merve/poetry
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 14, 2024
    Authors
    merve
    Description

    Dataset Card for poetry

      Dataset Summary
    

    It contains poems from subjects: Love, Nature and Mythology & Folklore that belong to two periods namely Renaissance and Modern

      Supported Tasks and Leaderboards
    

    [Needs More Information]

      Languages
    

    [Needs More Information]

      Dataset Structure
    
    
    
    
    
      Data Instances
    

    [Needs More Information]

      Data Fields
    

    Has 5 columns:

    Content Author Poem name Age Type

      Data Splits
    

    Only training… See the full description on the dataset page: https://huggingface.co/datasets/merve/poetry.

  2. H

    AraPoems: An Extensive Dataset of Arabic Poetry Associated with Verses,...

    • dataverse.harvard.edu
    • dataone.org
    Updated Sep 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Faisal Qarah (2024). AraPoems: An Extensive Dataset of Arabic Poetry Associated with Verses, Rhymes, Meters, and More [Dataset]. http://doi.org/10.7910/DVN/PJPWOY
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 14, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Faisal Qarah
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The largest Arabic poetry dataset that contains more than 2.09 million verses. The dataset is comprehensive and contains additional information associated for each verse such as poet's name, poem's title, era, meter, sub-meter, etc.

  3. h

    PoetryFoundationData

    • huggingface.co
    Updated Mar 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shahul Es (2023). PoetryFoundationData [Dataset]. https://huggingface.co/datasets/shahules786/PoetryFoundationData
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 13, 2023
    Authors
    Shahul Es
    Description

    This file contains nearly all poems from the Poetry Foundation Website. Content All poems have a title and author. Most poems are also labeled with the tags as available from the Poetry Foundation Website. The word cloud above shows the most used tags! Inspiration This dataset can be used for a variety of tasks related to poetry writing.

  4. Blackout Poetry Dataset

    • kaggle.com
    Updated Jan 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aditeya Baral (2022). Blackout Poetry Dataset [Dataset]. https://www.kaggle.com/aditeyabaral/blackout-poetry-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 9, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aditeya Baral
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Blackout Poetry Dataset

    A blackout poetry dataset constructed from publicly available short stories and large poems. The dataset consists of two variants: 8K and 16K examples of passages along with a poem generated from the passage and the indices of the words in the passage from which words in the poem have been selected. The dataset also contains perplexity scores for each of the poems indicating the language quality of the poems.

    The dataset was constructed synthetically, and hence contains multiple poor poems and frequent grammatical errors. However, it is a great starting point for the task of applying machine learning to blackout poetry generation.

    The dataset was first introduced in MAPLE – MAsking words to generate blackout Poetry using sequence-to-sequence LEarning.

    Content

    The dataset has two variants: - 8K (sampled poems from the 16K dataset with the lowest perplexity scores) - 16K

    Both variants contain data in the following format:

    passagepoemindices
    Did the CIA tell the FBI that it knows the wor...cia fbi the biggest weapon[2, 5, 9, 24, 25]
    A vigilante lacking of heroic qualities that
    ...lacking qualities that damn criminals[2, 5, 6, 11, 12]

    The passage is the text from which the poem is generated. The poem is the generated poem. The indices are the indices of the words in the text that are chosen for the poem.

    Acknowledgements

    This dataset was generated synthetically using Liza Daly's pattern matching based blackout poetry generation.

  5. h

    prompt-poem-dataset-20240921_004141

    • huggingface.co
    Updated Sep 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vida Tayebati (2024). prompt-poem-dataset-20240921_004141 [Dataset]. https://huggingface.co/datasets/VidaEdco/prompt-poem-dataset-20240921_004141
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 21, 2024
    Authors
    Vida Tayebati
    Description

    VidaEdco/prompt-poem-dataset-20240921_004141 dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. P

    CCPM Dataset

    • paperswithcode.com
    Updated Jun 4, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wenhao Li; Fanchao Qi; Maosong Sun; Xiaoyuan Yi; Jiarui Zhang (2021). CCPM Dataset [Dataset]. https://paperswithcode.com/dataset/ccpm
    Explore at:
    Dataset updated
    Jun 4, 2021
    Authors
    Wenhao Li; Fanchao Qi; Maosong Sun; Xiaoyuan Yi; Jiarui Zhang
    Description

    Introduction

    CCPM is a large Chinese classical poetry matching dataset that can be used for poetry matching, understanding and translation.

    The main task of this dataset is: given a description in modern Chinese, the model is supposed to select one line of Chinese classical poetry from four candidates that semantically match the given description most.

    Size

    It contains 27,218 instances in total, which are split into training (21,778), validation (2,720) and test (2,720) sets.

    Format

    Each instance is composed of translation (the description in modern Chinese, a string), choice (four candidate lines of Chinese classical poetry, a list) and answer (the index of the correct line, an integer between 0 and 3).

  7. Open Poetry Vision Dataset

    • universe.roboflow.com
    zip
    Updated Apr 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roboflow (2022). Open Poetry Vision Dataset [Dataset]. https://universe.roboflow.com/roboflow-gw7yv/open-poetry-vision/model/2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 7, 2022
    Dataset authored and provided by
    Roboflowhttps://roboflow.com/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Text Bounding Boxes
    Description

    Overview

    The Open Poetry Vision dataset is a synthetic dataset created by Roboflow for OCR tasks.

    It combines a random image from the Open Images Dataset with text primarily sampled from Gwern's GPT-2 Poetry project. Each image in the dataset contains between 1 and 5 strings in a variety of fonts and colors randomly positioned in the 512x512 canvas. The classes correspond to the font of the text.

    Example Image: https://i.imgur.com/sZT516a.png" alt="Example Image">

    Use Cases

    A common OCR workflow is to use a neural network to isolate text for input into traditional optical character recognition software. This dataset could make a good starting point for an OCR project like business card parsing or automated paper form-processing.

    Alternatively, you could try your hand using this as a neural font identification dataset. Nvidia, amongst others, have had success with this task.

    Using this Dataset

    Use the fork button to copy this dataset to your own Roboflow account and export it with new preprocessing settings (perhaps resized for your model's desired format or converted to grayscale), or additional augmentations to make your model generalize better. This particular dataset would be very well suited for Roboflow's new advanced Bounding Box Only Augmentations.

    Version 5 of this dataset (classes_all_text-raw-images) has all classes remapped to be labeled as "text." This was accomplished by using Modify Classes as a preprocessing step.

    Version 6 of this dataset (classes_all_text-augmented-FAST) has all classes remapped to be labeled as "text." and was trained with Roboflow's Fast Model.

    Version 7 of this dataset (classes_all_text-augmented-ACCURATE) has all classes remapped to be labeled as "text." and was trained with Roboflow's Accurate Model.

    About Roboflow

    Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

    Developers reduce 50% of their code when using Roboflow's workflow, automate annotation quality assurance, save training time, and increase model reproducibility.

    Roboflow Workmark

  8. w

    Dataset of books about English poetry

    • workwithdata.com
    Updated Apr 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books about English poetry [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=j0-book_subject&fop0=%3D&fval0=English+poetry&j=1&j0=book_subjects
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 4,938 rows and is filtered where the book subjects is English poetry. It features 9 columns including author, publication date, language, and book publisher.

  9. S

    A dataset of poetry literature landscape of Chinese eminence mountains from...

    • scidb.cn
    Updated May 18, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Du Xiaohan; Hu Di; Li Daiwei; Zhou Sifan; Bai Tianyi (2021). A dataset of poetry literature landscape of Chinese eminence mountains from pre-Qin dynasty to Tang and Song Dynasties [Dataset]. http://doi.org/10.11922/sciencedb.j00001.00232
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 18, 2021
    Dataset provided by
    Science Data Bank
    Authors
    Du Xiaohan; Hu Di; Li Daiwei; Zhou Sifan; Bai Tianyi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Song dynasty
    Description

    The data set includes three Excel data sheets, namely, famous mountain table, poetry table and poet table. The famous mountain list includes fields such as famous mountain number, famous mountain type, social characteristics, famous mountain name and province; The poetry table includes fields such as poetry number, poetry name, author, Dynasty and creation time; The poet list includes the poet's number, name, alias, time of birth and time of death.

  10. d

    20C Poetry

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew Piper (2023). 20C Poetry [Dataset]. http://doi.org/10.7910/DVN/YVN6IW
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Andrew Piper
    Description

    This is a table of word counts for a collection of 75,297 English-language poems.

  11. w

    Dataset of books about Italian poetry

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books about Italian poetry [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=j0-book_subject&fop0=%3D&fval0=Italian+poetry&j=1&j0=book_subjects
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 12 rows and is filtered where the book subjects is Italian poetry. It features 9 columns including author, publication date, language, and book publisher.

  12. h

    Hindi-Poetry-Dataset

    • huggingface.co
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Sajjad Rasool (2025). Hindi-Poetry-Dataset [Dataset]. https://huggingface.co/datasets/ReySajju742/Hindi-Poetry-Dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 12, 2025
    Authors
    Muhammad Sajjad Rasool
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Hindi Transliteration of Urdu Poetry Dataset

    Welcome to the Hindi Transliteration of Urdu Poetry Dataset! This dataset features Hindi transliterations of traditional Urdu poetry. Each entry in the dataset includes two columns:

    Title: The transliterated title of the poem in Hindi. Poem: The transliterated text of the Urdu poem rendered in Hindi script.

    This dataset is perfect for researchers and developers working on cross-script language processing, transliteration models, and… See the full description on the dataset page: https://huggingface.co/datasets/ReySajju742/Hindi-Poetry-Dataset.

  13. w

    Dataset of books called Poetry : reading, reacting, writing

    • workwithdata.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of books called Poetry : reading, reacting, writing [Dataset]. https://www.workwithdata.com/datasets/books?f=1&fcol0=book&fop0=%3D&fval0=Poetry+%3A+reading%2C+reacting%2C+writing
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about books. It has 1 row and is filtered where the book is Poetry : reading, reacting, writing. It features 7 columns including author, publication date, language, and book publisher.

  14. Z

    Dataset of limericks for computational poetics

    • data.niaid.nih.gov
    • zenodo.org
    Updated Nov 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yohei Igarashi (2021). Dataset of limericks for computational poetics [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5520077
    Explore at:
    Dataset updated
    Nov 24, 2021
    Dataset provided by
    Almas Abdibayev
    Yohei Igarashi
    Daniel Rockmore
    Allen Riddell
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Description

    Herein is a data set comprising 98k limericks scraped from the The Omnificent English Dictionary In Limerick Form - OEDILF. It is a subset of the full data set, filtered to pass a basic test of standard limerick form (i.e., ensuring five lines, no emojis, no symbols). Each limerick was written by a human contributor whose work has passed through a rigorous moderation. This dataset is released alongside two companion papers: "BPoMP: The Benchmark of Poetic Minimal Pairs – Limericks, Rhyme, and Narrative Coherence" (Abdibayev, Riddell, Rockmore, RANLP 2021) and "Automating the Detection of Poetic Features: The Limerick as Model Organism" (Abdibayev, Riddell, Igarashi, Rockmore, SIGHUM 2021). The dataset is primarily released for use by NLP researchers interested in studying formal structure of poetry and more generally, interested in computational poetics. Each limerick is accompanied by metadata: author information, id within the website and "is_limerick" field, which denotes if limerick was recognized by our custom filter that was built to check for formal limerick properties (this tagging was a goal of the SIGHUM paper and reflects the results reported there - see the paper for details). Thus, if "is_limerick"=True this is a true positive, "is_limerick"=False is (almost surely) a false negative. We identify 70% of these as limericks and provide the tagging as a benchmark for the community to improve upon. With these considerations in mind we hope that NLP community will use this dataset to study poetical knowledge of language models trained on large corpora as many of their properties still remain a mystery to the community at large. We are excited for the possibilities ahead!

    UPDATE: we released a new version of our dataset that contains all of the limericks that we planned to publish. Previous version (v2) was created using code that contained a bug which in turn lowered the number of available limericks.

  15. m

    Data from: Classical Arabic Poetry: Classification based on Era

    • data.mendeley.com
    Updated Nov 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariam Orabi (2021). Classical Arabic Poetry: Classification based on Era [Dataset]. http://doi.org/10.17632/mcj6vkg6zw.1
    Explore at:
    Dataset updated
    Nov 4, 2021
    Authors
    Mariam Orabi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the cleaned fragmented dataset described in the paper "Classical Arabic Poetry: Classification based on Era". The dataset was originally scraped from Adab.com in April 2020.

  16. Poetry

    • kaggle.com
    Updated Sep 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Likai Peng (2021). Poetry [Dataset]. https://www.kaggle.com/penglikai/poetry/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 28, 2021
    Dataset provided by
    Kaggle
    Authors
    Likai Peng
    Description

    Dataset

    This dataset was created by Likai Peng

    Contents

  17. S

    Dataset of imagery and sentiment in frontier poetry throughout history

    • scidb.cn
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiang Zudong; Li Lin; Li Chengcheng (2025). Dataset of imagery and sentiment in frontier poetry throughout history [Dataset]. http://doi.org/10.57760/sciencedb.25440
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 23, 2025
    Dataset provided by
    Science Data Bank
    Authors
    Jiang Zudong; Li Lin; Li Chengcheng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Frontier poetry is one of the most important themes in classical Chinese poetry, focusing on life and scenery in border regions. Imagery is a semantic composite of subjective and objective interactions, representing the objective objects of the poet's subjective emotions. The imagery system of frontier poetry exhibits significant regional convergence and cultural symbolism. This paper constructs a dataset of imagery sentiment in frontier poetry, which includes 40,000 frontier poems from the pre-Qin period to the present. It uses a combination of textual criticism and computational linguistics theories and methods to annotate and proofread the imagery and sentiments expressed in frontier poetry. This dataset not only provides rich research data for the study of frontier poetry, but also provides a macro perspective for in-depth exploration of the evolution of imagery sentiment in poetry.This dataset crawled 42,836 frontier poems from the Internet, covering war poems from the Book of Songs in the pre-Qin period to contemporary new poems, spanning the pre-Qin to modern and contemporary periods, striving to be complete, accurate, and reliable. The crawled data was cleaned and standardized, non-text symbols and redundant format tags were removed, a table of variant characters was established, and ancient texts were used to restore garbled characters through exegesis. Incorrectly identified poems were deleted, and finally, sentence segmentation and error correction were performed, with each sentence separated by commas and periods. In the end, a total of 42,807 high-quality frontier poems were obtained. Based on the collected poem texts, we constructed a data annotation system containing the encoding, author, name, imagery, and sentiment information of the poems. Each poem has a unique number, with the first two digits representing the dynasty number, such as “01” for the pre-Qin period, the middle four digits representing the author number, with poets sorted by their birth and death years, and the last two digits representing the serial number of the work, sorted by the first letter of the title. The imagery data of the poems and lyrics is annotated using a pre-trained model and manual review, while the sentiment is annotated manually.The final dataset consists of 11 CSV tables, with one table for each dynasty, and the files are named after the dynasty. Each data point consists of six parts: code, author, name, text, imagery, and sentiment.

  18. Hindi Poetry Dataset

    • kaggle.com
    zip
    Updated Jun 13, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    itsshavar (2020). Hindi Poetry Dataset [Dataset]. https://www.kaggle.com/shishu1421/hindi-poetry-dataset
    Explore at:
    zip(4136392 bytes)Available download formats
    Dataset updated
    Jun 13, 2020
    Authors
    itsshavar
    Description

    Context

    The data is scraped from a website consist of Gulzaar's pukhraaj , Rahat's Dhoop Bahut hai and Naaraz.

    Content

    These files are related to three poetry series of Gulzaar and Rahat Indauri. Series' are as follows: Pukhraaj Dhoop Bahut Hai Naaraz All these poetries are written in mixture of Hindi -Urdu words. These files are incremental and has good overlapping.

    Acknowledgements

    This is all possible because of these wonderful poets and also people who made them available online.

    Inspiration

    Scrapping the hindi poetry came in to my mind after the launch of OpenAI GPT-3. I decided to check the output on Hindi language mainly on poetries and the output was really good some of them were core Urdu words I was not able to understand them. But the overall experience was good. You can also use this dataset to explore more in field of Natural Language Generation and Analysis of Hindi-Urdu Literature.

  19. f

    Dataset: What the Eyes Reveal about (Reading) Poetry

    • figshare.com
    txt
    Updated Dec 16, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastian Wallot; Winfried Menninghaus (2020). Dataset: What the Eyes Reveal about (Reading) Poetry [Dataset]. http://doi.org/10.6084/m9.figshare.13387475.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Dec 16, 2020
    Dataset provided by
    figshare
    Authors
    Sebastian Wallot; Winfried Menninghaus
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    dataPOEM.csv

    The dataPOEM.csv data set contains data on the level of each poem.

    scoresAes = factor scores of moving, beauty, and melodious ratings.

    participant = participant number

    poemVersion = Version of poem presented: (A = original poem with rhyme and meter, B = poem variant with only rhyme, C = poem variant with only meter, D = poem variant without rhyme and meter)

    poemIdentity = poem number

    avgWFreq = average word frequency of poem

    totalGazeSlopeLineLength

    totalGazeWordMeanNAByWordLen

    totalGazeWordMeanNADiff

    order = order of presentation (1 = from A to D, 2 = from D to A; between participant factor)

    firstFixDurMS_MINFIX_AVG = first fixation duration

    totalGazeMS_MINFIX_AVG = total gaze durations

    fixDurMS_MINFIX_NUM = number of fixations

    sacLenMS_MINFIX_AVG = average saccade length

    percRegMS_MINFIX_AVG = percentage of regressive eye movements

    pupilDial_AVG = average pupil dilation

    blink_NUM_TotalRT = number of blinks relative to total reading time

    totalReadingTime = total reading time of the poem

    areaTT = total score of the Aesthetic Responsiveness Assessment questionnaire

    dataIntegrity = percentage of valid position measurements by eye tracker during reading of a poem

    moving = rating of how moving the poem was

    beauty = rating of how beautiful the poem was

    melodious = rating of how melodious the poem was

    dataROI.csv

    The dataROI.csv data set contains data on the level of each line within a poem.

    order = order of presentation (1 = from A to D, 2 = from D to A; between participant factor)

    participant = participant number

    poemIdentity = poem number

    lineNr = line number within poem

    poemVersion = Version of poem presented: (A = original poem with rhyme and meter, B = poem variant with only rhyme, C = poem variant with only meter, D = poem variant without rhyme and meter)

    verseEnd = wheter a particular word/line was the last line of a stanza (0 = word/line within a stanza, 1 = last word/line of a stanza)

    BeginCloseRhyme = whether a particular line’s final word marked the opening or closing of a rhyme pair (1 = opening of rhyme, 2 = closing of rhyme)

    lastFix = whether a particular line or word was the last one of the poem (0 = word/line within a poem, 1 = last word/line of poem)

    totalGazeByWordNA = total gaze duration of final word of a line relative to word length

    gazeByLineLengthNA = total gaze duration of a line relative to line length

    dataIntegrity = percentage of valid position measurements by eye tracker during reading of a poem

  20. Readership of poetry in the U.S. 2012-2017, by ethnicity

    • statista.com
    Updated Feb 14, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2019). Readership of poetry in the U.S. 2012-2017, by ethnicity [Dataset]. https://www.statista.com/statistics/971673/ethnic-groups-poetry-reading/
    Explore at:
    Dataset updated
    Feb 14, 2019
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    This statistic shows the share of adults reading poetry in the United States in 2012 and 2017, broken down by ethnicity. The data reveals that the share of surveyed Asian Americans in the U.S. reading poetry more than doubled in five years, increasing from 4.8 percent in 2012 to 12.6 percent in 2017. In fact, there was a significant increase in poetry readership among all surveyed ethnic groups.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
merve (2024). poetry [Dataset]. https://huggingface.co/datasets/merve/poetry

poetry

merve/poetry

Explore at:
4 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 14, 2024
Authors
merve
Description

Dataset Card for poetry

  Dataset Summary

It contains poems from subjects: Love, Nature and Mythology & Folklore that belong to two periods namely Renaissance and Modern

  Supported Tasks and Leaderboards

[Needs More Information]

  Languages

[Needs More Information]

  Dataset Structure





  Data Instances

[Needs More Information]

  Data Fields

Has 5 columns:

Content Author Poem name Age Type

  Data Splits

Only training… See the full description on the dataset page: https://huggingface.co/datasets/merve/poetry.

Search
Clear search
Close search
Google apps
Main menu