9 datasets found
  1. h

    english_quotes

    • huggingface.co
    • opendatalab.com
    Updated Dec 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abir ELTAIEF (2021). english_quotes [Dataset]. http://doi.org/10.57967/hf/1053
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 19, 2021
    Authors
    Abir ELTAIEF
    Description

    Dataset Card for English quotes

      I-Dataset Summary
    

    english_quotes is a dataset of all the quotes retrieved from goodreads quotes. This dataset can be used for multi-label text classification and text generation. The content of each quote is in English and concerns the domain of datasets for NLP and beyond.

      II-Supported Tasks and Leaderboards
    

    Multi-label text classification : The dataset can be used to train a model for text-classification, which consists ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Abirate/english_quotes.

  2. h

    english-quotes-dataset

    • huggingface.co
    Updated Sep 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ade Olubummo (2024). english-quotes-dataset [Dataset]. https://huggingface.co/datasets/adeo/english-quotes-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 18, 2024
    Authors
    Ade Olubummo
    Description

    adeo/english-quotes-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. English Quotes

    • kaggle.com
    zip
    Updated Jul 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vijay J0shi (2025). English Quotes [Dataset]. https://www.kaggle.com/datasets/vijayj0shi/english-quotes
    Explore at:
    zip(444988 bytes)Available download formats
    Dataset updated
    Jul 18, 2025
    Authors
    Vijay J0shi
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Vijay J0shi

    Released under MIT

    Contents

  4. h

    english_quotes_paraphrase

    • huggingface.co
    Updated Oct 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    speed (2024). english_quotes_paraphrase [Dataset]. https://huggingface.co/datasets/speed/english_quotes_paraphrase
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 23, 2024
    Authors
    speed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a paraphrase of https://huggingface.co/datasets/Abirate/english_quotes using the google/gemma-2-2b-it model. The license follows the original dataset's Creative Commons Attribution 4.0 International License. Paraphrasing was conducted using text2dataset.

  5. Wikiquote Short English Quotes

    • kaggle.com
    zip
    Updated Jun 19, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seth Miller (2018). Wikiquote Short English Quotes [Dataset]. https://www.kaggle.com/datasets/fantop/wikiquote-short-english-quotes/code
    Explore at:
    zip(1259052 bytes)Available download formats
    Dataset updated
    Jun 19, 2018
    Authors
    Seth Miller
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    Context

    There aren't any large, public datasets of quotes to be found online (at the time of writing). So I decided to create my own by parsing and cleaning up a Wikiquote data dump. To create your own dataset with different languages and cutoff lengths, check out my Github repository.

    Content

    quotes-100-en.json

    A JSON file containing english quotes less than 100 characters, scraped from Wikiquote.

    Acknowledgments

    Huge thanks to all the contributors to Wikiquote, and the Wikimedia Foundation.

    Inspiration

    Analysis and interpretation of quotes from important historical figures.

  6. h

    english_quotes_sanitized

    • huggingface.co
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alvaro Moran (2025). english_quotes_sanitized [Dataset]. https://huggingface.co/datasets/tengomucho/english_quotes_sanitized
    Explore at:
    Dataset updated
    Jun 1, 2025
    Authors
    Alvaro Moran
    Description

    This dataset is the same as Abirate/english_quotes, but I sanitized the author and sanitized the text to avoid weird characters. from ftfy import fix_encoding from datasets import load_dataset

    def correct_encoding(examples): quote = examples["quote"] author = examples["author"]

    # remove trailing comma from authors and fix encoding
    author = author.rstrip(",")
    author = fix_encoding(author)
    examples["author"] = author
    
    # fix encoding
    quote = fix_encoding(quote)โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/tengomucho/english_quotes_sanitized.
    
  7. h

    english-quotes-text

    • huggingface.co
    Updated Sep 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrea Soria (2024). english-quotes-text [Dataset]. https://huggingface.co/datasets/asoria/english-quotes-text
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 19, 2024
    Authors
    Andrea Soria
    Description

    This dataset was created using english-quotes dataset and SQL Console: Query

  8. h

    english_quotes_ja

    • huggingface.co
    Updated Oct 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    speed (2024). english_quotes_ja [Dataset]. https://huggingface.co/datasets/speed/english_quotes_ja
    Explore at:
    Dataset updated
    Oct 23, 2024
    Authors
    speed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a translation of https://huggingface.co/datasets/Abirate/english_quotes into Japanese using the llm-jp/llm-jp-3-3.7b-instruct model. The license follows the original dataset's Creative Commons Attribution 4.0 International License. The translation was performed using text2dataset.

  9. h

    text-dataset-tiny-code-script-py-format

    • huggingface.co
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ysn_rfd (2025). text-dataset-tiny-code-script-py-format [Dataset]. https://huggingface.co/datasets/ysn-rfd/text-dataset-tiny-code-script-py-format
    Explore at:
    Dataset updated
    Aug 12, 2025
    Authors
    ysn_rfd
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    USED of tahamajs/medicine_ds_persian for .parquet file

    USED of Alijafarixcs2/persian-it-llama2-2k for .parquet file

    USED of Abirate/english_quotes for .jsonl file

    SEVERAL Markdown (.md) files have been added to the dataset; Language: English.

      The biggest update is coming.
    
    
    
    
    
      Pytroch UPDATE
    
    
    
    
    
      UPLOADED SOME IMAGES IN PATH pytorch_directml_cpu_optimized_low_end_pc
    
    
    
    
    
      17 months left ("The time was changed and postponed, Sorry!.")
    
  10. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Abir ELTAIEF (2021). english_quotes [Dataset]. http://doi.org/10.57967/hf/1053

english_quotes

Abirate/english_quotes

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 19, 2021
Authors
Abir ELTAIEF
Description

Dataset Card for English quotes

  I-Dataset Summary

english_quotes is a dataset of all the quotes retrieved from goodreads quotes. This dataset can be used for multi-label text classification and text generation. The content of each quote is in English and concerns the domain of datasets for NLP and beyond.

  II-Supported Tasks and Leaderboards

Multi-label text classification : The dataset can be used to train a model for text-classification, which consists ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Abirate/english_quotes.

Search
Clear search
Close search
Google apps
Main menu