9 datasets found

h
english_quotes
huggingface.co
opendatalab.com
Updated Dec 19, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abir ELTAIEF (2021). english_quotes [Dataset]. http://doi.org/10.57967/hf/1053
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.57967/hf/1053
Dataset updated
Dec 19, 2021
Authors
Abir ELTAIEF
Description
Dataset Card for English quotes

I-Dataset Summary

english_quotes is a dataset of all the quotes retrieved from goodreads quotes. This dataset can be used for multi-label text classification and text generation. The content of each quote is in English and concerns the domain of datasets for NLP and beyond.

II-Supported Tasks and Leaderboards

Multi-label text classification : The dataset can be used to train a model for text-classification, which consists of… See the full description on the dataset page: https://huggingface.co/datasets/Abirate/english_quotes.
h
english-quotes-dataset
huggingface.co
Updated Sep 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ade Olubummo (2024). english-quotes-dataset [Dataset]. https://huggingface.co/datasets/adeo/english-quotes-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 18, 2024
Authors
Ade Olubummo
Description
adeo/english-quotes-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
English Quotes
kaggle.com
zip
Updated Jul 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vijay J0shi (2025). English Quotes [Dataset]. https://www.kaggle.com/datasets/vijayj0shi/english-quotes
Explore at:
zip(444988 bytes)Available download formats
Dataset updated
Jul 18, 2025
Authors
Vijay J0shi
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Vijay J0shi

Released under MIT

Contents
h
english_quotes_paraphrase
huggingface.co
Updated Oct 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
speed (2024). english_quotes_paraphrase [Dataset]. https://huggingface.co/datasets/speed/english_quotes_paraphrase
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 23, 2024
Authors
speed
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is a paraphrase of https://huggingface.co/datasets/Abirate/english_quotes using the google/gemma-2-2b-it model. The license follows the original dataset's Creative Commons Attribution 4.0 International License. Paraphrasing was conducted using text2dataset.
Wikiquote Short English Quotes
kaggle.com
zip
Updated Jun 19, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Seth Miller (2018). Wikiquote Short English Quotes [Dataset]. https://www.kaggle.com/datasets/fantop/wikiquote-short-english-quotes/code
Explore at:
zip(1259052 bytes)Available download formats
Dataset updated
Jun 19, 2018
Authors
Seth Miller
License
Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
License information was derived automatically
Description
Context

There aren't any large, public datasets of quotes to be found online (at the time of writing). So I decided to create my own by parsing and cleaning up a Wikiquote data dump. To create your own dataset with different languages and cutoff lengths, check out my Github repository.

Content

quotes-100-en.json

A JSON file containing english quotes less than 100 characters, scraped from Wikiquote.

Acknowledgments

Huge thanks to all the contributors to Wikiquote, and the Wikimedia Foundation.

Inspiration

Analysis and interpretation of quotes from important historical figures.
h
english_quotes_sanitized
huggingface.co
Updated Jun 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alvaro Moran (2025). english_quotes_sanitized [Dataset]. https://huggingface.co/datasets/tengomucho/english_quotes_sanitized
Explore at:
Dataset updated
Jun 1, 2025
Authors
Alvaro Moran
Description
This dataset is the same as Abirate/english_quotes, but I sanitized the author and sanitized the text to avoid weird characters. from ftfy import fix_encoding from datasets import load_dataset

def correct_encoding(examples): quote = examples["quote"] author = examples["author"]

# remove trailing comma from authors and fix encoding author = author.rstrip(",") author = fix_encoding(author) examples["author"] = author # fix encoding quote = fix_encoding(quote)… See the full description on the dataset page: https://huggingface.co/datasets/tengomucho/english_quotes_sanitized.
h
english-quotes-text
huggingface.co
Updated Sep 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrea Soria (2024). english-quotes-text [Dataset]. https://huggingface.co/datasets/asoria/english-quotes-text
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 19, 2024
Authors
Andrea Soria
Description
This dataset was created using english-quotes dataset and SQL Console: Query
h
english_quotes_ja
huggingface.co
Updated Oct 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
speed (2024). english_quotes_ja [Dataset]. https://huggingface.co/datasets/speed/english_quotes_ja
Explore at:
Dataset updated
Oct 23, 2024
Authors
speed
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is a translation of https://huggingface.co/datasets/Abirate/english_quotes into Japanese using the llm-jp/llm-jp-3-3.7b-instruct model. The license follows the original dataset's Creative Commons Attribution 4.0 International License. The translation was performed using text2dataset.
h
text-dataset-tiny-code-script-py-format
huggingface.co
Updated Aug 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ysn_rfd (2025). text-dataset-tiny-code-script-py-format [Dataset]. https://huggingface.co/datasets/ysn-rfd/text-dataset-tiny-code-script-py-format
Explore at:
Dataset updated
Aug 12, 2025
Authors
ysn_rfd
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
USED of tahamajs/medicine_ds_persian for .parquet file

USED of Alijafarixcs2/persian-it-llama2-2k for .parquet file

USED of Abirate/english_quotes for .jsonl file

SEVERAL Markdown (.md) files have been added to the dataset; Language: English.

The biggest update is coming. Pytroch UPDATE UPLOADED SOME IMAGES IN PATH pytorch_directml_cpu_optimized_low_end_pc 17 months left ("The time was changed and postponed, Sorry!.")
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Abir ELTAIEF (2021). english_quotes [Dataset]. http://doi.org/10.57967/hf/1053

english_quotes

Abirate/english_quotes

Explore at:

3 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Unique identifier

https://doi.org/10.57967/hf/1053

Dataset updated

Dec 19, 2021

Authors

Abir ELTAIEF

Description

Dataset Card for English quotes

  I-Dataset Summary

english_quotes is a dataset of all the quotes retrieved from goodreads quotes. This dataset can be used for multi-label text classification and text generation. The content of each quote is in English and concerns the domain of datasets for NLP and beyond.

  II-Supported Tasks and Leaderboards

Multi-label text classification : The dataset can be used to train a model for text-classification, which consists of… See the full description on the dataset page: https://huggingface.co/datasets/Abirate/english_quotes.

Clear search

Close search

Google apps

Main menu

english_quotes

english-quotes-dataset

English Quotes

Dataset

Contents

english_quotes_paraphrase

Wikiquote Short English Quotes

Context

Content

Acknowledgments

Inspiration

english_quotes_sanitized

english-quotes-text

english_quotes_ja

text-dataset-tiny-code-script-py-format

english_quotesSee More Versions

Abirate/english_quotes

english_quotes