10 datasets found

h
instruction-dataset-mini-with-generations
huggingface.co
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdoulaye Diallo, instruction-dataset-mini-with-generations [Dataset]. https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Abdoulaye Diallo
Description
Dataset Card for instruction-dataset-mini-with-generations

This dataset has been created with distilabel.

Dataset Summary

This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations/raw/main/pipeline.yaml"

or explore the configuration: distilabel pipeline info… See the full description on the dataset page: https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations.
Combined Generations Wave 1 and TransPop surveys, United States, 2016-2018
icpsr.umich.edu
myumi.ch
ascii, delimited, r +3
Updated Aug 29, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meyer, Ilan H. (2024). Combined Generations Wave 1 and TransPop surveys, United States, 2016-2018 [Dataset]. http://doi.org/10.3886/ICPSR38421.v1
Explore at:
ascii, delimited, sas, spss, r, stataAvailable download formats
Unique identifier
https://doi.org/10.3886/ICPSR38421.v1
Dataset updated
Aug 29, 2024
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
Authors
Meyer, Ilan H.
License
https://www.icpsr.umich.edu/web/ICPSR/studies/38421/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/38421/terms
Time period covered
2016 - 2018
Area covered
United States
Description
This collection includes a combined dataset of the Generations study wave 1 (baseline) survey and the TransPop study transgender survey. The two studies have many overlapping variables, and they examined topics such as respondents' health outcomes and behaviors, experiences with discrimination, identity, and transition-related experiences. Data from these studies were merged to allow for analysis of the combined LGBT populations. This dataset has also been reweighted to be representative of these populations. The complete Generations study data (baseline, wave 2, and wave 3 survey data) can be found under study number 37166, and the complete TransPop study data (transgender and cisgender survey data) can be found under study number 37938. For detailed information on the Generations and TransPop studies, including related publications, please refer to their respective DSDR/ICPSR study pages.
Data from: Social Bonds Across Immigration Generations and the Immigrant...
catalog.data.gov
s.cnmilf.com
+1more
Updated Mar 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Justice (2025). Social Bonds Across Immigration Generations and the Immigrant School Enclave: A Multilevel Longitudinal Study of Student Violence, School Disorder, and Dropping Out, United States, 2002 [Dataset]. https://catalog.data.gov/dataset/social-bonds-across-immigration-generations-and-the-immigrant-school-enclave-a-multilevel--ac431
Explore at:
Dataset updated
Mar 12, 2025
Dataset provided by
National Institute of Justicehttp://nij.ojp.gov/
Area covered
United States
Description
These data are part of NACJD's Fast Track Release and are distributed as they there received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except of the removal of direct identifiers. Users should refer to the accompany readme file for a brief description of the files available with this collections and consult the investigator(s) if further information is needed. This study consists of a secondary analysis of data from the Educational Longitudinal Study of 2002 (ELS) to investigate associations between immigration, misbehavior, victimization, disorder, and educational failure (i.e., dropping out). Six research questions that were addressed in this study include: do school social bonds vary across immigration generations? Second, is student violence (i.e., misbehavior and victimization) explained by school social bonds across generations? Third, are student violence and school disorder related to the children immigrants' likelihood of dropping out? Fourth, are strong school social bonds mitigating the likelihood of dropping out for the children of immigrants? Fifth, are immigrant school enclaves associated with increased school social bonds among adolescents, decreased student violence and school disorder, and lower levels of dropping out? Sixth, does the intersection of race, ethnicity, and gender moderate the relationship between student violence and school social bonds for the children of immigrants?There are no data files available with this study. Only the syntax file used by the researcher is provided.
Audio listening time of Gen Z in the U.S. 2024, by platform
statista.com
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Audio listening time of Gen Z in the U.S. 2024, by platform [Dataset]. https://www.statista.com/statistics/1541554/audio-listening-time-share-gen-z-united-states-platform/
Explore at:
Dataset updated
Jun 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2024
Area covered
United States
Description
According to the latest data gathered in the United States in 2024, teens and young adults spent most of their audio listening time with streaming music, that is, ** percent. Streaming music videos on YouTube is also a popular choice, with ** percent of audio time spent on the platform. AM/FM Radio closely followed with a share of ** percent of Gen Z audio time.
h
genz-slang-dataset
huggingface.co
Updated Oct 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GMLB trio 2024 (2024). genz-slang-dataset [Dataset]. https://huggingface.co/datasets/MLBtrio/genz-slang-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 2, 2024
Dataset authored and provided by
GMLB trio 2024
Description
Dataset Details

This dataset contains a rich collection of popular slang terms and acronyms used primarily by Generation Z. It includes detailed descriptions of each term, its context of use, and practical examples that demonstrate how the slang is used in real-life conversations. The dataset is designed to capture the unique and evolving language patterns of GenZ, reflecting their communication style in digital spaces such as social media, text messaging, and online forums. Each… See the full description on the dataset page: https://huggingface.co/datasets/MLBtrio/genz-slang-dataset.
U.S. mean disposable household income 2023, by generation
statista.com
ai-chatbox.pro
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abigail Tierney (2025). U.S. mean disposable household income 2023, by generation [Dataset]. https://www.statista.com/topics/9997/generation-z-fashion-in-the-united-states/
Explore at:
Dataset updated
May 15, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Abigail Tierney
Description
In 2023, the disposable income of a household led by a Millennial in the United States was 97,866 U.S. dollars per year. Households led by someone born in Generation X, however, had a disposable income of around 113,886 U.S. dollars in 2023.
Favorite music genres for Gen Z in the U.S. in March 2023
statista.com
Updated Jun 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Favorite music genres for Gen Z in the U.S. in March 2023 [Dataset]. https://www.statista.com/statistics/1129893/favorite-music-genres-gen-z-united-states/
Explore at:
Dataset updated
Jun 23, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Mar 2023
Area covered
United States
Description
According to data gathered in the United States in March 2023, Pop was the most popular genre for Generation Z. ** percent of Gen Z respondents included the genre to be among their favorites. Rap or Hip-Hop was second, being mentioned by a share of ** percent, while Rock concludes the top three, reaching ** percent.
h
gpt4all-j-prompt-generations
huggingface.co
opendatalab.com
Updated Apr 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nomic AI (2023). gpt4all-j-prompt-generations [Dataset]. https://huggingface.co/datasets/nomic-ai/gpt4all-j-prompt-generations
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 13, 2023
Dataset authored and provided by
Nomic AI
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Card for [GPT4All-J Prompt Generations]

Dataset Description

Dataset used to train GPT4All-J and GPT4All-J-LoRA We release several versions of datasets

v1.0: The original dataset we used to finetune GPT-J on v1.1-breezy: A filtered dataset where we removed all instances of AI language model v1.2-jazzy: A filtered dataset where we also removed instances like I'm sorry, I can't answer... and AI language model v1.3-groovy: The v1.2 dataset with ShareGPT and Dolly… See the full description on the dataset page: https://huggingface.co/datasets/nomic-ai/gpt4all-j-prompt-generations.
h
Data from: AI-GEN
huggingface.co
Updated Aug 24, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tristan Hopkins (2024). AI-GEN [Dataset]. https://huggingface.co/datasets/tdh87/AI-GEN
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 24, 2024
Authors
Tristan Hopkins
Description
tdh87/AI-GEN dataset hosted on Hugging Face and contributed by the HF Datasets community
h
deepscaler-hard-r1-qwen7b-n32
huggingface.co
Updated Apr 18, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Khiem Pham (2025). deepscaler-hard-r1-qwen7b-n32 [Dataset]. https://huggingface.co/datasets/drproduck/deepscaler-hard-r1-qwen7b-n32
Explore at:
Dataset updated
Apr 18, 2025
Authors
Khiem Pham
Description
deepseek-r1-qwen-7b generations for deepscaler dataset

The original deepscaler dataset has been filtered:

we removed all synthetic data because their problem-answer may not match. based on generations from qwen-7b (pre-o1), we removed problems that has 5/32 correct generations.

We then use deepseek-r1-qwen-7b to generate from this filtered dataset with num_generations=32.

We keep generations that finish. This translates to generations that have the second boxed.… See the full description on the dataset page: https://huggingface.co/datasets/drproduck/deepscaler-hard-r1-qwen7b-n32.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Abdoulaye Diallo, instruction-dataset-mini-with-generations [Dataset]. https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations

instruction-dataset-mini-with-generations

vonewman/instruction-dataset-mini-with-generations

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Authors

Abdoulaye Diallo

Description

Dataset Card for instruction-dataset-mini-with-generations

This dataset has been created with distilabel.

  Dataset Summary

This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations/raw/main/pipeline.yaml"

or explore the configuration: distilabel pipeline info… See the full description on the dataset page: https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations.

Clear search

Close search

Google apps

Main menu

instruction-dataset-mini-with-generations

Combined Generations Wave 1 and TransPop surveys, United States, 2016-2018

Data from: Social Bonds Across Immigration Generations and the Immigrant...

Audio listening time of Gen Z in the U.S. 2024, by platform

genz-slang-dataset

U.S. mean disposable household income 2023, by generation

Favorite music genres for Gen Z in the U.S. in March 2023

gpt4all-j-prompt-generations

Data from: AI-GEN

deepscaler-hard-r1-qwen7b-n32

instruction-dataset-mini-with-generationsSee More Versions

vonewman/instruction-dataset-mini-with-generations

instruction-dataset-mini-with-generations