10 datasets found
  1. h

    instruction-dataset-mini-with-generations

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdoulaye Diallo, instruction-dataset-mini-with-generations [Dataset]. https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Abdoulaye Diallo
    Description

    Dataset Card for instruction-dataset-mini-with-generations

    This dataset has been created with distilabel.

      Dataset Summary
    

    This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations/raw/main/pipeline.yaml"

    or explore the configuration: distilabel pipeline info… See the full description on the dataset page: https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations.

  2. Combined Generations Wave 1 and TransPop surveys, United States, 2016-2018

    • icpsr.umich.edu
    • myumi.ch
    ascii, delimited, r +3
    Updated Aug 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meyer, Ilan H. (2024). Combined Generations Wave 1 and TransPop surveys, United States, 2016-2018 [Dataset]. http://doi.org/10.3886/ICPSR38421.v1
    Explore at:
    ascii, delimited, sas, spss, r, stataAvailable download formats
    Dataset updated
    Aug 29, 2024
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    Meyer, Ilan H.
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/38421/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/38421/terms

    Time period covered
    2016 - 2018
    Area covered
    United States
    Description

    This collection includes a combined dataset of the Generations study wave 1 (baseline) survey and the TransPop study transgender survey. The two studies have many overlapping variables, and they examined topics such as respondents' health outcomes and behaviors, experiences with discrimination, identity, and transition-related experiences. Data from these studies were merged to allow for analysis of the combined LGBT populations. This dataset has also been reweighted to be representative of these populations. The complete Generations study data (baseline, wave 2, and wave 3 survey data) can be found under study number 37166, and the complete TransPop study data (transgender and cisgender survey data) can be found under study number 37938. For detailed information on the Generations and TransPop studies, including related publications, please refer to their respective DSDR/ICPSR study pages.

  3. Data from: Social Bonds Across Immigration Generations and the Immigrant...

    • catalog.data.gov
    • s.cnmilf.com
    • +1more
    Updated Mar 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Justice (2025). Social Bonds Across Immigration Generations and the Immigrant School Enclave: A Multilevel Longitudinal Study of Student Violence, School Disorder, and Dropping Out, United States, 2002 [Dataset]. https://catalog.data.gov/dataset/social-bonds-across-immigration-generations-and-the-immigrant-school-enclave-a-multilevel--ac431
    Explore at:
    Dataset updated
    Mar 12, 2025
    Dataset provided by
    National Institute of Justicehttp://nij.ojp.gov/
    Area covered
    United States
    Description

    These data are part of NACJD's Fast Track Release and are distributed as they there received from the data depositor. The files have been zipped by NACJD for release, but not checked or processed except of the removal of direct identifiers. Users should refer to the accompany readme file for a brief description of the files available with this collections and consult the investigator(s) if further information is needed. This study consists of a secondary analysis of data from the Educational Longitudinal Study of 2002 (ELS) to investigate associations between immigration, misbehavior, victimization, disorder, and educational failure (i.e., dropping out). Six research questions that were addressed in this study include: do school social bonds vary across immigration generations? Second, is student violence (i.e., misbehavior and victimization) explained by school social bonds across generations? Third, are student violence and school disorder related to the children immigrants' likelihood of dropping out? Fourth, are strong school social bonds mitigating the likelihood of dropping out for the children of immigrants? Fifth, are immigrant school enclaves associated with increased school social bonds among adolescents, decreased student violence and school disorder, and lower levels of dropping out? Sixth, does the intersection of race, ethnicity, and gender moderate the relationship between student violence and school social bonds for the children of immigrants?There are no data files available with this study. Only the syntax file used by the researcher is provided.

  4. Audio listening time of Gen Z in the U.S. 2024, by platform

    • statista.com
    Updated Jun 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Audio listening time of Gen Z in the U.S. 2024, by platform [Dataset]. https://www.statista.com/statistics/1541554/audio-listening-time-share-gen-z-united-states-platform/
    Explore at:
    Dataset updated
    Jun 23, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2024
    Area covered
    United States
    Description

    According to the latest data gathered in the United States in 2024, teens and young adults spent most of their audio listening time with streaming music, that is, ** percent. Streaming music videos on YouTube is also a popular choice, with ** percent of audio time spent on the platform. AM/FM Radio closely followed with a share of ** percent of Gen Z audio time.

  5. h

    genz-slang-dataset

    • huggingface.co
    Updated Oct 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GMLB trio 2024 (2024). genz-slang-dataset [Dataset]. https://huggingface.co/datasets/MLBtrio/genz-slang-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 2, 2024
    Dataset authored and provided by
    GMLB trio 2024
    Description

    Dataset Details

    This dataset contains a rich collection of popular slang terms and acronyms used primarily by Generation Z. It includes detailed descriptions of each term, its context of use, and practical examples that demonstrate how the slang is used in real-life conversations. The dataset is designed to capture the unique and evolving language patterns of GenZ, reflecting their communication style in digital spaces such as social media, text messaging, and online forums. Each… See the full description on the dataset page: https://huggingface.co/datasets/MLBtrio/genz-slang-dataset.

  6. U.S. mean disposable household income 2023, by generation

    • statista.com
    • ai-chatbox.pro
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abigail Tierney (2025). U.S. mean disposable household income 2023, by generation [Dataset]. https://www.statista.com/topics/9997/generation-z-fashion-in-the-united-states/
    Explore at:
    Dataset updated
    May 15, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Abigail Tierney
    Description

    In 2023, the disposable income of a household led by a Millennial in the United States was 97,866 U.S. dollars per year. Households led by someone born in Generation X, however, had a disposable income of around 113,886 U.S. dollars in 2023.

  7. Favorite music genres for Gen Z in the U.S. in March 2023

    • statista.com
    Updated Jun 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Favorite music genres for Gen Z in the U.S. in March 2023 [Dataset]. https://www.statista.com/statistics/1129893/favorite-music-genres-gen-z-united-states/
    Explore at:
    Dataset updated
    Jun 23, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2023
    Area covered
    United States
    Description

    According to data gathered in the United States in March 2023, Pop was the most popular genre for Generation Z. ** percent of Gen Z respondents included the genre to be among their favorites. Rap or Hip-Hop was second, being mentioned by a share of ** percent, while Rock concludes the top three, reaching ** percent.

  8. h

    gpt4all-j-prompt-generations

    • huggingface.co
    • opendatalab.com
    Updated Apr 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nomic AI (2023). gpt4all-j-prompt-generations [Dataset]. https://huggingface.co/datasets/nomic-ai/gpt4all-j-prompt-generations
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 13, 2023
    Dataset authored and provided by
    Nomic AI
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for [GPT4All-J Prompt Generations]

      Dataset Description
    

    Dataset used to train GPT4All-J and GPT4All-J-LoRA We release several versions of datasets

    v1.0: The original dataset we used to finetune GPT-J on v1.1-breezy: A filtered dataset where we removed all instances of AI language model v1.2-jazzy: A filtered dataset where we also removed instances like I'm sorry, I can't answer... and AI language model v1.3-groovy: The v1.2 dataset with ShareGPT and Dolly… See the full description on the dataset page: https://huggingface.co/datasets/nomic-ai/gpt4all-j-prompt-generations.

  9. h

    Data from: AI-GEN

    • huggingface.co
    Updated Aug 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tristan Hopkins (2024). AI-GEN [Dataset]. https://huggingface.co/datasets/tdh87/AI-GEN
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 24, 2024
    Authors
    Tristan Hopkins
    Description

    tdh87/AI-GEN dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    deepscaler-hard-r1-qwen7b-n32

    • huggingface.co
    Updated Apr 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khiem Pham (2025). deepscaler-hard-r1-qwen7b-n32 [Dataset]. https://huggingface.co/datasets/drproduck/deepscaler-hard-r1-qwen7b-n32
    Explore at:
    Dataset updated
    Apr 18, 2025
    Authors
    Khiem Pham
    Description

    deepseek-r1-qwen-7b generations for deepscaler dataset

    The original deepscaler dataset has been filtered:

    we removed all synthetic data because their problem-answer may not match. based on generations from qwen-7b (pre-o1), we removed problems that has 5/32 correct generations.

    We then use deepseek-r1-qwen-7b to generate from this filtered dataset with num_generations=32.

    We keep generations that finish. This translates to generations that have the second boxed.… See the full description on the dataset page: https://huggingface.co/datasets/drproduck/deepscaler-hard-r1-qwen7b-n32.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Abdoulaye Diallo, instruction-dataset-mini-with-generations [Dataset]. https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations

instruction-dataset-mini-with-generations

vonewman/instruction-dataset-mini-with-generations

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Abdoulaye Diallo
Description

Dataset Card for instruction-dataset-mini-with-generations

This dataset has been created with distilabel.

  Dataset Summary

This dataset contains a pipeline.yaml which can be used to reproduce the pipeline that generated it in distilabel using the distilabel CLI: distilabel pipeline run --config "https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations/raw/main/pipeline.yaml"

or explore the configuration: distilabel pipeline info… See the full description on the dataset page: https://huggingface.co/datasets/vonewman/instruction-dataset-mini-with-generations.

Search
Clear search
Close search
Google apps
Main menu