5 datasets found
  1. olmoe-0125-1b-7b-preference-mix

    • huggingface.co
    Updated Jan 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2025). olmoe-0125-1b-7b-preference-mix [Dataset]. https://huggingface.co/datasets/allenai/olmoe-0125-1b-7b-preference-mix
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 31, 2025
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    Description

    OLMoE-1B-7B-0125-Instruct

    Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:

    Reused prompts from the SFT mix (ai2-adapt-dev/sft_v3.9_used_on_policy_p0_olmoe_1b-7b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmoe-0125-1b-7b-preference-mix.

  2. olmo-2-0325-32b-preference-mix

    • huggingface.co
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2025). olmo-2-0325-32b-preference-mix [Dataset]. https://huggingface.co/datasets/allenai/olmo-2-0325-32b-preference-mix
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    Description

    OLMo 2 0325 32B Preference Mixture

    Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:

    Reused prompts from the SFT mix (allenai/sft_v3.9_used_off_policy_prompts-olmo32) Reused prompts from… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-0325-32b-preference-mix.

  3. olmo-2-1124-13b-preference-mix

    • huggingface.co
    Updated Nov 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2024). olmo-2-1124-13b-preference-mix [Dataset]. https://huggingface.co/datasets/allenai/olmo-2-1124-13b-preference-mix
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 26, 2024
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    OLMo 2 1124 13B Preference Mixture

    Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu

    Reused prompts from the SFT mix (via ai2-adapt-dev/sft_v3.9_used_on_policy_po_olmo2_13b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-1124-13b-preference-mix.

  4. olmo-2-1124-7b-preference-mix

    • huggingface.co
    Updated Nov 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2024). olmo-2-1124-7b-preference-mix [Dataset]. https://huggingface.co/datasets/allenai/olmo-2-1124-7b-preference-mix
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 26, 2024
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    License

    https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/

    Description

    OLMo 2 1124 7B Preference Mixture

    Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:

    Reused prompts from the SFT mix (via ai2-adapt-dev/sft_v3.9_used_on_policy_po_olmo2_7b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-1124-7b-preference-mix.

  5. olmo-2-0425-1b-preference-mix

    • huggingface.co
    Updated Apr 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ai2 (2025). olmo-2-0425-1b-preference-mix [Dataset]. https://huggingface.co/datasets/allenai/olmo-2-0425-1b-preference-mix
    Explore at:
    Dataset updated
    Apr 30, 2025
    Dataset provided by
    Allen Institute for AIhttp://allenai.org/
    Authors
    Ai2
    Description

    OLMo 2 0425 1B Preference Mixture

    Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:

    Reused prompts from the SFT mix (allenai/sft_v3.9_used_off_policy_prompts-olmo32) Reused prompts from… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-0425-1b-preference-mix.

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ai2 (2025). olmoe-0125-1b-7b-preference-mix [Dataset]. https://huggingface.co/datasets/allenai/olmoe-0125-1b-7b-preference-mix
Organization logo

olmoe-0125-1b-7b-preference-mix

allenai/olmoe-0125-1b-7b-preference-mix

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 31, 2025
Dataset provided by
Allen Institute for AIhttp://allenai.org/
Authors
Ai2
Description

OLMoE-1B-7B-0125-Instruct

Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:

Reused prompts from the SFT mix (ai2-adapt-dev/sft_v3.9_used_on_policy_p0_olmoe_1b-7b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmoe-0125-1b-7b-preference-mix.

Search
Clear search
Close search
Google apps
Main menu