Facebook
TwitterOLMoE-1B-7B-0125-Instruct
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:
Reused prompts from the SFT mix (ai2-adapt-dev/sft_v3.9_used_on_policy_p0_olmoe_1b-7b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmoe-0125-1b-7b-preference-mix.
Facebook
TwitterOLMo 2 0325 32B Preference Mixture
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:
Reused prompts from the SFT mix (allenai/sft_v3.9_used_off_policy_prompts-olmo32) Reused prompts from… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-0325-32b-preference-mix.
Facebook
Twitterhttps://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
OLMo 2 1124 13B Preference Mixture
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu
Reused prompts from the SFT mix (via ai2-adapt-dev/sft_v3.9_used_on_policy_po_olmo2_13b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-1124-13b-preference-mix.
Facebook
Twitterhttps://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
OLMo 2 1124 7B Preference Mixture
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:
Reused prompts from the SFT mix (via ai2-adapt-dev/sft_v3.9_used_on_policy_po_olmo2_7b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-1124-7b-preference-mix.
Facebook
TwitterOLMo 2 0425 1B Preference Mixture
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:
Reused prompts from the SFT mix (allenai/sft_v3.9_used_off_policy_prompts-olmo32) Reused prompts from… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmo-2-0425-1b-preference-mix.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterOLMoE-1B-7B-0125-Instruct
Note that this collection is licensed under ODC-BY-1.0 license; different licenses apply to subsets of the data. Some portions of the dataset are non-commercial. We present the mixture as a research artifact. This mix is made up of the following on-policy preference datasets generated using a synthetic data generation pipeline similar to Tulu 3:
Reused prompts from the SFT mix (ai2-adapt-dev/sft_v3.9_used_on_policy_p0_olmoe_1b-7b and… See the full description on the dataset page: https://huggingface.co/datasets/allenai/olmoe-0125-1b-7b-preference-mix.