100+ datasets found
  1. h

    MagpieLM-DPO-Data-v0.1

    • huggingface.co
    Updated Sep 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Magpie Alignment (2024). MagpieLM-DPO-Data-v0.1 [Dataset]. https://huggingface.co/datasets/Magpie-Align/MagpieLM-DPO-Data-v0.1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 18, 2024
    Dataset authored and provided by
    Magpie Alignment
    License

    https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/

    Description

    Project Web: https://magpie-align.github.io/ Arxiv Technical Report: https://arxiv.org/abs/2406.08464 Codes: https://github.com/magpie-align/magpie

      🧐 Dataset Details
    

    The Magpie Team generates this dataset for direct preference optimization. This dataset was used to train Magpie-Align/MagpieLM-4B-Chat-v0.1. This dataset is a combination of two datasets:

    Half of the dataset (100K) is from Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1. Another half of the dataset (100K) uses… See the full description on the dataset page: https://huggingface.co/datasets/Magpie-Align/MagpieLM-DPO-Data-v0.1.

  2. Bodies having appointed a Data Protection Officer (DPO/DPO)

    • data.europa.eu
    csv, excel xlsx
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CNIL (2025). Bodies having appointed a Data Protection Officer (DPO/DPO) [Dataset]. https://data.europa.eu/data/datasets/5c926a7a634f410578005c68
    Explore at:
    excel xlsx(17482080), csv(31863536)Available download formats
    Dataset updated
    Jun 3, 2025
    Dataset provided by
    National Commission on Informatics and Liberty
    Authors
    CNIL
    License

    https://www.etalab.gouv.fr/licence-ouverte-open-licencehttps://www.etalab.gouv.fr/licence-ouverte-open-licence

    Description

    The General Data Protection Regulation (GDPR) provides, since 25 May 2018, for the mandatory designation of a Data Protection Officer (DPO) in public services and, under certain conditions, by companies and associations.

    The delegate — also known as the Data Protection Officer (DPO) — is responsible for ensuring GDPR compliance with the processing of personal data of the body that designated him or her. Internal or external, the delegate may also be appointed on behalf of several bodies.

    To ensure the effectiveness of his/her tasks, the delegate shall:

    — must have specific professional qualities and knowledge; — must benefit from material and organisational resources, resources and positioning enabling it to carry out its tasks effectively and independently.

    To learn more about the role of delegate: https://www.cnil.fr/fr/devenir-delegue-la-protection-des-donnees.

    In accordance with the applicable texts, the CNIL shall publish in an open and easily reusable format the name and professional contact details of the bodies that have appointed a Data Protection Officer, as well as the means of contacting the Data Protection Officer.

    ** Warning 1:** The published data, including the public contact details of delegates, are extracted from the designations of delegates as received by the CNIL via its dedicated teleservice. Any delegate may request the modification of the contact details published directly to the CNIL’s Data Protection Officers Service.

    ** Warning 2:** Any re-use of published data which would have the nature of personal data (telephone number, e-mail address, etc.) presupposes, on the part of the re-user, verification of the full fulfilment of his/her obligations under the GDPR, in particular in terms of informing the delegates concerned and respecting their other rights as defined by the European Regulation. Otherwise, the re-user would in particular be exposed to the penalties provided for in the GDPR.

  3. D

    DPO as a Service Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jun 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). DPO as a Service Report [Dataset]. https://www.datainsightsmarket.com/reports/dpo-as-a-service-522987
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    Jun 17, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Data Protection Officer (DPO) as a Service market is experiencing robust growth, driven by increasing data privacy regulations like GDPR and CCPA, and the rising complexity of managing data compliance across diverse geographical locations and business operations. The market, estimated at $500 million in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033. This growth is fueled by a growing awareness of data breaches and associated financial and reputational risks, prompting organizations, particularly SMEs lacking internal expertise, to outsource DPO responsibilities. Key market drivers include the escalating volume of sensitive data, the expanding scope of data protection laws, and the increasing demand for specialized DPO services, such as data mapping, breach response planning, and employee training. Several trends are shaping the market landscape. The increasing adoption of cloud-based DPO solutions enhances accessibility and scalability. Furthermore, the integration of artificial intelligence (AI) and machine learning (ML) technologies within DPO services promises to improve efficiency and accuracy in data protection activities. However, challenges remain, including concerns around data security and confidentiality when outsourcing sensitive information and the variability in service offerings across providers. Despite these restraints, the ongoing expansion of data privacy legislation globally, coupled with the escalating demand for specialized expertise, ensures sustained market expansion throughout the forecast period. Major players like Deloitte, PwC, and KPMG are leveraging their existing consulting expertise to gain a significant foothold, while specialized DPO-as-a-service providers are focusing on niche solutions and innovative technological integrations to stand out.

  4. Italy: share of companies with Data Protection Officer 2018, by industry

    • statista.com
    Updated Jan 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Italy: share of companies with Data Protection Officer 2018, by industry [Dataset]. https://www.statista.com/statistics/998633/share-of-companies-with-dpo-by-industry-in-italy/
    Explore at:
    Dataset updated
    Jan 9, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2018
    Area covered
    Italy
    Description

    The timeline shows the share of companies with DPO (Data Protection Officer) in Italy in 2018, broken down by industrial sector. As the graph highlights, the sectors where Data Protection Officers were more present were the banks and the insurance sectors.

  5. h

    dpo-training-data

    • huggingface.co
    Updated Aug 15, 2007
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zeynep (2007). dpo-training-data [Dataset]. https://huggingface.co/datasets/Tandogan/dpo-training-data
    Explore at:
    Dataset updated
    Aug 15, 2007
    Authors
    Zeynep
    Description

    Tandogan/dpo-training-data dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. Share of Italian companies with Data Protection Officer 2017-2019

    • statista.com
    Updated Feb 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2023). Share of Italian companies with Data Protection Officer 2017-2019 [Dataset]. https://www.statista.com/statistics/998439/share-of-companies-with-data-protection-officer-in-italy/
    Explore at:
    Dataset updated
    Feb 7, 2023
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Italy
    Description

    The timeline shows the share of companies with DPO (Data Protection Officer) in Italy from 2017 to 2019. As the graph highlights, the company with a DPO increased really significantly. On the other hand, the number of companies totally disinterested in the figure of the data protection officer decreased from 15 to 10 percent.

  7. h

    DPO-Data-Mistral-Ours

    • huggingface.co
    Updated Aug 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DPO (2024). DPO-Data-Mistral-Ours [Dataset]. https://huggingface.co/datasets/StepControlled/DPO-Data-Mistral-Ours
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 11, 2024
    Authors
    DPO
    Description

    StepControlled/DPO-Data-Mistral-Ours dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    LLM-QE-DPO-Training-Data

    • huggingface.co
    Updated Mar 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    chengpingan (2025). LLM-QE-DPO-Training-Data [Dataset]. https://huggingface.co/datasets/chengpingan/LLM-QE-DPO-Training-Data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 12, 2025
    Authors
    chengpingan
    Description

    chengpingan/LLM-QE-DPO-Training-Data dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. Programmes in Promoting the Adoption of Digital Technology among the Elderly...

    • data.gov.hk
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.gov.hk, Programmes in Promoting the Adoption of Digital Technology among the Elderly by DPO [Dataset]. https://data.gov.hk/en-data/dataset/hk-dpo-dpo_hp-encouraging-ict-adoption-among-the-elderly
    Explore at:
    Dataset provided by
    data.gov.hk
    Description

    List of programmes funded by the DPO in promoting the adoption of digital technology among the elderly - over the years, the DPO has been striving to implement various programmes to encourage elderly using digital technology to improve their quality of life and stay connected to the community and family.

  10. h

    ChemPref-DPO-for-Chemistry-data-en

    • huggingface.co
    Updated Apr 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AI4Chem (2024). ChemPref-DPO-for-Chemistry-data-en [Dataset]. https://huggingface.co/datasets/AI4Chem/ChemPref-DPO-for-Chemistry-data-en
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 11, 2024
    Dataset authored and provided by
    AI4Chem
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Citation

    @misc{zhang2024chemllm, title={ChemLLM: A Chemical Large Language Model}, author={Di Zhang and Wei Liu and Qian Tan and Jingdan Chen and Hang Yan and Yuliang Yan and Jiatong Li and Weiran Huang and Xiangyu Yue and Dongzhan Zhou and Shufei Zhang and Mao Su and Hansen Zhong and Yuqiang Li and Wanli Ouyang}, year={2024}, eprint={2402.06852}, archivePrefix={arXiv}, primaryClass={cs.AI} }

  11. o

    SPADATAS DPO Questionnaire

    • explore.openaire.eu
    Updated May 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SPADATAS (2023). SPADATAS DPO Questionnaire [Dataset]. http://doi.org/10.5281/zenodo.15014992
    Explore at:
    Dataset updated
    May 19, 2023
    Authors
    SPADATAS
    Description

    Educational roles need to acquire digital skills and competencies exposed in DigCompEdu regarding data literacy and responsible use of digital competence. With this tool, schools can analyze their digital skills and data literacy competencies that help protect the privacy and security of students' data, as map their data academic management processes to detect gaps and foster data treatment policy updates. Questionnaire available in languages: English, Croatian, Portuguese, Slovenian, and Spanish.

  12. DPO

    • webbook.nist.gov
    Updated Mar 21, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2018). DPO [Dataset]. https://webbook.nist.gov/cgi/formula?ID=B1001987
    Explore at:
    Dataset updated
    Mar 21, 2018
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    License

    https://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRDhttps://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRD

    Description

    This page, "DPO", is part of the NIST Chemistry WebBook. This site and its contents are part of the NIST Standard Reference Data Program.

  13. IĮ "DPO" - turnover, revenue, profit | Okredo

    • okredo.com
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Okredo (2025). IĮ "DPO" - turnover, revenue, profit | Okredo [Dataset]. https://okredo.com/en-lt/company/ii-dpo-300538728/finance
    Explore at:
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    Okredo
    License

    https://okredo.com/en-lt/general-ruleshttps://okredo.com/en-lt/general-rules

    Time period covered
    2022 - 2024
    Area covered
    Lithuania
    Description

    IĮ "DPO" financial data: profit, annual turnover, paid taxes, sales revenue, equity, assets (long-term and short-term), profitability indicators.

  14. MB "Dpo" - turnover, revenue, profit | Okredo

    • okredo.com
    Updated Jun 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Okredo (2025). MB "Dpo" - turnover, revenue, profit | Okredo [Dataset]. https://okredo.com/en-lt/company/mb-dpo-307090900/finance
    Explore at:
    Dataset updated
    Jun 23, 2025
    Dataset authored and provided by
    Okredo
    License

    https://okredo.com/en-lt/general-ruleshttps://okredo.com/en-lt/general-rules

    Time period covered
    2022 - 2024
    Area covered
    Lithuania
    Description

    MB "Dpo" financial data: profit, annual turnover, paid taxes, sales revenue, equity, assets (long-term and short-term), profitability indicators.

  15. Ultrafeedback Binarized

    • kaggle.com
    • opendatalab.com
    • +1more
    Updated Nov 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Ultrafeedback Binarized [Dataset]. https://www.kaggle.com/datasets/thedevastator/ultra-fine-binary-preference-learning/suggestions?status=pending&yourSuggestions=true
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 23, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Ultrafeedback Binarized

    Predicting Binary Preferences with SFT, PPO and DPO

    By Huggingface Hub [source]

    About this dataset

    This dataset contains data for ultra-fine-grained binary preference learning tasks. It features three distinct datasets - SFT, PPO, and DPO. These datasets provide rich insights into the user preferences via prompts, chosen and rejected messages, as well as scores assigned to each option. This is a great dataset to perform analysis on regarding user sentiment towards different input prompts and which responses they find more desirable or satisfying. Analyzing this data can offer deeper understanding of how people think in order to improve many applications that rely on artificial intelligence such as recommendation systems or automated customer service programs. By delving into this data we are able to gain a better understanding of the human mind with respect to decision making processes thus allowing us to develop more interpretable models in machine learning that operate closer from our own logic

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset can be used to train and evaluate models for ultra-fine-grained binary preference learning tasks. The data is organized into three files: SFT, PPO, and DPO. Each file contains a series of prompts, chosen and rejected messages, and scores for each option. With this data, you can train a model that can predict user preferences consistently and accurately across multiple settings.

    Here are the steps to work with this dataset: - Read through the prompts in each file and understand what the task is asking of the user. - Review both the chosen and rejected messages based on their accompanying scores to understand how they are influencing or being influenced by other factors such as emotion or sentiment. - Using your understanding of the task at hand from 1 & 2), create a model that accurately predicts user preference for any pair of options given in an ultra-fine grained binary preference learning task (SFT, PPO or DPO).
    - Validate your model against other predictions using unseen data sets from all three files (SFT, PPO & DPO). This will help you determine whether your model accurately predicts user preferences within different contexts.

    With these steps you should have an understanding of how best to use this dataset in order to build models which reliably predict how users will respond when presented with a choice between two options in an ultra-fine grained binary preference learning scenario!

    Research Ideas

    • Training a model or algorithm based on machine learning and natural language processing methods to determine user preferences between ultra-fine-grained options.
    • Developing a supervised learning algorithm that uses the information from the prompt, chosen option, rejected option, message and score variables to identify factors that influence user preference selection for ultra-fine-grained tasks.
    • Utilizing reinforcement learning agents such as PPO (Proximal Policy Optimization) or DPO (Deep Deterministic Policy Gradients) to create policies for effectively selecting between ultra-fine-grained options in different domains, via interactive experiments with real user data collected from this dataset

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

    Columns

    File: test_sft.csv | Column name | Description | |:-------------------|:------------------------------------------------------| | prompt | The prompt that was given to the user. (String) | | chosen | The message that the user chose. (String) | | rejected | The message that the user rejected. (String) | | messages | The messages that were presented to the user. (List) | | score_chosen | The score assigned to the chosen message. (Integer) | | score_rejected | The score assigned to the rejected message. (Integer) |

    File: train_sft.csv | Column name | Description ...

  16. D

    Data Protection Service Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Feb 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Data Protection Service Report [Dataset]. https://www.archivemarketresearch.com/reports/data-protection-service-14668
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Feb 9, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Market Analysis for Data Protection Services The global data protection services market is expected to reach $128,110 million by 2033, growing at a CAGR of 18.0% during the forecast period. The increasing adoption of digital technologies, the rise of remote work, and the stringent regulatory landscape are driving the demand for these services. The market is segmented into services (Designated Data Protection Officer (DPO), Outsourced Data Subject Access (SAR) Services), applications (SMEs, large enterprises), and regions (North America, Europe, Asia Pacific, Middle East & Africa, South America). Leading companies in the market include EY, IBM, PwC, Deloitte, and Data Privacy and Security Services Ltd. The growth of the data protection services market is primarily driven by the increasing data breaches and cyber threats and the adoption of cloud-based services and the Internet of Things (IoT). The increasing awareness of data privacy and security regulations is also contributing to the market's growth. The market is expected to witness significant growth in emerging economies as governments and businesses focus on implementing data protection and privacy regulations. The adoption of artificial intelligence (AI) and machine learning (ML) technologies is expected to further enhance the efficiency and effectiveness of data protection services.

    VIEW FULL REPORT

  17. Global DPO as a Service Market Business Opportunities 2025-2032

    • statsndata.org
    excel, pdf
    Updated Jun 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats N Data (2025). Global DPO as a Service Market Business Opportunities 2025-2032 [Dataset]. https://www.statsndata.org/report/dpo-as-a-service-market-284588
    Explore at:
    pdf, excelAvailable download formats
    Dataset updated
    Jun 2025
    Dataset authored and provided by
    Stats N Data
    License

    https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

    Area covered
    Global
    Description

    The DPO as a Service market is rapidly evolving, emerging as a vital solution for organizations navigating the complexities of data protection and privacy compliance. As businesses continue to grapple with stringent regulations like the GDPR and CCPA, outsourcing the role of a Data Protection Officer (DPO) has gaine

  18. Dpo Import Data in August - Seair.co.in

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Dpo Import Data in August - Seair.co.in [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    Afghanistan, Belgium, French Southern Territories, Malta, Nepal, Marshall Islands, Egypt, Macedonia (the former Yugoslav Republic of), Denmark, Kuwait
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  19. Dpo Import Data in November - Seair.co.in

    • seair.co.in
    Updated Nov 22, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim (2016). Dpo Import Data in November - Seair.co.in [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Nov 22, 2016
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    Sudan, Isle of Man, Lesotho, Norway, Bouvet Island, Cyprus, Curaçao, Ukraine, United States Minor Outlying Islands, Kuwait
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  20. List of new datasets in Consolidated Annual Open Data Plans of Bureaux and...

    • data.gov.hk
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.gov.hk, List of new datasets in Consolidated Annual Open Data Plans of Bureaux and Departments [Dataset]. https://data.gov.hk/en-data/dataset/hk-dpo-datagovhk1-aodp-new-dataset-list
    Explore at:
    Dataset provided by
    data.gov.hk
    Description

    The dataset provides a consolidated list of new datasets planned to be open up according to Consolidated Annual Open Data Plans of Bureaux and Departments.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Magpie Alignment (2024). MagpieLM-DPO-Data-v0.1 [Dataset]. https://huggingface.co/datasets/Magpie-Align/MagpieLM-DPO-Data-v0.1

MagpieLM-DPO-Data-v0.1

Magpie-Align/MagpieLM-DPO-Data-v0.1

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 18, 2024
Dataset authored and provided by
Magpie Alignment
License

https://choosealicense.com/licenses/llama3.1/https://choosealicense.com/licenses/llama3.1/

Description

Project Web: https://magpie-align.github.io/ Arxiv Technical Report: https://arxiv.org/abs/2406.08464 Codes: https://github.com/magpie-align/magpie

  🧐 Dataset Details

The Magpie Team generates this dataset for direct preference optimization. This dataset was used to train Magpie-Align/MagpieLM-4B-Chat-v0.1. This dataset is a combination of two datasets:

Half of the dataset (100K) is from Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1. Another half of the dataset (100K) uses… See the full description on the dataset page: https://huggingface.co/datasets/Magpie-Align/MagpieLM-DPO-Data-v0.1.

Search
Clear search
Close search
Google apps
Main menu