33 datasets found
  1. h

    legalbench

    • huggingface.co
    Updated Aug 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neel Guha (2023). legalbench [Dataset]. https://huggingface.co/datasets/nguha/legalbench
    Explore at:
    Dataset updated
    Aug 21, 2023
    Authors
    Neel Guha
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    LegalBench is a collection of benchmark tasks for evaluating legal reasoning in large language models.

  2. h

    legalbench

    • huggingface.co
    • opendatalab.com
    Updated Sep 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Omkar Kabde (2025). legalbench [Dataset]. https://huggingface.co/datasets/omkar334/legalbench
    Explore at:
    Dataset updated
    Sep 20, 2025
    Authors
    Omkar Kabde
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    Dataset Card for Dataset Name

    Homepage: https://hazyresearch.stanford.edu/legalbench/ Repository: https://github.com/HazyResearch/legalbench/ Paper: https://arxiv.org/abs/2308.11462

      Dataset Description
    
    
    
    
    
    
    
      Dataset Summary
    

    The LegalBench project is an ongoing open science effort to collaboratively curate tasks for evaluating legal reasoning in English large language models (LLMs). The benchmark currently consists of 162 tasks gathered from 40… See the full description on the dataset page: https://huggingface.co/datasets/omkar334/legalbench.

  3. h

    legalbench-entire

    • huggingface.co
    Updated Jun 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prithviraj Anil Maurya (2024). legalbench-entire [Dataset]. https://huggingface.co/datasets/prithviraj-maurya/legalbench-entire
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 21, 2024
    Authors
    Prithviraj Anil Maurya
    Description

    prithviraj-maurya/legalbench-entire dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    legalbench

    • huggingface.co
    Updated Sep 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarthak Jain (2025). legalbench [Dataset]. https://huggingface.co/datasets/sarthak-wiz01/legalbench
    Explore at:
    Dataset updated
    Sep 29, 2025
    Authors
    Sarthak Jain
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

      Dataset Description
    

    LegalBench is an open science effort to collaboratively curate tasks for evaluating legal reasoning in English large language models (LLMs). The benchmark consists of 162 tasks gathered from 40 contributors, covering a wide range of legal domains, task structures, and difficulty levels.

      Homepage
    

    Website:… See the full description on the dataset page: https://huggingface.co/datasets/sarthak-wiz01/legalbench.

  5. h

    legalbench-qa-contractnli

    • huggingface.co
    Updated Sep 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Antonio Menta (2025). legalbench-qa-contractnli [Dataset]. https://huggingface.co/datasets/amentaphd/legalbench-qa-contractnli
    Explore at:
    Dataset updated
    Sep 14, 2025
    Authors
    Antonio Menta
    Description

    amentaphd/legalbench-qa-contractnli dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    legalbench_corporate_lobbying

    • huggingface.co
    Updated Mar 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2024). legalbench_corporate_lobbying [Dataset]. https://huggingface.co/datasets/mteb/legalbench_corporate_lobbying
    Explore at:
    Dataset updated
    Mar 31, 2024
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    LegalBenchCorporateLobbying An MTEB dataset Massive Text Embedding Benchmark

    The dataset includes bill titles and bill summaries related to corporate lobbying.

    Task category t2t

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench/viewer/corporate_lobbying

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset using the following code: import mteb

    task =… See the full description on the dataset page: https://huggingface.co/datasets/mteb/legalbench_corporate_lobbying.

  7. h

    legalbench-function_of_decision_section

    • huggingface.co
    Updated Aug 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuel Berger (2025). legalbench-function_of_decision_section [Dataset]. https://huggingface.co/datasets/manuelberger/legalbench-function_of_decision_section
    Explore at:
    Dataset updated
    Aug 31, 2025
    Authors
    Manuel Berger
    Description

    Dataset Card for "legalbench-function_of_decision_section"

    More Information needed

  8. h

    ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task is a subset of ContractNLI, and consists of determining whether a clause from an NDA clause provides that Confidential Information may include verbally conveyed information.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an… See the full description on the dataset page: https://huggingface.co/datasets/mteb/ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification.

  9. h

    legalbench_instruct

    • huggingface.co
    Updated Mar 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Equall (2024). legalbench_instruct [Dataset]. https://huggingface.co/datasets/Equall/legalbench_instruct
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 8, 2024
    Dataset authored and provided by
    Equall
    Description

    Equall/legalbench_instruct dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    CUADGoverningLawLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). CUADGoverningLawLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADGoverningLawLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CUADGoverningLawLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task was constructed from the CUAD dataset. It consists of determining if the clause specifies which state/country’s law governs the contract.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset using the following code: import mteb… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADGoverningLawLegalBenchClassification.

  11. h

    CUADNoSolicitOfEmployeesLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). CUADNoSolicitOfEmployeesLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADNoSolicitOfEmployeesLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CUADNoSolicitOfEmployeesLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task was constructed from the CUAD dataset. It consists of determining if the clause restricts a party's soliciting or hiring employees and/or contractors from the counterparty, whether during the contract or after the contract ends (or both).

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADNoSolicitOfEmployeesLegalBenchClassification.
    
  12. h

    SCDBPAuditsLegalBenchClassification

    • huggingface.co
    Updated Jun 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). SCDBPAuditsLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/SCDBPAuditsLegalBenchClassification
    Explore at:
    Dataset updated
    Jun 21, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SCDBPAuditsLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This is a binary classification task in which the LLM must determine if a supply chain disclosure meets the following coding criteria: 'Does the above statement disclose whether the retail seller or manufacturer performs any type of audit, or reserves the right to audit?'

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench… See the full description on the dataset page: https://huggingface.co/datasets/mteb/SCDBPAuditsLegalBenchClassification.

  13. h

    CUADLicenseGrantLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). CUADLicenseGrantLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADLicenseGrantLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CUADLicenseGrantLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task was constructed from the CUAD dataset. It consists of determining if the clause contains a license granted by one party to its counterparty.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset using the following code: import… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADLicenseGrantLegalBenchClassification.

  14. h

    ContractNLISharingWithEmployeesLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). ContractNLISharingWithEmployeesLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/ContractNLISharingWithEmployeesLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ContractNLISharingWithEmployeesLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task is a subset of ContractNLI, and consists of determining whether a clause from an NDA clause provides that the Receiving Party may share some Confidential Information with some of Receiving Party's employees.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You… See the full description on the dataset page: https://huggingface.co/datasets/mteb/ContractNLISharingWithEmployeesLegalBenchClassification.

  15. h

    PersonalJurisdictionLegalBenchClassification

    • huggingface.co
    Updated May 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). PersonalJurisdictionLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/PersonalJurisdictionLegalBenchClassification
    Explore at:
    Dataset updated
    May 7, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PersonalJurisdictionLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    Given a fact pattern describing the set of contacts between a plaintiff, defendant, and forum, determine if a court in that forum could excercise personal jurisdiction over the defendant.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this… See the full description on the dataset page: https://huggingface.co/datasets/mteb/PersonalJurisdictionLegalBenchClassification.

  16. h

    LegalReasoningCausalityLegalBenchClassification

    • huggingface.co
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). LegalReasoningCausalityLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/LegalReasoningCausalityLegalBenchClassification
    Explore at:
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    LegalReasoningCausalityLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    Given an excerpt from a district court opinion, classify if it relies on statistical evidence in its reasoning.

    Task category t2c

    Domains Legal, Written Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset using the following code: import mteb

    task =… See the full description on the dataset page: https://huggingface.co/datasets/mteb/LegalReasoningCausalityLegalBenchClassification.

  17. h

    Diversity2LegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). Diversity2LegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/Diversity2LegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diversity2LegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    Given a set of facts about the citizenships of plaintiffs and defendants and the amounts associated with claims, determine if the criteria for diversity jurisdiction have been met (variant 2).

    Task categoryt2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset… See the full description on the dataset page: https://huggingface.co/datasets/mteb/Diversity2LegalBenchClassification.

  18. h

    DefinitionClassificationLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). DefinitionClassificationLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/DefinitionClassificationLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    DefinitionClassificationLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task consists of determining whether or not a sentence from a Supreme Court opinion offers a definition of a term.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset using the following code: import mteb

    task =… See the full description on the dataset page: https://huggingface.co/datasets/mteb/DefinitionClassificationLegalBenchClassification.

  19. h

    ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task is a subset of ContractNLI, and consists of determining whether a clause from an NDA clause provides that the Receiving Party may retain some Confidential Information even after the return or destruction of Confidential Information.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench… See the full description on the dataset page: https://huggingface.co/datasets/mteb/ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification.

  20. h

    CUADEffectiveDateLegalBenchClassification

    • huggingface.co
    Updated May 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massive Text Embedding Benchmark (2025). CUADEffectiveDateLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADEffectiveDateLegalBenchClassification
    Explore at:
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Massive Text Embedding Benchmark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CUADEffectiveDateLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

    This task was constructed from the CUAD dataset. It consists of determining if the clause specifies the date upon which the agreement becomes effective.

    Task category t2c

    Domains Legal, Written

    Reference https://huggingface.co/datasets/nguha/legalbench

      How to evaluate on this task
    

    You can evaluate an embedding model on this dataset using the following code: import… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADEffectiveDateLegalBenchClassification.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Neel Guha (2023). legalbench [Dataset]. https://huggingface.co/datasets/nguha/legalbench

legalbench

nguha/legalbench

Explore at:
Dataset updated
Aug 21, 2023
Authors
Neel Guha
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

LegalBench is a collection of benchmark tasks for evaluating legal reasoning in large language models.

Search
Clear search
Close search
Google apps
Main menu