33 datasets found

h
legalbench
huggingface.co
Updated Aug 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neel Guha (2023). legalbench [Dataset]. https://huggingface.co/datasets/nguha/legalbench
Explore at:
Dataset updated
Aug 21, 2023
Authors
Neel Guha
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
LegalBench is a collection of benchmark tasks for evaluating legal reasoning in large language models.
h
legalbench
huggingface.co
opendatalab.com
Updated Sep 20, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Omkar Kabde (2025). legalbench [Dataset]. https://huggingface.co/datasets/omkar334/legalbench
Explore at:
Dataset updated
Sep 20, 2025
Authors
Omkar Kabde
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Dataset Card for Dataset Name

Homepage: https://hazyresearch.stanford.edu/legalbench/ Repository: https://github.com/HazyResearch/legalbench/ Paper: https://arxiv.org/abs/2308.11462

Dataset Description Dataset Summary

The LegalBench project is an ongoing open science effort to collaboratively curate tasks for evaluating legal reasoning in English large language models (LLMs). The benchmark currently consists of 162 tasks gathered from 40… See the full description on the dataset page: https://huggingface.co/datasets/omkar334/legalbench.
h
legalbench-entire
huggingface.co
Updated Jun 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prithviraj Anil Maurya (2024). legalbench-entire [Dataset]. https://huggingface.co/datasets/prithviraj-maurya/legalbench-entire
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 21, 2024
Authors
Prithviraj Anil Maurya
Description
prithviraj-maurya/legalbench-entire dataset hosted on Hugging Face and contributed by the HF Datasets community
h
legalbench
huggingface.co
Updated Sep 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sarthak Jain (2025). legalbench [Dataset]. https://huggingface.co/datasets/sarthak-wiz01/legalbench
Explore at:
Dataset updated
Sep 29, 2025
Authors
Sarthak Jain
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

Dataset Description

LegalBench is an open science effort to collaboratively curate tasks for evaluating legal reasoning in English large language models (LLMs). The benchmark consists of 162 tasks gathered from 40 contributors, covering a wide range of legal domains, task structures, and difficulty levels.

Homepage

Website:… See the full description on the dataset page: https://huggingface.co/datasets/sarthak-wiz01/legalbench.
h
legalbench-qa-contractnli
huggingface.co
Updated Sep 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Antonio Menta (2025). legalbench-qa-contractnli [Dataset]. https://huggingface.co/datasets/amentaphd/legalbench-qa-contractnli
Explore at:
Dataset updated
Sep 14, 2025
Authors
Antonio Menta
Description
amentaphd/legalbench-qa-contractnli dataset hosted on Hugging Face and contributed by the HF Datasets community
h
legalbench_corporate_lobbying
huggingface.co
Updated Mar 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2024). legalbench_corporate_lobbying [Dataset]. https://huggingface.co/datasets/mteb/legalbench_corporate_lobbying
Explore at:
Dataset updated
Mar 31, 2024
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
LegalBenchCorporateLobbying An MTEB dataset Massive Text Embedding Benchmark

The dataset includes bill titles and bill summaries related to corporate lobbying.

Task category t2t

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench/viewer/corporate_lobbying

How to evaluate on this task

You can evaluate an embedding model on this dataset using the following code: import mteb

task =… See the full description on the dataset page: https://huggingface.co/datasets/mteb/legalbench_corporate_lobbying.
h
legalbench-function_of_decision_section
huggingface.co
Updated Aug 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Manuel Berger (2025). legalbench-function_of_decision_section [Dataset]. https://huggingface.co/datasets/manuelberger/legalbench-function_of_decision_section
Explore at:
Dataset updated
Aug 31, 2025
Authors
Manuel Berger
Description
Dataset Card for "legalbench-function_of_decision_section"

More Information needed
h
ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task is a subset of ContractNLI, and consists of determining whether a clause from an NDA clause provides that Confidential Information may include verbally conveyed information.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an… See the full description on the dataset page: https://huggingface.co/datasets/mteb/ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification.
h
legalbench_instruct
huggingface.co
Updated Mar 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Equall (2024). legalbench_instruct [Dataset]. https://huggingface.co/datasets/Equall/legalbench_instruct
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 8, 2024
Dataset authored and provided by
Equall
Description
Equall/legalbench_instruct dataset hosted on Hugging Face and contributed by the HF Datasets community
h
CUADGoverningLawLegalBenchClassification
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). CUADGoverningLawLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADGoverningLawLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
CUADGoverningLawLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task was constructed from the CUAD dataset. It consists of determining if the clause specifies which state/country’s law governs the contract.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this dataset using the following code: import mteb… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADGoverningLawLegalBenchClassification.
h
CUADNoSolicitOfEmployeesLegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). CUADNoSolicitOfEmployeesLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADNoSolicitOfEmployeesLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
CUADNoSolicitOfEmployeesLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task was constructed from the CUAD dataset. It consists of determining if the clause restricts a party's soliciting or hiring employees and/or contractors from the counterparty, whether during the contract or after the contract ends (or both).

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADNoSolicitOfEmployeesLegalBenchClassification.
h
SCDBPAuditsLegalBenchClassification
huggingface.co
Updated Jun 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). SCDBPAuditsLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/SCDBPAuditsLegalBenchClassification
Explore at:
Dataset updated
Jun 21, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
SCDBPAuditsLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This is a binary classification task in which the LLM must determine if a supply chain disclosure meets the following coding criteria: 'Does the above statement disclose whether the retail seller or manufacturer performs any type of audit, or reserves the right to audit?'

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench… See the full description on the dataset page: https://huggingface.co/datasets/mteb/SCDBPAuditsLegalBenchClassification.
h
CUADLicenseGrantLegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). CUADLicenseGrantLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADLicenseGrantLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
CUADLicenseGrantLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task was constructed from the CUAD dataset. It consists of determining if the clause contains a license granted by one party to its counterparty.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this dataset using the following code: import… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADLicenseGrantLegalBenchClassification.
h
ContractNLISharingWithEmployeesLegalBenchClassification
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). ContractNLISharingWithEmployeesLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/ContractNLISharingWithEmployeesLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ContractNLISharingWithEmployeesLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task is a subset of ContractNLI, and consists of determining whether a clause from an NDA clause provides that the Receiving Party may share some Confidential Information with some of Receiving Party's employees.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You… See the full description on the dataset page: https://huggingface.co/datasets/mteb/ContractNLISharingWithEmployeesLegalBenchClassification.
h
PersonalJurisdictionLegalBenchClassification
huggingface.co
Updated May 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). PersonalJurisdictionLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/PersonalJurisdictionLegalBenchClassification
Explore at:
Dataset updated
May 7, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
PersonalJurisdictionLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

Given a fact pattern describing the set of contacts between a plaintiff, defendant, and forum, determine if a court in that forum could excercise personal jurisdiction over the defendant.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this… See the full description on the dataset page: https://huggingface.co/datasets/mteb/PersonalJurisdictionLegalBenchClassification.
h
LegalReasoningCausalityLegalBenchClassification
huggingface.co
Updated May 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). LegalReasoningCausalityLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/LegalReasoningCausalityLegalBenchClassification
Explore at:
Dataset updated
May 6, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
LegalReasoningCausalityLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

Given an excerpt from a district court opinion, classify if it relies on statistical evidence in its reasoning.

Task category t2c

Domains Legal, Written Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this dataset using the following code: import mteb

task =… See the full description on the dataset page: https://huggingface.co/datasets/mteb/LegalReasoningCausalityLegalBenchClassification.
h
Diversity2LegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). Diversity2LegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/Diversity2LegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Diversity2LegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

Given a set of facts about the citizenships of plaintiffs and defendants and the amounts associated with claims, determine if the criteria for diversity jurisdiction have been met (variant 2).

Task categoryt2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this dataset… See the full description on the dataset page: https://huggingface.co/datasets/mteb/Diversity2LegalBenchClassification.
h
DefinitionClassificationLegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). DefinitionClassificationLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/DefinitionClassificationLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
DefinitionClassificationLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task consists of determining whether or not a sentence from a Supreme Court opinion offers a definition of a term.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this dataset using the following code: import mteb

task =… See the full description on the dataset page: https://huggingface.co/datasets/mteb/DefinitionClassificationLegalBenchClassification.
h
ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task is a subset of ContractNLI, and consists of determining whether a clause from an NDA clause provides that the Receiving Party may retain some Confidential Information even after the return or destruction of Confidential Information.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench… See the full description on the dataset page: https://huggingface.co/datasets/mteb/ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification.
h
CUADEffectiveDateLegalBenchClassification
huggingface.co
Updated May 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massive Text Embedding Benchmark (2025). CUADEffectiveDateLegalBenchClassification [Dataset]. https://huggingface.co/datasets/mteb/CUADEffectiveDateLegalBenchClassification
Explore at:
Dataset updated
May 11, 2025
Dataset authored and provided by
Massive Text Embedding Benchmark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
CUADEffectiveDateLegalBenchClassification An MTEB dataset Massive Text Embedding Benchmark

This task was constructed from the CUAD dataset. It consists of determining if the clause specifies the date upon which the agreement becomes effective.

Task category t2c

Domains Legal, Written

Reference https://huggingface.co/datasets/nguha/legalbench

How to evaluate on this task

You can evaluate an embedding model on this dataset using the following code: import… See the full description on the dataset page: https://huggingface.co/datasets/mteb/CUADEffectiveDateLegalBenchClassification.

Facebook

Twitter

Click to copy link

Link copied

Cite

Neel Guha (2023). legalbench [Dataset]. https://huggingface.co/datasets/nguha/legalbench

legalbench

nguha/legalbench

Explore at:

Dataset updated

Aug 21, 2023

Authors

Neel Guha

License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

LegalBench is a collection of benchmark tasks for evaluating legal reasoning in large language models.

Clear search

Close search

Google apps

Main menu

legalbench

legalbench

legalbench-entire

legalbench

legalbench-qa-contractnli

legalbench_corporate_lobbying

legalbench-function_of_decision_section

ContractNLIInclusionOfVerballyConveyedInformationLegalBenchClassification

legalbench_instruct

CUADGoverningLawLegalBenchClassification

CUADNoSolicitOfEmployeesLegalBenchClassification

SCDBPAuditsLegalBenchClassification

CUADLicenseGrantLegalBenchClassification

ContractNLISharingWithEmployeesLegalBenchClassification

PersonalJurisdictionLegalBenchClassification

LegalReasoningCausalityLegalBenchClassification

Diversity2LegalBenchClassification

DefinitionClassificationLegalBenchClassification

ContractNLIPermissiblePostAgreementPossessionLegalBenchClassification

CUADEffectiveDateLegalBenchClassification

legalbenchSee More Versions

nguha/legalbench

legalbench