6 datasets found

g
RhetoricalRole: A dataset to structure indian legal judgements
legal-nlp-ekstep.github.io
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Thoughtworks, Ekstep, RhetoricalRole: A dataset to structure indian legal judgements [Dataset]. https://legal-nlp-ekstep.github.io/Competitions/Rhetorical-Role/
Explore at:
Dataset authored and provided by
Thoughtworks, Ekstep
Description
Predict the stucture of Indian Court Judgements using sentence rhetorical roles.
Spanish Legal Domain Corpora
zenodo.org
data.niaid.nih.gov
zip
Updated Nov 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Asier Gutiérrez-Fandiño; Asier Gutiérrez-Fandiño; Jordi Armengol-Estapé; Jordi Armengol-Estapé; Aitor Gonzalez-Agirre; Marta Villegas; Marta Villegas; Aitor Gonzalez-Agirre (2022). Spanish Legal Domain Corpora [Dataset]. http://doi.org/10.5281/zenodo.5495529
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5495529
Dataset updated
Nov 4, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Asier Gutiérrez-Fandiño; Asier Gutiérrez-Fandiño; Jordi Armengol-Estapé; Jordi Armengol-Estapé; Aitor Gonzalez-Agirre; Marta Villegas; Marta Villegas; Aitor Gonzalez-Agirre
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Spanish Legal Domain Corpora

A collection of corpora of Spanish legal domain.

More legal domain resources: https://github.com/PlanTL-GOB-ES/lm-legal-es

Citation

@misc{gutierrezfandino2021legal, title={Spanish Legalese Language Model and Corpora}, author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Aitor Gonzalez-Agirre and Marta Villegas}, year={2021}, eprint={2110.12201}, archivePrefix={arXiv}, primaryClass={cs.CL} }

Copyright

Copyright (c) 2021 Secretaría de Estado de Digitalización e Inteligencia Artificial
h
ger_da_lir
huggingface.co
Updated Feb 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jina AI (2024). ger_da_lir [Dataset]. https://huggingface.co/datasets/jinaai/ger_da_lir
Explore at:
Dataset updated
Feb 6, 2024
Dataset authored and provided by
Jina AI
Description
A legal retrieval dataset in Germnan https://github.com/lavis-nlp/GerDaLIR
MAUD v1
zenodo.org
data.niaid.nih.gov
zip
Updated Jul 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Atticus Project; The Atticus Project (2024). MAUD v1 [Dataset]. http://doi.org/10.5281/zenodo.7500064
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7500064
Dataset updated
Jul 15, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
The Atticus Project; The Atticus Project
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Merger Agreement Understanding Dataset (MAUD) v1 is a corpus of 47,000+ labels in 152 merger agreements that have been manually labeled under the supervision of experienced lawyers to identify 92 questions in each agreement used by the 2021 American Bar Association (ABA) Public Target Deal Points Study.

MAUD is curated and maintained by The Atticus Project, Inc. to support NLP research and development in legal contract review.

ReadMe and Datasheet are published here. Code for replicating the results, together with the model trained on CUAD, is published on Github here.
h
ID_REG_MD_KG
huggingface.co
Updated Nov 4, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Azzindani (2008). ID_REG_MD_KG [Dataset]. https://huggingface.co/datasets/Azzindani/ID_REG_MD_KG
Explore at:
Dataset updated
Nov 4, 2008
Authors
Azzindani
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
📧 Github 🔗 LinkedIn

🏛️ Indonesian Legal RAG Dataset with Knowledge Graph Enhancement 📖 What is this dataset?

This dataset contains Indonesian legal documents enhanced with Knowledge Graph features for building better RAG (Retrieval-Augmented Generation) systems. It includes regulations, laws, and legal documents from Indonesia with smart scoring and relationship mapping. 🎯 Perfect for: Legal AI, Indonesian NLP, RAG systems, legal research, and chatbots

✨ Key… See the full description on the dataset page: https://huggingface.co/datasets/Azzindani/ID_REG_MD_KG.
h
ID_REG_MD_Embed
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Azzindani, ID_REG_MD_Embed [Dataset]. https://huggingface.co/datasets/Azzindani/ID_REG_MD_Embed
Explore at:
Authors
Azzindani
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
📧 Github 🔗 LinkedIn

🏛️ Indonesian Legal RAG Dataset with Embeddings & TF-IDF Vectors 📖 What is this dataset?

This dataset contains Indonesian legal documents with pre-computed embeddings and TF-IDF vectors for building RAG (Retrieval-Augmented Generation) systems. It includes regulations, laws, and legal documents from Indonesia with ready-to-use vector representations. 🎯 Perfect for: Legal AI, Indonesian NLP, RAG systems, semantic search, and legal chatbots… See the full description on the dataset page: https://huggingface.co/datasets/Azzindani/ID_REG_MD_Embed.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Thoughtworks, Ekstep, RhetoricalRole: A dataset to structure indian legal judgements [Dataset]. https://legal-nlp-ekstep.github.io/Competitions/Rhetorical-Role/

RhetoricalRole: A dataset to structure indian legal judgements

Explore at:

3 scholarly articles cite this dataset (View in Google Scholar)

Dataset authored and provided by

Thoughtworks, Ekstep

Description

Predict the stucture of Indian Court Judgements using sentence rhetorical roles.

Clear search

Close search

Google apps

Main menu

RhetoricalRole: A dataset to structure indian legal judgements

Spanish Legal Domain Corpora

ger_da_lir

MAUD v1

ID_REG_MD_KG

ID_REG_MD_Embed

RhetoricalRole: A dataset to structure indian legal judgements