Predict the stucture of Indian Court Judgements using sentence rhetorical roles.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Spanish Legal Domain Corpora
A collection of corpora of Spanish legal domain.
More legal domain resources: https://github.com/PlanTL-GOB-ES/lm-legal-es
Citation
@misc{gutierrezfandino2021legal,
title={Spanish Legalese Language Model and Corpora},
author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Aitor Gonzalez-Agirre and Marta Villegas},
year={2021},
eprint={2110.12201},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Copyright
Copyright (c) 2021 Secretaría de Estado de Digitalización e Inteligencia Artificial
A legal retrieval dataset in Germnan https://github.com/lavis-nlp/GerDaLIR
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Merger Agreement Understanding Dataset (MAUD) v1 is a corpus of 47,000+ labels in 152 merger agreements that have been manually labeled under the supervision of experienced lawyers to identify 92 questions in each agreement used by the 2021 American Bar Association (ABA) Public Target Deal Points Study.
MAUD is curated and maintained by The Atticus Project, Inc. to support NLP research and development in legal contract review.
ReadMe and Datasheet are published here. Code for replicating the results, together with the model trained on CUAD, is published on Github here.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
📧 Github 🔗 LinkedIn
🏛️ Indonesian Legal RAG Dataset with Knowledge Graph Enhancement
📖 What is this dataset?
This dataset contains Indonesian legal documents enhanced with Knowledge Graph features for building better RAG (Retrieval-Augmented Generation) systems. It includes regulations, laws, and legal documents from Indonesia with smart scoring and relationship mapping. 🎯 Perfect for: Legal AI, Indonesian NLP, RAG systems, legal research, and chatbots
✨ Key… See the full description on the dataset page: https://huggingface.co/datasets/Azzindani/ID_REG_MD_KG.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
📧 Github 🔗 LinkedIn
🏛️ Indonesian Legal RAG Dataset with Embeddings & TF-IDF Vectors
📖 What is this dataset?
This dataset contains Indonesian legal documents with pre-computed embeddings and TF-IDF vectors for building RAG (Retrieval-Augmented Generation) systems. It includes regulations, laws, and legal documents from Indonesia with ready-to-use vector representations. 🎯 Perfect for: Legal AI, Indonesian NLP, RAG systems, semantic search, and legal chatbots… See the full description on the dataset page: https://huggingface.co/datasets/Azzindani/ID_REG_MD_Embed.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Predict the stucture of Indian Court Judgements using sentence rhetorical roles.