6 datasets found
  1. g

    RhetoricalRole: A dataset to structure indian legal judgements

    • legal-nlp-ekstep.github.io
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thoughtworks, Ekstep, RhetoricalRole: A dataset to structure indian legal judgements [Dataset]. https://legal-nlp-ekstep.github.io/Competitions/Rhetorical-Role/
    Explore at:
    Dataset authored and provided by
    Thoughtworks, Ekstep
    Description

    Predict the stucture of Indian Court Judgements using sentence rhetorical roles.

  2. Spanish Legal Domain Corpora

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Nov 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asier Gutiérrez-Fandiño; Asier Gutiérrez-Fandiño; Jordi Armengol-Estapé; Jordi Armengol-Estapé; Aitor Gonzalez-Agirre; Marta Villegas; Marta Villegas; Aitor Gonzalez-Agirre (2022). Spanish Legal Domain Corpora [Dataset]. http://doi.org/10.5281/zenodo.5495529
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 4, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Asier Gutiérrez-Fandiño; Asier Gutiérrez-Fandiño; Jordi Armengol-Estapé; Jordi Armengol-Estapé; Aitor Gonzalez-Agirre; Marta Villegas; Marta Villegas; Aitor Gonzalez-Agirre
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Spanish Legal Domain Corpora

    A collection of corpora of Spanish legal domain.

    More legal domain resources: https://github.com/PlanTL-GOB-ES/lm-legal-es

    Citation

    @misc{gutierrezfandino2021legal,
       title={Spanish Legalese Language Model and Corpora}, 
       author={Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Aitor Gonzalez-Agirre and Marta Villegas},
       year={2021},
       eprint={2110.12201},
       archivePrefix={arXiv},
       primaryClass={cs.CL}
    }

    Copyright

    Copyright (c) 2021 Secretaría de Estado de Digitalización e Inteligencia Artificial

  3. h

    ger_da_lir

    • huggingface.co
    Updated Feb 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jina AI (2024). ger_da_lir [Dataset]. https://huggingface.co/datasets/jinaai/ger_da_lir
    Explore at:
    Dataset updated
    Feb 6, 2024
    Dataset authored and provided by
    Jina AI
    Description

    A legal retrieval dataset in Germnan https://github.com/lavis-nlp/GerDaLIR

  4. MAUD v1

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Jul 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Atticus Project; The Atticus Project (2024). MAUD v1 [Dataset]. http://doi.org/10.5281/zenodo.7500064
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 15, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    The Atticus Project; The Atticus Project
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Merger Agreement Understanding Dataset (MAUD) v1 is a corpus of 47,000+ labels in 152 merger agreements that have been manually labeled under the supervision of experienced lawyers to identify 92 questions in each agreement used by the 2021 American Bar Association (ABA) Public Target Deal Points Study.

    MAUD is curated and maintained by The Atticus Project, Inc. to support NLP research and development in legal contract review.

    ReadMe and Datasheet are published here. Code for replicating the results, together with the model trained on CUAD, is published on Github here.

  5. h

    ID_REG_MD_KG

    • huggingface.co
    Updated Nov 4, 2008
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Azzindani (2008). ID_REG_MD_KG [Dataset]. https://huggingface.co/datasets/Azzindani/ID_REG_MD_KG
    Explore at:
    Dataset updated
    Nov 4, 2008
    Authors
    Azzindani
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    📧 Github 🔗 LinkedIn

      🏛️ Indonesian Legal RAG Dataset with Knowledge Graph Enhancement
    
    
    
    
    
      📖 What is this dataset?
    

    This dataset contains Indonesian legal documents enhanced with Knowledge Graph features for building better RAG (Retrieval-Augmented Generation) systems. It includes regulations, laws, and legal documents from Indonesia with smart scoring and relationship mapping. 🎯 Perfect for: Legal AI, Indonesian NLP, RAG systems, legal research, and chatbots

      ✨ Key… See the full description on the dataset page: https://huggingface.co/datasets/Azzindani/ID_REG_MD_KG.
    
  6. h

    ID_REG_MD_Embed

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Azzindani, ID_REG_MD_Embed [Dataset]. https://huggingface.co/datasets/Azzindani/ID_REG_MD_Embed
    Explore at:
    Authors
    Azzindani
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    📧 Github 🔗 LinkedIn

      🏛️ Indonesian Legal RAG Dataset with Embeddings & TF-IDF Vectors
    
    
    
    
    
      📖 What is this dataset?
    

    This dataset contains Indonesian legal documents with pre-computed embeddings and TF-IDF vectors for building RAG (Retrieval-Augmented Generation) systems. It includes regulations, laws, and legal documents from Indonesia with ready-to-use vector representations. 🎯 Perfect for: Legal AI, Indonesian NLP, RAG systems, semantic search, and legal chatbots… See the full description on the dataset page: https://huggingface.co/datasets/Azzindani/ID_REG_MD_Embed.

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Thoughtworks, Ekstep, RhetoricalRole: A dataset to structure indian legal judgements [Dataset]. https://legal-nlp-ekstep.github.io/Competitions/Rhetorical-Role/

RhetoricalRole: A dataset to structure indian legal judgements

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
Dataset authored and provided by
Thoughtworks, Ekstep
Description

Predict the stucture of Indian Court Judgements using sentence rhetorical roles.

Search
Clear search
Close search
Google apps
Main menu