2 datasets found
  1. h

    aws-public-pdf-chunked-dataset

    • huggingface.co
    Updated May 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    semih kalkandelen (2025). aws-public-pdf-chunked-dataset [Dataset]. https://huggingface.co/datasets/semihk1/aws-public-pdf-chunked-dataset
    Explore at:
    Dataset updated
    May 25, 2025
    Authors
    semih kalkandelen
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    📚 AWS PDF Chunk Dataset

    This dataset consists of chunked text extracted from all publicly available PDF documents on the Amazon Web Services (AWS) official website. The data includes whitepapers, user guides, technical documentation, and best practice manuals—covering virtually every AWS service, concept, and architecture in depth. It is designed to serve as a high-quality knowledge base for use in embedding generation, vector databases, and retrieval-augmented generation (RAG)… See the full description on the dataset page: https://huggingface.co/datasets/semihk1/aws-public-pdf-chunked-dataset.

  2. e

    GeoDAE — public base (WFS service)

    • data.europa.eu
    wfs
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GeoDAE — public base (WFS service) [Dataset]. https://data.europa.eu/data/datasets/87d621bd-fc7b-4b51-92b7-7515ac1c66c7
    Explore at:
    wfsAvailable download formats
    Description

    Service WFS — GeoDAE is the national database of external automated defibrillators, listed in France.

    This sheet concerns the extraction of public data, as provided for in the Decree of 29 October 2019 on the operation of the national database of external automated defibrillators (EAD), published in the OJ of 13 November 2019.

    The public or limited dissemination rules are specified in Annexes 1, 2 and 3 to this Order. Only open access data is disseminated in open data.

    PDF link of the authenticated Official Journal No 0263 of 13/11/2019 https://www.legifrance.gouv.fr/download/pdf?id=7dFR0QFiwf-MSw4c2oQJQ1o7HqWR6wDUo19VGpmA_28=

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
semih kalkandelen (2025). aws-public-pdf-chunked-dataset [Dataset]. https://huggingface.co/datasets/semihk1/aws-public-pdf-chunked-dataset

aws-public-pdf-chunked-dataset

AWS All-in-One Docs Dataset

semihk1/aws-public-pdf-chunked-dataset

Explore at:
Dataset updated
May 25, 2025
Authors
semih kalkandelen
License

https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

Description

📚 AWS PDF Chunk Dataset

This dataset consists of chunked text extracted from all publicly available PDF documents on the Amazon Web Services (AWS) official website. The data includes whitepapers, user guides, technical documentation, and best practice manuals—covering virtually every AWS service, concept, and architecture in depth. It is designed to serve as a high-quality knowledge base for use in embedding generation, vector databases, and retrieval-augmented generation (RAG)… See the full description on the dataset page: https://huggingface.co/datasets/semihk1/aws-public-pdf-chunked-dataset.

Search
Clear search
Close search
Google apps
Main menu