60 datasets found
  1. h

    Huggingface_Uploader

    • huggingface.co
    Updated Feb 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ktiseos Nyx (2025). Huggingface_Uploader [Dataset]. https://huggingface.co/datasets/EarthnDusk/Huggingface_Uploader
    Explore at:
    Dataset updated
    Feb 17, 2025
    Dataset authored and provided by
    Ktiseos Nyx
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    🚀 Hugging Face Uploader: Streamline Your Model Sharing! 🚀

    This tool provides a user-friendly way to upload files directly to your Hugging Face repositories. Whether you prefer the interactive environment of a Jupyter Notebook or the command-line efficiency of a Python script, we've got you covered. We've designed it to streamline your workflow and make sharing your models, datasets, and spaces easier than ever before! Will be more consistently updated here:… See the full description on the dataset page: https://huggingface.co/datasets/EarthnDusk/Huggingface_Uploader.

  2. h

    Data from: dataset-creation

    • huggingface.co
    Updated Jul 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    uv scripts (2025). dataset-creation [Dataset]. https://huggingface.co/datasets/uv-scripts/dataset-creation
    Explore at:
    Dataset updated
    Jul 23, 2025
    Dataset authored and provided by
    uv scripts
    Description

    Dataset Creation Scripts

    Ready-to-run scripts for creating Hugging Face datasets from local files.

      Available Scripts
    
    
    
    
    
      📄 pdf-to-dataset.py
    

    Convert directories of PDF files into Hugging Face datasets. Features:

    📁 Uploads PDFs as dataset objects for flexible processing 🏷️ Automatic labeling from folder structure 🚀 Zero configuration - just point at your PDFs 📤 Direct upload to Hugging Face Hub

    Usage:

    Basic usage

    uv run pdf-to-dataset.py /path/to/pdfs… See the full description on the dataset page: https://huggingface.co/datasets/uv-scripts/dataset-creation.

  3. h

    tmp-file-upload

    • huggingface.co
    Updated Nov 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Albert Villanova del Moral (2024). tmp-file-upload [Dataset]. https://huggingface.co/datasets/albertvillanova/tmp-file-upload
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 26, 2024
    Authors
    Albert Villanova del Moral
    Description

    albertvillanova/tmp-file-upload dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    test-uploading-jsonl-file-for-preview-dataset-final

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mahnoor Malik, test-uploading-jsonl-file-for-preview-dataset-final [Dataset]. https://huggingface.co/datasets/MahnoorMalik/test-uploading-jsonl-file-for-preview-dataset-final
    Explore at:
    Authors
    Mahnoor Malik
    Description

    MahnoorMalik/test-uploading-jsonl-file-for-preview-dataset-final dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    nlistral-7b-results

    • huggingface.co
    Updated Jul 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jordan (2025). nlistral-7b-results [Dataset]. https://huggingface.co/datasets/jd0g/nlistral-7b-results
    Explore at:
    Dataset updated
    Jul 27, 2025
    Authors
    Jordan
    Description

    Experimental Results

    This directory stores the output files from running inference and evaluation scripts.

      Uploading Results to Hugging Face
    

    To back up or share your results, you can upload the entire results/ directory to a Hugging Face dataset repository:

    Ensure your .env file has your HF_TOKEN

    Build the Docker image if needed: docker build -t mistral-nli-ft .

    Upload results to the default dataset (jd0g/Mistral-NLI-Results) or your own

    docker run --rm
    -v… See the full description on the dataset page: https://huggingface.co/datasets/jd0g/nlistral-7b-results.

  6. h

    file-storage-7485

    • huggingface.co
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    How Okd R (2025). file-storage-7485 [Dataset]. https://huggingface.co/datasets/xhowold/file-storage-7485
    Explore at:
    Dataset updated
    Jul 30, 2025
    Authors
    How Okd R
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    File Storage Dataset

    This dataset is used for file storage purposes.

      Files
    

    This dataset contains uploaded files organized in the uploads directory.

  7. h

    DH-FaceVid-Sample

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hefeng, DH-FaceVid-Sample [Dataset]. https://huggingface.co/datasets/jjuik2014/DH-FaceVid-Sample
    Explore at:
    Authors
    hefeng
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    English Version:
    Note: Due to network issues, we are currently working on uploading the sample data to this Hugging Face repository. The sample data is now temporarily available at:📌 ModelScope Dataset: https://www.modelscope.cn/datasets/fh2678713685/DH-FaceVid-1K_Sample/files
    ⚠ Important: The uploaded files consist of two split archive parts (.tar.gz format). Users must download both parts, merge them, and then extract the final dataset.

      How to Merge & Extract:
    

    Download… See the full description on the dataset page: https://huggingface.co/datasets/jjuik2014/DH-FaceVid-Sample.

  8. h

    drug_development_supported_by_informatics

    • huggingface.co
    Updated Jun 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bri Parales (2025). drug_development_supported_by_informatics [Dataset]. https://huggingface.co/datasets/introvoyz041/drug_development_supported_by_informatics
    Explore at:
    Dataset updated
    Jun 16, 2025
    Authors
    Bri Parales
    Description

    Dataset Card for introvoyz041/drug_development_supported_by_informatics

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 358 Number of PDFs processed: 1 Sample size per PDF: 100 Created on: 2025-06-16 15:39:43

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to… See the full description on the dataset page: https://huggingface.co/datasets/introvoyz041/drug_development_supported_by_informatics.

  9. h

    coldocs-fin-sample

    • huggingface.co
    Updated Oct 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirant Kasliwal (2024). coldocs-fin-sample [Dataset]. https://huggingface.co/datasets/nirantk/coldocs-fin-sample
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 30, 2024
    Authors
    Nirant Kasliwal
    Description

    Dataset Card for nirantk/coldocs-fin-sample

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 200 Number of PDFs processed: 1 Sample size per PDF: 100 Created on: 2024-10-30 01:19:36

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images… See the full description on the dataset page: https://huggingface.co/datasets/nirantk/coldocs-fin-sample.

  10. h

    Data from: BHI

    • huggingface.co
    Updated Dec 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philip Hofmann (2024). BHI [Dataset]. https://huggingface.co/datasets/Phips/BHI
    Explore at:
    Dataset updated
    Dec 21, 2024
    Authors
    Philip Hofmann
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BHI SISR Dataset

      Content
    

    HR Dataset Used Datasets Tiling BHI Filtering Files Upload

    Corresponding LR Sets Trained models

      HR Dataset
    

    The BHI SISR Dataset's purpose is for training single image super-resolution models and is a result of tests on my BHI filtering method, which I made a huggingface community blogpost about, which can be extremely summarized by that removing (by filtering) only the worst quality tiles from a training set has a way bigger… See the full description on the dataset page: https://huggingface.co/datasets/Phips/BHI.

  11. h

    Finecode

    • huggingface.co
    Updated Mar 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jayan Kesavan (2025). Finecode [Dataset]. https://huggingface.co/datasets/jayan12k/Finecode
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 2, 2025
    Authors
    Jayan Kesavan
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    FineCode: A High-Quality Code Dataset

    Disclaimer: No big files uploaded...yet The one upload is simply an example format and doesn't contain all the highest quality code or the final version.

      Overview
    

    FineCode is a meticulously curated dataset aimed at providing high-quality code for training and benchmarking code generation models. While many code datasets exist on Hugging Face, the quality of code varies significantly. FineCode seeks to address this by rigorously… See the full description on the dataset page: https://huggingface.co/datasets/jayan12k/Finecode.

  12. h

    ufo

    • huggingface.co
    Updated Sep 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel van Strien (2024). ufo [Dataset]. https://huggingface.co/datasets/davanstrien/ufo
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 19, 2024
    Authors
    Daniel van Strien
    Description

    Dataset Card for davanstrien/ufo

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 212 Number of PDFs processed: 109 Sample size per PDF: 10 Created on: 2024-09-19 20:46:12

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images Converter.… See the full description on the dataset page: https://huggingface.co/datasets/davanstrien/ufo.

  13. h

    9th-grade-chem

    • huggingface.co
    Updated May 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zohaib Saqib (2025). 9th-grade-chem [Dataset]. https://huggingface.co/datasets/zohaibterminator/9th-grade-chem
    Explore at:
    Dataset updated
    May 27, 2025
    Authors
    Zohaib Saqib
    Description

    Dataset Card for zohaibterminator/9th-grade-chem

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 53 Number of PDFs processed: 1 Sample size per PDF: 100 Created on: 2025-05-27 12:51:55

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images… See the full description on the dataset page: https://huggingface.co/datasets/zohaibterminator/9th-grade-chem.

  14. h

    Open-Sora-Plan-v1.1.0

    • huggingface.co
    Updated Nov 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    linbin (2024). Open-Sora-Plan-v1.1.0 [Dataset]. https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.1.0
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 3, 2024
    Authors
    linbin
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Annotation

    We resized the dataset to 1080p for easier uploading. Therefore, the original annotation file might not match the video names. Please refer to this https://github.com/PKU-YuanGroup/Open-Sora-Plan/issues/312#issuecomment-2197312973

      Pexels
    

    Pexels consists of multiple folders, but each folder exceeds the size limit for Huggingface uploads. Therefore, we divided each folder into 5 parts. You need to merge the 5 parts of each folder first, and then extract each… See the full description on the dataset page: https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.1.0.

  15. h

    LightRAG-DAPO-ScalingLaws

    • huggingface.co
    Updated May 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    amand (2025). LightRAG-DAPO-ScalingLaws [Dataset]. https://huggingface.co/datasets/axondendriteplus/LightRAG-DAPO-ScalingLaws
    Explore at:
    Dataset updated
    May 13, 2025
    Authors
    amand
    Description

    Dataset Card for axondendriteplus/LightRAG-DAPO-ScalingLaws

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 62 Number of PDFs processed: 3 Sample size per PDF: 100 Created on: 2025-05-13 10:15:11

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to… See the full description on the dataset page: https://huggingface.co/datasets/axondendriteplus/LightRAG-DAPO-ScalingLaws.

  16. h

    Statista

    • huggingface.co
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Praneith Ranganath (2024). Statista [Dataset]. https://huggingface.co/datasets/Pran10/Statista
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 12, 2024
    Authors
    Praneith Ranganath
    Description

    Dataset Card for Pran10/Statista

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 250 Number of PDFs processed: 12 Sample size per PDF: 100 Created on: 2024-11-12 01:04:56

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images Converter.… See the full description on the dataset page: https://huggingface.co/datasets/Pran10/Statista.

  17. h

    britishhland

    • huggingface.co
    Updated Dec 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastien Grima (2024). britishhland [Dataset]. https://huggingface.co/datasets/sebgrima/britishhland
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 1, 2024
    Authors
    Sebastien Grima
    Description

    Dataset Card for sebgrima/britishhland

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 626 Number of PDFs processed: 4 Sample size per PDF: 100 Created on: 2024-12-01 19:28:20

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images… See the full description on the dataset page: https://huggingface.co/datasets/sebgrima/britishhland.

  18. h

    iln

    • huggingface.co
    Updated Nov 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Melvin Wevers (2024). iln [Dataset]. https://huggingface.co/datasets/melvinwevers/iln
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 15, 2024
    Authors
    Melvin Wevers
    Description

    Dataset Card for melvinwevers/iln

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 48 Number of PDFs processed: 3 Sample size per PDF: 100 Created on: 2024-11-15 14:17:07

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images Converter. Each… See the full description on the dataset page: https://huggingface.co/datasets/melvinwevers/iln.

  19. h

    state-of-ai-2024

    • huggingface.co
    Updated Oct 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atita (2024). state-of-ai-2024 [Dataset]. https://huggingface.co/datasets/atitaarora/state-of-ai-2024
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 11, 2024
    Authors
    Atita
    Description

    Dataset Card for atitaarora/state-of-ai-2024

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 212 Number of PDFs processed: 1 Sample size per PDF: 100 Created on: 2024-10-11 15:05:25

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page Images… See the full description on the dataset page: https://huggingface.co/datasets/atitaarora/state-of-ai-2024.

  20. h

    Legal-AI-K-Hub

    • huggingface.co
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    amand (2025). Legal-AI-K-Hub [Dataset]. https://huggingface.co/datasets/axondendriteplus/Legal-AI-K-Hub
    Explore at:
    Dataset updated
    May 15, 2025
    Authors
    amand
    Description

    Dataset Card for axondendriteplus/Legal-AI-K-Hub

      Dataset Description
    

    This dataset contains images converted from PDFs using the PDFs to Page Images Converter Space.

    Number of images: 4245 Number of PDFs processed: 175 Sample size per PDF: 100 Created on: 2025-05-15 11:26:43

      Dataset Creation
    
    
    
    
    
      Source Data
    

    The images in this dataset were generated from user-uploaded PDF files.

      Processing Steps
    

    PDF files were uploaded to the PDFs to Page… See the full description on the dataset page: https://huggingface.co/datasets/axondendriteplus/Legal-AI-K-Hub.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ktiseos Nyx (2025). Huggingface_Uploader [Dataset]. https://huggingface.co/datasets/EarthnDusk/Huggingface_Uploader

Huggingface_Uploader

EarthnDusk/Huggingface_Uploader

Hugging Face Uploader: Streamline Your Model Sharing!

Explore at:
Dataset updated
Feb 17, 2025
Dataset authored and provided by
Ktiseos Nyx
License

https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

Description

🚀 Hugging Face Uploader: Streamline Your Model Sharing! 🚀

This tool provides a user-friendly way to upload files directly to your Hugging Face repositories. Whether you prefer the interactive environment of a Jupyter Notebook or the command-line efficiency of a Python script, we've got you covered. We've designed it to streamline your workflow and make sharing your models, datasets, and spaces easier than ever before! Will be more consistently updated here:… See the full description on the dataset page: https://huggingface.co/datasets/EarthnDusk/Huggingface_Uploader.

Search
Clear search
Close search
Google apps
Main menu