100+ datasets found
  1. h

    cli

    • huggingface.co
    Updated Apr 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lacroix (2025). cli [Dataset]. https://huggingface.co/datasets/skip113/cli
    Explore at:
    Dataset updated
    Apr 26, 2025
    Authors
    lacroix
    Description

    skip113/cli dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. h

    ml-dataset-cli-test

    • huggingface.co
    Updated Aug 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hakan Kozaklı (2025). ml-dataset-cli-test [Dataset]. https://huggingface.co/datasets/Codyfederer/ml-dataset-cli-test
    Explore at:
    Dataset updated
    Aug 6, 2025
    Authors
    Hakan Kozaklı
    Description

    Codyfederer/ml-dataset-cli-test dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    mmcows

    • huggingface.co
    Updated Mar 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NEIS Lab @ Purdue (2025). mmcows [Dataset]. http://doi.org/10.57967/hf/5965
    Explore at:
    Dataset updated
    Mar 4, 2025
    Dataset authored and provided by
    NEIS Lab @ Purdue
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    MmCows: A Multimodal Dataset for Dairy Cattle Monitoring

    Details of the dataset and benchmarks are available here. For a quick overview of the dataset, please check this video.

      Instruction for downloading
    
    
    
    
    
      1. Install requirements
    

    pip install huggingface_hub

    See the file structure here for the next step.

      2. Download a file individually
    

    To download visual_data.zip to your local-dir, use command line: huggingface-cli download
    neis-lab/mmcows \… See the full description on the dataset page: https://huggingface.co/datasets/neis-lab/mmcows.

  4. h

    CLI-v2

    • huggingface.co
    Updated Mar 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amir Mohseni (2025). CLI-v2 [Dataset]. https://huggingface.co/datasets/AmirMohseni/CLI-v2
    Explore at:
    Dataset updated
    Mar 30, 2025
    Authors
    Amir Mohseni
    Description

    AmirMohseni/CLI-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    aiornot

    • huggingface.co
    Updated Jan 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Competitions (2023). aiornot [Dataset]. https://huggingface.co/datasets/competitions/aiornot
    Explore at:
    Dataset updated
    Jan 25, 2023
    Dataset authored and provided by
    Competitions
    Description

    Dataset Card for aiornot

    Dataset for the aiornot competition. By accessing this dataset, you accept the rules of the AI or Not competition. Please note that dataset may contain images which are not considered safe for work.

      Usage
    
    
    
    
    
      With Hugging Face Datasets 🤗
    

    You can download and use this dataset using the datasets library. 📝 Note: You must be logged in to you Hugging Face account for the snippet below to work. You can do this with huggingface-cli login or… See the full description on the dataset page: https://huggingface.co/datasets/competitions/aiornot.

  6. h

    Beginner

    • huggingface.co
    Updated Aug 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    Dataset updated
    Aug 26, 2025
    Dataset authored and provided by
    Roman Galactic Exoplanet Survey Project Infrastructure Team
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Experienced

    ML training data for the Roman Microlensing Data CHallenge 2025 - Beginner tier.

      Uploading
    

    CLI:

    Install the Hugging Face CLI

    brew install huggingface-cli

    Login with your Hugging Face credentials

    hf auth login

    Push your dataset files

    hf upload RGES-PIT/Beginner . --repo-type=dataset

    Python: from huggingface_hub import HfApi

    api = HfApi(token=os.getenv("HF_TOKEN")) api.upload_folder( folder_path="/path/to/local/dataset"… See the full description on the dataset page: https://huggingface.co/datasets/RGES-PIT/Beginner.

  7. h

    browsecomp-plus-indexes

    • huggingface.co
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tevatron (2025). browsecomp-plus-indexes [Dataset]. https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Tevatron
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    BM25, embedding index used in BrowseComp-Plus. For downloading the index: huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="bm25/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-0.6b/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-4b/*" --local-dir ./indexes huggingface-cli download… See the full description on the dataset page: https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes.

  8. h

    Wireframe

    • huggingface.co
    Updated Aug 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li Hao (2025). Wireframe [Dataset]. https://huggingface.co/datasets/lh9171338/Wireframe
    Explore at:
    Dataset updated
    Aug 31, 2025
    Authors
    Li Hao
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Wireframe Dataset

    This is the Wireframe dataset hosted on Hugging Face Hub.

      Summary
    

    Wireframe dataset with image annotations including line segments.The dataset is stored as jsonl files (train/metadata.jsonl, test/metadata.jsonl) and images. Number of samples:

    Train: 5,000 Test: 462

      Download
    

    Download with huggingface-hub

    python3 -m pip install huggingface-hub huggingface-cli download --repo-type dataset lh9171338/Wireframe --local-dir ./

    Download with Git… See the full description on the dataset page: https://huggingface.co/datasets/lh9171338/Wireframe.

  9. h

    MOSEv2

    • huggingface.co
    Updated Aug 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FudanCVL (2025). MOSEv2 [Dataset]. https://huggingface.co/datasets/FudanCVL/MOSEv2
    Explore at:
    Dataset updated
    Aug 8, 2025
    Dataset authored and provided by
    FudanCVL
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

    🔥 Evaluation Server | 🏠 Homepage | 📄 Paper | 🔗 GitHub

      Download
    

    We recommend using huggingface-cli to download: pip install -U "huggingface_hub[cli]" huggingface-cli download FudanCVL/MOSEv2 --repo-type dataset --local-dir ./MOSEv2 --local-dir-use-symlinks False --max-workers 16

      Dataset Summary
    

    MOSEv2 is a comprehensive video object segmentation dataset designed to advance… See the full description on the dataset page: https://huggingface.co/datasets/FudanCVL/MOSEv2.

  10. h

    GuardReasonerTrain

    • huggingface.co
    Updated May 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    yueliu1999 (2025). GuardReasonerTrain [Dataset]. https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 19, 2025
    Authors
    yueliu1999
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    GuardReasonerTrain

    GuardReasonerTrain is the training data for R-SFT of GuardReasoner, as described in the paper GuardReasoner: Towards Reasoning-based LLM Safeguards. Code: https://github.com/yueliu1999/GuardReasoner/

      Usage
    

    from datasets import load_dataset

    Login using e.g. huggingface-cli login to access this dataset

    ds = load_dataset("yueliu1999/GuardReasonerTrain")

      Citation
    

    If you use this dataset, please cite our paper. @article{GuardReasoner… See the full description on the dataset page: https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain.

  11. h

    ktda-datasets

    • huggingface.co
    Updated Dec 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    XavierJiezou (2024). ktda-datasets [Dataset]. https://huggingface.co/datasets/XavierJiezou/ktda-datasets
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 8, 2024
    Authors
    XavierJiezou
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    KTDA-Datasets

    This dataset card aims to describe the datasets used in the KTDA.

      Install
    

    pip install huggingface-hub

      Usage
    

    Step 1: Download datasets

    huggingface-cli download --repo-type dataset XavierJiezou/ktda-datasets --local-dir data --include grass.zip huggingface-cli download --repo-type dataset XavierJiezou/ktda-datasets --local-dir data --include cloud.zip

    Step 2: Extract datasets

    unzip grass.zip -d grass unzip cloud.zip -d l8_biome… See the full description on the dataset page: https://huggingface.co/datasets/XavierJiezou/ktda-datasets.

  12. h

    Vizdate

    • huggingface.co
    Updated Mar 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erik Firdaus (2025). Vizdate [Dataset]. https://huggingface.co/datasets/Vinzero/Vizdate
    Explore at:
    Dataset updated
    Mar 30, 2025
    Authors
    Erik Firdaus
    Description

    from datasets import load_dataset

      Login using e.g. huggingface-cli login to access this dataset
    

    ds = load_dataset("huggingface/transformers-metadata", "frameworks")

  13. h

    TrialPanorama-database

    • huggingface.co
    Updated Aug 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zifeng Wang (2025). TrialPanorama-database [Dataset]. https://huggingface.co/datasets/zifeng-ai/TrialPanorama-database
    Explore at:
    Dataset updated
    Aug 4, 2025
    Authors
    Zifeng Wang
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Quick start

    The easiest way to download the dataset to your local is to use huggingface-cli. The specific command you can use is huggingface-cli download zifeng-ai/TrialPanorama-database --local-dir LOCAL_DIR --repo-type dataset

    where LOCAL_DIR should be replaced with the target directory you want to save your dataset to.

      Update history
    

    Aug.4 2025: updated tables with the full set of studies

    Dataset website: https://ryanwangzf.github.io/projects/trialpanorama… See the full description on the dataset page: https://huggingface.co/datasets/zifeng-ai/TrialPanorama-database.

  14. h

    Experienced

    • huggingface.co
    Updated Aug 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roman Galactic Exoplanet Survey Project Infrastructure Team (2025). Experienced [Dataset]. https://huggingface.co/datasets/RGES-PIT/Experienced
    Explore at:
    Dataset updated
    Aug 26, 2025
    Dataset authored and provided by
    Roman Galactic Exoplanet Survey Project Infrastructure Team
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Experienced

    ML training data for the Roman Microlensing Data CHallenge 2025 - Experienced tier.

      Uploading
    

    CLI:

    Install the Hugging Face CLI

    brew install huggingface-cli

    Login with your Hugging Face credentials

    hf auth login

    Push your dataset files

    hf upload RGES-PIT/Experienced . --repo-type=dataset

    Python: from huggingface_hub import HfApi

    api = HfApi(token=os.getenv("HF_TOKEN")) api.upload_folder( folder_path="/path/to/local/dataset"… See the full description on the dataset page: https://huggingface.co/datasets/RGES-PIT/Experienced.

  15. h

    HealthyCT

    • huggingface.co
    Updated Mar 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qi Chen (2024). HealthyCT [Dataset]. https://huggingface.co/datasets/qicq1c/HealthyCT
    Explore at:
    Dataset updated
    Mar 28, 2024
    Authors
    Qi Chen
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Summary

    Healthy CT data for abdominal organs (liver, pancreas and kidney) are filtered out from public dataset.

      Downloading Instructions
    
    
    
    
    
      1- Install the Hugging Face library:
    

    pip install -U "huggingface_hub[cli]"

      2- Download the dataset:
    

    mkdir HealthyCT cd HealthyCT huggingface-cli download qicq1c/HealthyCT --repo-type dataset --local-dir . --cache-dir ./cache

    [Optional] Resume downloading

    In case you had a previous interrupted download… See the full description on the dataset page: https://huggingface.co/datasets/qicq1c/HealthyCT.

  16. h

    fastmap_sfm

    • huggingface.co
    Updated May 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haochen Wang (2025). fastmap_sfm [Dataset]. https://huggingface.co/datasets/whc/fastmap_sfm
    Explore at:
    Dataset updated
    May 7, 2025
    Authors
    Haochen Wang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Fastmap evaluation suite.

    You only need the databases to run fastmap. Download the images if you want to produce colored point cloud. Download the subset of data you want to your local directory. huggingface-cli download whc/fastmap_sfm --repo-type dataset --local-dir ./ --include 'databases/tnt_*' 'ground_truths/tnt_*'

    or use the python interface from huggingface_hub import hf_hub_download, snapshot_download snapshot_download( repo_id="whc/fastmap_sfm", repo_type='dataset'… See the full description on the dataset page: https://huggingface.co/datasets/whc/fastmap_sfm.

  17. h

    IQA-PyTorch-Datasets

    • huggingface.co
    Updated Feb 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chaofeng Chen (2024). IQA-PyTorch-Datasets [Dataset]. https://huggingface.co/datasets/chaofengc/IQA-PyTorch-Datasets
    Explore at:
    Dataset updated
    Feb 18, 2024
    Authors
    Chaofeng Chen
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Description

    This is the dataset repository used in the pyiqa toolbox. Please refer to Awesome Image Quality Assessment for details of each dataset Example commandline script with huggingface-cli: huggingface-cli download chaofengc/IQA-PyTorch-Datasets live.tgz --local-dir ./datasets --repo-type dataset cd datasets tar -xzvf live.tgz

      Disclaimer for This Dataset Collection
    

    This collection of datasets is compiled and maintained for academic, research, and educational… See the full description on the dataset page: https://huggingface.co/datasets/chaofengc/IQA-PyTorch-Datasets.

  18. h

    world_model_tokenized_data

    • huggingface.co
    Updated Jun 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    1X (2024). world_model_tokenized_data [Dataset]. https://huggingface.co/datasets/1x-technologies/world_model_tokenized_data
    Explore at:
    Dataset updated
    Jun 20, 2024
    Dataset authored and provided by
    1X
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    World
    Description

    1X World Model Compression Challenge Dataset

    This repository hosts the dataset for the 1X World Model Compression Challenge. huggingface-cli download 1x-technologies/worldmodel --repo-type dataset --local-dir data

      Updates Since v1.1
    

    Train/Val v2.0 (~100 hours), replacing v1.1 Test v2.0 dataset for the Compression Challenge Faces blurred for privacy New raw video dataset (CC-BY-NC-SA 4.0) at worldmodel_raw_data Example scripts now split into: cosmos_video_decoder.py —… See the full description on the dataset page: https://huggingface.co/datasets/1x-technologies/world_model_tokenized_data.

  19. h

    pansharpening-datasets

    • huggingface.co
    Updated Dec 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    XavierJiezou (2024). pansharpening-datasets [Dataset]. https://huggingface.co/datasets/XavierJiezou/pansharpening-datasets
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 17, 2024
    Authors
    XavierJiezou
    Description

    Pansharpening-Datasets

    This dataset card aims to describe the datasets used in the Pansharpening.

      Install
    

    pip install huggingface-hub

      Usage
    

    Step 1: Download datasets

    huggingface-cli download --repo-type dataset XavierJiezou/pansharpening-datasets --local-dir data --include PanBench.zip

    Step 2: Extract datasets

    unzip PanBench.zip -d PanBench

      Citation
    

    @Article{cmfnet, AUTHOR = {Wang, Shiying and Zou, Xuechao and Li, Kai and Xing, Junliang and… See the full description on the dataset page: https://huggingface.co/datasets/XavierJiezou/pansharpening-datasets.

  20. h

    coco-2017-mirror

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pedro Cuenca, coco-2017-mirror [Dataset]. https://huggingface.co/datasets/pcuenq/coco-2017-mirror
    Explore at:
    Authors
    Pedro Cuenca
    License

    https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/

    Description

    COCO 2017 mirror

    This is a just mirror of the raw COCO dataset files, for convenience. You have to download it using something like: pip install huggingface_hub

    huggingface-cli download --local-dir coco-2017 pcuenq/coco-2017-mirror

    And then unzip the files before use.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
lacroix (2025). cli [Dataset]. https://huggingface.co/datasets/skip113/cli

cli

skip113/cli

Explore at:
Dataset updated
Apr 26, 2025
Authors
lacroix
Description

skip113/cli dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu