100+ datasets found

h
cli
huggingface.co
Updated Apr 26, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
lacroix (2025). cli [Dataset]. https://huggingface.co/datasets/skip113/cli
Explore at:
Dataset updated
Apr 26, 2025
Authors
lacroix
Description
skip113/cli dataset hosted on Hugging Face and contributed by the HF Datasets community
h
ml-dataset-cli-test
huggingface.co
Updated Aug 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hakan Kozaklı (2025). ml-dataset-cli-test [Dataset]. https://huggingface.co/datasets/Codyfederer/ml-dataset-cli-test
Explore at:
Dataset updated
Aug 6, 2025
Authors
Hakan Kozaklı
Description
Codyfederer/ml-dataset-cli-test dataset hosted on Hugging Face and contributed by the HF Datasets community
h
mmcows
huggingface.co
Updated Mar 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NEIS Lab @ Purdue (2025). mmcows [Dataset]. http://doi.org/10.57967/hf/5965
Explore at:
Unique identifier
https://doi.org/10.57967/hf/5965
Dataset updated
Mar 4, 2025
Dataset authored and provided by
NEIS Lab @ Purdue
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
MmCows: A Multimodal Dataset for Dairy Cattle Monitoring

Details of the dataset and benchmarks are available here. For a quick overview of the dataset, please check this video.

Instruction for downloading 1. Install requirements

pip install huggingface_hub

See the file structure here for the next step.

2. Download a file individually

To download visual_data.zip to your local-dir, use command line: huggingface-cli download
neis-lab/mmcows \… See the full description on the dataset page: https://huggingface.co/datasets/neis-lab/mmcows.
h
CLI-v2
huggingface.co
Updated Mar 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Amir Mohseni (2025). CLI-v2 [Dataset]. https://huggingface.co/datasets/AmirMohseni/CLI-v2
Explore at:
Dataset updated
Mar 30, 2025
Authors
Amir Mohseni
Description
AmirMohseni/CLI-v2 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
aiornot
huggingface.co
Updated Jan 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Competitions (2023). aiornot [Dataset]. https://huggingface.co/datasets/competitions/aiornot
Explore at:
Dataset updated
Jan 25, 2023
Dataset authored and provided by
Competitions
Description
Dataset Card for aiornot

Dataset for the aiornot competition. By accessing this dataset, you accept the rules of the AI or Not competition. Please note that dataset may contain images which are not considered safe for work.

Usage With Hugging Face Datasets 🤗

You can download and use this dataset using the datasets library. 📝 Note: You must be logged in to you Hugging Face account for the snippet below to work. You can do this with huggingface-cli login or… See the full description on the dataset page: https://huggingface.co/datasets/competitions/aiornot.
h
Beginner
huggingface.co
Updated Aug 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The citation is currently not available for this dataset.
Explore at:
Dataset updated
Aug 26, 2025
Dataset authored and provided by
Roman Galactic Exoplanet Survey Project Infrastructure Team
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Experienced

ML training data for the Roman Microlensing Data CHallenge 2025 - Beginner tier.

Uploading

CLI:

Install the Hugging Face CLI

brew install huggingface-cli

Login with your Hugging Face credentials

hf auth login

Push your dataset files

hf upload RGES-PIT/Beginner . --repo-type=dataset

Python: from huggingface_hub import HfApi

api = HfApi(token=os.getenv("HF_TOKEN")) api.upload_folder( folder_path="/path/to/local/dataset"… See the full description on the dataset page: https://huggingface.co/datasets/RGES-PIT/Beginner.
h
browsecomp-plus-indexes
huggingface.co
Updated Aug 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tevatron (2025). browsecomp-plus-indexes [Dataset]. https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes
Explore at:
Dataset updated
Aug 12, 2025
Dataset authored and provided by
Tevatron
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
BM25, embedding index used in BrowseComp-Plus. For downloading the index: huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="bm25/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-0.6b/*" --local-dir ./indexes huggingface-cli download Tevatron/browsecomp-plus-indexes --repo-type=dataset --include="qwen3-embedding-4b/*" --local-dir ./indexes huggingface-cli download… See the full description on the dataset page: https://huggingface.co/datasets/Tevatron/browsecomp-plus-indexes.
h
Wireframe
huggingface.co
Updated Aug 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Li Hao (2025). Wireframe [Dataset]. https://huggingface.co/datasets/lh9171338/Wireframe
Explore at:
Dataset updated
Aug 31, 2025
Authors
Li Hao
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Wireframe Dataset

This is the Wireframe dataset hosted on Hugging Face Hub.

Summary

Wireframe dataset with image annotations including line segments.The dataset is stored as jsonl files (train/metadata.jsonl, test/metadata.jsonl) and images. Number of samples:

Train: 5,000 Test: 462

Download

Download with huggingface-hub

python3 -m pip install huggingface-hub huggingface-cli download --repo-type dataset lh9171338/Wireframe --local-dir ./

Download with Git… See the full description on the dataset page: https://huggingface.co/datasets/lh9171338/Wireframe.
h
MOSEv2
huggingface.co
Updated Aug 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FudanCVL (2025). MOSEv2 [Dataset]. https://huggingface.co/datasets/FudanCVL/MOSEv2
Explore at:
Dataset updated
Aug 8, 2025
Dataset authored and provided by
FudanCVL
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

🔥 Evaluation Server | 🏠 Homepage | 📄 Paper | 🔗 GitHub

Download

We recommend using huggingface-cli to download: pip install -U "huggingface_hub[cli]" huggingface-cli download FudanCVL/MOSEv2 --repo-type dataset --local-dir ./MOSEv2 --local-dir-use-symlinks False --max-workers 16

Dataset Summary

MOSEv2 is a comprehensive video object segmentation dataset designed to advance… See the full description on the dataset page: https://huggingface.co/datasets/FudanCVL/MOSEv2.
h
GuardReasonerTrain
huggingface.co
Updated May 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yueliu1999 (2025). GuardReasonerTrain [Dataset]. https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 19, 2025
Authors
yueliu1999
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
GuardReasonerTrain

GuardReasonerTrain is the training data for R-SFT of GuardReasoner, as described in the paper GuardReasoner: Towards Reasoning-based LLM Safeguards. Code: https://github.com/yueliu1999/GuardReasoner/

Usage

from datasets import load_dataset

Login using e.g. huggingface-cli login to access this dataset

ds = load_dataset("yueliu1999/GuardReasonerTrain")

Citation

If you use this dataset, please cite our paper. @article{GuardReasoner… See the full description on the dataset page: https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain.
h
ktda-datasets
huggingface.co
Updated Dec 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
XavierJiezou (2024). ktda-datasets [Dataset]. https://huggingface.co/datasets/XavierJiezou/ktda-datasets
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 8, 2024
Authors
XavierJiezou
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
KTDA-Datasets

This dataset card aims to describe the datasets used in the KTDA.

Install

pip install huggingface-hub

Usage

Step 1: Download datasets

huggingface-cli download --repo-type dataset XavierJiezou/ktda-datasets --local-dir data --include grass.zip huggingface-cli download --repo-type dataset XavierJiezou/ktda-datasets --local-dir data --include cloud.zip

Step 2: Extract datasets

unzip grass.zip -d grass unzip cloud.zip -d l8_biome… See the full description on the dataset page: https://huggingface.co/datasets/XavierJiezou/ktda-datasets.
h
Vizdate
huggingface.co
Updated Mar 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Erik Firdaus (2025). Vizdate [Dataset]. https://huggingface.co/datasets/Vinzero/Vizdate
Explore at:
Dataset updated
Mar 30, 2025
Authors
Erik Firdaus
Description
from datasets import load_dataset

Login using e.g. huggingface-cli login to access this dataset

ds = load_dataset("huggingface/transformers-metadata", "frameworks")
h
TrialPanorama-database
huggingface.co
Updated Aug 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zifeng Wang (2025). TrialPanorama-database [Dataset]. https://huggingface.co/datasets/zifeng-ai/TrialPanorama-database
Explore at:
Dataset updated
Aug 4, 2025
Authors
Zifeng Wang
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Quick start

The easiest way to download the dataset to your local is to use huggingface-cli. The specific command you can use is huggingface-cli download zifeng-ai/TrialPanorama-database --local-dir LOCAL_DIR --repo-type dataset

where LOCAL_DIR should be replaced with the target directory you want to save your dataset to.

Update history

Aug.4 2025: updated tables with the full set of studies

Dataset website: https://ryanwangzf.github.io/projects/trialpanorama… See the full description on the dataset page: https://huggingface.co/datasets/zifeng-ai/TrialPanorama-database.
h
Experienced
huggingface.co
Updated Aug 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Roman Galactic Exoplanet Survey Project Infrastructure Team (2025). Experienced [Dataset]. https://huggingface.co/datasets/RGES-PIT/Experienced
Explore at:
Dataset updated
Aug 26, 2025
Dataset authored and provided by
Roman Galactic Exoplanet Survey Project Infrastructure Team
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Experienced

ML training data for the Roman Microlensing Data CHallenge 2025 - Experienced tier.

Uploading

CLI:

Install the Hugging Face CLI

brew install huggingface-cli

Login with your Hugging Face credentials

hf auth login

Push your dataset files

hf upload RGES-PIT/Experienced . --repo-type=dataset

Python: from huggingface_hub import HfApi

api = HfApi(token=os.getenv("HF_TOKEN")) api.upload_folder( folder_path="/path/to/local/dataset"… See the full description on the dataset page: https://huggingface.co/datasets/RGES-PIT/Experienced.
h
HealthyCT
huggingface.co
Updated Mar 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qi Chen (2024). HealthyCT [Dataset]. https://huggingface.co/datasets/qicq1c/HealthyCT
Explore at:
Dataset updated
Mar 28, 2024
Authors
Qi Chen
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset Summary

Healthy CT data for abdominal organs (liver, pancreas and kidney) are filtered out from public dataset.

Downloading Instructions 1- Install the Hugging Face library:

pip install -U "huggingface_hub[cli]"

2- Download the dataset:

mkdir HealthyCT cd HealthyCT huggingface-cli download qicq1c/HealthyCT --repo-type dataset --local-dir . --cache-dir ./cache

[Optional] Resume downloading

In case you had a previous interrupted download… See the full description on the dataset page: https://huggingface.co/datasets/qicq1c/HealthyCT.
h
fastmap_sfm
huggingface.co
Updated May 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haochen Wang (2025). fastmap_sfm [Dataset]. https://huggingface.co/datasets/whc/fastmap_sfm
Explore at:
Dataset updated
May 7, 2025
Authors
Haochen Wang
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Fastmap evaluation suite.

You only need the databases to run fastmap. Download the images if you want to produce colored point cloud. Download the subset of data you want to your local directory. huggingface-cli download whc/fastmap_sfm --repo-type dataset --local-dir ./ --include 'databases/tnt_*' 'ground_truths/tnt_*'

or use the python interface from huggingface_hub import hf_hub_download, snapshot_download snapshot_download( repo_id="whc/fastmap_sfm", repo_type='dataset'… See the full description on the dataset page: https://huggingface.co/datasets/whc/fastmap_sfm.
h
IQA-PyTorch-Datasets
huggingface.co
Updated Feb 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chaofeng Chen (2024). IQA-PyTorch-Datasets [Dataset]. https://huggingface.co/datasets/chaofengc/IQA-PyTorch-Datasets
Explore at:
Dataset updated
Feb 18, 2024
Authors
Chaofeng Chen
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Description

This is the dataset repository used in the pyiqa toolbox. Please refer to Awesome Image Quality Assessment for details of each dataset Example commandline script with huggingface-cli: huggingface-cli download chaofengc/IQA-PyTorch-Datasets live.tgz --local-dir ./datasets --repo-type dataset cd datasets tar -xzvf live.tgz

Disclaimer for This Dataset Collection

This collection of datasets is compiled and maintained for academic, research, and educational… See the full description on the dataset page: https://huggingface.co/datasets/chaofengc/IQA-PyTorch-Datasets.
h
world_model_tokenized_data
huggingface.co
Updated Jun 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
1X (2024). world_model_tokenized_data [Dataset]. https://huggingface.co/datasets/1x-technologies/world_model_tokenized_data
Explore at:
Dataset updated
Jun 20, 2024
Dataset authored and provided by
1X
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Area covered
World
Description
1X World Model Compression Challenge Dataset

This repository hosts the dataset for the 1X World Model Compression Challenge. huggingface-cli download 1x-technologies/worldmodel --repo-type dataset --local-dir data

Updates Since v1.1

Train/Val v2.0 (~100 hours), replacing v1.1 Test v2.0 dataset for the Compression Challenge Faces blurred for privacy New raw video dataset (CC-BY-NC-SA 4.0) at worldmodel_raw_data Example scripts now split into: cosmos_video_decoder.py —… See the full description on the dataset page: https://huggingface.co/datasets/1x-technologies/world_model_tokenized_data.
h
pansharpening-datasets
huggingface.co
Updated Dec 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
XavierJiezou (2024). pansharpening-datasets [Dataset]. https://huggingface.co/datasets/XavierJiezou/pansharpening-datasets
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 17, 2024
Authors
XavierJiezou
Description
Pansharpening-Datasets

This dataset card aims to describe the datasets used in the Pansharpening.

Install

pip install huggingface-hub

Usage

Step 1: Download datasets

huggingface-cli download --repo-type dataset XavierJiezou/pansharpening-datasets --local-dir data --include PanBench.zip

Step 2: Extract datasets

unzip PanBench.zip -d PanBench

Citation

@Article{cmfnet, AUTHOR = {Wang, Shiying and Zou, Xuechao and Li, Kai and Xing, Junliang and… See the full description on the dataset page: https://huggingface.co/datasets/XavierJiezou/pansharpening-datasets.
h
coco-2017-mirror
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pedro Cuenca, coco-2017-mirror [Dataset]. https://huggingface.co/datasets/pcuenq/coco-2017-mirror
Explore at:
Authors
Pedro Cuenca
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
COCO 2017 mirror

This is a just mirror of the raw COCO dataset files, for convenience. You have to download it using something like: pip install huggingface_hub

huggingface-cli download --local-dir coco-2017 pcuenq/coco-2017-mirror

And then unzip the files before use.

Facebook

Twitter

Click to copy link

Link copied

Cite

lacroix (2025). cli [Dataset]. https://huggingface.co/datasets/skip113/cli

cli

skip113/cli

Explore at:

Dataset updated

Apr 26, 2025

Authors

lacroix

Description

skip113/cli dataset hosted on Hugging Face and contributed by the HF Datasets community

Clear search

Close search

Google apps

Main menu

cli

ml-dataset-cli-test

mmcows

CLI-v2

aiornot

Beginner

Install the Hugging Face CLI

Login with your Hugging Face credentials

Push your dataset files

browsecomp-plus-indexes

Wireframe

MOSEv2

GuardReasonerTrain

Login using e.g. huggingface-cli login to access this dataset

ktda-datasets

Step 1: Download datasets

Step 2: Extract datasets

Vizdate

TrialPanorama-database

Experienced

Install the Hugging Face CLI

Login with your Hugging Face credentials

Push your dataset files

HealthyCT

fastmap_sfm

IQA-PyTorch-Datasets

world_model_tokenized_data

pansharpening-datasets

Step 1: Download datasets

Step 2: Extract datasets

coco-2017-mirror

cliSee More Versions

skip113/cli

Login using e.g. `huggingface-cli login` to access this dataset

cli