8 datasets found

h
song-describer-dataset
huggingface.co
data.niaid.nih.gov
+1more
Updated Feb 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Renumics (2024). song-describer-dataset [Dataset]. https://huggingface.co/datasets/renumics/song-describer-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 2, 2024
Dataset authored and provided by
Renumics
Description
This is a mirror to the example dataset "The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation" paper by Manco et al. Project page on Github: https://github.com/mulab-mir/song-describer-dataset Dataset on Zenodoo: https://zenodo.org/records/10072001 Explore the dataset on your local machine: import datasets from renumics import spotlight

ds = datasets.load_dataset('renumics/song-describer-dataset') spotlight.show(ds)
h
2D_ElastoPlastoDynamics
huggingface.co
Updated May 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PLAID-datasets (2025). 2D_ElastoPlastoDynamics [Dataset]. https://huggingface.co/datasets/PLAID-datasets/2D_ElastoPlastoDynamics
Explore at:
Dataset updated
May 5, 2025
Dataset authored and provided by
PLAID-datasets
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Dataset Card

This dataset contains a single huggingface split, named 'all_samples'. The samples contains a single huggingface feature, named called "sample". Samples are instances of plaid.containers.sample.Sample. Mesh objects included in samples follow the CGNS standard, and can be converted in Muscat.Containers.Mesh.Mesh. Example of commands: fromdatasets import load_dataset from plaid.containers.sample import Sample import pickle

Load the dataset

hf_dataset =… See the full description on the dataset page: https://huggingface.co/datasets/PLAID-datasets/2D_ElastoPlastoDynamics.
h
climateset
huggingface.co
Updated Mar 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ClimateSet (2024). climateset [Dataset]. https://huggingface.co/datasets/climateset/climateset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 6, 2024
Authors
ClimateSet
License
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Description
Terms of Use

By using the dataset, you agree to comply with the dataset license (CC-by-4.0-Deed).

Download Instructions

To download one file, please use from huggingface_hub import hf_hub_download

Path of the directory where the data will be downloaded in your local machine

local_directory = 'LOCAL_DIRECTORY'

Relative path of the file in the repository

filepath = 'FILE_PATH'

repo_id = "climateset/climateset" repo_type = "dataset" hf_hub_download(repo_id=repo_id… See the full description on the dataset page: https://huggingface.co/datasets/climateset/climateset.
h
Data from: lambada
huggingface.co
Updated Dec 2, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CIMeC - Center for Mind/Brain Sciences, University of Trento (2020). lambada [Dataset]. https://huggingface.co/datasets/cimec/lambada
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 2, 2020
Dataset authored and provided by
CIMeC - Center for Mind/Brain Sciences, University of Trento
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for LAMBADA

Dataset Summary

The LAMBADA evaluates the capabilities of computational models for text understanding by means of a word prediction task. LAMBADA is a collection of narrative passages sharing the characteristic that human subjects are able to guess their last word if they are exposed to the whole passage, but not if they only see the last sentence preceding the target word. To succeed on LAMBADA, computational models cannot simply rely on local… See the full description on the dataset page: https://huggingface.co/datasets/cimec/lambada.
h
SimXRDreview
huggingface.co
Updated Jun 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
caobin (2024). SimXRDreview [Dataset]. https://huggingface.co/datasets/caobin/SimXRDreview
Explore at:
Dataset updated
Jun 11, 2024
Authors
caobin
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Partial data from SimXRD (the original dataset is too large to be shared on Hugging Face). Sample data provided for reviewers.

1. Point to a local or remote Croissant file

import mlcroissant as mlc url = "https://huggingface.co/datasets/caobin/SimXRDreview/raw/main/simxrd_croissant.json"

2. Inspect metadata

dataset_info = mlc.Dataset(url).metadata.to_json print(dataset_info)

from dataset.parse importload_dataset,bar_progress # defined in our github :… See the full description on the dataset page: https://huggingface.co/datasets/caobin/SimXRDreview.
h
fastmap_sfm
huggingface.co
Updated May 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haochen Wang (2025). fastmap_sfm [Dataset]. https://huggingface.co/datasets/whc/fastmap_sfm
Explore at:
Dataset updated
May 7, 2025
Authors
Haochen Wang
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Fastmap evaluation suite.

You only need the databases to run fastmap. Download the images if you want to produce colored point cloud. Download the subset of data you want to your local directory. huggingface-cli download whc/fastmap_sfm --repo-type dataset --local-dir ./ --include 'databases/tnt_*' 'ground_truths/tnt_*'

or use the python interface from huggingface_hub import hf_hub_download, snapshot_download snapshot_download( repo_id="whc/fastmap_sfm", repo_type='dataset'… See the full description on the dataset page: https://huggingface.co/datasets/whc/fastmap_sfm.
h
celeba
huggingface.co
datasetninja.com
+3more
Updated May 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yuehao Wang (2025). celeba [Dataset]. https://huggingface.co/datasets/Yuehao/celeba
Explore at:
Dataset updated
May 13, 2025
Authors
Yuehao Wang
Description
CelebA dataset

A copy of celeba dataset. https://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

How to use

Download data

huggingface-cli download --local-dir /path/to/datasets/celeba --repo-type dataset Yuehao/celeba unzip /path/to/datasets/celeba/img_align_celeba.zip -d /path/to/datasets/celeba

Load data via torchvision.datasets.CelebA

torchvision.datasets.CelebA(root='/path/to/datasets')
h
CLEVR-BT-DB
huggingface.co
Updated Sep 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrey Borevsky (2023). CLEVR-BT-DB [Dataset]. https://huggingface.co/datasets/Aborevsky01/CLEVR-BT-DB
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 15, 2023
Authors
Andrey Borevsky
Description
How to install?

!pip install datasets -q from huggingface_hub import snapshot_download import pandas as pd import matplotlib.pyplot as plt

First step: download an entire datatset

snapshot_download(repo_id="Aborevsky01/CLEVR-BT-DB", repo_type="dataset", local_dir='path-to-your-local-dir')

Second step: unarchive the images for VQA

!unzip [path-to-your-local-dir]/[type-of-task]/images.zip

Example of the triplet (image - question -… See the full description on the dataset page: https://huggingface.co/datasets/Aborevsky01/CLEVR-BT-DB.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Renumics (2024). song-describer-dataset [Dataset]. https://huggingface.co/datasets/renumics/song-describer-dataset

song-describer-dataset

renumics/song-describer-dataset

Explore at:

29 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Feb 2, 2024

Dataset authored and provided by

Renumics

Description

This is a mirror to the example dataset "The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation" paper by Manco et al. Project page on Github: https://github.com/mulab-mir/song-describer-dataset Dataset on Zenodoo: https://zenodo.org/records/10072001 Explore the dataset on your local machine: import datasets from renumics import spotlight

ds = datasets.load_dataset('renumics/song-describer-dataset') spotlight.show(ds)

Clear search

Close search

Google apps

Main menu

song-describer-dataset

2D_ElastoPlastoDynamics

Load the dataset

climateset

Path of the directory where the data will be downloaded in your local machine

Relative path of the file in the repository

Data from: lambada

SimXRDreview

1. Point to a local or remote Croissant file

2. Inspect metadata

fastmap_sfm

celeba

CLEVR-BT-DB

First step: download an entire datatset

Second step: unarchive the images for VQA

Example of the triplet (image - question -… See the full description on the dataset page: https://huggingface.co/datasets/Aborevsky01/CLEVR-BT-DB.

song-describer-dataset

renumics/song-describer-dataset