81 datasets found

h
google-colab
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Taylor Christian, google-colab [Dataset]. https://huggingface.co/datasets/taylorbobaylor/google-colab
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Taylor Christian
Description
taylorbobaylor/google-colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
test-for-colab
huggingface.co
Updated Sep 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Diego Crescenti (2024). test-for-colab [Dataset]. https://huggingface.co/datasets/dcrescentiai/test-for-colab
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 12, 2024
Authors
Diego Crescenti
Description
dcrescentiai/test-for-colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
my-colab-upload
huggingface.co
Updated Jul 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nitin ekka (2025). my-colab-upload [Dataset]. https://huggingface.co/datasets/Nitin12340/my-colab-upload
Explore at:
Dataset updated
Jul 27, 2025
Authors
Nitin ekka
Description
Nitin12340/my-colab-upload dataset hosted on Hugging Face and contributed by the HF Datasets community
h
Colab
huggingface.co
Updated Apr 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Polomanner (2024). Colab [Dataset]. https://huggingface.co/datasets/Poloman/Colab
Explore at:
Dataset updated
Apr 13, 2024
Authors
Polomanner
License
https://choosealicense.com/licenses/openrail/https://choosealicense.com/licenses/openrail/
Description
Poloman/Colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
gigaspeech
huggingface.co
opendatalab.com
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SpeechColab, gigaspeech [Dataset]. https://huggingface.co/datasets/speechcolab/gigaspeech
Explore at:
Dataset authored and provided by
SpeechColab
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
GigaSpeech is an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc. A new forced alignment and segmentation pipeline is proposed to create sentence segments suitable for speech recognition training, and to filter out segments with low-quality transcription. For system training, GigaSpeech provides five subsets of different sizes, 10h, 250h, 1000h, 2500h, and 10000h. For our 10,000-hour XL training subset, we cap the word error rate at 4% during the filtering/validation stage, and for all our other smaller training subsets, we cap it at 0%. The DEV and TEST evaluation sets, on the other hand, are re-processed by professional human transcribers to ensure high transcription quality.
h
colab
huggingface.co
Updated Feb 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ITAMAR CDAMASCENO (2024). colab [Dataset]. https://huggingface.co/datasets/itamarcard/colab
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 8, 2024
Authors
ITAMAR CDAMASCENO
License
https://choosealicense.com/licenses/openrail/https://choosealicense.com/licenses/openrail/
Description
itamarcard/colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
ragas-golden-dataset-colab
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Don Branson (2025). ragas-golden-dataset-colab [Dataset]. https://huggingface.co/datasets/dwb2023/ragas-golden-dataset-colab
Explore at:
Dataset updated
May 11, 2025
Authors
Don Branson
Description
dwb2023/ragas-golden-dataset-colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
files-colab
huggingface.co
Updated Jun 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sajjad Algburi (2025). files-colab [Dataset]. https://huggingface.co/datasets/Sajjadalgburi/files-colab
Explore at:
Dataset updated
Jun 26, 2025
Authors
Sajjad Algburi
Description
Sajjadalgburi/files-colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
cagliostro-colab-ui
huggingface.co
Updated Mar 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Victoria (2025). cagliostro-colab-ui [Dataset]. https://huggingface.co/datasets/viksi01/cagliostro-colab-ui
Explore at:
Dataset updated
Mar 4, 2025
Authors
Victoria
Description
viksi01/cagliostro-colab-ui dataset hosted on Hugging Face and contributed by the HF Datasets community
h
google-collab
huggingface.co
Updated Jun 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
JORGE EMILIO DE ALMEIDA NETO (2025). google-collab [Dataset]. https://huggingface.co/datasets/jorgeean1777/google-collab
Explore at:
Dataset updated
Jun 12, 2025
Authors
JORGE EMILIO DE ALMEIDA NETO
Description
jorgeean1777/google-collab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
evolved-math-problems-from-colab
huggingface.co
Updated Jan 15, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Masaru Nagaishi (2019). evolved-math-problems-from-colab [Dataset]. https://huggingface.co/datasets/Man-snow/evolved-math-problems-from-colab
Explore at:
Dataset updated
Jan 15, 2019
Authors
Masaru Nagaishi
Description
Man-snow/evolved-math-problems-from-colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
sadtalker-colab-assets
huggingface.co
Updated Jul 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maleeha Asghar (2025). sadtalker-colab-assets [Dataset]. https://huggingface.co/datasets/maleehaasghar/sadtalker-colab-assets
Explore at:
Dataset updated
Jul 27, 2025
Authors
Maleeha Asghar
Description
maleehaasghar/sadtalker-colab-assets dataset hosted on Hugging Face and contributed by the HF Datasets community
h
n8n-from-colab
huggingface.co
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OmarElSayed (2025). n8n-from-colab [Dataset]. https://huggingface.co/datasets/omarelsayeed/n8n-from-colab
Explore at:
Dataset updated
Jun 2, 2025
Authors
OmarElSayed
Description
n8n - Secure Workflow Automation for Technical Teams

n8n is a workflow automation platform that gives technical teams the flexibility of code with the speed of no-code. With 400+ integrations, native AI capabilities, and a fair-code license, n8n lets you build powerful automations while maintaining full control over your data and deployments.

Key Capabilities

Code When You Need It: Write JavaScript/Python, add npm packages, or use the visual interface AI-Native… See the full description on the dataset page: https://huggingface.co/datasets/omarelsayeed/n8n-from-colab.
h
cagliostro-colab-ui
huggingface.co
Updated Jun 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ssanto (2023). cagliostro-colab-ui [Dataset]. https://huggingface.co/datasets/Jokoasa/cagliostro-colab-ui
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 29, 2023
Authors
Ssanto
Description
Jokoasa/cagliostro-colab-ui dataset hosted on Hugging Face and contributed by the HF Datasets community
h
wav2vec2-base-lj-demo-colab
huggingface.co
Updated Oct 5, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohamed Illiyas (2022). wav2vec2-base-lj-demo-colab [Dataset]. https://huggingface.co/datasets/mohamed-illiyas/wav2vec2-base-lj-demo-colab
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 5, 2022
Authors
Mohamed Illiyas
Description
mohamed-illiyas/wav2vec2-base-lj-demo-colab dataset hosted on Hugging Face and contributed by the HF Datasets community
h
playpen-data
huggingface.co
Updated Jun 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Computational-Linguistics-Potsdam (2025). playpen-data [Dataset]. https://huggingface.co/datasets/colab-potsdam/playpen-data
Explore at:
Dataset updated
Jun 14, 2025
Dataset authored and provided by
Computational-Linguistics-Potsdam
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Interactions Dataset

We created the interactions dataset from all model interactions recorded in https://github.com/clembench/clembench-runs.git for version v2.0. The dataset is structured as a conversational dataset that contains samples that specify a list of messages. These messages usually iterate on roles, that is, between a user and an assistant, and carry textual content. Furthermore, we added to each sample a meta annotation that informs about game, experiment, task_id… See the full description on the dataset page: https://huggingface.co/datasets/colab-potsdam/playpen-data.
h
musicnet_jukebox_embeddings
huggingface.co
Updated Oct 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jon Flynn (2024). musicnet_jukebox_embeddings [Dataset]. https://huggingface.co/datasets/jonflynn/musicnet_jukebox_embeddings
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 26, 2024
Authors
Jon Flynn
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Jukebox Embeddings for MusicNet Dataset

Repo with Colab notebook used to extract the embeddings.

Overview

This dataset extends the MusicNet Dataset by providing embeddings for each audio file.

Original MusicNet Dataset

Link to original dataset

Jukebox Embeddings

Embeddings are derived from OpenAI's Jukebox model, following the approach described in Castellon et al. (2021) with some modifications followed in Spotify's Llark paper:

Source: Output of… See the full description on the dataset page: https://huggingface.co/datasets/jonflynn/musicnet_jukebox_embeddings.
h
cagliostro-colab-ui
huggingface.co
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BroJoko (2023). cagliostro-colab-ui [Dataset]. https://huggingface.co/datasets/JokoSusiloA/cagliostro-colab-ui
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 31, 2023
Authors
BroJoko
Description
JokoSusiloA/cagliostro-colab-ui dataset hosted on Hugging Face and contributed by the HF Datasets community
h
cagliostro-colab-ui-sktch1
huggingface.co
Updated Aug 20, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
depana (2023). cagliostro-colab-ui-sktch1 [Dataset]. https://huggingface.co/datasets/kiluade/cagliostro-colab-ui-sktch1
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 20, 2023
Authors
depana
Description
kiluade/cagliostro-colab-ui-sktch1 dataset hosted on Hugging Face and contributed by the HF Datasets community
h
cagliostro-colab-ui-gym-track-jacket
huggingface.co
Updated Jul 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
depana (2023). cagliostro-colab-ui-gym-track-jacket [Dataset]. https://huggingface.co/datasets/kiluade/cagliostro-colab-ui-gym-track-jacket
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 5, 2023
Authors
depana
Description
kiluade/cagliostro-colab-ui-gym-track-jacket dataset hosted on Hugging Face and contributed by the HF Datasets community

Facebook

Twitter

Click to copy link

Link copied

Cite

Taylor Christian, google-colab [Dataset]. https://huggingface.co/datasets/taylorbobaylor/google-colab

google-colab

taylorbobaylor/google-colab

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Authors

Taylor Christian

Description

taylorbobaylor/google-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

Clear search

Close search

Google apps

Main menu

google-colab

test-for-colab

my-colab-upload

Colab

gigaspeech

colab

ragas-golden-dataset-colab

files-colab

cagliostro-colab-ui

google-collab

evolved-math-problems-from-colab

sadtalker-colab-assets

n8n-from-colab

cagliostro-colab-ui

wav2vec2-base-lj-demo-colab

playpen-data

musicnet_jukebox_embeddings

cagliostro-colab-ui

cagliostro-colab-ui-sktch1

cagliostro-colab-ui-gym-track-jacket

google-colab

taylorbobaylor/google-colab