81 datasets found
  1. h

    google-colab

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taylor Christian, google-colab [Dataset]. https://huggingface.co/datasets/taylorbobaylor/google-colab
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Taylor Christian
    Description

    taylorbobaylor/google-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. h

    test-for-colab

    • huggingface.co
    Updated Sep 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Diego Crescenti (2024). test-for-colab [Dataset]. https://huggingface.co/datasets/dcrescentiai/test-for-colab
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 12, 2024
    Authors
    Diego Crescenti
    Description

    dcrescentiai/test-for-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. h

    my-colab-upload

    • huggingface.co
    Updated Jul 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nitin ekka (2025). my-colab-upload [Dataset]. https://huggingface.co/datasets/Nitin12340/my-colab-upload
    Explore at:
    Dataset updated
    Jul 27, 2025
    Authors
    Nitin ekka
    Description

    Nitin12340/my-colab-upload dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. h

    Colab

    • huggingface.co
    Updated Apr 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Polomanner (2024). Colab [Dataset]. https://huggingface.co/datasets/Poloman/Colab
    Explore at:
    Dataset updated
    Apr 13, 2024
    Authors
    Polomanner
    License

    https://choosealicense.com/licenses/openrail/https://choosealicense.com/licenses/openrail/

    Description

    Poloman/Colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    gigaspeech

    • huggingface.co
    • opendatalab.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SpeechColab, gigaspeech [Dataset]. https://huggingface.co/datasets/speechcolab/gigaspeech
    Explore at:
    Dataset authored and provided by
    SpeechColab
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    GigaSpeech is an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc. A new forced alignment and segmentation pipeline is proposed to create sentence segments suitable for speech recognition training, and to filter out segments with low-quality transcription. For system training, GigaSpeech provides five subsets of different sizes, 10h, 250h, 1000h, 2500h, and 10000h. For our 10,000-hour XL training subset, we cap the word error rate at 4% during the filtering/validation stage, and for all our other smaller training subsets, we cap it at 0%. The DEV and TEST evaluation sets, on the other hand, are re-processed by professional human transcribers to ensure high transcription quality.

  6. h

    colab

    • huggingface.co
    Updated Feb 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ITAMAR CDAMASCENO (2024). colab [Dataset]. https://huggingface.co/datasets/itamarcard/colab
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2024
    Authors
    ITAMAR CDAMASCENO
    License

    https://choosealicense.com/licenses/openrail/https://choosealicense.com/licenses/openrail/

    Description

    itamarcard/colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    ragas-golden-dataset-colab

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Don Branson (2025). ragas-golden-dataset-colab [Dataset]. https://huggingface.co/datasets/dwb2023/ragas-golden-dataset-colab
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    Don Branson
    Description

    dwb2023/ragas-golden-dataset-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    files-colab

    • huggingface.co
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sajjad Algburi (2025). files-colab [Dataset]. https://huggingface.co/datasets/Sajjadalgburi/files-colab
    Explore at:
    Dataset updated
    Jun 26, 2025
    Authors
    Sajjad Algburi
    Description

    Sajjadalgburi/files-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    cagliostro-colab-ui

    • huggingface.co
    Updated Mar 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Victoria (2025). cagliostro-colab-ui [Dataset]. https://huggingface.co/datasets/viksi01/cagliostro-colab-ui
    Explore at:
    Dataset updated
    Mar 4, 2025
    Authors
    Victoria
    Description

    viksi01/cagliostro-colab-ui dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    google-collab

    • huggingface.co
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JORGE EMILIO DE ALMEIDA NETO (2025). google-collab [Dataset]. https://huggingface.co/datasets/jorgeean1777/google-collab
    Explore at:
    Dataset updated
    Jun 12, 2025
    Authors
    JORGE EMILIO DE ALMEIDA NETO
    Description

    jorgeean1777/google-collab dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    evolved-math-problems-from-colab

    • huggingface.co
    Updated Jan 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Masaru Nagaishi (2019). evolved-math-problems-from-colab [Dataset]. https://huggingface.co/datasets/Man-snow/evolved-math-problems-from-colab
    Explore at:
    Dataset updated
    Jan 15, 2019
    Authors
    Masaru Nagaishi
    Description

    Man-snow/evolved-math-problems-from-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    sadtalker-colab-assets

    • huggingface.co
    Updated Jul 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maleeha Asghar (2025). sadtalker-colab-assets [Dataset]. https://huggingface.co/datasets/maleehaasghar/sadtalker-colab-assets
    Explore at:
    Dataset updated
    Jul 27, 2025
    Authors
    Maleeha Asghar
    Description

    maleehaasghar/sadtalker-colab-assets dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    n8n-from-colab

    • huggingface.co
    Updated Jun 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OmarElSayed (2025). n8n-from-colab [Dataset]. https://huggingface.co/datasets/omarelsayeed/n8n-from-colab
    Explore at:
    Dataset updated
    Jun 2, 2025
    Authors
    OmarElSayed
    Description

    n8n - Secure Workflow Automation for Technical Teams

    n8n is a workflow automation platform that gives technical teams the flexibility of code with the speed of no-code. With 400+ integrations, native AI capabilities, and a fair-code license, n8n lets you build powerful automations while maintaining full control over your data and deployments.

      Key Capabilities
    

    Code When You Need It: Write JavaScript/Python, add npm packages, or use the visual interface AI-Nativeโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/omarelsayeed/n8n-from-colab.

  14. h

    cagliostro-colab-ui

    • huggingface.co
    Updated Jun 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ssanto (2023). cagliostro-colab-ui [Dataset]. https://huggingface.co/datasets/Jokoasa/cagliostro-colab-ui
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 29, 2023
    Authors
    Ssanto
    Description

    Jokoasa/cagliostro-colab-ui dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    wav2vec2-base-lj-demo-colab

    • huggingface.co
    Updated Oct 5, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamed Illiyas (2022). wav2vec2-base-lj-demo-colab [Dataset]. https://huggingface.co/datasets/mohamed-illiyas/wav2vec2-base-lj-demo-colab
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 5, 2022
    Authors
    Mohamed Illiyas
    Description

    mohamed-illiyas/wav2vec2-base-lj-demo-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    playpen-data

    • huggingface.co
    Updated Jun 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Computational-Linguistics-Potsdam (2025). playpen-data [Dataset]. https://huggingface.co/datasets/colab-potsdam/playpen-data
    Explore at:
    Dataset updated
    Jun 14, 2025
    Dataset authored and provided by
    Computational-Linguistics-Potsdam
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Interactions Dataset

    We created the interactions dataset from all model interactions recorded in https://github.com/clembench/clembench-runs.git for version v2.0. The dataset is structured as a conversational dataset that contains samples that specify a list of messages. These messages usually iterate on roles, that is, between a user and an assistant, and carry textual content. Furthermore, we added to each sample a meta annotation that informs about game, experiment, task_idโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/colab-potsdam/playpen-data.

  17. h

    musicnet_jukebox_embeddings

    • huggingface.co
    Updated Oct 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jon Flynn (2024). musicnet_jukebox_embeddings [Dataset]. https://huggingface.co/datasets/jonflynn/musicnet_jukebox_embeddings
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 26, 2024
    Authors
    Jon Flynn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Jukebox Embeddings for MusicNet Dataset

    Repo with Colab notebook used to extract the embeddings.

      Overview
    

    This dataset extends the MusicNet Dataset by providing embeddings for each audio file.

      Original MusicNet Dataset
    

    Link to original dataset

      Jukebox Embeddings
    

    Embeddings are derived from OpenAI's Jukebox model, following the approach described in Castellon et al. (2021) with some modifications followed in Spotify's Llark paper:

    Source: Output ofโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/jonflynn/musicnet_jukebox_embeddings.

  18. h

    cagliostro-colab-ui

    • huggingface.co
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BroJoko (2023). cagliostro-colab-ui [Dataset]. https://huggingface.co/datasets/JokoSusiloA/cagliostro-colab-ui
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 31, 2023
    Authors
    BroJoko
    Description

    JokoSusiloA/cagliostro-colab-ui dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    cagliostro-colab-ui-sktch1

    • huggingface.co
    Updated Aug 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    depana (2023). cagliostro-colab-ui-sktch1 [Dataset]. https://huggingface.co/datasets/kiluade/cagliostro-colab-ui-sktch1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 20, 2023
    Authors
    depana
    Description

    kiluade/cagliostro-colab-ui-sktch1 dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    cagliostro-colab-ui-gym-track-jacket

    • huggingface.co
    Updated Jul 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    depana (2023). cagliostro-colab-ui-gym-track-jacket [Dataset]. https://huggingface.co/datasets/kiluade/cagliostro-colab-ui-gym-track-jacket
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 5, 2023
    Authors
    depana
    Description

    kiluade/cagliostro-colab-ui-gym-track-jacket dataset hosted on Hugging Face and contributed by the HF Datasets community

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Taylor Christian, google-colab [Dataset]. https://huggingface.co/datasets/taylorbobaylor/google-colab

google-colab

taylorbobaylor/google-colab

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Taylor Christian
Description

taylorbobaylor/google-colab dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu