58 datasets found
  1. ClipModel

    • kaggle.com
    Updated Apr 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meher Deepak-2005 (2023). ClipModel [Dataset]. https://www.kaggle.com/datasets/meherdeepak2005/clipmodel/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 22, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Meher Deepak-2005
    Description

    Dataset

    This dataset was created by Meher Deepak-2005

    Contents

  2. clip-vit

    • kaggle.com
    Updated Apr 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    XuChenLong (2023). clip-vit [Dataset]. https://www.kaggle.com/datasets/xuchenlong/clip-vit
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    XuChenLong
    Description

    Dataset

    This dataset was created by XuChenLong

    Contents

  3. clip_l.safetensors

    • kaggle.com
    Updated Apr 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Heqing_HappyStar (2025). clip_l.safetensors [Dataset]. https://www.kaggle.com/datasets/heqinghappystar/clip-l-safetensors/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 30, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Heqing_HappyStar
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Heqing_HappyStar

    Released under CC0: Public Domain

    Contents

  4. OpenAI CLIP

    • kaggle.com
    Updated Mar 10, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bruno G. do Amaral (2021). OpenAI CLIP [Dataset]. https://www.kaggle.com/bguberfain/openai-clip/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 10, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Bruno G. do Amaral
    Description

    Dataset

    This dataset was created by Bruno G. do Amaral

    Contents

  5. clip-vit-base-patch32

    • kaggle.com
    Updated Sep 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LYLyyds (2023). clip-vit-base-patch32 [Dataset]. https://www.kaggle.com/datasets/lylyyds/clip-vit-base-patch32
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 16, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    LYLyyds
    Description

    Dataset

    This dataset was created by LYLyyds

    Contents

  6. clip-frames-288

    • kaggle.com
    Updated Sep 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Camaro (2022). clip-frames-288 [Dataset]. https://www.kaggle.com/datasets/bamps53/clip-frames-288/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 17, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Camaro
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Camaro

    Released under CC0: Public Domain

    Contents

  7. OpenAI Clip Model and Processor

    • kaggle.com
    Updated Apr 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vinicius Suaiden (2023). OpenAI Clip Model and Processor [Dataset]. https://www.kaggle.com/datasets/viniciussuaiden/openai-clip-model-and-processor/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 28, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Vinicius Suaiden
    Description

    Dataset

    This dataset was created by Vinicius Suaiden

    Contents

  8. OpenAI-CLIP weights

    • kaggle.com
    Updated Aug 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). OpenAI-CLIP weights [Dataset]. https://www.kaggle.com/datasets/thedevastator/openaiclip-weights
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 4, 2022
    Dataset provided by
    Kaggle
    Authors
    The Devastator
    Description

    This dataset contains the official pretrained weights of clip, released by OpenAI.

  9. LJSpeech sr16k Dataset

    • kaggle.com
    Updated Sep 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Awsaf (2023). LJSpeech sr16k Dataset [Dataset]. https://www.kaggle.com/datasets/awsaf49/ljspeech-sr16k-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 13, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Awsaf
    Description

    The LJ Speech Dataset

    This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours.

    The texts were published between 1884 and 1964, and are in the public domain. The audio was recorded in 2016-17 by the LibriVox project and is also in the public domain.

    File Format

    Metadata is provided in transcripts.csv. This file consists of one record per line, delimited by the pipe character (0x7c). The fields are: * ID: this is the name of the corresponding .wav file * Transcription: words spoken by the reader (UTF-8) * Normalized Transcription: transcription with numbers, ordinals, and monetary units expanded into full words (UTF-8).

    Each audio file is a single-channel 16-bit PCM WAV with a sample rate of 22050 Hz means ~22 k.

    Statistics

    • Total Clips: 13,100
    • Total Words: 225,715
    • Total Characters: 1,308,678
    • Total Duration: 23:55:17
    • Mean Clip Duration: 6.57 sec
    • Min Clip Duration: 1.11 sec
    • Max Clip Duration: 10.10 sec
    • Mean Words per Clip: 17.23
    • Distinct Words: 13,821

    Miscellaneous

    The audio clips range in length from approximately 1 second to 10 seconds. They were segmented automatically based on silences in the recording. Clip boundaries generally align with sentence or clause boundaries, but not always. The text was matched to the audio manually, and a QA pass was done to ensure that the text accurately matched the words spoken in the audio. The original LibriVox recordings were distributed as 128 kbps MP3 files. As a result, they may contain artifacts introduced by the MP3 encoding. The following abbreviations appear in the text. They may be expanded as follows:

      Abbreviation  Expansion
      Mr. Mister
      Mrs.  Misess (*)
      Dr. Doctor
      No. Number
      St. Saint
      Co. Company
      Jr. Junior
      Maj.  Major
      Gen.  General
      Drs.  Doctors
      Rev.  Reverend
      Lt. Lieutenant
      Hon.  Honorable
      Sgt.  Sergeant
      Capt.  Captain
      Esq.  Esquire
      Ltd.  Limited
      Col.  Colonel
      Ft. Fort
    

    (*) there's no standard expansion for "Mrs." 19 of the transcriptions contain non-ASCII characters (for example, LJ016-0257 contains "raison d'être"). Example code using this dataset to train a speech synthesis model can be found at: github.com/keithito/tacotron. For more information or to report errors, please email kito@kito.us.

  10. Data from: Clip Weights

    • kaggle.com
    Updated Aug 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). Clip Weights [Dataset]. https://www.kaggle.com/datasets/thedevastator/clip-weights
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 4, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    The Devastator
    Description

    Dataset

    This dataset was created by The Devastator

    Contents

  11. open-clip-wheels

    • kaggle.com
    Updated Feb 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leonid Kulyk (2023). open-clip-wheels [Dataset]. https://www.kaggle.com/datasets/leonidkulyk/open-clip-wheels/discussion?sort=undefined
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Leonid Kulyk
    Description

    Dataset

    This dataset was created by Leonid Kulyk

    Contents

  12. open-clip-models

    • kaggle.com
    Updated Feb 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leonid Kulyk (2023). open-clip-models [Dataset]. https://www.kaggle.com/leonidkulyk/open-clip-models/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 16, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Leonid Kulyk
    Description

    Dataset

    This dataset was created by Leonid Kulyk

    Contents

  13. OpenAi-clip-base-p32

    • kaggle.com
    Updated Apr 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 17, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Kevin(Zeming) Wang
    Description

    Dataset

    This dataset was created by Kevin(Zeming) Wang

    Contents

  14. clip-picture

    • kaggle.com
    Updated Apr 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    smnd (2021). clip-picture [Dataset]. https://www.kaggle.com/datasets/smdlmr/clippicture
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 1, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    smnd
    Description

    Dataset

    This dataset was created by smnd

    Contents

  15. clip model

    • kaggle.com
    Updated Mar 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SeaLeopard (2025). clip model [Dataset]. https://www.kaggle.com/datasets/sealeopard/clip-model/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 3, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    SeaLeopard
    Description

    Dataset

    This dataset was created by SeaLeopard

    Contents

  16. CLIP-PACKAGE-WEIGHT

    • kaggle.com
    Updated Dec 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ForcewithMe (2023). CLIP-PACKAGE-WEIGHT [Dataset]. https://www.kaggle.com/datasets/forcewithme/clip-package-weight/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 13, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ForcewithMe
    Description

    Dataset

    This dataset was created by ForcewithMe

    Contents

  17. clip-stylegan

    • kaggle.com
    Updated May 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 1, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    kirito174
    Description

    Dataset

    This dataset was created by kirito174

    Contents

  18. Contrastive Language Image Pretraining by openai

    • kaggle.com
    Updated Apr 18, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The citation is currently not available for this dataset.
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 18, 2021
    Dataset provided by
    Kaggle
    Authors
    prakash
    Description

    Dataset

    This dataset was created by prakash

    Released under Data files © Original Authors

    Contents

  19. mix-loss-clip

    • kaggle.com
    Updated Apr 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JiangShaoYin (2022). mix-loss-clip [Dataset]. https://www.kaggle.com/datasets/jiangshaoyin/mix-loss-clip/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 19, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    JiangShaoYin
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by JiangShaoYin

    Released under CC0: Public Domain

    Contents

  20. clip-models

    • kaggle.com
    Updated Aug 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mathurin Aché (2022). clip-models [Dataset]. https://www.kaggle.com/datasets/mathurinache/clip-embedding-extractor
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 17, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mathurin Aché
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Mathurin Aché

    Released under CC BY-NC-SA 4.0

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Meher Deepak-2005 (2023). ClipModel [Dataset]. https://www.kaggle.com/datasets/meherdeepak2005/clipmodel/code
Organization logo

ClipModel

OpenAI Clip model. Source: (GitHub) https://github.com/openai/CLIP

Explore at:
44 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 22, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Meher Deepak-2005
Description

Dataset

This dataset was created by Meher Deepak-2005

Contents

Search
Clear search
Close search
Google apps
Main menu