84 datasets found
  1. Huggingface Modelhub

    • kaggle.com
    zip
    Updated Jun 19, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kartik Godawat (2021). Huggingface Modelhub [Dataset]. https://www.kaggle.com/crazydiv/huggingface-modelhub
    Explore at:
    zip(2274876 bytes)Available download formats
    Dataset updated
    Jun 19, 2021
    Authors
    Kartik Godawat
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    https://huggingface.co/landing/assets/transformers-docs/huggingface_logo.svg" alt="HuggingFace">

    Dataset containing metadata information of all the publicly uploaded models(10,000+) available on HuggingFace model hub Data was collected between 15-20th June 2021.

    Dataset was generated using huggingface_hub APIs provided by huggingface team.

    Update v3:

    • Added Downloads last month metric
    • Added library name

    Contents:

    • huggingface_models.csv : Primary file which contains metadata information like model name, tags, last modified and filenames
    • huggingface_modelcard_readme.csv : Detailed file containing README.md contents if available for a particular model. Content is in markdown format. modelId column joins both the files together. ### huggingface_models.csv
    • modelId: ID of the model as present on HF website
    • lastModified: Time when this model was last modified
    • tags: Tags associated with the model (provided by mantainer)
    • pipeline_tag: If exists, denotes which pipeline this model could be used with
    • files: List of available files in the model repo
    • publishedBy: Custom column derived from modelID, specifying who published this model
    • downloads_last_month: Number of times the model has been downloaded in last month.
    • library: Name of library the model belongs to eg: transformers, spacy, timm etc. ### huggingface_modelcard_readme.csv
    • modelId: ID of the model as available on HF website
    • modelCard: Readme contents of a model (referred to as modelCard in HuggingFace ecoystem). It contains useful information on how the model was trained, benchmarks and author notes. ### Inspiration: The idea of analyzing publicly available models on HugginFace struck me while I was attending a livesession of the amazing transformers course by @LysandreJik. Soon after, I tweeted the team and asked for permission to create such a dataset. Special shoutout to @osanseviero for encouraging and pointing me in the right direction.

    This is my first dataset upload on Kaggle. I hope you like it. :)

  2. h

    AST-Speech-Art

    • huggingface.co
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Art [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Art
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Art dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. Hugging Face Models Metadata

    • kaggle.com
    zip
    Updated Nov 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kumar Saksham (2023). Hugging Face Models Metadata [Dataset]. https://www.kaggle.com/datasets/everydaycodings/hugging-face-models-metadata/discussion
    Explore at:
    zip(8182909 bytes)Available download formats
    Dataset updated
    Nov 30, 2023
    Authors
    Kumar Saksham
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Description:

    This dataset provides metadata for various models available on Hugging Face, a popular platform for sharing and discovering natural language processing (NLP) and machine learning models. The dataset includes information such as model name, author, repository link, image URL, category, star ratings, download statistics, and the last update timestamp.

    Columns: 1. Name: Model name on Hugging Face. 2. Author: Author or organization associated with the model. 3. Repo Link: Link to the model's repository on Hugging Face. 4. Image URL: URL for the model's image/icon. 5. Category: The category or type of model (e.g., Text Generation, Automatic Speech Recognition). 6. Stars: Number of stars the model has received. 7. Downloads: Number of downloads for the model. 8. Last Updated: Timestamp indicating the last update of the model.

    This dataset is valuable for researchers, data scientists, and enthusiasts interested in exploring and analyzing the landscape of pre-trained models on Hugging Face.

  4. h

    AST-Speech-Technology

    • huggingface.co
    Updated Nov 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Technology [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Technology
    Explore at:
    Dataset updated
    Nov 29, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Technology dataset hosted on Hugging Face and contributed by the HF Datasets community

  5. h

    AST-Speech-Nature-and-Documentary

    • huggingface.co
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Nature-and-Documentary [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Nature-and-Documentary
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Nature-and-Documentary dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. h

    AST-Speech-Healty-and-Science

    • huggingface.co
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Healty-and-Science [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Healty-and-Science
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Healty-and-Science dataset hosted on Hugging Face and contributed by the HF Datasets community

  7. h

    AST-Speech-Business

    • huggingface.co
    Updated Dec 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Business [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Business
    Explore at:
    Dataset updated
    Dec 2, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Business dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. h

    AST-Speech-History

    • huggingface.co
    Updated Jun 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2023). AST-Speech-History [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-History
    Explore at:
    Dataset updated
    Jun 19, 2023
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-History dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    AST-Speech-Personal-Development

    • huggingface.co
    Updated Nov 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Personal-Development [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Personal-Development
    Explore at:
    Dataset updated
    Nov 29, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Personal-Development dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. h

    xd-violence-mini-audio-ast-chunked

    • huggingface.co
    Updated Nov 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Isaiah Freeman (2025). xd-violence-mini-audio-ast-chunked [Dataset]. https://huggingface.co/datasets/KingTechnician/xd-violence-mini-audio-ast-chunked
    Explore at:
    Dataset updated
    Nov 18, 2025
    Authors
    Isaiah Freeman
    Description

    KingTechnician/xd-violence-mini-audio-ast-chunked dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. h

    xd-violence-20pct-audio-ast-chunked-correct

    • huggingface.co
    Updated Nov 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Isaiah Freeman (2025). xd-violence-20pct-audio-ast-chunked-correct [Dataset]. https://huggingface.co/datasets/KingTechnician/xd-violence-20pct-audio-ast-chunked-correct
    Explore at:
    Dataset updated
    Nov 20, 2025
    Authors
    Isaiah Freeman
    Description

    KingTechnician/xd-violence-20pct-audio-ast-chunked-correct dataset hosted on Hugging Face and contributed by the HF Datasets community

  12. h

    AST-Music-Data-45K

    • huggingface.co
    Updated Nov 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo-Research (2025). AST-Music-Data-45K [Dataset]. https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-45K
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset authored and provided by
    Vyvo-Research
    Description

    Vyvo-Research/AST-Music-Data-45K dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    ast-titles

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rodrigo Asth, ast-titles [Dataset]. https://huggingface.co/datasets/rodrigoasth/ast-titles
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Rodrigo Asth
    Description

    rodrigoasth/ast-titles dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. h

    AST-Speech-Sport

    • huggingface.co
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-Sport [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Sport
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-Sport dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. h

    ast-deneme

    • huggingface.co
    Updated Dec 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aynur Susuz (2025). ast-deneme [Dataset]. https://huggingface.co/datasets/Aynursusuz/ast-deneme
    Explore at:
    Dataset updated
    Dec 2, 2025
    Authors
    Aynur Susuz
    Description

    Aynursusuz/ast-deneme dataset hosted on Hugging Face and contributed by the HF Datasets community

  16. h

    code-with-ast-sequence

    • huggingface.co
    Updated Sep 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShijiaDong (2025). code-with-ast-sequence [Dataset]. https://huggingface.co/datasets/ShijiaD/code-with-ast-sequence
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 20, 2025
    Authors
    ShijiaDong
    Description

    ShijiaD/code-with-ast-sequence dataset hosted on Hugging Face and contributed by the HF Datasets community

  17. h

    needles-in-images

    • huggingface.co
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agentic Science and Technology, FRI (2025). needles-in-images [Dataset]. https://huggingface.co/datasets/AST-FRI/needles-in-images
    Explore at:
    Dataset updated
    May 29, 2025
    Dataset authored and provided by
    Agentic Science and Technology, FRI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Academic & Commercial VQA Dataset

    This dataset contains visual question-answering (VQA) entries from multiple domains:

    Academic Papers Restaurant Menus Magazines Website ScreenShots Lecture ScreenShots

    Each entry includes a natural language question, an answer, an associated image (or images), and bounding box metadata that localizes the answer in the image.

      ๐Ÿ“ File Structure
    

    The final merged file is stored in:

    test.json

      Each entry contains fieldsโ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/AST-FRI/needles-in-images.
    
  18. h

    AST-Speech-True-and-Crime

    • huggingface.co
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo (2025). AST-Speech-True-and-Crime [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-True-and-Crime
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    Vyvo
    Description

    Vyvo/AST-Speech-True-and-Crime dataset hosted on Hugging Face and contributed by the HF Datasets community

  19. h

    workshop-ast-docstring

    • huggingface.co
    Updated Sep 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShijiaDong (2025). workshop-ast-docstring [Dataset]. https://huggingface.co/datasets/ShijiaD/workshop-ast-docstring
    Explore at:
    Dataset updated
    Sep 21, 2025
    Authors
    ShijiaDong
    Description

    ShijiaD/workshop-ast-docstring dataset hosted on Hugging Face and contributed by the HF Datasets community

  20. h

    AST-Music-Data-291K

    • huggingface.co
    Updated Nov 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vyvo-Research (2025). AST-Music-Data-291K [Dataset]. https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-291K
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset authored and provided by
    Vyvo-Research
    Description

    Speech-Music Merged Dataset

      Source Datasets
    

    Dataset Samples Speech Music

    AIGenLab/high-sound-and-low-music 91,054 0 91,054

    AIGenLab/Speech_Dataset 100,000 100,000 0

    AIGenLab/Music-Dataset 100,000 0 100,000

    TOTAL 291,054 100,000 191,054

      Dataset Info
    

    Total Samples: 291,054 Labels: speech, music Audio: 16kHz, Mono, WAV

      Usage
    

    from datasets import load_dataset

    dataset = load_dataset("AIGenLab/speech-music-merge", split="train")โ€ฆ See the full description on the dataset page: https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-291K.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Kartik Godawat (2021). Huggingface Modelhub [Dataset]. https://www.kaggle.com/crazydiv/huggingface-modelhub
Organization logo

Huggingface Modelhub

Dataset containing information on all the models on HuggingFace modelhub

Explore at:
8 scholarly articles cite this dataset (View in Google Scholar)
zip(2274876 bytes)Available download formats
Dataset updated
Jun 19, 2021
Authors
Kartik Godawat
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

https://huggingface.co/landing/assets/transformers-docs/huggingface_logo.svg" alt="HuggingFace">

Dataset containing metadata information of all the publicly uploaded models(10,000+) available on HuggingFace model hub Data was collected between 15-20th June 2021.

Dataset was generated using huggingface_hub APIs provided by huggingface team.

Update v3:

  • Added Downloads last month metric
  • Added library name

Contents:

  • huggingface_models.csv : Primary file which contains metadata information like model name, tags, last modified and filenames
  • huggingface_modelcard_readme.csv : Detailed file containing README.md contents if available for a particular model. Content is in markdown format. modelId column joins both the files together. ### huggingface_models.csv
  • modelId: ID of the model as present on HF website
  • lastModified: Time when this model was last modified
  • tags: Tags associated with the model (provided by mantainer)
  • pipeline_tag: If exists, denotes which pipeline this model could be used with
  • files: List of available files in the model repo
  • publishedBy: Custom column derived from modelID, specifying who published this model
  • downloads_last_month: Number of times the model has been downloaded in last month.
  • library: Name of library the model belongs to eg: transformers, spacy, timm etc. ### huggingface_modelcard_readme.csv
  • modelId: ID of the model as available on HF website
  • modelCard: Readme contents of a model (referred to as modelCard in HuggingFace ecoystem). It contains useful information on how the model was trained, benchmarks and author notes. ### Inspiration: The idea of analyzing publicly available models on HugginFace struck me while I was attending a livesession of the amazing transformers course by @LysandreJik. Soon after, I tweeted the team and asked for permission to create such a dataset. Special shoutout to @osanseviero for encouraging and pointing me in the right direction.

This is my first dataset upload on Kaggle. I hope you like it. :)

Search
Clear search
Close search
Google apps
Main menu