84 datasets found

Huggingface Modelhub
kaggle.com
zip
Updated Jun 19, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kartik Godawat (2021). Huggingface Modelhub [Dataset]. https://www.kaggle.com/crazydiv/huggingface-modelhub
Explore at:
zip(2274876 bytes)Available download formats
Dataset updated
Jun 19, 2021
Authors
Kartik Godawat
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
https://huggingface.co/landing/assets/transformers-docs/huggingface_logo.svg" alt="HuggingFace">

Dataset containing metadata information of all the publicly uploaded models(10,000+) available on HuggingFace model hub Data was collected between 15-20th June 2021.

Dataset was generated using huggingface_hub APIs provided by huggingface team.

Update v3:

Added Downloads last month metric

Added library name

Contents:

huggingface_models.csv : Primary file which contains metadata information like model name, tags, last modified and filenames

huggingface_modelcard_readme.csv : Detailed file containing README.md contents if available for a particular model. Content is in markdown format. modelId column joins both the files together. ### huggingface_models.csv

modelId: ID of the model as present on HF website

lastModified: Time when this model was last modified

tags: Tags associated with the model (provided by mantainer)

pipeline_tag: If exists, denotes which pipeline this model could be used with

files: List of available files in the model repo

publishedBy: Custom column derived from modelID, specifying who published this model

downloads_last_month: Number of times the model has been downloaded in last month.

library: Name of library the model belongs to eg: transformers, spacy, timm etc. ### huggingface_modelcard_readme.csv

modelId: ID of the model as available on HF website

modelCard: Readme contents of a model (referred to as modelCard in HuggingFace ecoystem). It contains useful information on how the model was trained, benchmarks and author notes. ### Inspiration: The idea of analyzing publicly available models on HugginFace struck me while I was attending a livesession of the amazing transformers course by @LysandreJik. Soon after, I tweeted the team and asked for permission to create such a dataset. Special shoutout to @osanseviero for encouraging and pointing me in the right direction.

This is my first dataset upload on Kaggle. I hope you like it. :)
h
AST-Speech-Art
huggingface.co
Updated Dec 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Art [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Art
Explore at:
Dataset updated
Dec 1, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Art dataset hosted on Hugging Face and contributed by the HF Datasets community
Hugging Face Models Metadata
kaggle.com
zip
Updated Nov 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kumar Saksham (2023). Hugging Face Models Metadata [Dataset]. https://www.kaggle.com/datasets/everydaycodings/hugging-face-models-metadata/discussion
Explore at:
zip(8182909 bytes)Available download formats
Dataset updated
Nov 30, 2023
Authors
Kumar Saksham
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Description:

This dataset provides metadata for various models available on Hugging Face, a popular platform for sharing and discovering natural language processing (NLP) and machine learning models. The dataset includes information such as model name, author, repository link, image URL, category, star ratings, download statistics, and the last update timestamp.

Columns: 1. Name: Model name on Hugging Face. 2. Author: Author or organization associated with the model. 3. Repo Link: Link to the model's repository on Hugging Face. 4. Image URL: URL for the model's image/icon. 5. Category: The category or type of model (e.g., Text Generation, Automatic Speech Recognition). 6. Stars: Number of stars the model has received. 7. Downloads: Number of downloads for the model. 8. Last Updated: Timestamp indicating the last update of the model.

This dataset is valuable for researchers, data scientists, and enthusiasts interested in exploring and analyzing the landscape of pre-trained models on Hugging Face.
h
AST-Speech-Technology
huggingface.co
Updated Nov 29, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Technology [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Technology
Explore at:
Dataset updated
Nov 29, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Technology dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Speech-Nature-and-Documentary
huggingface.co
Updated Dec 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Nature-and-Documentary [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Nature-and-Documentary
Explore at:
Dataset updated
Dec 1, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Nature-and-Documentary dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Speech-Healty-and-Science
huggingface.co
Updated Dec 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Healty-and-Science [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Healty-and-Science
Explore at:
Dataset updated
Dec 1, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Healty-and-Science dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Speech-Business
huggingface.co
Updated Dec 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Business [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Business
Explore at:
Dataset updated
Dec 2, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Business dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Speech-History
huggingface.co
Updated Jun 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2023). AST-Speech-History [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-History
Explore at:
Dataset updated
Jun 19, 2023
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-History dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Speech-Personal-Development
huggingface.co
Updated Nov 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Personal-Development [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Personal-Development
Explore at:
Dataset updated
Nov 29, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Personal-Development dataset hosted on Hugging Face and contributed by the HF Datasets community
h
xd-violence-mini-audio-ast-chunked
huggingface.co
Updated Nov 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Isaiah Freeman (2025). xd-violence-mini-audio-ast-chunked [Dataset]. https://huggingface.co/datasets/KingTechnician/xd-violence-mini-audio-ast-chunked
Explore at:
Dataset updated
Nov 18, 2025
Authors
Isaiah Freeman
Description
KingTechnician/xd-violence-mini-audio-ast-chunked dataset hosted on Hugging Face and contributed by the HF Datasets community
h
xd-violence-20pct-audio-ast-chunked-correct
huggingface.co
Updated Nov 20, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Isaiah Freeman (2025). xd-violence-20pct-audio-ast-chunked-correct [Dataset]. https://huggingface.co/datasets/KingTechnician/xd-violence-20pct-audio-ast-chunked-correct
Explore at:
Dataset updated
Nov 20, 2025
Authors
Isaiah Freeman
Description
KingTechnician/xd-violence-20pct-audio-ast-chunked-correct dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Music-Data-45K
huggingface.co
Updated Nov 26, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo-Research (2025). AST-Music-Data-45K [Dataset]. https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-45K
Explore at:
Dataset updated
Nov 26, 2025
Dataset authored and provided by
Vyvo-Research
Description
Vyvo-Research/AST-Music-Data-45K dataset hosted on Hugging Face and contributed by the HF Datasets community
h
ast-titles
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rodrigo Asth, ast-titles [Dataset]. https://huggingface.co/datasets/rodrigoasth/ast-titles
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Rodrigo Asth
Description
rodrigoasth/ast-titles dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Speech-Sport
huggingface.co
Updated Dec 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-Sport [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-Sport
Explore at:
Dataset updated
Dec 1, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-Sport dataset hosted on Hugging Face and contributed by the HF Datasets community
h
ast-deneme
huggingface.co
Updated Dec 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aynur Susuz (2025). ast-deneme [Dataset]. https://huggingface.co/datasets/Aynursusuz/ast-deneme
Explore at:
Dataset updated
Dec 2, 2025
Authors
Aynur Susuz
Description
Aynursusuz/ast-deneme dataset hosted on Hugging Face and contributed by the HF Datasets community
h
code-with-ast-sequence
huggingface.co
Updated Sep 20, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ShijiaDong (2025). code-with-ast-sequence [Dataset]. https://huggingface.co/datasets/ShijiaD/code-with-ast-sequence
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 20, 2025
Authors
ShijiaDong
Description
ShijiaD/code-with-ast-sequence dataset hosted on Hugging Face and contributed by the HF Datasets community
h
needles-in-images
huggingface.co
Updated May 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agentic Science and Technology, FRI (2025). needles-in-images [Dataset]. https://huggingface.co/datasets/AST-FRI/needles-in-images
Explore at:
Dataset updated
May 29, 2025
Dataset authored and provided by
Agentic Science and Technology, FRI
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Academic & Commercial VQA Dataset

This dataset contains visual question-answering (VQA) entries from multiple domains:

Academic Papers Restaurant Menus Magazines Website ScreenShots Lecture ScreenShots

Each entry includes a natural language question, an answer, an associated image (or images), and bounding box metadata that localizes the answer in the image.

📁 File Structure

The final merged file is stored in:

test.json

Each entry contains fields… See the full description on the dataset page: https://huggingface.co/datasets/AST-FRI/needles-in-images.
h
AST-Speech-True-and-Crime
huggingface.co
Updated Dec 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo (2025). AST-Speech-True-and-Crime [Dataset]. https://huggingface.co/datasets/Vyvo/AST-Speech-True-and-Crime
Explore at:
Dataset updated
Dec 1, 2025
Dataset authored and provided by
Vyvo
Description
Vyvo/AST-Speech-True-and-Crime dataset hosted on Hugging Face and contributed by the HF Datasets community
h
workshop-ast-docstring
huggingface.co
Updated Sep 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ShijiaDong (2025). workshop-ast-docstring [Dataset]. https://huggingface.co/datasets/ShijiaD/workshop-ast-docstring
Explore at:
Dataset updated
Sep 21, 2025
Authors
ShijiaDong
Description
ShijiaD/workshop-ast-docstring dataset hosted on Hugging Face and contributed by the HF Datasets community
h
AST-Music-Data-291K
huggingface.co
Updated Nov 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vyvo-Research (2025). AST-Music-Data-291K [Dataset]. https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-291K
Explore at:
Dataset updated
Nov 26, 2025
Dataset authored and provided by
Vyvo-Research
Description
Speech-Music Merged Dataset

Source Datasets

Dataset Samples Speech Music

AIGenLab/high-sound-and-low-music 91,054 0 91,054

AIGenLab/Speech_Dataset 100,000 100,000 0

AIGenLab/Music-Dataset 100,000 0 100,000

TOTAL 291,054 100,000 191,054

Dataset Info

Total Samples: 291,054 Labels: speech, music Audio: 16kHz, Mono, WAV

Usage

from datasets import load_dataset

dataset = load_dataset("AIGenLab/speech-music-merge", split="train")… See the full description on the dataset page: https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-291K.

Facebook

Twitter

Click to copy link

Link copied

Cite

Kartik Godawat (2021). Huggingface Modelhub [Dataset]. https://www.kaggle.com/crazydiv/huggingface-modelhub

Huggingface Modelhub

Dataset containing information on all the models on HuggingFace modelhub

Explore at:

8 scholarly articles cite this dataset (View in Google Scholar)

zip(2274876 bytes)Available download formats

Dataset updated

Jun 19, 2021

Authors

Kartik Godawat

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

https://huggingface.co/landing/assets/transformers-docs/huggingface_logo.svg" alt="HuggingFace">

Dataset containing metadata information of all the publicly uploaded models(10,000+) available on HuggingFace model hub Data was collected between 15-20th June 2021.

Dataset was generated using huggingface_hub APIs provided by huggingface team.

Update v3:

Added Downloads last month metric
Added library name

huggingface_models.csv : Primary file which contains metadata information like model name, tags, last modified and filenames
huggingface_modelcard_readme.csv : Detailed file containing README.md contents if available for a particular model. Content is in markdown format. modelId column joins both the files together. ### huggingface_models.csv
modelId: ID of the model as present on HF website
lastModified: Time when this model was last modified
tags: Tags associated with the model (provided by mantainer)
pipeline_tag: If exists, denotes which pipeline this model could be used with
files: List of available files in the model repo
publishedBy: Custom column derived from modelID, specifying who published this model
downloads_last_month: Number of times the model has been downloaded in last month.
library: Name of library the model belongs to eg: transformers, spacy, timm etc. ### huggingface_modelcard_readme.csv
modelId: ID of the model as available on HF website
modelCard: Readme contents of a model (referred to as modelCard in HuggingFace ecoystem). It contains useful information on how the model was trained, benchmarks and author notes. ### Inspiration: The idea of analyzing publicly available models on HugginFace struck me while I was attending a livesession of the amazing transformers course by @LysandreJik. Soon after, I tweeted the team and asked for permission to create such a dataset. Special shoutout to @osanseviero for encouraging and pointing me in the right direction.

This is my first dataset upload on Kaggle. I hope you like it. :)

Clear search

Close search

Google apps

Main menu

Huggingface Modelhub

Update v3:

Contents:

AST-Speech-Art

Hugging Face Models Metadata

AST-Speech-Technology

AST-Speech-Nature-and-Documentary

AST-Speech-Healty-and-Science

AST-Speech-Business

AST-Speech-History

AST-Speech-Personal-Development

xd-violence-mini-audio-ast-chunked

xd-violence-20pct-audio-ast-chunked-correct

AST-Music-Data-45K

ast-titles

AST-Speech-Sport

ast-deneme

code-with-ast-sequence

needles-in-images

AST-Speech-True-and-Crime

workshop-ast-docstring

AST-Music-Data-291K

Huggingface Modelhub

Dataset containing information on all the models on HuggingFace modelhub

Update v3:

Contents: