Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://huggingface.co/landing/assets/transformers-docs/huggingface_logo.svg" alt="HuggingFace">
Dataset containing metadata information of all the publicly uploaded models(10,000+) available on HuggingFace model hub Data was collected between 15-20th June 2021.
Dataset was generated using huggingface_hub APIs provided by huggingface team.
This is my first dataset upload on Kaggle. I hope you like it. :)
Facebook
TwitterVyvo/AST-Speech-Art dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description:
This dataset provides metadata for various models available on Hugging Face, a popular platform for sharing and discovering natural language processing (NLP) and machine learning models. The dataset includes information such as model name, author, repository link, image URL, category, star ratings, download statistics, and the last update timestamp.
Columns: 1. Name: Model name on Hugging Face. 2. Author: Author or organization associated with the model. 3. Repo Link: Link to the model's repository on Hugging Face. 4. Image URL: URL for the model's image/icon. 5. Category: The category or type of model (e.g., Text Generation, Automatic Speech Recognition). 6. Stars: Number of stars the model has received. 7. Downloads: Number of downloads for the model. 8. Last Updated: Timestamp indicating the last update of the model.
This dataset is valuable for researchers, data scientists, and enthusiasts interested in exploring and analyzing the landscape of pre-trained models on Hugging Face.
Facebook
TwitterVyvo/AST-Speech-Technology dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo/AST-Speech-Nature-and-Documentary dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo/AST-Speech-Healty-and-Science dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo/AST-Speech-Business dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo/AST-Speech-History dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo/AST-Speech-Personal-Development dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterKingTechnician/xd-violence-mini-audio-ast-chunked dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterKingTechnician/xd-violence-20pct-audio-ast-chunked-correct dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo-Research/AST-Music-Data-45K dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twitterrodrigoasth/ast-titles dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterVyvo/AST-Speech-Sport dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterAynursusuz/ast-deneme dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterShijiaD/code-with-ast-sequence dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Academic & Commercial VQA Dataset
This dataset contains visual question-answering (VQA) entries from multiple domains:
Academic Papers Restaurant Menus Magazines Website ScreenShots Lecture ScreenShots
Each entry includes a natural language question, an answer, an associated image (or images), and bounding box metadata that localizes the answer in the image.
๐ File Structure
The final merged file is stored in:
test.json
Each entry contains fieldsโฆ See the full description on the dataset page: https://huggingface.co/datasets/AST-FRI/needles-in-images.
Facebook
TwitterVyvo/AST-Speech-True-and-Crime dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterShijiaD/workshop-ast-docstring dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterSpeech-Music Merged Dataset
Source Datasets
Dataset Samples Speech Music
AIGenLab/high-sound-and-low-music 91,054 0 91,054
AIGenLab/Speech_Dataset 100,000 100,000 0
AIGenLab/Music-Dataset 100,000 0 100,000
TOTAL 291,054 100,000 191,054
Dataset Info
Total Samples: 291,054 Labels: speech, music Audio: 16kHz, Mono, WAV
Usage
from datasets import load_dataset
dataset = load_dataset("AIGenLab/speech-music-merge", split="train")โฆ See the full description on the dataset page: https://huggingface.co/datasets/Vyvo-Research/AST-Music-Data-291K.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://huggingface.co/landing/assets/transformers-docs/huggingface_logo.svg" alt="HuggingFace">
Dataset containing metadata information of all the publicly uploaded models(10,000+) available on HuggingFace model hub Data was collected between 15-20th June 2021.
Dataset was generated using huggingface_hub APIs provided by huggingface team.
This is my first dataset upload on Kaggle. I hope you like it. :)