100+ datasets found
  1. YouTube Trending Videos Dataset

    • kaggle.com
    zip
    Updated Dec 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). YouTube Trending Videos Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/youtube-trending-videos-dataset
    Explore at:
    zip(29769637 bytes)Available download formats
    Dataset updated
    Dec 19, 2023
    Authors
    The Devastator
    Area covered
    YouTube
    Description

    YouTube Trending Videos Dataset

    Exploring YouTube Trending Videos

    By dskl [source]

    About this dataset

    Moreover it also reveals various engagement metrics such as the number of views the video has received, likes and dislikes it has garnered from viewership. Additionally information related to comment count on particular videos enables analysis regarding viewer interaction and response. Furthermore this dataset describes whether comments or ratings are disabled for a particular video allowing examination into how these factors impact engagement.

    By exploring this dataset in-depth marketers can gain valuable insights into identifying trends in content popularity across different countries while taking into account timing considerations based on published day of week. It also opens up avenues for analyzing public sentiment towards specific videos based on likes vs dislikes ratios and comment count which further aids in devising suitable marketing strategies.

    Overall,this informative dataset serves as an invaluable asset for researchers,data analysts,and marketers alike who strive to gain deeper understanding about trending video patterns,relevant metrics influencing content virality,factors dictating viewer sentiments,and exploring new possibilities within digital marketing space leveraging YouTube's wide reach

    How to use the dataset

    How to Use This Dataset: A Guide

    In this guide, we will walk you through the different columns in the dataset and provide insights on how you can explore the popularity and engagement of these trending videos. Let's dive in!

    Column Descriptions:

    • title: The title of the video.
    • channel_title: The title of the YouTube channel that published the video.
    • publish_date: The date when the video was published on YouTube.
    • time_frame: The duration of time (e.g., 1 day, 6 hours) that the video has been trending on YouTube.
    • published_day_of_week: The day of week (e.g., Monday) when the video was published.
    • publish_country: The country where the video was published.
    • tags: The tags or keywords associated with the video.
    • views: The number of views received by a particular video
    • likes: Number o likes received per each videos
    • dislike: Number dislikes receives per an individual vidoe 11.comment_count: number of comments

    Popular Video Insights:

    To gain insights into popular videos based on this dataset, you can focus your analysis using these columns:

    title, channel_title, publish_date, time_frame, and** publish_country**.

    By analyzing these attributes together with other engagement metrics such as views ,likes,**dislikes,**comments),comment_count you can identify trends in what type content is most popular both globally or within specific countries.

    For instance: - You could analyze which channels are consistently publishing trending videos - Explore whether certain types of titles or tags are more likely to attract views and engagement. - Determine if certain days of the week or time frames have a higher likelihood of trending videos being published.

    Engagement Insights:

    To explore user engagement with the trending videos, you can focus your analysis on these columns:

    likes, dislikes, comment_count

    By analyzing these attributes you can get insights into how users are interacting with the content. For example: - You could compare the like and dislike ratios to identify positively received videos versus those that are more controversial. - Analyze comment counts to understand how users are engaging with the content and whether comments being disabled affects overall

    Research Ideas

    • Analyzing the popularity and engagement of trending videos: By analyzing the number of views, likes, dislikes, and comments, we can understand which types of videos are popular among YouTube users. We can also examine factors such as comment count and ratings disabled to see how viewers engage with trending videos.
    • Understanding video trends across different countries: By examining the publish country column, we can compare the popularity of trending videos in different countries. This can help content creators or marketers understand regional preferences and tailor their content strategy accordingly.
    • Studying the impact of video attributes on engagement: By exploring the relationship between video attributes (such as title, tags, publish day) and engagement metrics (views, likes), we can identify patterns or trends that influence a video's success on YouTube. This information can be...
  2. YouTube Datasets

    • brightdata.com
    .json, .csv, .xlsx
    Updated Jan 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2023). YouTube Datasets [Dataset]. https://brightdata.com/products/datasets/youtube
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Jan 9, 2023
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide, YouTube
    Description

    Use our YouTube profiles dataset to extract both business and non-business information from public channels and filter by channel name, views, creation date, or subscribers. Datapoints include URL, handle, banner image, profile image, name, subscribers, description, video count, create date, views, details, and more. You may purchase the entire dataset or a customized subset, depending on your needs. Popular use cases for this dataset include sentiment analysis, brand monitoring, influencer marketing, and more.

  3. Youtube Videos Dataset (~3400 videos)

    • kaggle.com
    zip
    Updated Mar 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rajat Chaudhari (2022). Youtube Videos Dataset (~3400 videos) [Dataset]. https://www.kaggle.com/datasets/rajatrc1705/youtube-videos-dataset
    Explore at:
    zip(661105 bytes)Available download formats
    Dataset updated
    Mar 19, 2022
    Authors
    Rajat Chaudhari
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    YouTube
    Description

    Context 📃

    I wanted to practice text classification using NLP techniques, so I thought why not practice it by generating the data myself! This way, I brushed up on my scraping techniques using Selenium, collected the data, cleaned it, and then started working on it. You can take a peek at my work Github Repository For This Dataset and Trained Models/ Results

    Content 📰

    The total number of videos scraped was 3600. I scraped the following things from each video: | link | title | description | category | | --- | --- | --- | --- | | Video ID | Category for which the video was scraped | Description of the video | Category for which the video was scraped. |

    I queried the videos for 4 categories:

    Travel Vlogs 🧳 Food 🥑 Art and Music 🎨 🎻 History 📜

    Acknowledgements 🙏

    I could have used a ready made API, but just for the fun of it, I scraped the data from Youtube using Selenium.

    Inspiration 🦋

    The data is not clean (for your enjoyment of cleaning the data!), has some missing values, and is imbalanced. Practice text classification on this dataset, you will have to learn different techniques for eg:- How to handle imbalanced classes..? While working on this dataset, you will learn a lot of different things and also get an opportunity to apply on this dataset.

  4. YouTube Dataset of all Data Science Channels🎓🧾

    • kaggle.com
    zip
    Updated Jun 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhishek0032 (2024). YouTube Dataset of all Data Science Channels🎓🧾 [Dataset]. https://www.kaggle.com/datasets/abhishek0032/youtube-dataset-all-data-scienceanalyst-channels
    Explore at:
    zip(732289 bytes)Available download formats
    Dataset updated
    Jun 21, 2024
    Authors
    Abhishek0032
    Area covered
    YouTube
    Description

    Description: This dataset contains detailed information about videos from various YouTube channels that specialize in data science and analytics. It includes metrics such as views, likes, comments, and publication dates. The dataset consists of 22862 rows, providing a robust sample for analyzing trends in content engagement, popularity of topics over time, and comparison of channels' performance.

    Column Descriptors:

    Channel_Name: The name of the YouTube channel. Title: The title of the video. Published_date: The date when the video was published. Views: The number of views the video has received. Like_count: The number of likes the video has received. Comment_Count: The number of comments on the video.

    This dataset contains information from the following YouTube channels:

    ['sentdex', 'freeCodeCamp.org' ,'CampusX', 'Darshil Parmar',' Keith Galli' ,'Alex The Analyst', 'Socratica' , Krish Naik', 'StatQuest with Josh Starmer', 'Nicholas Renotte', 'Leila Gharani', 'Rob Mulla' ,'Ryan Nolan Data', 'techTFQ', 'Dataquest' ,'WsCube Tech', 'Chandoo', 'Luke Barousse', 'Andrej Karpathy', 'Thu Vu data analytics', 'Guy in a Cube', 'Tableau Tim', 'codebasics', 'DeepLearningAI', 'Rishabh Mishra' 'ExcelIsFun', 'Kevin Stratvert' ' Ken Jee','Kaggle' , 'Tina Huang']

    This dataset can be used for various analyses, including but not limited to:

    Identifying the most popular videos and channels in the data science field.

    Understanding viewer engagement trends over time.

    Comparing the performance of different types of content across multiple channels.

    Performing a comparison between different channels to find the best-performing ones.

    Identifying the best videos to watch for specific topics in data science and analytics.

    Conducting a detailed analysis of your favorite YouTube channel to understand its content strategy and performance.

    Note: The data is current as of the date of extraction and may not reflect real-time changes on YouTube. For any analyses, ensure to consider the date when the data was last updated to maintain accuracy and relevance.

  5. T

    youtube_vis

    • tensorflow.org
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). youtube_vis [Dataset]. https://www.tensorflow.org/datasets/catalog/youtube_vis
    Explore at:
    Dataset updated
    Dec 6, 2022
    Description

    Youtube-vis is a video instance segmentation dataset. It contains 2,883 high-resolution YouTube videos, a per-pixel category label set including 40 common objects such as person, animals and vehicles, 4,883 unique video instances, and 131k high-quality manual annotations.

    The YouTube-VIS dataset is split into 2,238 training videos, 302 validation videos and 343 test videos.

    No files were removed or altered during preprocessing.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('youtube_vis', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

  6. h

    YouTube-Commons

    • huggingface.co
    Updated Apr 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PleIAs (2024). YouTube-Commons [Dataset]. https://huggingface.co/datasets/PleIAs/YouTube-Commons
    Explore at:
    Dataset updated
    Apr 17, 2024
    Dataset authored and provided by
    PleIAs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    📺 YouTube-Commons 📺

    YouTube-Commons is a collection of audio transcripts of 2,063,066 videos shared on YouTube under a CC-By license.

      Content
    

    The collection comprises 22,709,724 original and automatically translated transcripts from 3,156,703 videos (721,136 individual channels). In total, this represents nearly 45 billion words (44,811,518,375). All the videos where shared on YouTube with a CC-BY license: the dataset provide all the necessary provenance information… See the full description on the dataset page: https://huggingface.co/datasets/PleIAs/YouTube-Commons.

  7. R

    Youtube 2 Dataset

    • universe.roboflow.com
    zip
    Updated May 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashita Teen Interns (2025). Youtube 2 Dataset [Dataset]. https://universe.roboflow.com/ashita-teen-interns/youtube-dataset-2
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 21, 2025
    Dataset authored and provided by
    Ashita Teen Interns
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    YouTube
    Variables measured
    Cars Bounding Boxes
    Description

    Youtube Dataset 2

    ## Overview
    
    Youtube Dataset 2 is a dataset for object detection tasks - it contains Cars annotations for 520 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
    
  8. Ken Jee YouTube Data

    • kaggle.com
    zip
    Updated Jan 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ken Jee (2022). Ken Jee YouTube Data [Dataset]. https://www.kaggle.com/datasets/kenjee/ken-jee-youtube-data
    Explore at:
    zip(6556461 bytes)Available download formats
    Dataset updated
    Jan 22, 2022
    Authors
    Ken Jee
    Area covered
    YouTube
    Description

    Context

    I've been creating videos on YouTube since November of 2017 (https://www.youtube.com/c/KenJee1) with the mission of making data science accessible to more people. One of the best ways to do this is to tell stories and working on projects. This is my attempt at my first community project. I am making my YouTube data available for everyone to help better understand the growth of my YouTube community and think about ways that it could be improved! I would love for everyone in the community feel like they had some hand in contributing to the channel.

    Announcement Video: https://youtu.be/YPph59-rTxA

    I will be sharing my favorite projects in a few of my videos (with permission of course), and would also like to give away a few small prizes to the top featured notebooks. I hope you have fun with the analysis, I'm interested in seeing what you find in the data!

    For those looking for a place to start, some things I'm thinking about are: - What are the themes of the comment data? - What types of video titles and thumbnails drive the most traffic? - Who is my core audience and what are they interested in? - What types of videos have lead to the most growth? - What type of content are people engaging with the most or watching the longest?

    Some advanced projects could be: - Creating a chat bot to respond to common comments with videos where I have addressed a topic - Pulling sentiment from thumbnails and titles and comparing that with performance

    Data I would like to add over time - Video descriptions - Video subtitles - Actual video data

    Content

    There are four files in this repo. The relevant data included in most of them is from Nov 2017 - Jan 2022. I gathered some of this data via the YouTube API and the rest from my specific analytics.

    1) Aggregated Metrics By Video - This has all the topline metrics from my channel from its start (around 2015 to Jan 22 2022). I didn't post my first video until around 2) Aggregated Metrics By Video with Country and Subscriber Status - This has the same data as aggregated metrics by video, but it includes dimensions for which country people are viewing from and if the viewers are subscribed to the channel or not. 3) Video Performance Over Time - This has the daily data from each of my videos. 4) All Comments - This is all of my comment data gathered from the YouTube API. I have anonymized the users so don't worry about your name showing up!

    Acknowledgements

    This obviously wouldn't be possible without all of the wonderful people who watch and interact with my videos! I'm incredibly grateful for you all and I'm so happy I can share this project with you!

    License

    I collected this data from the YouTube API and through my own google analytics. Thus use of it must uphold the YouTube API's terms of service: https://developers.google.com/youtube/terms/api-services-terms-of-service

  9. g

    YouTube Thumbnail Dataset

    • gts.ai
    json
    Updated May 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2024). YouTube Thumbnail Dataset [Dataset]. https://gts.ai/dataset-download/youtube-thumbnail-dataset/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    May 5, 2024
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    The YouTube Thumbnail Dataset features a large collection of top-performing video thumbnails from over 90 YouTube channels. It includes metadata such as video IDs, views, likes, comments, and duration for AI and media analysis applications.

  10. h

    OpenDV-YouTube-Language

    • huggingface.co
    Updated Mar 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OpenDriveLab (2024). OpenDV-YouTube-Language [Dataset]. https://huggingface.co/datasets/OpenDriveLab/OpenDV-YouTube-Language
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 24, 2024
    Dataset authored and provided by
    OpenDriveLab
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    OpenDV-YouTube

    This is the dataset repository of OpenDV-YouTube language annotations, including context and command. For more details, please refer to GenAD project and OpenDV-YouTube.

      Usage
    

    To use the annotations, you need to first download and prepare the data as instructed in OpenDV-YouTube. Note that we recommend to process the dataset in Linux environment since Windows may have issues with the file paths. You can use the following code to load in annotations… See the full description on the dataset page: https://huggingface.co/datasets/OpenDriveLab/OpenDV-YouTube-Language.

  11. R

    Data from: Youtube Videos Dataset

    • universe.roboflow.com
    zip
    Updated May 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Teeninterns project (2025). Youtube Videos Dataset [Dataset]. https://universe.roboflow.com/teeninterns-project-f20vg/youtube-videos-wcvme
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 25, 2025
    Dataset authored and provided by
    Teeninterns project
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Variables measured
    Cars Bounding Boxes
    Description

    Youtube Videos

    ## Overview
    
    Youtube Videos is a dataset for object detection tasks - it contains Cars annotations for 3,842 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  12. finevideo

    • huggingface.co
    Updated Sep 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugging Face FineVideo (2024). finevideo [Dataset]. https://huggingface.co/datasets/HuggingFaceFV/finevideo
    Explore at:
    Dataset updated
    Sep 12, 2024
    Dataset provided by
    Hugging Facehttps://huggingface.co/
    Authors
    Hugging Face FineVideo
    License

    https://choosealicense.com/licenses/cc/https://choosealicense.com/licenses/cc/

    Description

    FineVideo

    FineVideo Description Dataset Explorer Revisions Dataset Distribution

    How to download and use FineVideo Using datasets Using huggingface_hub Load a subset of the dataset

    Dataset StructureData Instances Data Fields

    Dataset Creation License CC-By Considerations for Using the Data Social Impact of Dataset Discussion of Biases

    Additional Information Credits Future Work Opting out of FineVideo Citation Information

    Terms of use for FineVideo… See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceFV/finevideo.

  13. R

    Real+fake+wild+youtube Dataset

    • universe.roboflow.com
    zip
    Updated Apr 2, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Smoke (2022). Real+fake+wild+youtube Dataset [Dataset]. https://universe.roboflow.com/smoke/real-fake-wild-youtube/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 2, 2022
    Dataset authored and provided by
    Smoke
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Variables measured
    Smoke Bounding Boxes
    Description

    Real+fake+wild+youtube

    ## Overview
    
    Real+fake+wild+youtube is a dataset for object detection tasks - it contains Smoke annotations for 1,340 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  14. R

    Sampel Youtube Dataset

    • universe.roboflow.com
    zip
    Updated Apr 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robby Mahendra (2024). Sampel Youtube Dataset [Dataset]. https://universe.roboflow.com/robby-mahendra-ugijg/sampel-youtube/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 25, 2024
    Dataset authored and provided by
    Robby Mahendra
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Variables measured
    Cigarette 2nwS Bounding Boxes
    Description

    Sampel Youtube

    ## Overview
    
    Sampel Youtube is a dataset for object detection tasks - it contains Cigarette 2nwS annotations for 450 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  15. UCF YouTube Action Data Set

    • kaggle.com
    zip
    Updated Oct 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmad (2023). UCF YouTube Action Data Set [Dataset]. https://www.kaggle.com/datasets/pypiahmad/ucf-youtube-action-data-set
    Explore at:
    zip(633314500 bytes)Available download formats
    Dataset updated
    Oct 29, 2023
    Authors
    Ahmad
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    The UCF11 dataset, previously known as the YouTube Action Data Set, is a comprehensive collection of 1,160 videos categorized into 11 distinct action classes. The actions represented include:

    1. Basketball shooting
    2. Biking/Cycling
    3. Diving
    4. Golf swinging
    5. Horseback riding
    6. Soccer juggling
    7. Swinging
    8. Tennis swinging
    9. Trampoline jumping
    10. Volleyball spiking
    11. Walking with a dog

    Dataset Structure:

    The dataset is organized into folders, each representing one of the 11 action categories. Within each category folder, videos are grouped into 25 groups, with each group containing more than four action clips. The folder structure is as follows:

    UCF11/
    │
    ├── Basketball/
    │  ├── Group01/
    │  │  ├── video01.mpg
    │  │  ├── video02.mpg
    │  │  └── ...
    │  ├── Group02/
    │  └── ...
    ├── Biking/
    ├── Diving/
    └── ...
    

    Each video file is named in a manner indicating its group and sequential number within the group. The videos within a group share common features such as the same actor, similar background, and viewpoint, which is conducive for studying action recognition under varied conditions.

    File Format:

    The videos are in ms mpeg4 format, and it's recommended to have the appropriate codec (e.g., K-lite Codec Pack) for access. In the updated version of UCF11, all videos are converted to 29.97 fps (mpg), and annotations have been refined to address previous inconsistencies.

    Usage:

    This dataset is well-suited for projects in computer vision, machine learning, and action recognition, providing a substantive resource for training and testing algorithms in real-world conditions.

    Related Publications: - Jingen Liu, Jiebo Luo, and Mubarak Shah, "Recognizing Realistic Actions from Videos 'in the Wild'", CVPR 2009. - Jingen Liu, Yang Yang, and Mubarak Shah, "Learning Semantic Visual Vocabularies using Diffusion Distance", CVPR 2009.

  16. i

    Netflix

    • ieee-dataport.org
    Updated Oct 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Danil Shamsimukhametov (2021). Netflix [Dataset]. https://ieee-dataport.org/documents/youtube-netflix-web-dataset-encrypted-traffic-classification
    Explore at:
    Dataset updated
    Oct 1, 2021
    Authors
    Danil Shamsimukhametov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    YouTube flows

  17. R

    Youtube Dataset

    • universe.roboflow.com
    zip
    Updated Jun 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    THESISSidewalkData (2024). Youtube Dataset [Dataset]. https://universe.roboflow.com/thesissidewalkdata/youtube-gygyu/dataset/6
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 20, 2024
    Dataset authored and provided by
    THESISSidewalkData
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Variables measured
    Sidewalk Objects Bounding Boxes
    Description

    Youtube

    ## Overview
    
    Youtube is a dataset for object detection tasks - it contains Sidewalk Objects annotations for 507 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  18. h

    my-cool-dataset

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Li, my-cool-dataset [Dataset]. https://huggingface.co/datasets/Jonathan916/my-cool-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Authors
    Li
    Description

    We only provide video URLs. Text/Captions are generated by BLIP-2. Please follow open-source agreement for any usage. DATA PREPARATION

    Download youtube videos to the folder '$workdir/download_videos' with the urls provided in metafiles, and name the videos with their video_id. e.g. url: https://www.youtube.com/watch?v=--4M68p_Loc

    $workdir download_videos --4M68p_Loc.mp4

    We do not provide an official script for downloading YouTube videos. You may consider using the open-source… See the full description on the dataset page: https://huggingface.co/datasets/Jonathan916/my-cool-dataset.

  19. R

    Avas Youtube Dataset

    • universe.roboflow.com
    zip
    Updated Nov 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sa (2025). Avas Youtube Dataset [Dataset]. https://universe.roboflow.com/sa-re0eg/avas-youtube-6x4ga
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 11, 2025
    Dataset authored and provided by
    Sa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Variables measured
    Avas GoSg Bounding Boxes
    Description

    AVAS YouTube

    ## Overview
    
    AVAS YouTube is a dataset for object detection tasks - it contains Avas GoSg annotations for 5,072 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  20. h

    indonesian-youtube

    • huggingface.co
    Updated Apr 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Malaysia AI (2024). indonesian-youtube [Dataset]. https://huggingface.co/datasets/malaysia-ai/indonesian-youtube
    Explore at:
    Dataset updated
    Apr 18, 2024
    Dataset authored and provided by
    Malaysia AI
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    Indonesian Youtube

    Source code at https://github.com/mesolitica/malaysian-dataset/tree/master/speech/indonesian-youtube

      how to download
    

    huggingface-cli download --repo-type dataset
    --include '*.z*'
    --local-dir './'
    malaysia-ai/indonesian-youtube

    wget https://www.7-zip.org/a/7z2301-linux-x64.tar.xz tar -xf 7z2301-linux-x64.tar.xz ~/7zz x mp3-16k.zip -y -mmt40

      Licensing
    

    All the videos, songs, images, and graphics used in the video belong to their… See the full description on the dataset page: https://huggingface.co/datasets/malaysia-ai/indonesian-youtube.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
The Devastator (2023). YouTube Trending Videos Dataset [Dataset]. https://www.kaggle.com/datasets/thedevastator/youtube-trending-videos-dataset
Organization logo

YouTube Trending Videos Dataset

Exploring YouTube Trending Videos

Explore at:
84 scholarly articles cite this dataset (View in Google Scholar)
zip(29769637 bytes)Available download formats
Dataset updated
Dec 19, 2023
Authors
The Devastator
Area covered
YouTube
Description

YouTube Trending Videos Dataset

Exploring YouTube Trending Videos

By dskl [source]

About this dataset

Moreover it also reveals various engagement metrics such as the number of views the video has received, likes and dislikes it has garnered from viewership. Additionally information related to comment count on particular videos enables analysis regarding viewer interaction and response. Furthermore this dataset describes whether comments or ratings are disabled for a particular video allowing examination into how these factors impact engagement.

By exploring this dataset in-depth marketers can gain valuable insights into identifying trends in content popularity across different countries while taking into account timing considerations based on published day of week. It also opens up avenues for analyzing public sentiment towards specific videos based on likes vs dislikes ratios and comment count which further aids in devising suitable marketing strategies.

Overall,this informative dataset serves as an invaluable asset for researchers,data analysts,and marketers alike who strive to gain deeper understanding about trending video patterns,relevant metrics influencing content virality,factors dictating viewer sentiments,and exploring new possibilities within digital marketing space leveraging YouTube's wide reach

How to use the dataset

How to Use This Dataset: A Guide

In this guide, we will walk you through the different columns in the dataset and provide insights on how you can explore the popularity and engagement of these trending videos. Let's dive in!

Column Descriptions:

  • title: The title of the video.
  • channel_title: The title of the YouTube channel that published the video.
  • publish_date: The date when the video was published on YouTube.
  • time_frame: The duration of time (e.g., 1 day, 6 hours) that the video has been trending on YouTube.
  • published_day_of_week: The day of week (e.g., Monday) when the video was published.
  • publish_country: The country where the video was published.
  • tags: The tags or keywords associated with the video.
  • views: The number of views received by a particular video
  • likes: Number o likes received per each videos
  • dislike: Number dislikes receives per an individual vidoe 11.comment_count: number of comments

Popular Video Insights:

To gain insights into popular videos based on this dataset, you can focus your analysis using these columns:

title, channel_title, publish_date, time_frame, and** publish_country**.

By analyzing these attributes together with other engagement metrics such as views ,likes,**dislikes,**comments),comment_count you can identify trends in what type content is most popular both globally or within specific countries.

For instance: - You could analyze which channels are consistently publishing trending videos - Explore whether certain types of titles or tags are more likely to attract views and engagement. - Determine if certain days of the week or time frames have a higher likelihood of trending videos being published.

Engagement Insights:

To explore user engagement with the trending videos, you can focus your analysis on these columns:

likes, dislikes, comment_count

By analyzing these attributes you can get insights into how users are interacting with the content. For example: - You could compare the like and dislike ratios to identify positively received videos versus those that are more controversial. - Analyze comment counts to understand how users are engaging with the content and whether comments being disabled affects overall

Research Ideas

  • Analyzing the popularity and engagement of trending videos: By analyzing the number of views, likes, dislikes, and comments, we can understand which types of videos are popular among YouTube users. We can also examine factors such as comment count and ratings disabled to see how viewers engage with trending videos.
  • Understanding video trends across different countries: By examining the publish country column, we can compare the popularity of trending videos in different countries. This can help content creators or marketers understand regional preferences and tailor their content strategy accordingly.
  • Studying the impact of video attributes on engagement: By exploring the relationship between video attributes (such as title, tags, publish day) and engagement metrics (views, likes), we can identify patterns or trends that influence a video's success on YouTube. This information can be...
Search
Clear search
Close search
Google apps
Main menu