Facebook
TwitterI've been creating videos on YouTube since November of 2017 (https://www.youtube.com/c/KenJee1) with the mission of making data science accessible to more people. One of the best ways to do this is to tell stories and working on projects. This is my attempt at my first community project. I am making my YouTube data available for everyone to help better understand the growth of my YouTube community and think about ways that it could be improved! I would love for everyone in the community feel like they had some hand in contributing to the channel.
Announcement Video: https://youtu.be/YPph59-rTxA
I will be sharing my favorite projects in a few of my videos (with permission of course), and would also like to give away a few small prizes to the top featured notebooks. I hope you have fun with the analysis, I'm interested in seeing what you find in the data!
For those looking for a place to start, some things I'm thinking about are: - What are the themes of the comment data? - What types of video titles and thumbnails drive the most traffic? - Who is my core audience and what are they interested in? - What types of videos have lead to the most growth? - What type of content are people engaging with the most or watching the longest?
Some advanced projects could be: - Creating a chat bot to respond to common comments with videos where I have addressed a topic - Pulling sentiment from thumbnails and titles and comparing that with performance
Data I would like to add over time - Video descriptions - Video subtitles - Actual video data
There are four files in this repo. The relevant data included in most of them is from Nov 2017 - Jan 2022. I gathered some of this data via the YouTube API and the rest from my specific analytics.
1) Aggregated Metrics By Video - This has all the topline metrics from my channel from its start (around 2015 to Jan 22 2022). I didn't post my first video until around 2) Aggregated Metrics By Video with Country and Subscriber Status - This has the same data as aggregated metrics by video, but it includes dimensions for which country people are viewing from and if the viewers are subscribed to the channel or not. 3) Video Performance Over Time - This has the daily data from each of my videos. 4) All Comments - This is all of my comment data gathered from the YouTube API. I have anonymized the users so don't worry about your name showing up!
This obviously wouldn't be possible without all of the wonderful people who watch and interact with my videos! I'm incredibly grateful for you all and I'm so happy I can share this project with you!
I collected this data from the YouTube API and through my own google analytics. Thus use of it must uphold the YouTube API's terms of service: https://developers.google.com/youtube/terms/api-services-terms-of-service
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
Use our YouTube profiles dataset to extract both business and non-business information from public channels and filter by channel name, views, creation date, or subscribers. Datapoints include URL, handle, banner image, profile image, name, subscribers, description, video count, create date, views, details, and more. You may purchase the entire dataset or a customized subset, depending on your needs. Popular use cases for this dataset include sentiment analysis, brand monitoring, influencer marketing, and more.
Facebook
TwitterBy dskl [source]
Moreover it also reveals various engagement metrics such as the number of views the video has received, likes and dislikes it has garnered from viewership. Additionally information related to comment count on particular videos enables analysis regarding viewer interaction and response. Furthermore this dataset describes whether comments or ratings are disabled for a particular video allowing examination into how these factors impact engagement.
By exploring this dataset in-depth marketers can gain valuable insights into identifying trends in content popularity across different countries while taking into account timing considerations based on published day of week. It also opens up avenues for analyzing public sentiment towards specific videos based on likes vs dislikes ratios and comment count which further aids in devising suitable marketing strategies.
Overall,this informative dataset serves as an invaluable asset for researchers,data analysts,and marketers alike who strive to gain deeper understanding about trending video patterns,relevant metrics influencing content virality,factors dictating viewer sentiments,and exploring new possibilities within digital marketing space leveraging YouTube's wide reach
How to Use This Dataset: A Guide
In this guide, we will walk you through the different columns in the dataset and provide insights on how you can explore the popularity and engagement of these trending videos. Let's dive in!
Column Descriptions:
- title: The title of the video.
- channel_title: The title of the YouTube channel that published the video.
- publish_date: The date when the video was published on YouTube.
- time_frame: The duration of time (e.g., 1 day, 6 hours) that the video has been trending on YouTube.
- published_day_of_week: The day of week (e.g., Monday) when the video was published.
- publish_country: The country where the video was published.
- tags: The tags or keywords associated with the video.
- views: The number of views received by a particular video
- likes: Number o likes received per each videos
- dislike: Number dislikes receives per an individual vidoe 11.comment_count: number of comments
Popular Video Insights:
To gain insights into popular videos based on this dataset, you can focus your analysis using these columns:
title, channel_title, publish_date, time_frame, and** publish_country**.
By analyzing these attributes together with other engagement metrics such as views ,likes,**dislikes,**comments),comment_count you can identify trends in what type content is most popular both globally or within specific countries.
For instance: - You could analyze which channels are consistently publishing trending videos - Explore whether certain types of titles or tags are more likely to attract views and engagement. - Determine if certain days of the week or time frames have a higher likelihood of trending videos being published.
Engagement Insights:
To explore user engagement with the trending videos, you can focus your analysis on these columns:
likes, dislikes, comment_count
By analyzing these attributes you can get insights into how users are interacting with the content. For example: - You could compare the like and dislike ratios to identify positively received videos versus those that are more controversial. - Analyze comment counts to understand how users are engaging with the content and whether comments being disabled affects overall
- Analyzing the popularity and engagement of trending videos: By analyzing the number of views, likes, dislikes, and comments, we can understand which types of videos are popular among YouTube users. We can also examine factors such as comment count and ratings disabled to see how viewers engage with trending videos.
- Understanding video trends across different countries: By examining the publish country column, we can compare the popularity of trending videos in different countries. This can help content creators or marketers understand regional preferences and tailor their content strategy accordingly.
- Studying the impact of video attributes on engagement: By exploring the relationship between video attributes (such as title, tags, publish day) and engagement metrics (views, likes), we can identify patterns or trends that influence a video's success on YouTube. This information can be...
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This dataset provides an in-depth look at YouTube video analytics, capturing key metrics related to video performance, audience engagement, revenue generation, and viewer behavior. Sourced from real video data, it highlights how variables like video duration, upload time, and ad impressions contribute to monetization and audience retention. This dataset is ideal for data analysts, content creators, and marketers aiming to uncover trends in viewer engagement, optimize content strategies, and maximize ad revenue. Inspired by the evolving landscape of digital content, it serves as a resource for understanding the impact of YouTube metrics on channel growth and content reach.
Video Details: Columns like Video Duration, Video Publish Time, Days Since Publish, Day of Week.
Revenue Metrics: Includes Revenue per 1000 Views (USD), Estimated Revenue (USD), Ad Impressions, and various ad revenue sources (e.g., AdSense, DoubleClick).
Engagement Metrics: Metrics such as Views, Likes, Dislikes, Shares, Comments, Average View Duration, Average View Percentage (%), and Video Thumbnail CTR (%).
Audience Data: Data on New Subscribers, Unsubscribes, Unique Viewers, Returning Viewers, and New Viewers.
Monetization & Transaction Metrics: Details on Monetized Playbacks, Playback-Based CPM, YouTube Premium Revenue, and transactions like Orders and Total Sales Volume (USD).
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
Use our YouTube Videos dataset to extract detailed information from public videos and filter by video title, views, upload date, or likes. Data points include video URL, title, description, thumbnail, upload date, view count, like count, comment count, tags, and more. You can purchase the entire dataset or a customized subset, tailored to your needs. Popular use cases for this dataset include trend analysis, content performance tracking, brand monitoring, and influencer campaign optimization.
Facebook
TwitterComprehensive YouTube channel statistics for Genuine Data, featuring 260,000 subscribers and 127,563,181 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Entertainment category and is based in IN. Track 1,399 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We present a three-year archival, longitudinal dataset of YouTube Trending videos, collected from July 1, 2022, to June 30, 2025, four retrieval per day. This collection, a unique historical record of digital culture in transition, includes 446,971 snapshots from 104 countries, encompassing 726,627 unique videos and their associated metadata. Each record includes collection timestamp, geographic region, video ranking, core identifiers (video ID, channel ID, category), content metadata (title, description, tags, localization), language information, live status, view and comment counts. Full documentation: https://arxiv.org/abs/2510.23645 Unlike previous datasets with limited geographic scope or short timeframes, our data offers exceptional coverage for cross-national and longitudinal analyses of digital culture. This non-personalized data corpus provides an irreplaceable baseline for understanding crisis communication, platform governance or temporal shifts in content popularity.
Facebook
TwitterDescription: This dataset contains detailed information about videos from various YouTube channels that specialize in data science and analytics. It includes metrics such as views, likes, comments, and publication dates. The dataset consists of 22862 rows, providing a robust sample for analyzing trends in content engagement, popularity of topics over time, and comparison of channels' performance.
Column Descriptors:
Channel_Name: The name of the YouTube channel. Title: The title of the video. Published_date: The date when the video was published. Views: The number of views the video has received. Like_count: The number of likes the video has received. Comment_Count: The number of comments on the video.
This dataset contains information from the following YouTube channels:
['sentdex', 'freeCodeCamp.org' ,'CampusX', 'Darshil Parmar',' Keith Galli' ,'Alex The Analyst', 'Socratica' , Krish Naik', 'StatQuest with Josh Starmer', 'Nicholas Renotte', 'Leila Gharani', 'Rob Mulla' ,'Ryan Nolan Data', 'techTFQ', 'Dataquest' ,'WsCube Tech', 'Chandoo', 'Luke Barousse', 'Andrej Karpathy', 'Thu Vu data analytics', 'Guy in a Cube', 'Tableau Tim', 'codebasics', 'DeepLearningAI', 'Rishabh Mishra' 'ExcelIsFun', 'Kevin Stratvert' ' Ken Jee','Kaggle' , 'Tina Huang']
This dataset can be used for various analyses, including but not limited to:
Identifying the most popular videos and channels in the data science field.
Understanding viewer engagement trends over time.
Comparing the performance of different types of content across multiple channels.
Performing a comparison between different channels to find the best-performing ones.
Identifying the best videos to watch for specific topics in data science and analytics.
Conducting a detailed analysis of your favorite YouTube channel to understand its content strategy and performance.
Note: The data is current as of the date of extraction and may not reflect real-time changes on YouTube. For any analyses, ensure to consider the date when the data was last updated to maintain accuracy and relevance.
Facebook
TwitterThe Department of Labor's YouTube tutoral videos.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
About Dataset UPDATE: Source code used for collecting this data released here
Context YouTube (the world-famous video sharing website) maintains a list of the top trending videos on the platform. According to Variety magazine, “To determine the year’s top-trending videos, YouTube uses a combination of factors including measuring users interactions (number of views, shares, comments and likes). Note that they’re not the most-viewed videos overall for the calendar year”. Top performers on the YouTube trending list are music videos (such as the famously virile “Gangam Style”), celebrity and/or reality TV performances, and the random dude-with-a-camera viral videos that YouTube is well-known for.
This dataset is a daily record of the top trending YouTube videos.
Note that this dataset is a structurally improved version of this dataset.
Content This dataset includes several months (and counting) of data on daily trending YouTube videos. Data is included for the US, GB, DE, CA, and FR regions (USA, Great Britain, Germany, Canada, and France, respectively), with up to 200 listed trending videos per day.
EDIT: Now includes data from RU, MX, KR, JP and IN regions (Russia, Mexico, South Korea, Japan and India respectively) over the same time period.
Each region’s data is in a separate file. Data includes the video title, channel title, publish time, tags, views, likes and dislikes, description, and comment count.
The data also includes a category_id field, which varies between regions. To retrieve the categories for a specific video, find it in the associated JSON. One such file is included for each of the five regions in the dataset.
For more information on specific columns in the dataset refer to the column metadata.
Acknowledgements This dataset was collected using the YouTube API.
Inspiration Possible uses for this dataset could include:
Sentiment analysis in a variety of forms Categorising YouTube videos based on their comments and statistics. Training ML algorithms like RNNs to generate their own YouTube comments. Analysing what factors affect how popular a YouTube video will be. Statistical analysis over time. For further inspiration, see the kernels on this dataset!
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
YouTube was launched in 2005. It was founded by three PayPal employees: Chad Hurley, Steve Chen, and Jawed Karim, who ran the company from an office above a small restaurant in San Mateo. The first...
Facebook
TwitterComprehensive YouTube channel statistics for Global Data, featuring 699,000 subscribers and 266,756,805 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Gaming category and is based in US. Track 209 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.
Facebook
Twitterlikes
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Abstract: We present a method for estimating the ideology of political YouTube videos. The subfield of estimating ideology as a latent variable has often focused on traditional actors such as legislators while more recent work has used social media data to estimate the ideology of ordinary users, political elites, and media sources. We build on this work to estimate the ideology of a political YouTube video. First, we start with a matrix of political Reddit posts linking to YouTube videos and apply correspondence analysis to place those videos in an ideological space. Second, we train a language model with those estimated ideologies as training labels, enabling us to estimate the ideologies of videos not posted on Reddit. These predicted ideologies are then validated against human labels. We demonstrate the utility of this method by applying it to the watch histories of survey respondents to evaluate the prevalence of echo chambers on YouTube in addition to the association between video ideology and viewer engagement. Our approach gives video-level scores based only on supplied text metadata, is scalable, and can be easily adjusted to account for changes in the ideological landscape. Keywords: Ideology estimation, YouTube, latent variable This folder contains the replication materials for "Estimating the Ideology of Political YouTube Videos."
Facebook
TwitterComprehensive YouTube channel statistics for World Data 3D, featuring 728,000 subscribers and 284,141,438 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Lifestyle category and is based in US. Track 342 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
🎙️ YouTube Transcripts Dataset
Created by SN13(, powered by Bittensor
This dataset provides high-quality transcripts from 26,241 public YouTube videos, meticulously collected and formatted for research and development in natural language processing (NLP), summarization, and language modeling.
📦 Dataset Summary
Total Videos: 26,241 Total Hours: ~6,886.84 Language Coverage: Multilingual with translation to English(details below) Storage Format:… See the full description on the dataset page: https://huggingface.co/datasets/arrmlet/data-universe-youtube-1.
Facebook
Twitterhttps://networkrepository.com/policy.phphttps://networkrepository.com/policy.php
Youtube online social network - Youtube is a video-sharing web site that includes a social network. The dataset contains a list of all of the user-to-user links.
Facebook
TwitterComprehensive YouTube channel statistics for WatchData, featuring 1,630,000 subscribers and 1,047,113,232 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Health category and is based in US. Track 1,677 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ABSTRACT This research aims to analyze the use of Youtube as a useful platform for the activities of library and information science professionals in brazilian academic libraries. Related audiovisual practices of the university libraries to encourage activities and focus on the importance of the librarian as a content producer in the digital enviroment. The survey results serve as reference material for information scientists and managers of information units interested in sharing audiovisual information as a new way of relationship with their users. Finally, based on the results, it is recommended to plan the communication strategy on social media platforms as YouTube, and prepare relevant content to engage with their subscribers and users.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The file contains the names of all YouTube videos that were evaluated in the survey, along with their corresponding access links (for the paper - accessed occurred in March 2020), and the number of shares up to that date.
Facebook
TwitterI've been creating videos on YouTube since November of 2017 (https://www.youtube.com/c/KenJee1) with the mission of making data science accessible to more people. One of the best ways to do this is to tell stories and working on projects. This is my attempt at my first community project. I am making my YouTube data available for everyone to help better understand the growth of my YouTube community and think about ways that it could be improved! I would love for everyone in the community feel like they had some hand in contributing to the channel.
Announcement Video: https://youtu.be/YPph59-rTxA
I will be sharing my favorite projects in a few of my videos (with permission of course), and would also like to give away a few small prizes to the top featured notebooks. I hope you have fun with the analysis, I'm interested in seeing what you find in the data!
For those looking for a place to start, some things I'm thinking about are: - What are the themes of the comment data? - What types of video titles and thumbnails drive the most traffic? - Who is my core audience and what are they interested in? - What types of videos have lead to the most growth? - What type of content are people engaging with the most or watching the longest?
Some advanced projects could be: - Creating a chat bot to respond to common comments with videos where I have addressed a topic - Pulling sentiment from thumbnails and titles and comparing that with performance
Data I would like to add over time - Video descriptions - Video subtitles - Actual video data
There are four files in this repo. The relevant data included in most of them is from Nov 2017 - Jan 2022. I gathered some of this data via the YouTube API and the rest from my specific analytics.
1) Aggregated Metrics By Video - This has all the topline metrics from my channel from its start (around 2015 to Jan 22 2022). I didn't post my first video until around 2) Aggregated Metrics By Video with Country and Subscriber Status - This has the same data as aggregated metrics by video, but it includes dimensions for which country people are viewing from and if the viewers are subscribed to the channel or not. 3) Video Performance Over Time - This has the daily data from each of my videos. 4) All Comments - This is all of my comment data gathered from the YouTube API. I have anonymized the users so don't worry about your name showing up!
This obviously wouldn't be possible without all of the wonderful people who watch and interact with my videos! I'm incredibly grateful for you all and I'm so happy I can share this project with you!
I collected this data from the YouTube API and through my own google analytics. Thus use of it must uphold the YouTube API's terms of service: https://developers.google.com/youtube/terms/api-services-terms-of-service