https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
YouTube maintains a list of the top trending videos on the platform. According to Variety magazine, “To determine the year’s top-trending videos, YouTube uses a combination of factors including measuring users interactions (number of views, shares, comments and likes). Note that they’re not the most-viewed videos overall for the calendar year”.
Note that this dataset is a structurally improved version of this dataset.
This dataset includes several months (and counting) of data on daily trending YouTube videos. Data is included for the IN, US, GB, DE, CA, FR, RU, BR, MX, KR, and JP regions (India, USA, Great Britain, Germany, Canada, France, Russia, Brazil, Mexico, South Korea, and, Japan respectively), with up to 200 listed trending videos per day.
Each region’s data is in a separate file. Data includes the video title, channel title, publish time, tags, views, likes and dislikes, description, and comment count.
The data also includes a category_id field, which varies between regions. To retrieve the categories for a specific video, find it in the associated JSON. One such file is included for each of the 11 regions in the dataset.
For more information on specific columns in the dataset refer to the column metadata.
This dataset was collected using the YouTube API. This dataset is the updated version of Trending YouTube Video Statistics.
Possible uses for this dataset could include: - Sentiment analysis in a variety of forms - Categorizing YouTube videos based on their comments and statistics. - Training ML algorithms like RNNs to generate their own YouTube comments. - Analyzing what factors affect how popular a YouTube video will be. - Statistical analysis over time.
For further inspiration, see the kernels on this dataset!
This dataset contains statistics for a selection of YouTube videos, capturing metrics such as views, comments, likes, dislikes, and the timestamp when the data was recorded. The dataset provides insights into the popularity and engagement levels of these videos as of April 15, 2019. This data can be useful for analyzing trends in video performance, user engagement, and the impact of content over time.
File Description: This CSV file contains detailed statistics for a set of YouTube videos, including unique video identifiers and various engagement metrics. Each row represents a different video, and the columns provide specific data points related to the video's performance.
videostatsid: Unique identifier for each video statistics entry. ytvideoid: Unique YouTube video identifier. views: The total number of views the video has received. comments: The total number of comments posted on the video. likes: The total number of likes the video has received. dislikes: The total number of dislikes the video has received. timestamp:The date and time when the statistics were recorded, in the format YYYY-MM-DD HH:MM
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We present a three-year archival, longitudinal dataset of YouTube Trending videos, collected from July 1, 2022, to June 30, 2025, four retrieval per day. This collection, a unique historical record of digital culture in transition, includes 446,971 snapshots from 104 countries, encompassing 726,627 unique videos and their associated metadata. Each record includes collection timestamp, geographic region, video ranking, core identifiers (video ID, channel ID, category), content metadata (title, description, tags, localization), language information, live status, view and comment counts. Unlike previous datasets with limited geographic scope or short timeframes, our data offers exceptional coverage for cross-national and longitudinal analyses of digital culture. This non-personalized data corpus provides an irreplaceable baseline for understanding crisis communication, platform governance or temporal shifts in content popularity.
This dataset contains information about trending YouTube videos from multiple countries, providing valuable insights for predicting video popularity based on various attributes. The dataset includes both numerical and categorical features that are essential for analyzing viewer behavior, engagement, and trends in content creation. The original source of this dataset can be found at : https://www.kaggle.com/datasets/datasnaek/youtube-new/data
title: The title of the YouTube video.
channel_title: Name of the channel that published the video.
trending_date: The date the video started trending.
publish_date: The original upload date of the video.
publish_time: The exact time the video was published.
views: The total number of views the video received.
likes: The number of likes the video received.
dislikes: The number of dislikes the video received.
comment_count: The total number of comments on the video.
tags: Keywords or tags associated with the video, helping discoverability.
description: A detailed text description provided by the uploader.
category_id: The category assigned to the video (e.g., Music, Gaming, News).
Predicting the number of views on youtube videos based on video attributes. The goal is to develop a model that can accurately predict the number of views a video will receive, using various video attributes such as likes, shares, comments, video duration, and more.
RMSE (Root Mean Squared Error) RMSE is a metric that measures the magnitude of the error between the values predicted by the model (Predicted Views) and the actual values (Actual Views). The lower the RMSE value, the more accurate the model's predictions.
R² (Coefficient of Determination) R² measures the extent to which the model can explain the variation in the data. R² values range from 0 to 1, where 1 means the model can explain all the variation in the number of views based on the given attributes, and 0 means the model cannot explain the variation. The higher the R², the better the model is at predicting views and the more relevant the features used in the model.
The machine learning model was evaluated using several approaches, including different pre-processing techniques and multiple ML models. Ultimately, the chosen model for this analysis is the Random Forest Regressor. The final evaluation results show an RMSE of 630.741, indicating an average prediction error of approximately 630.741 units. Additionally, the R² score is 0.9623, meaning that the model explains 96.23% of the variance in the data (number of views). These results were deemed satisfactory and were selected as the final modeling approach for the system and its potential future applications.
https://brightdata.com/licensehttps://brightdata.com/license
Use our YouTube Videos dataset to extract detailed information from public videos and filter by video title, views, upload date, or likes. Data points include video URL, title, description, thumbnail, upload date, view count, like count, comment count, tags, and more. You can purchase the entire dataset or a customized subset, tailored to your needs. Popular use cases for this dataset include trend analysis, content performance tracking, brand monitoring, and influencer campaign optimization.
https://brightdata.com/licensehttps://brightdata.com/license
Use our YouTube profiles dataset to extract both business and non-business information from public channels and filter by channel name, views, creation date, or subscribers. Datapoints include URL, handle, banner image, profile image, name, subscribers, description, video count, create date, views, details, and more. You may purchase the entire dataset or a customized subset, depending on your needs. Popular use cases for this dataset include sentiment analysis, brand monitoring, influencer marketing, and more.
YouTube is an American online video-sharing platform headquartered in San Bruno, California. The service, created in February 2005 by three former PayPal employees—Chad Hurley, Steve Chen, and Jawed Karim—was bought by Google in November 2006 for US$1.65 billion and now operates as one of the company's subsidiaries. YouTube is the second most-visited website after Google Search, according to Alexa Internet rankings.
YouTube allows users to upload, view, rate, share, add to playlists, report, comment on videos, and subscribe to other users. Available content includes video clips, TV show clips, music videos, short and documentary films, audio recordings, movie trailers, live streams, video blogging, short original videos, and educational videos.
YouTube (the world-famous video sharing website) maintains a list of the top trending videos on the platform. According to Variety magazine, “To determine the year’s top-trending videos, YouTube uses a combination of factors including measuring users interactions (number of views, shares, comments, and likes). Note that they’re not the most-viewed videos overall for the calendar year”. Top performers on the YouTube trending list are music videos (such as the famously virile “Gangam Style”), celebrity and/or reality TV performances, and the random dude-with-a-camera viral videos that YouTube is well-known for.
This dataset is a daily record of the top trending YouTube videos.
Note that this dataset is a structurally improved version of this dataset.
This dataset was collected using the YouTube API. This Description is cited in Wikipedia.
In recent years, video has become one of the most popular online formats, spanning from educational content to product reviews. During the third quarter of 2024, music videos recorded the highest category reach, with almost half of internet users worldwide reporting to watch music videos online each week. Social video engagement In recent years, YouTube and TikTok have become two of the most important social media platforms for global users, as video content commands high levels of engagement. In 2024, users worldwide spent approximately 28.4 hours using the YouTube mobile app per month. Additionally, the leading hashtags used by content creators on TikTok have amassed billions of views: as of January 2024, the TikTok hashtag “fyp” or “for you page” had reached 55 and 35 billion post views, respectively. Watching content: what device do users prefer? In 2023, televisions were the most used devices for global viewers to watch video-on-demand (VOD), with 55 percent of respondents reporting using these devices. In comparison, 13 percent of respondents reported using smartphones. Age group and generation are factors impacting viewership habits and device preferences, as younger users appear to prefer using their smartphones to consume content. According to a March 2024 survey, U.S. users aged 18-34 years were more likely to watch video content on smartphones than any other devices. By comparison, connected TVs were particularly popular for the online video audience aged 35 and older.
By VISHWANATH SESHAGIRI [source]
This dataset contains YouTube video and channel metadata to analyze the statistical relation between videos and form a topic tree. With 9 direct features, 13 more indirect features, it has all that you need to build a deep understanding of how videos are related – including information like total views per unit time, channel views, likes/subscribers ratio, comments/views ratio, dislikes/subscribers ratio etc. This data provides us with a unique opportunity to gain insights on topics such as subscriber count trends over time or calculating the impact of trends on subscriber engagement. We can develop powerful models that show us how different types of content drive viewership and identify the most popular styles or topics within YouTube's vast catalogue. Additionally this data offers an intriguing look into consumer behaviour as we can explore what drives people to watch specific videos at certain times or appreciate certain channels more than others - by analyzing things like likes per subscribers and dislikes per views ratios for example! Finally this dataset is completely open source with an easy-to-understand Github repo making it an invaluable resource for anyone looking to gain better insights into how their audience interacts with their content and how they might improve it in the future
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
How to Use This Dataset
In general, it is important to understand each parameter in the data set before proceeding with analysis. The parameters included are totalviews/channelelapsedtime, channelViewCount, likes/subscriber, views/subscribers, subscriberCounts, dislikes/views comments/subscriberchannelCommentCounts,, likes/dislikes comments/views dislikes/ subscribers totviewes /totsubsvews /elapsedtime.
To use this dataset for your own analysis:1) Review each parameter’s meaning and purpose in our dataset; 2) Get familiar with basic descriptive statistics such as mean median mode range; 3) Create visualizations or tables based on subsets of our data; 4) Understand correlations between different sets of variables or parameters; 5) Generate meaningful conclusions about specific channels or topics based on organized graph hierarchies or tables.; 6) Analyze trends over time for individual parameters as well as an aggregate reaction from all users when videos are released
Predicting the Relative Popularity of Videos: This dataset can be used to build a statistical model that can predict the relative popularity of videos based on various factors such as total views, channel viewers, likes/dislikes ratio, and comments/views ratio. This model could then be used to make recommendations and predict which videos are likely to become popular or go viral.
Creating Topic Trees: The dataset can also be used to create topic trees or taxonomies by analyzing the content of videos and looking at what topics they cover. For example, one could analyze the most popular YouTube channels in a specific subject area, group together those that discuss similar topics, and then build an organized tree structure around those topics in order to better understand viewer interests in that area.
Viewer Engagement Analysis: This dataset could also be used for viewer engagement analysis purposes by analyzing factors such as subscriber count, average time spent watching a video per user (elapsed time), comments made per view etc., so as to gain insights into how engaged viewers are with specific content or channels on YouTube. From this information it would be possible to optimize content strategy accordingly in order improve overall engagement rates across various types of video content and channel types
If you use this dataset in your research, please credit the original authors.
License
Unknown License - Please check the dataset description for more information.
File: YouTubeDataset_withChannelElapsed.csv | Column name | Description | |:----------------------------------|:-------------------------------------------------------| | totalviews/channelelapsedtime | Ratio of total views to channel elapsed time. (Ratio) | | channelViewCount | Total number of views for the channel. (Integer) | | likes/subscriber ...
https://www.enterpriseappstoday.com/privacy-policyhttps://www.enterpriseappstoday.com/privacy-policy
Key YouTube Statistics (Editor’s Choice) YouTube recorded 70 billion monthly active users in March 2023, which includes 55.10% of worldwide active social media users. There have been more than 14 million daily active users currently on YouTube, in the United States of America this platform is accessed by 62% of users. YouTube is touted as the second largest search engine and the second most visited website after Google. Revenue earned by YouTube in the first two quarters of 2023 is around $14.358 billion. In 2023, YouTube Premium and YouTube Music have recorded 80 million subscribers collectively worldwide. YouTube consumers view more than a billion hours of video per day. YouTube has more than 38 million active channels. In the fourth quarter of 2021, YouTube ad revenue has been $8.6 billion. Around 3 million paid subscribers to access YouTube TV. YouTube Premium has around 1 billion paid users. In 2023, YouTube was banned in countries such as China excluding Macau and Hong Kong, Eritrea, Iran, North Korea, Turkmenistan, and South Sudan. With 166 million downloads, the YouTube app has become the second most downloaded entertainment application across the world after Netflix. With 91 million downloads, YouTube Kids has become the sixth most downloaded entertainment app in the world. Nearly 90% of digital consumers access YouTube in the US, making it the most popular social network for watching video content. Over 70% of YouTube viewership takes place on its mobile application. More than 70% of YouTube video content watched by people is suggested by its algorithm. The average duration of a video on YouTube is 12 minutes. An average YouTube user spends 20 minutes and 23 seconds on the platform daily. Around 28% of YouTube videos that are published by popular channels are in the English language. 77% of YouTube users watch comedy content on the platform. With 247 million subscribers, T-Series has become the most subscribed channel on YouTube. Around 50 million users log on to YouTube every day. YouTube's biggest concurrent views record has been at 2.3 billion from when SpaceX has gone live on the platform to unveil Falcon Heavy Rocket. The majority of YouTube users are in the age group of 15 to 35 years in the US. The male-female ratio of YouTube users is 11:9. Apple INC. has been touted as the biggest advertiser on YouTube in 2020 spending $237.15 million. YouTube produced total revenue of $19.7 billion in 2020. As of 2021, the majority of YouTube users (467 million) are from India. It is the most popular platform in the United States with 74 percent of adult users. YouTube contributes to nearly 25% of mobile traffic worldwide. Daily live streaming on YouTube has increased by 45% in total in 2020. In India, around 225 million people are active on the platform each hour as per the 2021 statistics. YouTube Usage and Viewership Statistics #1. YouTube accounts for more than 2 billion monthly active users Around 2.7 billion users log on to YouTube each month. The number of monthly active users of YouTube is expected to grow even further. #2. Around 14.3 billion people visit the platform every month The number of YouTube visitors is far higher compared to Facebook, Amazon, and Instagram. #3. YouTube is accessible across 100 countries in 80 languages. The platform is widely available across different communities and nations. #4. 53.9% of YouTube users are men and 46.1% of women use the platform As of 2023 statistics, 53.9% of men use the platform and 46.1% of women over 18 years are on YouTube. The share in the number of males and females is 1.38 billion and 1.18 billion respectively. Age Group Male Female 18 to 24 8.5% 6% 25 to 34 11.6% 8.6% 35 to 44 9% 7.5% 45 to 54 6.2% 5.7% 55 to 64 4.4% 4.5% Above 65 4.3% 5.4% #5. 99% of YouTube users are active on other social media networks as well. Fewer than 1% of YouTube users are solely dependent on the platform. #6. Users spend around 20 minutes and 23 seconds per day on YouTube on average It is quite a generous amount of time spent on any social network platform. #7. YouTube is the second most visited site worldwide With more than 14 billion visits per month, YouTube has become the second most visited site in the world. However, its parent company Google is the most visited site across the globe. As per the statistics, YouTube is the third most popular searched word on Google. #8. 694000 hours of video content are streamed on YouTube per minute YouTube has outweighed Netflix as well in terms of streaming video content. #9. Over 81% of total internet users have accessed YouTube #10. Nearly 450 million hours of video content are uploaded on YouTube each hour More than 5 billion videos are watched on YouTube per day. #11. India has the maximum numb
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
I created this dataset as part of a data analysis project and concluded that it might be relevant for others who are interested in examining in analyzing content on YouTube. This dataset is a collection of over 6000 videos having the columns:
Comments: comments count for the video
Through the YouTube API and using Python, I collect data about some of these popular channels' videos that provide educational content about Machine Learning and Data Science in order to extract insights about which topics had been popular within the last couple of years. Featured in the dataset are the following creators:
Krish Naik
Nicholas Renotte
Sentdex
DeepLearningAI
Artificial Intelligence — All in One
Siraj Raval
Jeremy Howard
Applied AI Course
Daniel Bourke
Jeff Heaton
DeepLearning.TV
Arxiv Insights
These channels are features in multiple top AI channels to subscribe to lists and have seen a big growth in the last couple of years on YouTube. They all have a creation date since or before 2018.
According to the average number of YouTube video comments as of September 2022, the most popular video games channel in Poland was HRejterzy with *** average comments on uploaded videos.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains engagement analytics from two prominent tech YouTube channels:
The purpose of this dataset is to analyze and compare the performance, engagement, and growth trends of both channels using metrics such as:
VideoID
Title
UploadDate
Views
Likes
Dislikes
(Note: Not available via API since 2021)Comments
Data collected using the YouTube Data API v3 between July 25–28, 2025. Only public video data is included.
Column | Description |
---|---|
VideoID | Unique ID of the video |
Title | Title of the video |
UploadDate | ISO format date of upload |
Views | Total views (at time of extraction) |
Likes | Number of likes |
Dislikes | Not available (deprecated in YouTube API) |
Comments | Number of comments |
Data is collected from publicly available sources (YouTube API). No copyrighted content is included.
During the first quarter of 2024, Huge YouTube accounts, which had over 50,000 followers, reported an engagement rate of approximately 6.2 percent on their short-format content. In comparison, engagement was sensibly lower on long-format videos, which reported an engagement rate of 1.72 percent for Huge accounts. Medium YouTube accounts, which had a following between 2,001 and 10,000 users, reported engagement ratings of almost three percent on their Shorts, while long videos had an engagement of around 0.15 percent.
This dataset was complied as a resource for analyzing viewer engagement, sentiment, and discussion trends on the Ben Shapiro YouTube channel over the specified period. It comprises user-generated comments extracted from the Ben Shapiro YouTube channel. The collection process involved first cataloging a comprehensive list of all videos published on the channel. Subsequently, these videos were categorized into three distinct time frames. From each time frame, the ten videos that garnered the highest number of comments were identified for detailed comment extraction. The extraction of videos and their associated comments was conducted utilizing YouTube Data Tools (Rieder, 2015). The dataset was finalized on September 12, 2022, and encompasses 711,909 comments ranging from September 1, 2020, to September 12, 2022. This dataset was uploaded and analyzed in the 4CAT: Capture & Anlysis Toolkit (Peeters & Hagen, 2022).
References:
Peeters, S., & Hagen, S. (2022). The 4CAT Capture and Analysis Toolkit: A Modular Tool for Transparent and Traceable Social Media Research. Computational Communication Research, 4(2), 571–589. https://doi.org/10.5117/CCR2022.2.007.HAGE
Rieder, B. (2015). YouTube Data Tools (1.11) [Computer software].
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Gum bleeding is a common dental problem, and numerous patients seek health-related information on this topic online. The YouTube website is a popular resource for people searching for medical information. To our knowledge, no recent study has evaluated content related to bleeding gums on YouTube™. Therefore, this study aimed to conduct a quantitative and qualitative analysis of YouTube videos related to bleeding gums. A search was performed on YouTube using the keyword "bleeding gums" from Google Trends. Of the first 200 results, 107 videos met the inclusion criteria. The descriptive statistics for the videos included the time since upload, the video length, and the number of likes, views, comments, subscribers, and viewing rates. The global quality score (GQS), usefulness score, and DISCERN were used to evaluate the video quality. Statistical analysis was performed using the Kruskal–Wallis test, Mann–Whitney test, and Spearman correlation analysis. The majority (n = 69, 64.48%) of the videos observed were uploaded by hospitals/clinics and dentists/specialists. The highest coverage was for symptoms (95.33%). Only 14.02% of the videos were classified as "good". The average video length of the videos rated as "good" was significantly longer than the other groups (p
As of February 2025, approximately 54 percent of YouTube users were male. By comparison, female users on the popular social video platform were approximately 46 percent of the total. In the last examined period, the United Arab Emirates and Israel were among the country with the highest YouTube penetration worldwide.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Sexually Abusive Comments and specific words collection from popular youtube videos such as music videos and cartoons (Peppa Pig)
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Background. YouTube is an increasingly important medium for consumer health information – with content provided by healthcare professionals, government and non-government organizations, industry, and consumers themselves. It is a rapidly developing area of study for healthcare researchers. We examine the methods used in reviews of YouTube consumer health videos to identify trends and best practices. Methods and Materials. Published reviews of consumer-oriented health-related YouTube videos were identified through PubMed. Data extracted from these studies included type of journal, topic, characteristics of the search, methods of review including number of reviewers and method to achieve consensus between reviewers, inclusion and exclusion criteria, characteristics of the videos reported, ethical oversight, and follow-up. Results. Thirty-three studies were identified. Most were recent and published in specialty journals. Typically, these included more than 100 videos, and were examined by multiple reviewers. Most studies described characteristics of the videos, number of views, and sometime characteristics of the viewers. Accuracy of portrayal of the health issue under consideration was a common focus. Conclusion. Optimal transparency and reproducibility of studies of YouTube health-related videos can be achieved by following guidance designed for systematic review reporting, with attention to several elements specific to the video medium. Particularly when seeking to replicate consumer viewing behavior, investigators should consider the method used to select search terms, and use a snowballing rather than a sequential screening approach. Discontinuation protocols for online screening of relevance ranked search results is an area identified for further development.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Gum bleeding is a common dental problem, and numerous patients seek health-related information on this topic online. The YouTube website is a popular resource for people searching for medical information. To our knowledge, no recent study has evaluated content related to bleeding gums on YouTube™. Therefore, this study aimed to conduct a quantitative and qualitative analysis of YouTube videos related to bleeding gums. A search was performed on YouTube using the keyword "bleeding gums" from Google Trends. Of the first 200 results, 107 videos met the inclusion criteria. The descriptive statistics for the videos included the time since upload, the video length, and the number of likes, views, comments, subscribers, and viewing rates. The global quality score (GQS), usefulness score, and DISCERN were used to evaluate the video quality. Statistical analysis was performed using the Kruskal–Wallis test, Mann–Whitney test, and Spearman correlation analysis. The majority (n = 69, 64.48%) of the videos observed were uploaded by hospitals/clinics and dentists/specialists. The highest coverage was for symptoms (95.33%). Only 14.02% of the videos were classified as "good". The average video length of the videos rated as "good" was significantly longer than the other groups (p
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
YouTube maintains a list of the top trending videos on the platform. According to Variety magazine, “To determine the year’s top-trending videos, YouTube uses a combination of factors including measuring users interactions (number of views, shares, comments and likes). Note that they’re not the most-viewed videos overall for the calendar year”.
Note that this dataset is a structurally improved version of this dataset.
This dataset includes several months (and counting) of data on daily trending YouTube videos. Data is included for the IN, US, GB, DE, CA, FR, RU, BR, MX, KR, and JP regions (India, USA, Great Britain, Germany, Canada, France, Russia, Brazil, Mexico, South Korea, and, Japan respectively), with up to 200 listed trending videos per day.
Each region’s data is in a separate file. Data includes the video title, channel title, publish time, tags, views, likes and dislikes, description, and comment count.
The data also includes a category_id field, which varies between regions. To retrieve the categories for a specific video, find it in the associated JSON. One such file is included for each of the 11 regions in the dataset.
For more information on specific columns in the dataset refer to the column metadata.
This dataset was collected using the YouTube API. This dataset is the updated version of Trending YouTube Video Statistics.
Possible uses for this dataset could include: - Sentiment analysis in a variety of forms - Categorizing YouTube videos based on their comments and statistics. - Training ML algorithms like RNNs to generate their own YouTube comments. - Analyzing what factors affect how popular a YouTube video will be. - Statistical analysis over time.
For further inspiration, see the kernels on this dataset!