Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Netflix subscribers and revenue by country’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/pariaagharabi/netflix2020 on 28 January 2022.
--- Dataset description provided by original source is as follows ---
I prepare this dataset for one of my courses to show how Netflix’s subscription figures and Netflix's revenue($) have grown in four different regions: - the United States and Canada, - Europe, the Middle East, and Africa, - Latin America, - Asia-Pacific over the last 2.5 years. According to the final month of the quarter 2020(March) was being the start of the global coronavirus pandemic in many countries, Netflix noted that it added 26 million paid new subscribers in the first two quarters of 2020 alone; in 2019, the company added 28 million subscribers in total.
Dataset Description: This dataset contains four CSV files. 1. DataNetflixRevenue2020_V2.csv: three columns Area, Years, Revenue.
DataNetflixSubscriber2020_V2.csv: three columns Area, Years, Subscribers.
NetflixSubscribersbyCountryfrom2018toQ2_2020.csv: eleven columns Area, Q1 - 2018, Q2 - 2018, Q3 - 2018, Q4 - 2018, Q1 - 2019, Q2 - 2019, Q3 - 2019, Q4 - 2019, Q1 - 2020, Q2 - 2020
Netflix'sRevenue2018toQ2_2020.csv: eleven columns Area, Q1 - 2018, Q2 - 2018, Q3 - 2018, Q4 - 2018, Q1 - 2019, Q2 - 2019, Q3 - 2019, Q4 - 2019, Q1 - 2020, Q2 - 2020
--- Original source retains full ownership of the source dataset ---
Netflix's global subscriber base has reached an impressive milestone, surpassing *** million paid subscribers worldwide in the fourth quarter of 2024. This marks a significant increase of nearly ** million subscribers compared to the previous quarter, solidifying Netflix's position as a dominant force in the streaming industry. Adapting to customer losses Netflix's growth has not always been consistent. During the first half of 2022, the streaming giant lost over *** million customers. In response to these losses, Netflix introduced an ad-supported tier in November of that same year. This strategic move has paid off, with the lower-cost plan attracting ** million monthly active users globally by November 2024, demonstrating Netflix's ability to adapt to changing market conditions and consumer preferences. Global expansion Netflix continues to focus on international markets, with a forecast suggesting that the Asia Pacific region is expected to see the most substantial growth in the upcoming years, potentially reaching around **** million subscribers by 2029. To correspond to the needs of the non-American target group, the company has heavily invested in international content in recent years, with Korean, Spanish, and Japanese being the most watched non-English content languages on the platform.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here is the full breakdown of Netflix subscribers by region.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here is the full breakdown of Netflix global subscribers by year since 2013.
In 2024, Netflix revealed that it had 89.63 million paying streaming subscribers in the United States and Canada. North America had long been Netflix's biggest market, though subscriber numbers in the EMEA region surpassed that in the U.S. and Canada for the first time during 2022. The number of paid streaming memberships in Asia Pacific grew the most, by 13 percent compared with the previous year.
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
I extracted this data to find the unpopular movies on Netflix. The dataset I used here comes directly from Netflix movies data, which consists of 4 text data files, each file contains over 20M rows, over 4K movies, and 400K, customers. Altogether over are 17K movies and 500K+ customers!
I made some modifications and I extracted the e df_avgRating_with_usersCount.csv
from the original data after applying some mathematical operations to get the average ratings and the count of users who made the ratings for each movie in movie_id
below. Feel free to browse and use the data within your notebooks.
Here you could find my previous notebook on Kaggle to extract the dataset
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In this post, you'll see how the Netflix platform is evolving, how many users Netflix has and how they perform against the growing competition.
Netflix held the Netflix Prize open competition for the best algorithm to predict user ratings for films. The grand prize was $1,000,000 and was won by BellKor's Pragmatic Chaos team. This is the dataset that was used in that competition.
This comes directly from the README:
The file "training_set.tar" is a tar of a directory containing 17770 files, one per movie. The first line of each file contains the movie id followed by a colon. Each subsequent line in the file corresponds to a rating from a customer and its date in the following format:
CustomerID,Rating,Date
Movie information in "movie_titles.txt" is in the following format:
MovieID,YearOfRelease,Title
The qualifying dataset for the Netflix Prize is contained in the text file "qualifying.txt". It consists of lines indicating a movie id, followed by a colon, and then customer ids and rating dates, one per line for that movie id. The movie and customer ids are contained in the training set. Of course the ratings are withheld. There are no empty lines in the file.
MovieID1:
CustomerID11,Date11
CustomerID12,Date12
...
MovieID2:
CustomerID21,Date21
CustomerID22,Date22
For the Netflix Prize, your program must predict the all ratings the customers gave the movies in the qualifying dataset based on the information in the training dataset.
The format of your submitted prediction file follows the movie and customer id, date order of the qualifying dataset. However, your predicted rating takes the place of the corresponding customer id (and date), one per line.
For example, if the qualifying dataset looked like:
111:
3245,2005-12-19
5666,2005-12-23
6789,2005-03-14
225:
1234,2005-05-26
3456,2005-11-07
then a prediction file should look something like:
111:
3.0
3.4
4.0
225:
1.0
2.0
which predicts that customer 3245 would have rated movie 111 3.0 stars on the 19th of Decemeber, 2005, that customer 5666 would have rated it slightly higher at 3.4 stars on the 23rd of Decemeber, 2005, etc.
You must make predictions for all customers for all movies in the qualifying dataset.
To allow you to test your system before you submit a prediction set based on the qualifying dataset, we have provided a probe dataset in the file "probe.txt". This text file contains lines indicating a movie id, followed by a colon, and then customer ids, one per line for that movie id.
MovieID1:
CustomerID11
CustomerID12
...
MovieID2:
CustomerID21
CustomerID22
Like the qualifying dataset, the movie and customer id pairs are contained in the training set. However, unlike the qualifying dataset, the ratings (and dates) for each pair are contained in the training dataset.
If you wish, you may calculate the RMSE of your predictions against those ratings and compare your RMSE against the Cinematch RMSE on the same data. See http://www.netflixprize.com/faq#probe for that value.
The training data came in 17,000+ files. In the interest of keeping files together and file sizes as low as possible, I combined them into four text files: combined_data_(1,2,3,4).txt
The contest was originally hosted at http://netflixprize.com/index.html
The dataset was downloaded from https://archive.org/download/nf_prize_dataset.tar
This is a fun dataset to work with. You can read about the winning algorithm by BellKor's Pragmatic Chaos here
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset offers a comprehensive historical record of Netflix’s stock price movements, capturing the company’s financial journey from its early days to its position as a global streaming giant.
From its IPO in May 2002, Netflix (Ticker: NFLX) has transformed from a DVD rental service to a powerhouse in on-demand digital content. With its disruptive innovation, strategic shifts, and global expansion, Netflix has seen dramatic shifts in stock prices, reflecting not just market trends but also cultural impact. This dataset provides a window into that evolution.
Each row in this dataset represents daily trading activity on the stock market and includes the following columns:
The data is structured in CSV format and is clean, easy to use, and ready for immediate analysis.
Whether you're learning data science, building a financial model, or exploring machine learning in the real world, this dataset is a goldmine of insights. Netflix's market history includes:
This makes the dataset ideal for:
This dataset is designed for:
The dataset is derived from publicly available historical stock price data, such as Yahoo Finance, and has been cleaned and organized for educational and research purposes. It is continuously maintained to ensure accuracy.
Netflix’s rise is more than just a business story — it’s a data-driven journey. With this dataset, you can analyze the company’s stock behavior, train models to predict future trends, or simply visualize how tech reshapes the market.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here is the breakdown of Netflix’s revenue earnings year over year from 2011.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Netflix produced more than 2,769 hours of original content in 2019. This was a huge 80.15% increase compared to 2018. Netflix had over 2,000 originals at the beginning of 2021.
https://scoop.market.us/privacy-policyhttps://scoop.market.us/privacy-policy
Streaming Services Statistics: Streaming services have transformed the entertainment landscape, revolutionizing how people consume content.
The advent of high-speed internet and the proliferation of smart devices have fueled the growth of these platforms, offering a wide array of movies, TV shows, music, and more, at the viewers' convenience.
This introduction provides an overview of key statistics that shed light on the impact, trends, and challenges within the streaming industry.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Netflix has been met with tons of competition from major multinational companies. These are the key Netflix Statistics you need to know.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘📺 Hulu TV Shows and Movies’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/top-1000-most-popular-hulu-showse on 13 February 2022.
--- Dataset description provided by original source is as follows ---
Background
In the market today, there are only a few competitors for Netflix. One of the top leaders in the streaming shows market is Hulu. Recently my family switched from Netflix to Hulu to try out the options and our experience wasn’t as wonderful as we would have expected. However, instead of giving my opinion, lets pull some data for Hulu shows and analyze the results so that you can make your own decision. For those interested in viewing my micro-research study that I did on Netflix, you can read my article Netflix Show Analysis.
About the Data
The data from Hulu’s shows were in a well-structured format. An example of the JSON object can be found at HuluShows. The 1,000 shows were sorted by “popular of all time” on Hulu. The raw data can be found at raw data.
Sources
Thanks to the micro-research study at The Concept Center for performing this study.
This dataset was created by Chase Willden and contains around 1000 samples along with Show/is Subscriber Only, Show/key Art Url, technical information and other features such as: - Show/description - Show/show Rollups/auth On Web/games Count - and more.
- Analyze Show/annotations/0 in relation to Show/show Rollups/current/feature Films Count
- Study the influence of Show/show Rollups/free On Web/games Count on Show/show Rollups/showtime/html5 Videos Count
- More datasets
If you use this dataset in your research, please credit Chase Willden
--- Original source retains full ownership of the source dataset ---
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Netflix Prize was a competition devised by Netflix to improve the accuracy of its recommendation system. To facilitate this Netflix released real ratings about movies from the users (voters) of the system. Any set of movies can be transformed into an election via a process outlined by Mattei, Forshee, and Goldsmith.This data set includes all 5 candidate elections with at least 350 voters generated by this process from 300 randomly chosen movies. Extending beyond prior work by Mattei et al. we allow for weak preferences, i.e., a voter is indifferent between a set of movies if he assigns the same rating to each of them. Thus, there are 541 possibilities to rank a given set of five movies.The archive is gzip compressed and includes 165,672 elections in PrefLib.org's TOC file format (Orders with Ties - Complete List).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These are the top 10 countries for Netflix in terms of penetration rate.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The average Netflix user spends 3.2 hours per day streaming content on Netflix.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The company reported that its users are 49% women and 51% men.
Attribution-NoDerivs 4.0 (CC BY-ND 4.0)https://creativecommons.org/licenses/by-nd/4.0/
License information was derived automatically
Network monitoring and analysis of consumption behavior represents an important aspect for network operators allowing to obtain vital information about consumption trends in order to offer new data plans aimed at specific users and obtain an adequate perspective of the network. Over-the-top (OTT) media and communications services and applications are shifting the Internet consumption by increasing the traffic generation over the different available networks. OTT refers to applications that deliver audio, video, and other media over the Internet by leveraging the infrastructure deployed by network operators but without their involvement in the control or distribution of the content and are known by their large consumption of network resources.
This dataset contains 1581 instances and 131 attributes on a single file. Each instance represents a user’s consumption profile which holds summarized information about the consumption behavior of the user related to the 29 OTT applications identified in the different IP flows captured in order to create the dataset
The OTT applications that the users interacted with during the capture experiment and were stored on the dataset are: Amazon, Apple store, Apple Icloud, Apple Itunes, Deezer, Dropbox, EasyTaxi, Ebay, Facebook, Gmail, Google suite, Google Maps, Browsing (HTTP, HTTP_Connect, HTTP_Download, HTTP_Proxy), Instagram, LastFM, Microsoft One Drive (MS_One_Drive), Facebook Messenger (MSN), Netflix, Skype, Spotify, Teamspeak, Teamviewer, Twitch, Twitter, Waze, Whatsapp, Wikipedia, Yahoo and Youtube.
Each application has 4 different types of attributes (quantity of generated flows, mean duration of the flows, average size of the packets exchanged on the flows and the mean bytes per second on the flows). These attributes summarizes the interaction that the user had with the respective OTT application in terms of consumption. Furthermore, the dataset contains the user’s IP address in network and decimal format which are used as user identifiers. Finally the User Group attribute represents the objective class (high consumption, medium consumption and low consumption) in which a user is classified considering his/her OTT consumption behavior. All of this information gives a total of 131 attributes.
For further information you can read and please cite the following papers:
Springer: https://link.springer.com/chapter/10.1007/978-3-319-95168-3_37
IEEExplore: https://ieeexplore.ieee.org/document/8845576
The structure of the attributes and its definition is presented below:
Source.Decimal: This attribute holds the user’s IP address in decimal format and it is mainly used as a user identifier.
Source.IP: This attribute holds the user’s IP address in network format (e.g., 192.168.14.35) and as in the previous case its main function is to work as a user identifier.
Application-Name.Flows: This type of attributes hold the information about the quantity of IP flows that a user generated toward an OTT application. As was mentioned before each application has a group of 4 attributes that describe the interaction of the user with a specific OTT application (an example for this case would be Netflix.Flows or Facebook.Flows).
Application-Name.Flow.Duration.Mean: This type of attributes hold the information related to the mean duration (time) of the flows generated by the user towards a specific OTT application, measured in microseconds. Examples of how this attributes are stored in the dataset are: Amazon.Flow.Duration.Mean or Instagram.Flow.Duration.Mean.
Application-Name.AVG.Packet.Size: This type of attributes hold the average size of the IP packets that were exchanged in all the flows generated by the user towards a specific OTT application, measured in bytes. It is important to notice that this size is focused on the packet’s header only. Examples of how this attribute are presented on the dataset are: Google_Maps.AVG.Packet.Size or Spotify.AVG.Packet.Size.
Application-Name.Flow.Bytes.Per.Sec: This type of attributes hold the mean number of bytes per second that were exchanged in the flows generated by the user towards a specific OTT application. Examples of this kind of attributes in the dataset are: Deezer.Flow.Bytes.Per.Sec or Skype.Flow.Bytes.Per.Sec.
User.Group: This type of attribute represents the objective class of the dataset i.e., the different groups that the users are classified in according to their OTT consumption behavior...
In the fourth quarter of 2024, Netflix generated total revenue of over **** billion U.S. dollars, up from about *** billion dollars in the corresponding quarter of 2023. The company's annual revenue in 2024 amounted to around ** billion U.S. dollars, continuing the impressive year-on-year growth Netflix has enjoyed over the last decade. Netflix’s global position Netflix’s revenue has been heavily impacted by its ever-growing global subscriber base. The leading Netflix market is Europe, Middle East, and Africa, surpassing the U.S. and Canada in terms of subscriber count. Netflix has also significantly increased its licensed and produced content assets since 2016. Despite concerns among investors that the company’s content spend was negatively affecting cash flow, Netflix’s plans to amortize its content assets long-term along with generating revenue from other sources such as licensing and merchandise should ensure the company’s future profitability. Netflix’s original content Netflix is also fortunate in that many of its original shows have been a hit with consumers across the globe. Shows such as “Orange is the New Black,” “Black Mirror,” and “House of Cards” won the hearts of subscribers long ago, but newer content such as English-language shows “Bridgerton,” “Wednesday,” and “Stranger Things,” as well as local TV shows such as “Squid Game” have also been favorably reviewed and proved popular among users.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Netflix subscribers and revenue by country’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/pariaagharabi/netflix2020 on 28 January 2022.
--- Dataset description provided by original source is as follows ---
I prepare this dataset for one of my courses to show how Netflix’s subscription figures and Netflix's revenue($) have grown in four different regions: - the United States and Canada, - Europe, the Middle East, and Africa, - Latin America, - Asia-Pacific over the last 2.5 years. According to the final month of the quarter 2020(March) was being the start of the global coronavirus pandemic in many countries, Netflix noted that it added 26 million paid new subscribers in the first two quarters of 2020 alone; in 2019, the company added 28 million subscribers in total.
Dataset Description: This dataset contains four CSV files. 1. DataNetflixRevenue2020_V2.csv: three columns Area, Years, Revenue.
DataNetflixSubscriber2020_V2.csv: three columns Area, Years, Subscribers.
NetflixSubscribersbyCountryfrom2018toQ2_2020.csv: eleven columns Area, Q1 - 2018, Q2 - 2018, Q3 - 2018, Q4 - 2018, Q1 - 2019, Q2 - 2019, Q3 - 2019, Q4 - 2019, Q1 - 2020, Q2 - 2020
Netflix'sRevenue2018toQ2_2020.csv: eleven columns Area, Q1 - 2018, Q2 - 2018, Q3 - 2018, Q4 - 2018, Q1 - 2019, Q2 - 2019, Q3 - 2019, Q4 - 2019, Q1 - 2020, Q2 - 2020
--- Original source retains full ownership of the source dataset ---