By Emma Culwell [source]
This dataset offers an extensive look at some of the most popular movie franchises in history, shedding light on their financial success and public reception. It includes data on the lifetime gross sales, budgets, ratings, and release dates of each featured movie. Furthermore, this dataset provides invaluable insights into how different elements such as ratings and runtime can affect the performance of a film at the box office. Whether you are an aspiring or established filmmaker looking for inspiration to craft your own successful blockbuster or simply a fan curious about these films’ inner workings, this dataset offers an unprecedented level of detail regarding many beloved franchises
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides comprehensive information on movie franchises released worldwide between 2000 and 2020. It includes data such as lifetime gross, budget, rating, runtime, release date and vote count/average. This dataset can be used to gain insights on the global movie industry trends over this time period.
The data can be explored in various ways to identify patterns of success or failure among movie franchises across countries, genres or decades. For example, you may want to examine the average budget for movies released each year or calculate the average number of votes received by movies of a particular genre. Additionally, you could use this dataset to compare different types of media (e.g., cable vs streaming) and understand how they impact box-office performance.
To get the most out of this data set it is essential that you first familiarize yourself with all the columns provided: Title: The title of the movie; Lifetime Gross: Total amount money earned by a franchise in all territories; Year: The year in which it was first made available publicly; Studio: The production company behind the production; Rating: Classification given by MPAA/BBFC; Runtime: Length in minutes/hours; Budget: Amount spent producing it ; Release Date : Date when publically announced Availability ; Vote Average : Average ratings based on user reviews ; Vote Count : Number people who rated franchise).
Once you have become comfortable with these variables then feel free to try out some larger analysis techniques such as predictive analytics (predicting future success based on existing trends) or clustering (grouping similar outcomes together). No matter which methods you decide to utilize it is important that you remember – always validate your assumptions! Good luck exploring!
- A comparison of movie budget to box office returns, to identify over/underperforming movies.
- A study of the correlation between movie rating and viewership.
- An analysis of what types of movies tend to become franchise success stories (big budget, PG-13 rating, etc.)
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: MovieFranchises.csv | Column name | Description | |:-------------------|:------------------------------------------------------------------------| | Title | The title of the movie. (String) | | Lifetime Gross | The total amount of money the movie has made in its lifetime. (Integer) | | Year | The year the movie was released. (Integer) | | Studio | The studio that produced the movie. (String) | | Rating | The rating of the movie (e.g. PG-13, R, etc). (String) | | Runtime | The length of the movie in minutes. (Integer) | | Budget | The budget of the movie in USD. (Integer) | | ReleaseDate | The date the movie was released. (Date) | | VoteAvg | The average rating of the movie from users. (Float) | | VoteCount | The total number of votes the movie has received from users. (Integer) |
If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Emma Culwell.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The baseball industry is one of the most lucrative sports in the world, with players and teams earning substantial amounts of money each year. The salaries of baseball players are determined by a variety of factors, including their performance statistics, years of experience, and the financial resources of their respective teams. To gain a better understanding of how these factors impact player salaries, a comprehensive dataset has been compiled that contains information on baseball player statistics and team financials.
The dataset includes information on player salaries, performance metrics such as batting average, home runs, and RBI, as well as team data such as win-loss records and payroll. With this information, researchers and analysts can explore the relationship between player performance and compensation, as well as the spending habits of individual teams.
This dataset has the potential to provide valuable insights into the inner workings of the baseball industry and could be used to inform decisions related to player contracts, team management, and league policies. Additionally, the dataset may be of interest to fans of the sport who want to better understand how their favorite players and teams are compensated.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in Iran was worth 436.91 billion US dollars in 2024, according to official data from the World Bank. The GDP value of Iran represents 0.41 percent of the world economy. This dataset provides - Iran GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset was created to simulate a market basket dataset, providing insights into customer purchasing behavior and store operations. The dataset facilitates market basket analysis, customer segmentation, and other retail analytics tasks. Here's more information about the context and inspiration behind this dataset:
Context:
Retail businesses, from supermarkets to convenience stores, are constantly seeking ways to better understand their customers and improve their operations. Market basket analysis, a technique used in retail analytics, explores customer purchase patterns to uncover associations between products, identify trends, and optimize pricing and promotions. Customer segmentation allows businesses to tailor their offerings to specific groups, enhancing the customer experience.
Inspiration:
The inspiration for this dataset comes from the need for accessible and customizable market basket datasets. While real-world retail data is sensitive and often restricted, synthetic datasets offer a safe and versatile alternative. Researchers, data scientists, and analysts can use this dataset to develop and test algorithms, models, and analytical tools.
Dataset Information:
The columns provide information about the transactions, customers, products, and purchasing behavior, making the dataset suitable for various analyses, including market basket analysis and customer segmentation. Here's a brief explanation of each column in the Dataset:
Use Cases:
Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in Australia was worth 1752.19 billion US dollars in 2024, according to official data from the World Bank. The GDP value of Australia represents 1.65 percent of the world economy. This dataset provides - Australia GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This comprehensive dataset offers insights into the 2025 construction industry, highlighting topics like global market size trends, employment growth in construction, technological innovations in building, sustainable development practices, and future outlook of the construction sector.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Apple is one of the most influential and recognisable brands in the world, responsible for the rise of the smartphone with the iPhone. Valued at over $2 trillion in 2021, it is also the most valuable...
Oracle’s cloud services and license support division is the company’s most profitable business segment, bringing in over ** billion U.S. dollars in its 2024 fiscal year. In that year, Oracle brought in annual revenue of close to ** billion U.S. dollars, its highest revenue figure to date. Oracle Corporation Oracle was founded by Larry Ellison in 1977 as a tech company primarily focused on relational databases. Today, Oracle ranks among the largest companies in the world in terms of market value and serves as the world’s most popular database management system provider. Oracle’s success is not only reflected in its booming sales figures, but also in its growing number of employees: between fiscal year 2008 and 2021, Oracle’s total employee number has grown substantially, increasing from around ****** to *******. Database market The global database market reached a size of ** billion U.S. dollars in 2020. Database Management Systems (DBMSs) provide a platform through which developers can organize, update, and control large databases, with products like Oracle, MySQL, and Microsoft SQL Server being the most widely used in the market.
In July 2024, global industrial production, excluding the United States, increased by 1.5 percent compared to the same time in the previous year, based on three month moving averages. This is compared to an increase of 0.2 percent in advanced economies (excluding the United States) for the same time period. The global industrial production collapsed after the outbreak of COVID-19, but increased steadily in the months after, peaking at 23 percent in June 2021. Industrial growth rate tracks the output production in the industrial sector.
In 2024, global retail e-commerce sales reached an estimated ************ U.S. dollars. Projections indicate a ** percent growth in this figure over the coming years, with expectations to come close to ************** dollars by 2028. World players Among the key players on the world stage, the American marketplace giant Amazon holds the title of the largest e-commerce player globally, with a gross merchandise value of nearly *********** U.S. dollars in 2024. Amazon was also the most valuable retail brand globally, followed by mostly American competitors such as Walmart and the Home Depot. Leading e-tailing regions E-commerce is a dormant channel globally, but nowhere has it been as successful as in Asia. In 2024, the e-commerce revenue in that continent alone was measured at nearly ************ U.S. dollars, outperforming the Americas and Europe. That year, the up-and-coming e-commerce markets also centered around Asia. The Philippines and India stood out as the swiftest-growing e-commerce markets based on online sales, anticipating a growth rate surpassing ** percent.
The revenue in the 'Sports Equipment' segment of the toys & hobby market in the United States was forecast to continuously increase between 2025 and 2029 by in total 3.5 billion U.S. dollars (+17.2 percent). After the ninth consecutive increasing year, the revenue is estimated to reach 23.86 billion U.S. dollars and therefore a new peak in 2029. Find further information regarding revenue in Mexico and average revenue per user (ARPU) in Mexico. The Statista Market Insights cover a broad range of additional markets.
In July 2024, the merchandise exports index worldwide, excluding the U.S., stood at 204.8. This is compared to an index value of 143 for the United States in the same month. The index was highest in emerging economies, reaching an index score of 353. Moreover, the merchandise imports index was also highest in emerging economies. The merchandise exports index is the U.S. dollar value of goods sold to the rest of the world, deflated by the U.S. Consumer Price Index (CPI).
In 2023, Meta Platforms had a total annual revenue of over 134 billion U.S. dollars, up from 116 billion in 2022. LinkedIn reported its highest annual revenue to date, generating over 15 billion USD, whilst Snapchat reported an annual revenue of 4.6 billion USD.
As of April 2024, Facebook had an addressable ad audience reach 131.1 percent in Libya, followed by the United Arab Emirates with 120.5 percent and Mongolia with 116 percent. Additionally, the Philippines and Qatar had addressable ad audiences of 114.5 percent and 111.7 percent.
As of April 2024, almost 32 percent of global Instagram audiences were aged between 18 and 24 years, and 30.6 percent of users were aged between 25 and 34 years. Overall, 16 percent of users belonged to the 35 to 44 year age group.
Instagram users
With roughly one billion monthly active users, Instagram belongs to the most popular social networks worldwide. The social photo sharing app is especially popular in India and in the United States, which have respectively 362.9 million and 169.7 million Instagram users each.
Instagram features
One of the most popular features of Instagram is Stories. Users can post photos and videos to their Stories stream and the content is live for others to view for 24 hours before it disappears. In January 2019, the company reported that there were 500 million daily active Instagram Stories users. Instagram Stories directly competes with Snapchat, another photo sharing app that initially became famous due to it’s “vanishing photos” feature.
As of the second quarter of 2021, Snapchat had 293 million daily active users.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
By Emma Culwell [source]
This dataset offers an extensive look at some of the most popular movie franchises in history, shedding light on their financial success and public reception. It includes data on the lifetime gross sales, budgets, ratings, and release dates of each featured movie. Furthermore, this dataset provides invaluable insights into how different elements such as ratings and runtime can affect the performance of a film at the box office. Whether you are an aspiring or established filmmaker looking for inspiration to craft your own successful blockbuster or simply a fan curious about these films’ inner workings, this dataset offers an unprecedented level of detail regarding many beloved franchises
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides comprehensive information on movie franchises released worldwide between 2000 and 2020. It includes data such as lifetime gross, budget, rating, runtime, release date and vote count/average. This dataset can be used to gain insights on the global movie industry trends over this time period.
The data can be explored in various ways to identify patterns of success or failure among movie franchises across countries, genres or decades. For example, you may want to examine the average budget for movies released each year or calculate the average number of votes received by movies of a particular genre. Additionally, you could use this dataset to compare different types of media (e.g., cable vs streaming) and understand how they impact box-office performance.
To get the most out of this data set it is essential that you first familiarize yourself with all the columns provided: Title: The title of the movie; Lifetime Gross: Total amount money earned by a franchise in all territories; Year: The year in which it was first made available publicly; Studio: The production company behind the production; Rating: Classification given by MPAA/BBFC; Runtime: Length in minutes/hours; Budget: Amount spent producing it ; Release Date : Date when publically announced Availability ; Vote Average : Average ratings based on user reviews ; Vote Count : Number people who rated franchise).
Once you have become comfortable with these variables then feel free to try out some larger analysis techniques such as predictive analytics (predicting future success based on existing trends) or clustering (grouping similar outcomes together). No matter which methods you decide to utilize it is important that you remember – always validate your assumptions! Good luck exploring!
- A comparison of movie budget to box office returns, to identify over/underperforming movies.
- A study of the correlation between movie rating and viewership.
- An analysis of what types of movies tend to become franchise success stories (big budget, PG-13 rating, etc.)
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: MovieFranchises.csv | Column name | Description | |:-------------------|:------------------------------------------------------------------------| | Title | The title of the movie. (String) | | Lifetime Gross | The total amount of money the movie has made in its lifetime. (Integer) | | Year | The year the movie was released. (Integer) | | Studio | The studio that produced the movie. (String) | | Rating | The rating of the movie (e.g. PG-13, R, etc). (String) | | Runtime | The length of the movie in minutes. (Integer) | | Budget | The budget of the movie in USD. (Integer) | | ReleaseDate | The date the movie was released. (Date) | | VoteAvg | The average rating of the movie from users. (Float) | | VoteCount | The total number of votes the movie has received from users. (Integer) |
If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Emma Culwell.