https://data.gov.tw/licensehttps://data.gov.tw/license
This dataset provides national theater box office statistics for films distributed by the Administrative Institution National Film and Audiovisual Culture Center. The data is up to the last Sunday before the announcement date and does not include films that have not been screened for less than 7 calendar days. The earliest CSV format data in this dataset begins on July 30, 2018, and the earliest JSON format data begins on March 1, 2020. JSON format queries require entering the start and end dates (in the format of year, month, and day), and can provide data for a maximum of 90 days at a time.
In 2021, the global box office revenue added up to approximately 21.3 billion U.S. dollars, up from 11.8 billion dollars a year earlier – an annual increase of 80.5 percent. Still, the 2021 result amounted to only little more than half of the 42.3-billion-dollar box office revenue recorded in 2019, before the COVID-19 outbreak. Furthermore, the share of 3D films in the global revenue went from six percent in 2020 to 6.6 percent in 2021.
Cinema market: a challenging comeback The pandemic changed the film industry by emptying movie theaters and accelerating the increase in video streaming penetration. In the so-called North American movie market – which consists of Canada and the United States (including the unincorporated territories of Guam and Puerto Rico) – the box office revenue more than doubled between 2020 and 2021. But the latter figure amounted to less than 40 percent of the pre-COVID-19 result. Meanwhile, subscription video-on-demand (SVoD) platforms went even further. Netflix kept the top spot while new competitors such as Disney+ diversified the offering.
Big players on the big screen The global cinema segment spans way beyond North America, though. China alone sold more than 1.1 billion movie tickets throughout 2021, making it the leading market worldwide, right above the U.S. India ranked third with almost 380 million tickets sold that same year. With a vast film culture – even larger than its iconic Bollywood industry – India and its cinema feature a myriad of languages and advertising opportunities to its gargantuan audience.
In 2021, the global box office revenue amounted to approximately **** billion U.S. dollars, out of which more than half – **** billion dollars – came from the Asia Pacific region. In Europe, the Middle East, and Africa – collectively known as EMEA – the box office revenue added up to **** billion dollars or almost one-fourth of the total. Despite the increase when compared to 2020, the 2021 figures remained distant from the ****-billion-dollar global revenue recorded in 2019, before the COVID-19 outbreak. What is box office data used for? The term box office revenue refers to the total revenue generated through movie ticket sales. It is primarily used to measure and compare the commercial success of a film. Ticket sales may account for a large portion of the film industry’s total revenue – especially before the pandemic. They are also the main source of income for movie theaters. Leading box office markets The United States and Canada – known as the North American movie market – were the leading box office market worldwide for several decades. But China, alongside other Asian markets, has also begun to make its mark on the global movie industry in recent years. Bollywood movies, in particular, are gaining popularity outside of India. While the Indian film industry released far more movies than China and the U.S. until the coronavirus outbreak, its box office revenues remained comparatively small.
In 2024, Mexican movies generated close to four percent of Mexico's overall box office revenue – representing a decrease from the previous year. None of the top ten most-watched films at movie theaters in Mexico in 2024 were made in that North American country.
In 2023, the highest-grossing film worldwide was "Barbie", with a box office revenue of around **** billion U.S. dollars. This corresponds to about a half of the revenue generated by "Avengers: Endgame" two years earlier: about *** billion dollars. This makes "Avengers: Endgame" one of the most commercially succesful movies of all time.
This data set was scraped from the site https://www.the-numbers.com/ using Python 3. it has data of more than 13k movies - and contains monetary data (Domestic Box Office, Infl. Adj. Dom. BO, Opening Weekend, and more) as well as "creative" cinema data (Comparisons, Creative Type, Genre, and more). The complete scraping code I wrote to create the data set is available in my profile: https://www.kaggle.com/code/mayasoffer/movies-data-scraper
Please note, that the data was scraped fully from the "The-numbers" website, therefore: - There is some missing data in accordance with the missing data on the site. - The scraping was committed on 01.03.22 (March 2022) so all the data is true to that time. - For more data on how the columns were created and where the site got that data initially, please look into the site itself. - Lastly, note that I scraped the data and saved it as CSV. however, all the columns were scraped in their original form - how they were written on the website. so some "cleaning" of the columns is necessary before any analysis can take place.
The data is very diverse and contains a lot of different columns and goes back to 1995. so the analysis options are many. here are a few analysis leads I thought about: - How have genres changed throughout the years? what genres are the most popular throughout the years? (revenue-wise, legs, opening week...). new genres that gained popularity (animation for example) - Does MPAA rating impact revenue? and much more...
Thank you for using my dataset!
Between 1995 and 2024, PG-13-rated movies grossed approximately 126.64 billion U.S. dollars at the North American box office – a term that excludes Mexico and includes Canada and the United States. R-rated and PG-rated films grossed around 69.28 billion and 56.04 billion dollars, respectively.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We collected movie dataset from Internet Movie Database (IMDB) website for our experiments using an IMDbPy script to extract all the movie metadata. We obtained the box office revenues from The Movies Dataset, Box-office Mojo and The Movie Database (TMDB).These databases predominantly consisted of movies from 2006 to 2020 in various countries, and we also collected movie posters. We also used the Open Images dataset V6 for object detection of movie posters.
Between 1995 and 2025, a movie based on comics or graphic novels grossed, on average, about 88.36 million U.S. dollars across the United States and Canada – collectively known as the North American box office. Spin-offs followed as the second-most commercially successful film source material, with average box office revenue of around 86.32 million dollars.
The impact of the COVID pandemic on worldwide box office revenues has reduced the estimated figure from **** billion U.S. dollars to **** billion for the year 2020. The July estimates indicate a more severe impact than was predicted in March 2020, when revenue was expected to drop to only **** billion U.S. dollars. Across the globe, cinemas were shut down for Q2 2020, and the damage to revenue is projected to last for the next 5 years, although small annual growth is still expected as of 2021.
In 2024, the annual box office revenue of domestic movies in China had decreased to about 33 billion yuan. Homemade movies have enjoyed increasing popularity among Chinese moviegoers, taking up nearly 80 percent of the market. .
In 2021, the global box office revenue added up to **** billion U.S. dollars, up from **** billion dollars a year earlier – an annual growth of **** percent. Still, the 2021 figure amounted to little more than half of the ****-billion-dollar revenue recorded in 2019, before the COVID-19 outbreak. Will the U.S. movie market recover? Altogether, Canada and the United States (including the unincorporated territories of Guam and Puerto Rico) form what is known as the North American film market. This region is recovering at a slower pace than the global average. In 2021, the North American box office revenue amounted to **** billion dollars. This represents less than ** percent of the *****-billion-dollar result reported two years before. The Asian film industry In 2021, China consolidated its upper hand in the global movie market. The country had already placed first in the worldwide ranking of box office revenues in 2020 when it first surpassed the U.S. and Canada. India is also on the rise. As Bollywood movies become more popular out of the country, their performances at the box offices across the world reach dozens of billions of Indian rupees.
During the weekend ending on June 15, 2025, the movie "How to Train Your Dragon" was the highest-grossing movie worldwide, with global box office revenue of almost 200 million U.S. dollars. "Lilo & Stitch" came in second position with less than 50 million U.S. dollars. The global box office In 2023, worldwide box office revenue grew by over 80 percent, standing at around 21.3 billion dollars. Despite the increase, the figure amounted to little more than half of the 42.3-billion-dollar result recorded in 2019, before the pandemic. But the slow recovery is not homogeneous. Box office gross in the Asia-Pacific (APAC) region, for instance, skyrocketed by about 88 percent in 2021. APAC alone accounted for more than half of the global value that year. 3D movies worldwide The COVID-19 outbreak impacted the 3D segment in particular since the use of this technology often aims at attracting viewers to theaters. In 2021, 3D films' share in global box office revenue stood below seven percent. Two years earlier, it surpassed 15 percent. A decrease in premieres in the most influential cinema market on the planet is both cause and consequence of that reduced percentage. In 2020 and 2021, the number of 3D movies released in the U.S. and Canada amounted to 12. Between 2012 and 2019, that figure had never stood below 33.
The annual global revenue of the ten highest grossing independent movies has been fluctuating over the past decade. After reaching its lowest value yet in 2020, independents recorded an all-time high three years later with a box office revenue of 1.63 billion U.S. dollars. In 2024, the industry recorded a revenue loss of 18 percent in comparison to the previous year.
Movie theaters in Japan recorded a total box office gross of about 207 billion Japanese yen in 2024, which was a decrease of more than 14 billion yen compared to the previous year. The box office gross strongly declined in 2020 due to the impact of the COVID-19 pandemic, but managed to reach pre-pandemic levels by 2022.
In 2024, total earnings at the box office across the United States and Canada amounted to around 8.56 billion U.S. dollars, down from 8.91 billion dollars in the previous year. Still, the 2024 figure was still under the revenue recorded in 2019. Light, camera, action – literally The initial recovery in the box office was followed by a return in market concentration. As of February 2023, the "Big Five" major film studios – Disney, Paramount, Sony/Columbia, Universal Pictures, and Warner Bros. – collectively held a market share of over 80 percent in the U.S. and Canada. Meanwhile, the action genre remained the most popular movie genre of the year. Diversity attracts moviegoers Over 60 percent of Gen Zers surveyed in the U.S. in May 2022 mentioned the movie offerings as the main reason to watch motion pictures in theaters. This suggests that new generations of moviegoers may be losing interest in some of the themes abundant in Hollywood productions. Between April 2018 and November 2021, the share of internet users in the U.S. who said they enjoyed superhero movies but were getting tired of so many of them went from 17 percent to 23 percent.
Between 1995 and 2024, an adventure movie grossed, on average, 57.34 million U.S. dollars at the North American box office – a term that excludes Mexico and includes Canada and the United States. The box office revenue of a documentary stood at an average of little less than a million dollars.
In 2023, the box office revenue in North America totaled *** billion U.S. dollars, or ** percent of the total global box office revenue that year. In terms of regional results, Latin America recorded **** billion dollars at the box office that year. When it comes to single markets, China had the highest revenue, with **** billion dollars in 2023.
In 2024, the global box office revenue amounted to approximately ** billion U.S. dollars, out of which more than half – **** billion dollars – came from the international market (without China). Despite being on the rise in comparison to 2022, box office revenues have yet to fully recover to pre-pandemic levels.
In 2024, imported movies accounted for around **** percent of the total box office revenue in China, after hitting the lowest point in 2022. Compared to a decade ago, slightly less than half of the cinema ticket sales revenue in the country were generated from domestically produced movies.
https://data.gov.tw/licensehttps://data.gov.tw/license
This dataset provides national theater box office statistics for films distributed by the Administrative Institution National Film and Audiovisual Culture Center. The data is up to the last Sunday before the announcement date and does not include films that have not been screened for less than 7 calendar days. The earliest CSV format data in this dataset begins on July 30, 2018, and the earliest JSON format data begins on March 1, 2020. JSON format queries require entering the start and end dates (in the format of year, month, and day), and can provide data for a maximum of 90 days at a time.