Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Using a Python script to scrape data from the web, we collected data pertaining to all 1698 Hindi language movies that released in India across a 13 year period (2005-2017) from the website of Box Office India.
Dataset Description: Movies and TV Series Metadata
The "Movies and TV Series Metadata" dataset is a comprehensive collection of entertainment information, comprising 10,000 records for both movies and TV series, obtained using the TMDb (The Movie Database) API. This dataset is designed to facilitate research and analysis within the realm of media content, offering a rich resource of 20,000 rows and 8 columns.
Columns:
Researchers, analysts, and enthusiasts interested in media and entertainment can leverage this dataset to conduct various analyses, such as exploring the distribution of content across different languages and genres, identifying trends in popular keywords, understanding the involvement of specific cast or crew members in successful productions, and much more. With its well-structured and diverse content, the "Movies and TV Series Metadata" dataset presents a valuable resource for advancing knowledge and insights in the realm of media content analysis.
Details about IMFDB: Indian Movie Face database (IMFDB) is a large unconstrained face database consisting of 34512 images of 100 Indian actors collected from more than 100 videos. All the images are manually selected and cropped from the video frames resulting in a high degree of variability interms of scale, pose, expression, illumination, age, resolution, occlusion, and makeup. IMFDB is the first face database that provides a detailed annotation of every image in terms of age, pose, gender, expression and type of occlusion that may help other face related applications.
This dataset is modified in such a way that it is ready for training a Face Recognition model. For dataset with annotations as mentioned above, you can download from here(official): https://cvit.iiit.ac.in/projects/IMFDB/
Acknowledgements: https://cvit.iiit.ac.in/projects/IMFDB/ Shankar Setty, Moula Husain, Parisa Beham, Jyothi Gudavalli, Menaka Kandasamy, Radhesyam Vaddi, Vidyagouri Hemadri, J C Karure, Raja Raju, Rajan, Vijay Kumar and C V Jawahar. "Indian Movie Face Database: A Benchmark for Face Recognition Under Wide Variations" National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), 2013.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
6319 Global import shipment records of Hindi Films with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.
This dataset provides a detailed catalogue of television shows and movies available on Disney+ Hotstar, a leading Indian subscription video on-demand service [2]. Disney+ Hotstar, owned by Novi Digital Entertainment of Disney Star, integrates content from Disney Star's local networks, including films, TV series, live sports, and original programming [2]. It also features content licensed from third-parties such as HBO and Showtime [2]. Following Disney's acquisition of 21st Century Fox in 2019, the platform expanded in April 2020 to include original programming, films, and television series from major Disney brands like Walt Disney Studios, Pixar, Marvel Studios, Lucasfilm, and National Geographic [2]. The service quickly became a dominant streaming platform in India and also operates in Indonesia, Malaysia, and Thailand, combining local, third-party entertainment with the broader Disney+ library [2]. This dataset offers insights into the platform's content offerings and media consumption trends [2, 3].
This dataset comprises 6,245 unique TV shows and movies, each described by 10 distinct attributes [4]. The content spans release years from 1928 to 2023 [6]. Analysis of the content reveals that Drama accounts for 30% of the entries, Comedy for 12%, with other genres making up the remaining 59% [6]. The age ratings are predominantly U/A 13+ (43%) and U (18%) [6]. For movies, running times range from 1 to 229 minutes, with a notable concentration between 115.00 and 137.80 minutes [7]. The dataset is composed of 66% movies and 34% TV shows [7]. The typical data file format is CSV [8].
This dataset is ideal for a variety of applications and use cases [1]: * Analysing content trends and genre popularity on streaming platforms [3]. * Developing and evaluating recommender systems for media content [3]. * Conducting market research on entertainment and media consumption [2, 3]. * Performing Natural Language Processing (NLP) tasks using content descriptions and titles [3]. * Studying content distribution across different age ratings and release years. * Understanding the content catalogue of a major over-the-top streaming service [2].
The geographic scope of the content primarily pertains to India, given Disney+ Hotstar's origins and primary market [2]. However, the service also operates in Indonesia, Malaysia, and Thailand [2]. Hotstar additionally targets overseas Indian audiences in markets such as Singapore, Canada, and the United Kingdom, although it operates as a service distinct from Disney+ in these regions [2]. The dataset includes content released between 1928 and 2023 [6]. Demographic scope is addressed through various age ratings assigned to the content, such as U/A 13+, U, U/A 16+, A, and U/A 7+ [6, 7]. The listed region for the dataset is GLOBAL [9].
CC-BY-SA
This dataset is suitable for a wide range of users and their specific needs [1]: * Data Scientists and Analysts: To perform statistical analysis, identify content trends, and build predictive models within the entertainment sector. * Academics and Researchers: To study media consumption patterns, content strategies, and the cultural impacts of streaming services. * Developers: For building and enhancing content recommendation engines or AI models aimed at media understanding and content generation [3, 9]. * Content Creators and Producers: To gain insights into popular genres, themes, and audience preferences for new productions. * Marketplace Users: Seeking data related to entertainment and media consumption, specifically movies and TV shows [3].
Original Data Source: Disney+ Hotstar Tv and Movie Catalog
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
12638 Global exporters importers export import shipment records of Hindi films with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
1698 Global exporters importers export import shipment records of Indian cinema with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
849 Global export shipment records of Indian Cinema with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Using a Python script to scrape data from the web, we collected data pertaining to all 1698 Hindi language movies that released in India across a 13 year period (2005-2017) from the website of Box Office India.