Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
If you use this dataset anywhere in your work, kindly cite as the below: L. Gupta, "Google Play Store Apps," Feb 2019. [Online]. Available: https://www.kaggle.com/lava18/google-play-store-apps
While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.
Each app (row) has values for catergory, rating, size, and more.
This information is scraped from the Google Play Store. This app information would not be available without it.
The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!
https://brightdata.com/licensehttps://brightdata.com/license
This dataset encompasses a wide-ranging collection of Google Play applications, providing a holistic view of the diverse ecosystem within the platform. It includes information on various attributes such as the title, developer, monetization features, images, app descriptions, data safety measures, user ratings, number of reviews, star rating distributions, user feedback, recent updates, related applications by the same developer, content ratings, estimated downloads, and timestamps. By aggregating this data, the dataset offers researchers, developers, and analysts an extensive resource to explore and analyze trends, patterns, and dynamics within the Google Play Store. Researchers can utilize this dataset to conduct comprehensive studies on user behavior, market trends, and the impact of various factors on app success. Developers can leverage the insights derived from this dataset to inform their app development strategies, improve user engagement, and optimize monetization techniques. Analysts can employ the dataset to identify emerging trends, assess the performance of different categories of applications, and gain valuable insights into consumer preferences. Overall, this dataset serves as a valuable tool for understanding the broader landscape of the Google Play Store and unlocking actionable insights for various stakeholders in the mobile app industry.
Google Play Store dataset to explore detailed information about apps, including ratings, descriptions, updates, and developer details. Popular use cases include app performance analysis, market research, and consumer behavior insights.
Use our Google Play Store dataset to explore detailed information about apps available on the platform, including app titles, developers, monetization features, user ratings, reviews, and more. This dataset also includes data on app descriptions, safety measures, download counts, recent updates, and compatibility, providing a complete overview of app performance and features.
Tailored for app developers, marketers, and researchers, this dataset offers valuable insights into user preferences, app trends, and market dynamics. Whether you're optimizing app development, conducting competitive analysis, or tracking app performance, the Google Play Store dataset is an essential resource for making data-driven decisions in the mobile app ecosystem.
This dataset is ideal for a variety of applications:
CUSTOM Please review the respective licenses below: 1. Data Provider's License - Bright Data Master Service Agreement
~Up to $0.0025 per record. Min order $250
Approximately 10M new records are added each month. Approximately 13.8M records are updated each month. Get the complete dataset each delivery, including all records. Retrieve only the data you need with the flexibility to set Smart Updates.
New snapshot each month, 12 snapshots/year Paid monthly
New snapshot each quarter, 4 snapshots/year Paid quarterly
New snapshot every 6 months, 2 snapshots/year Paid twice-a-year
New snapshot one-time delivery Paid once
Use the OpenWeb Ninja Google Play App Store Data API to access comprehensive data on Google Play Store, including Android Apps / Games, reviews, top charts, search, and more. Our extensive dataset provides over 40 app store data points, enabling you to gain deep insights into the market.
The App Store Data dataset includes all key app details:
App Name, Description, Rating, Photos, Downloads, Version Information, App Size, Permissions, Developer and Contact Information, Consumer Review Data.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset is a collection of user reviews for various Google Apps available on the Play Store. It provides detailed insights into user feedback, ratings, and engagement with different applications. The dataset's primary purpose is to offer a rich resource for understanding user sentiment, identifying app performance issues, and tracking user satisfaction over time. It is a valuable asset for analytics and natural language processing tasks related to app reviews.
The dataset contains over 90,000 app reviews. The score
column shows a distribution across ratings, with substantial counts for scores like 1.00-1.20, 2.00-2.20, 3.00-3.20, 4.00-4.20, and 4.80-5.00. For thumbsUpCount
, the majority of reviews have a relatively low number of likes (0-720), but there are instances with significantly higher counts, reaching up to over 14,000 likes. The reviewCreatedVersion
column shows a variety of app versions, with some being more frequently reviewed than others. Review creation dates span a period from April 2014 to February 2021, with a notable increase in review volume towards the later years, particularly between May 2020 and February 2021.
This dataset is ideal for: * Sentiment analysis of app reviews. * Natural Language Processing (NLP) tasks, such as topic modelling, text classification, and entity recognition. * App performance monitoring and identifying user pain points. * Market research on user satisfaction and trends in app usage. * Developing AI and Machine Learning models for predicting app ratings or automatically classifying feedback.
The dataset offers global coverage for app reviews. The time range for review creation spans from 10th April 2014 to 4th February 2021. While developer replies are included, the data on repliedAt
primarily indicates a single latest date (4th February 2021) with the majority being null, suggesting that developer reply timestamps are not as broadly distributed across the dataset as review creation times.
CC0
Original Data Source: Google Apps Playstore Reviews
Get access to information about all apps in the Google Playstore to understand your competitors, market to app developers etc. This dataset includes all the fields available in the play store such as:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
App Permission data of 2.2 million android applications from Google Play store. Backup repo: https://github.com/gauthamp10/android-permissions-dataset
I've collected the data with the help of Python and Scrapy running on a cloud virtual machine with the United States as geolocation. The data was collected on June 2021.
Also checkout:
I couldn't have build this dateset without the help of Digitalocean and github. Switched to facundoolano/google-play-scraper for sane reasons.
Took inspiration from: https://www.kaggle.com/gauthamp10/google-playstore-apps to build a big database for students and researchers who are interested to analyze and find insights on mobile application privacy.
Gautham Prakash
My other projects: github.com/gauthamp10
Website: gauthamp10.github.io
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Playstore Analysis’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/madhav000/playstore-analysis on 30 September 2021.
--- Dataset description provided by original source is as follows ---
Google Play Store team had launched a new feature wherein, certain apps that are promising, are boosted in visibility. The boost will manifest in multiple ways including higher priority in recommendations sections (“Similar apps”, “You might also like”, “New and updated games”). These will also get a boost in search results visibility. This feature will help bring more attention to newer apps that have the potential.
The problem is to identify the apps that are going to be good for Google to promote. App ratings, which are provided by the customers, is always a great indicator of the goodness of the app. The problem reduces to: predict which apps will have high ratings.
Google Play Store team is about to launch a new feature wherein, certain apps that are promising, are boosted in visibility. The boost will manifest in multiple ways including higher priority in recommendations sections (“Similar apps”, “You might also like”, “New and updated games”). These will also get a boost in search results visibility. This feature will help bring more attention to newer apps that have the potential.
Dataset: Google Play Store data (“googleplaystore.csv”)
Fields in the data: App: Application name Category: Category to which the app belongs Rating: Overall user rating of the app Reviews: Number of user reviews for the app Size: Size of the app Installs: Number of user downloads/installs for the app Type: Paid or Free Price: Price of the app Content Rating: Age group the app is targeted at - Children / Mature 21+ / Adult Genres: An app can belong to multiple genres (apart from its main category). For example, a musical family game will belong to Music, Game, Family genres. Last Updated: Date when the app was last updated on Play Store Current Ver: Current version of the app available on Play Store Android Ver: Minimum required Android version
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This paper introduces GLARE an Arabic Apps Reviews dataset collected from Saudi Google PlayStore. It consists of 76M reviews, 69M of which are Arabic reviews of 9,980 Android Applications. We present the data collection methodology, along with a detailed Exploratory Data Analysis (EDA) and Feature Engineering on the gathered reviews. We also highlight possible use cases and benefits of the dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Google Play Store Category wise Top 500 Apps’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/shakthidhar/google-play-store-category-wise-top-500-apps on 13 February 2022.
--- Dataset description provided by original source is as follows ---
Google Play stores top 500 app data based on their rankings on January 2022 for all the available categories. Link to scraping code: https://github.com/Shakthi-Dhar/AppPin Link to backup datafiles: github data files
The dataset contains the top 500 android apps available on the google play store for the following categories: All Categories, Art & Design, Auto & Vehicles, Beauty, Books & Reference, Business, Comics, Communication, Education, Entertainment, Events, Finance, Food & Drink, Health & Fitness, House & Home, Libraries & Demo, Lifestyle, Maps & Navigation, Medical, Music & Audio, News & Magazines, Parenting, Personalization, Photography, Productivity, Shopping, Social, Sports, Tools, Travel & Local, and Video Players & Editors.
The app rankings are based on google play store app rankings for January 2022.
In Review and Downloads, the alphabet T, L, Cr represents Thousands, Lakhs, Crores as per the google play store naming convention. They are similar to M, B which represent millions, billions. 1L (1 Lakh) = 100T (100 Thousand) 10L (10 Lakhs) = 1M (1 Million) 1Cr( 1 Crore) = 10M (10 Million)
This data is not provided directly by Google, so I used Appium an automation tool with python to scrape the data from the google play store app.
Inspired by Fortune500. Fortune500 provides data on top companies in the world, so why not have a data source for top apps in the world.
--- Original source retains full ownership of the source dataset ---
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Reviews on Messengers Dataset - Review dataset
The Reviews on Messengers Dataset is a comprehensive collection of 200 the most recent customer reviews on 6 messengers obtained from the popular app store, Google Play. See the list of the apps below. This dataset encompasses reviews written in 5 different languages: English, French, German, Italian, Japanese.
💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/messengers-reviews-google-play.
This dataset was constructed from an analysis of about 1.5 million apps from Google Play to identify a set of common libraries, to facilitate Android app analysis. It contains 1,113 libraries supporting common functionalities and 240 libraries for advertisement.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This repository contains the dataset for our study "A Large-Scale Empirical Study of Android Sports Apps in the Google Play Store" and this will help to replicate our study, also the replication package to direct you to help replicate it for your dataset too.
Note: The dataset given are protected with password, and the password is available in our published paper
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The amount of Android apps available for download is constantly increasing, exerting a continuous pressure on developers to publish outstanding apps. Google Play (GP) is the default distribution channel for Android apps, which provides mobile app users with metrics to identify and report apps quality such as rating, amount of downloads, previous users comments, etc. In addition to those metrics, GP presents a set of top charts that highlight the outstanding apps in different categories. Both metrics and top app charts help developers to identify whether their development decisions are well valued by the community. Therefore, app presence in these top charts is a valuable information when understanding the features of top-apps. In this paper we present Hall-of-Apps, a dataset containing top charts' apps metadata extracted (weekly) from GP, for 4 different countries, during 30 weeks. The data is presented as (i) raw HTML files, (ii) a MongoDB database with all the information contained in app's HTML files (e.g., app description, category, general rating, etc.), and (iii) data visualizations built with the D3.js framework. A first characterization of the data along with the urls to retrieve it can be found in our online appendix: https://thesoftwaredesignlab.github.io/hall-of-apps-tools/
There's a story behind every dataset and here's your opportunity to share yours. Based on installs, reviews you can sort out the apps. A clear picture can be drawn of apps, you can find out apps of what category are the most expensive, most popular, have most installs. Also various comparison can be done based on the data given in the dataset.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
App Download Key StatisticsApp and Game DownloadsiOS App and Game DownloadsGoogle Play App and Game DownloadsGame DownloadsiOS Game DownloadsGoogle Play Game DownloadsApp DownloadsiOS App...
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This dataset contains 76,535 real user reviews collected from the Google Play Store across seven popular music streaming applications: Spotify, Apple Music, SoundCloud, TIDAL, Deezer, Shazam, and Google Play Music.
Each review includes: * 🌐 The app name * 📝 The review content * ⭐ A star rating from 1 to 5 * 📱 The app version (if available) * 📅 The date the review was written
This dataset is cleaned (empty or invalid entries removed) but intentionally unaltered in tone, preserving user expressions (including slang, emojis, and punctuation). Total entries: 76,535 Language: Primarily English Date range: Varied (depending on app)
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This data set contains meta data about more than a million third-party Android application that were collected from the Google Play store between 2017 and 2019. Two privacy scores were calculated for each application based on: permission requests, broadcast receivers, and user's privacy preferences. The scores also depend on other applications in the app's category. The scores were calculated based on two published formulas. The first fomulas was proposed by Mohsel et al. and published in TrustCom '18, "Countering intrusiveness using new security-centric ranking algorithm built on top of elasticsearch". The second formula was published in SPSM '16 Taylor, and Martinovic, "SecuRank: Starving Permission-Hungry Apps Using Contextual Permission Analysis"s
This dataset was created by Ashish Patel
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
If you use this dataset anywhere in your work, kindly cite as the below: L. Gupta, "Google Play Store Apps," Feb 2019. [Online]. Available: https://www.kaggle.com/lava18/google-play-store-apps
While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.
Each app (row) has values for catergory, rating, size, and more.
This information is scraped from the Google Play Store. This app information would not be available without it.
The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!