Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...
https://brightdata.com/licensehttps://brightdata.com/license
This dataset encompasses a wide-ranging collection of Google Play applications, providing a holistic view of the diverse ecosystem within the platform. It includes information on various attributes such as the title, developer, monetization features, images, app descriptions, data safety measures, user ratings, number of reviews, star rating distributions, user feedback, recent updates, related applications by the same developer, content ratings, estimated downloads, and timestamps. By aggregating this data, the dataset offers researchers, developers, and analysts an extensive resource to explore and analyze trends, patterns, and dynamics within the Google Play Store. Researchers can utilize this dataset to conduct comprehensive studies on user behavior, market trends, and the impact of various factors on app success. Developers can leverage the insights derived from this dataset to inform their app development strategies, improve user engagement, and optimize monetization techniques. Analysts can employ the dataset to identify emerging trends, assess the performance of different categories of applications, and gain valuable insights into consumer preferences. Overall, this dataset serves as a valuable tool for understanding the broader landscape of the Google Play Store and unlocking actionable insights for various stakeholders in the mobile app industry.
Use the OpenWeb Ninja Google Play App Store Data API to access comprehensive data on Google Play Store, including Android Apps / Games, reviews, top charts, search, and more. Our extensive dataset provides over 40 app store data points, enabling you to gain deep insights into the market.
The App Store Data dataset includes all key app details:
App Name, Description, Rating, Photos, Downloads, Version Information, App Size, Permissions, Developer and Contact Information, Consumer Review Data.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Google Play App Reviews dataset contains valuable feedback from users who have reviewed apps on the Google Play Store. This dataset includes both user ratings and detailed comments, making it ideal for sentiment analysis, user experience evaluation, and app performance research.
Column Name | Description |
---|---|
review_id | Unique identifier for each review. 🆔 |
user_name | Name of the user who submitted the review. 👤 |
review_title | Title of the review (may be empty in some cases). 📝 |
review_description | The content or feedback given by the user about the app. 💬 |
rating | Rating given by the user, ranging from 1 (low) to 5 (high). ⭐ |
thumbs_up | Number of thumbs up the review received. 👍 |
review_date | Date and time the review was submitted. 📅 |
developer_response | Response from the app developer (if provided). 💬👨💻 |
developer_response_date | Date when the developer responded to the review. 📅💻 |
appVersion | The version of the app when the review was submitted. 📱🔢 |
language_code | The language in which the review was written (e.g., 'en' for English). 🗣️ |
country_code | The country of the user based on their review (e.g., 'us' for United States). 🌍 |
Ready to dive into the world of app feedback and sentiment analysis? Explore the dataset, build models to understand user sentiments, and enhance app experiences based on real feedback.
Happy coding! ✨
While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.
Each app (row) has values for catergory, rating, size, and more.
This information is scraped from the Google Play Store. This app information would not be available without it.
The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!
We built a crawler to collect data from the Google Play store including the application's metadata and APK files. The manifest files were extracted from the APK files and then processed to extract the features. The data set is composed of 870,515 records/apps, and for each app we produced 48 features. The data set was used to built and test two bootstrap aggregating of multiple XGBoost machine learning classifiers. The dataset were collected between April 2017 and November 2018. We then checked the status of these applications on three different occasions; December 2018, February 2019, and May-June 2019.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Explore the Google Play Store Reviews Database, a comprehensive collection of user reviews for various apps available on the Google Play Store.
This dataset includes millions of reviews across a wide range of categories such as games, productivity, social media, finance, health, and more. Each review entry provides essential details, including app names, user ratings, review texts, review dates, and user feedback, offering valuable insights for developers, data analysts, and market researchers.
Key Features:
Whether you're analyzing user feedback, researching market trends, or developing new app strategies, the Google Play Store Reviews Database is an invaluable resource that provides detailed insights and extensive coverage of app reviews on the Google Play Store.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the dataset used for paper: "A Recommender System of Buggy App Checkers for App Store Moderators", published on the International Conference on Mobile Software Engineering and Systems (MOBILESoft) in 2015.
Dataset Collection We built a dataset that consists of a random sample of Android app metadata and user reviews available on the Google Play Store on January and March 2014. Since the Google Play Store is continuously evolving (adding, removing and/or updating apps), we updated the dataset twice. The dataset D1 contains available apps in the Google Play Store in January 2014. Then, we created a new snapshot (D2) of the Google Play Store in March 2014.
The apps belong to the 27 different categories defined by Google (at the time of writing the paper), and the 4 predefined subcategories (free, paid, new_free, and new_paid). For each category-subcategory pair (e.g. tools-free, tools-paid, sports-new_free, etc.), we collected a maximum of 500 samples, resulting in a median number of 1.978 apps per category.
For each app, we retrieved the following metadata: name, package, creator, version code, version name, number of downloads, size, upload date, star rating, star counting, and the set of permission requests.
In addition, for each app, we collected up to a maximum of the latest 500 reviews posted by users in the Google Play Store. For each review, we retrieved its metadata: title, description, device, and version of the app. None of these fields were mandatory, thus several reviews lack some of these details. From all the reviews attached to an app, we only considered the reviews associated with the latest version of the app —i.e., we discarded unversioned and old-versioned reviews. Thus, resulting in a corpus of 1,402,717 reviews (2014 Jan.).
Dataset Stats Some stats about the datasets:
D1 (Jan. 2014) contains 38,781 apps requesting 7,826 different permissions, and 1,402,717 user reviews.
D2 (Mar. 2014) contains 46,644 apps and 9,319 different permission requests, and 1,361,319 user reviews.
Additional stats about the datasets are available here.
Dataset Description To store the dataset, we created a graph database with Neo4j. This dataset therefore consists of a graph describing the apps as nodes and edges. We chose a graph database because the graph visualization helps to identify connections among data (e.g., clusters of apps sharing similar sets of permission requests).
In particular, our dataset graph contains six types of nodes: - APP nodes containing metadata of each app, - PERMISSION nodes describing permission types, - CATEGORY nodes describing app categories, - SUBCATEGORY nodes describing app subcategories, - USER_REVIEW nodes storing user reviews. - TOPIC topics mined from user reviews (using LDA).
Furthermore, there are five types of relationships between APP nodes and each of the remaining nodes:
Dataset Files Info
Neo4j 2.0 Databases
googlePlayDB1-Jan2014_neo4j_2_0.rar
googlePlayDB2-Mar2014_neo4j_2_0.rar We provide two Neo4j databases containing the 2 snapshots of the Google Play Store (January and March 2014). These are the original databases created for the paper. The databases were created with Neo4j 2.0. In particular with the tool version 'Neo4j 2.0.0-M06 Community Edition' (latest version available at the time of implementing the paper in 2014).
Neo4j 3.5 Databases
googlePlayDB1-Jan2014_neo4j_3_5_28.rar
googlePlayDB2-Mar2014_neo4j_3_5_28.rar Currently, the version Neo4j 2.0 is deprecated and it is not available for download in the official Neo4j Download Center. We have migrated the original databases (Neo4j 2.0) to Neo4j 3.5.28. The databases can be opened with the tool version: 'Neo4j Community Edition 3.5.28'. The tool can be downloaded from the official Neo4j Donwload page.
In order to open the databases with more recent versions of Neo4j, the databases must be first migrated to the corresponding version. Instructions about the migration process can be found in the Neo4j Migration Guide.
First time the Neo4j database is connected, it could request credentials. The username and pasword are: neo4j/neo4j
There's a story behind every dataset and here's your opportunity to share yours. Based on installs, reviews you can sort out the apps. A clear picture can be drawn of apps, you can find out apps of what category are the most expensive, most popular, have most installs. Also various comparison can be done based on the data given in the dataset.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Google Playstore App data of 600K+ applications with all public details. Last updated on July 2020.
Markets
Mobile Application,App Store
603047
$399.00
To date (April 2020), Android is still the most popular mobile operating system in the world. Taking into account billion of Android users worldwide, mining this data has the potential to reveal user behaviors and trends in the whole global scope.
There are 2 CSV files: - app.csv with 53,732 rows and 18 columns. - comment.csv with 1,468,173 rows and 4 columns.
The scraping was done in April 2020.
This dataset is obtained from scraping Google Play Store. Without Google and Android, this dataset wouldn’t have existed.
The dataset is first published in this blog.
Business trends on mobile can be explored by examining this dataset.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Reviews on Messengers Dataset - Review dataset
The Reviews on Messengers Dataset is a comprehensive collection of 200 the most recent customer reviews on 6 messengers obtained from the popular app store, Google Play. See the list of the apps below. This dataset encompasses reviews written in 5 different languages: English, French, German, Italian, Japanese.
💴 For Commercial Usage: To discuss your requirements, learn about the price and buy the dataset, leave a request… See the full description on the dataset page: https://huggingface.co/datasets/UniqueData/messengers-reviews-google-play.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Automated Insights Dataset (AID) brings metadata from the 200 most downloaded free apps from each of the 32 categories on the Google Play Store, totaling 6400 apps, with information that goes beyond that presented by app stores, also bringing metadata from AppBrain. The User Interface Depth Dataset (UID) brings a high-quality sampling of the AID, and delves into the identification of 7540 components of 50 component types and the capture of 1948 screenshots of the interface of 400 apps. The component set was based on components of Google Material Design and Android Studio.
The datasets can be viewed in the spreadsheets named "Automated Insights Dataset (AID).xlsx" and "User Interface Depth Dataset (UID).xlsx".
The "UID - Screenshots.zip" file contains screenshots of the apps present in the UID, organized in folders by app IDs.
The "Source code of the developed tools.zip" file contains Python codes and complementary files used to collect the datasets.
The "Discarded apps.zip" file contains the apps discarded in the analysis, it presents screenshots of some apps, collected elements and the reasons that led to these apps being discarded.
The "Data explanation.zip" file contains graphical representations of the UID components and textual representations of each data present in the UID and AID, allowing a better understanding of the criteria used.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Introduction: It has been 4 months since the discovery of COVID-19, and there have been many measures introduced to curb movements of individuals to stem the spread. There has been an increase in the utilization of web-based technologies for counseling, and for supervision and training, and this has been carefully described in China. Several telehealth initiatives have been highlighted for Australian residents. Smartphone applications have previously been shown to be helpful in times of a crisis. Whilst there have been some examples of how web-based technologies have been used to support individuals who are concerned about or living with COVID-19, we know of no studies or review that have specifically looked at how M-Health technologies have been utilized for COVID-19.Objectives: There might be existing commercially available applications on the commercial stores, or in the published literature. There remains a lack of understanding of the resources that are available, the functionality of these applications, and the evidence base of these applications. Given this, the objective of this content analytical review is in identifying the commercial applications that are available currently for COVID-19, and in exploring their functionalities.Methods: A mobile application search application was used. The search terminologies used were “COVID” and “COVID-19.” Keyword search was performed based on the titles of the commercial applications. The search through the database was conducted from the 27th March through to the 18th of April 2020 by two independent authors.Results: A total of 103 applications were identified from the Apple iTunes and Google Play store, respectively; 32 were available on both Apple and Google Play stores. The majority appeared on the commercial stores between March and April 2020, more than 2 months after the first discovery of COVID-19. Some of the common functionalities include the provision of news and information, contact tracking, and self-assessment or diagnosis.Conclusions: This is the first review that has characterized the smartphone applications 4 months after the first discovery of COVID-19.
Google Play Store team had launched a new feature wherein, certain apps that are promising, are boosted in visibility. The boost will manifest in multiple ways including higher priority in recommendations sections (“Similar apps”, “You might also like”, “New and updated games”). These will also get a boost in search results visibility. This feature will help bring more attention to newer apps that have the potential.
The problem is to identify the apps that are going to be good for Google to promote. App ratings, which are provided by the customers, is always a great indicator of the goodness of the app. The problem reduces to: predict which apps will have high ratings.
Google Play Store team is about to launch a new feature wherein, certain apps that are promising, are boosted in visibility. The boost will manifest in multiple ways including higher priority in recommendations sections (“Similar apps”, “You might also like”, “New and updated games”). These will also get a boost in search results visibility. This feature will help bring more attention to newer apps that have the potential.
Dataset: Google Play Store data (“googleplaystore.csv”)
Fields in the data: App: Application name Category: Category to which the app belongs Rating: Overall user rating of the app Reviews: Number of user reviews for the app Size: Size of the app Installs: Number of user downloads/installs for the app Type: Paid or Free Price: Price of the app Content Rating: Age group the app is targeted at - Children / Mature 21+ / Adult Genres: An app can belong to multiple genres (apart from its main category). For example, a musical family game will belong to Music, Game, Family genres. Last Updated: Date when the app was last updated on Play Store Current Ver: Current version of the app available on Play Store Android Ver: Minimum required Android version
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
GoogleDataSafety
Data used in the papers:
Unpacking Privacy Labels: A Measurement and Developer Perspective on Google's Data Safety Section The Overview of Privacy Labels and their Compatibility with Privacy Policies Comparing Privacy Labels of Applications in Android and iOS
Getting Started
In this dataset you have the following data:
App Privacy Policies Data Safety Sections Labels
The data is collected from the apps in Google Play Store over the period of… See the full description on the dataset page: https://huggingface.co/datasets/WIPI/GoogleDataSafety.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We crawled 90,000 app reviews from both Google Play Store and Apple App Store, including reviews from both free and paid apps. These reviews were filtered for explainability needs, and after this process, 4,495 reviews remained. Among them, 2,185 reviews indicated an explanation need, while 2,310 did not. This resulting gold standard dataset was used to train and evaluate several machine learning models and rule-based approaches for detecting explanation needs in app reviews.
The dataset includes both balanced and unbalanced evaluation sets, as well as the original crawled data from October 2023. In addition to machine learning approaches, rule-based methods optimized for F1 score, precision, and recall are also included.
We provide several pre-trained machine learning models (including BERT, SetFit, AdaBoost, K-Nearest Neighbor, Logistic Regression, Naive Bayes, Random Forest, and SVM) along with training scripts and evaluation notebooks. These models can be applied directly or retrained using the included datasets.
For further details on the structure and usage of the dataset, please refer to the README.md file within the provided ZIP archive.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mobile App Stores such as Google, Apple have wide range of applications to suffice every need of customers in the digital platform. Customer feedback and ratings has always been one of the major metrics that can be used to review the performance and accordingly provide suitable recommendations to enhance the functionality. The Given dataset contain the feedback of the customer regarding the app used in app store.
Data Set Column Details are as given below:
Column name:
Description:
Column Name in Working Sheet
Datatype
Please read the Readme.docs file
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mental health screening and diagnostic apps can provide an opportunity to reduce strain on mental health services, improve patient well-being, and increase access for underrepresented groups. Despite promise of their acceptability, many mental health apps on the market suffer from high dropout due to a multitude of issues. Understanding user opinions of currently available mental health apps beyond star ratings can provide knowledge which can inform the development of future mental health apps. This study aimed to conduct a review of current apps which offer screening and/or aid diagnosis of mental health conditions on the Apple app store (iOS), Google Play app store (Android), and using the m-health Index and Navigation Database (MIND). In addition, the study aimed to evaluate user experiences of the apps, identify common app features and determine which features are associated with app use discontinuation. The Apple app store, Google Play app store, and MIND were searched. User reviews and associated metadata were then extracted to perform a sentiment and thematic analysis. The final sample included 92 apps. 45.65% (n = 42) of these apps only screened for or diagnosed a single mental health condition and the most commonly assessed mental health condition was depression (38.04%, n = 35). 73.91% (n = 68) of the apps offered additional in-app features to the mental health assessment (e.g., mood tracking). The average user rating for the included apps was 3.70 (SD = 1.63) and just under two-thirds had a rating of four stars or above (65.09%, n = 442). Sentiment analysis revealed that 65.24%, n = 441 of the reviews had a positive sentiment. Ten themes were identified in the thematic analysis, with the most frequently occurring being performance (41.32%, n = 231) and functionality (39.18%, n = 219). In reviews which commented on app use discontinuation, functionality and accessibility in combination were the most frequent barriers to sustained app use (25.33%, n = 19). Despite the majority of user reviews demonstrating a positive sentiment, there are several areas of improvement to be addressed. User reviews can reveal ways to increase performance and functionality. App user reviews are a valuable resource for the development and future improvements of apps designed for mental health diagnosis and screening.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...