67 datasets found

b
App Store Data (2025)
businessofapps.com
Updated Aug 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business of Apps (2025). App Store Data (2025) [Dataset]. https://www.businessofapps.com/data/app-stores/
Explore at:
Dataset updated
Aug 1, 2025
Dataset authored and provided by
Business of Apps
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...
d
Apple Appstore & Google Play Store data
datarade.ai
.json, .xml, .csv
Updated Oct 15, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Datandard (2021). Apple Appstore & Google Play Store data [Dataset]. https://datarade.ai/data-products/apple-appstore-google-play-store-data-cleardata
Explore at:
.json, .xml, .csvAvailable download formats
Dataset updated
Oct 15, 2021
Dataset authored and provided by
Datandard
Area covered
Libya, Belize, Rwanda, South Georgia and the South Sandwich Islands, Iran (Islamic Republic of), Zambia, Tonga, Andorra, Spain, Lao People's Democratic Republic
Description
Get access to information about all apps in the Google Playstore to understand your competitors, market to app developers etc. This dataset includes all the fields available in the play store such as:

Name, description, rating information etc.

Technical information such as size, app version etc.

Permissions.

Developer information.

Contact information.

Parsed app-ads.txt information for publisher domains.

Reviews (more than 100 million reviews available)
IOS App Store reviews dataset
crawlfeeds.com
csv, zip
Updated Jul 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). IOS App Store reviews dataset [Dataset]. https://crawlfeeds.com/datasets/ios-app-store-reviews-dataset
Explore at:
zip, csvAvailable download formats
Dataset updated
Jul 7, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
Unlock the power of user feedback with our iOS App Store Reviews Dataset, a comprehensive collection of reviews from thousands of apps across various categories. This robust App Store dataset includes essential details such as app names, ratings, user comments, timestamps, and more, offering valuable insights into user experiences and preferences.

Perfect for app developers, marketers, and data analysts, this dataset allows you to conduct sentiment analysis, monitor app performance, and identify trends in user behavior. By leveraging the iOS App Store Reviews Dataset, you can refine app features, optimize marketing strategies, and elevate user satisfaction.

Whether you’re tracking mobile app trends, analyzing specific app categories, or developing data-driven strategies, this App Store dataset is an indispensable tool. Download the iOS App Store Reviews Dataset today or contact us for custom datasets tailored to your unique project requirements.

Ready to take your app insights to the next level? Get the iOS App Store Reviews Dataset now or explore our custom data solutions to meet your needs.
Unlocking User Sentiment: The App Store Reviews Dataset
crawlfeeds.com
json, zip
Updated Jun 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). Unlocking User Sentiment: The App Store Reviews Dataset [Dataset]. https://crawlfeeds.com/datasets/app-store-reviews-dataset
Explore at:
json, zipAvailable download formats
Dataset updated
Jun 20, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
This dataset offers a focused and invaluable window into user perceptions and experiences with applications listed on the Apple App Store. It is a vital resource for app developers, product managers, market analysts, and anyone seeking to understand the direct voice of the customer in the dynamic mobile app ecosystem.

Dataset Specifications:

Investment: $45.0

Status: Published and immediately available.

Category: Ratings and Reviews Data

Format: Compressed ZIP archive containing JSON files, ensuring easy integration into your analytical tools and platforms.

Volume: Comprises 10,000 unique app reviews, providing a robust sample for qualitative and quantitative analysis of user feedback.

Timeliness: Last crawled: (This field is blank in your provided info, which means its recency is currently unknown. If this were a real product, specifying this would be critical for its value proposition.)

Richness of Detail (11 Comprehensive Fields):

Each record in this dataset provides a detailed breakdown of a single App Store review, enabling multi-dimensional analysis:

Review Content:

review: The full text of the user's written feedback, crucial for Natural Language Processing (NLP) to extract themes, sentiment, and common keywords.

title: The title given to the review by the user, often summarizing their main point.

isEdited: A boolean flag indicating whether the review has been edited by the user since its initial submission. This can be important for tracking evolving sentiment or understanding user behavior.

Reviewer & Rating Information:

username: The public username of the reviewer, allowing for analysis of engagement patterns from specific users (though not personally identifiable).

rating: The star rating (typically 1-5) given by the user, providing a quantifiable measure of satisfaction.

App & Origin Context:

app_name: The name of the application being reviewed.

app_id: A unique identifier for the application within the App Store, enabling direct linking to app details or other datasets.

country: The country of the App Store storefront where the review was left, allowing for geographic segmentation of feedback.

Metadata & Timestamps:

_id: A unique identifier for the specific review record in the dataset.

crawled_at: The timestamp indicating when this particular review record was collected by the data provider (Crawl Feeds).

date: The original date the review was posted by the user on the App Store.

Expanded Use Cases & Analytical Applications:

This dataset is a goldmine for understanding what users truly think and feel about mobile applications. Here's how it can be leveraged:

Product Development & Improvement:

Bug Detection & Prioritization: Analyze negative review text to identify recurring technical issues, crashes, or bugs, allowing developers to prioritize fixes based on user impact.

Feature Requests & Roadmap Prioritization: Extract feature suggestions from positive and neutral review text to inform future product roadmap decisions and develop features users actively desire.

User Experience (UX) Enhancement: Understand pain points related to app design, navigation, and overall usability by analyzing common complaints in the review field.

Version Impact Analysis: If integrated with app version data, track changes in rating and sentiment after new app updates to assess the effectiveness of bug fixes or new features.

Market Research & Competitive Intelligence:

Competitor Benchmarking: Analyze reviews of competitor apps (if included or combined with similar datasets) to identify their strengths, weaknesses, and user expectations within a specific app category.

Market Gap Identification: Discover unmet user needs or features that users desire but are not adequately provided by existing apps.

Niche Opportunities: Identify specific use cases or user segments that are underserved based on recurring feedback.

Marketing & App Store Optimization (ASO):

Sentiment Analysis: Perform sentiment analysis on the review and title fields to gauge overall user satisfaction, pinpoint specific positive and negative aspects, and track sentiment shifts over time.

Keyword Optimization: Identify frequently used keywords and phrases in reviews to optimize app store listings, improving discoverability and search ranking.

Messaging Refinement: Understand how users describe and use the app in their own words, which can inform marketing copy and advertising campaigns.

Reputation Management: Monitor rating trends and identify critical reviews quickly to facilitate timely responses and proactive customer engagement.

Academic & Data Science Research:

Natural Language Processing (NLP): The review and title fields are excellent for training and testing NLP models for sentiment analysis, topic modeling, named entity recognition, and text summarization.

User Behavior Analysis: Study patterns in rating distribution, isEdited status, and date to understand user engagement and feedback cycles.

Cross-Country Comparisons: Analyze country-specific reviews to understand regional differences in app perception, feature preferences, or cultural nuances in feedback.

This App Store Reviews dataset provides a direct, unfiltered conduit to understanding user needs and ultimately driving better app performance and greater user satisfaction. Its structured format and granular detail make it an indispensable asset for data-driven decision-making in the mobile app industry.
b
Google Play Store Datasets
brightdata.com
.json, .csv, .xlsx
Updated Aug 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bright Data, Google Play Store Datasets [Dataset]. https://brightdata.com/products/datasets/google-play-store
Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Aug 22, 2025
Dataset authored and provided by
Bright Data
License
https://brightdata.com/licensehttps://brightdata.com/license
Area covered
Worldwide
Description
This dataset encompasses a wide-ranging collection of Google Play applications, providing a holistic view of the diverse ecosystem within the platform. It includes information on various attributes such as the title, developer, monetization features, images, app descriptions, data safety measures, user ratings, number of reviews, star rating distributions, user feedback, recent updates, related applications by the same developer, content ratings, estimated downloads, and timestamps. By aggregating this data, the dataset offers researchers, developers, and analysts an extensive resource to explore and analyze trends, patterns, and dynamics within the Google Play Store. Researchers can utilize this dataset to conduct comprehensive studies on user behavior, market trends, and the impact of various factors on app success. Developers can leverage the insights derived from this dataset to inform their app development strategies, improve user engagement, and optimize monetization techniques. Analysts can employ the dataset to identify emerging trends, assess the performance of different categories of applications, and gain valuable insights into consumer preferences. Overall, this dataset serves as a valuable tool for understanding the broader landscape of the Google Play Store and unlocking actionable insights for various stakeholders in the mobile app industry.
d
Google Play Store Apps / Games Data, Android Apps Data, Consumer Review...
datarade.ai
.json, .csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OpenWeb Ninja, Google Play Store Apps / Games Data, Android Apps Data, Consumer Review Data, Top Charts | Real-Time API [Dataset]. https://datarade.ai/data-products/openweb-ninja-google-play-store-data-android-apps-games-openweb-ninja
Explore at:
.json, .csvAvailable download formats
Dataset authored and provided by
OpenWeb Ninja
Area covered
Macedonia (the former Yugoslav Republic of), Bermuda, Guam, Mali, Nicaragua, Finland, Korea (Republic of), Christmas Island, Netherlands, Azerbaijan
Description
Use the OpenWeb Ninja Google Play App Store Data API to access comprehensive data on Google Play Store, including Android Apps / Games, reviews, top charts, search, and more. Our extensive dataset provides over 40 app store data points, enabling you to gain deep insights into the market.

The App Store Data dataset includes all key app details:

App Name, Description, Rating, Photos, Downloads, Version Information, App Size, Permissions, Developer and Contact Information, Consumer Review Data.
mac-app-store-apps-metadata
huggingface.co
Updated Feb 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MacPaw Way Ltd. (2024). mac-app-store-apps-metadata [Dataset]. https://huggingface.co/datasets/MacPaw/mac-app-store-apps-metadata
Explore at:
Dataset updated
Feb 29, 2024
Dataset provided by
MacPaw
Authors
MacPaw Way Ltd.
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset Card for Macappstore Applications Metadata

Mac App Store Applications Metadata sourced by the public API.

Curated by: MacPaw Way Ltd.

Language(s) (NLP): Mostly EN, DE License: MIT

Dataset Details

This data aims to cover our internal company research needs and start collecting and sharing the macOS app dataset since we have yet to find a suitable existing one. Full application metadata was sourced by the public iTunes search API for the US, Germany, and Ukraine… See the full description on the dataset page: https://huggingface.co/datasets/MacPaw/mac-app-store-apps-metadata.
Data from: Google Play Store App Analysis Dataset
kaggle.com
zip
Updated Mar 31, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cabinet Shah (2021). Google Play Store App Analysis Dataset [Dataset]. https://www.kaggle.com/datasets/cabinetshah1999/google-play-store-app-analysis-dataset/discussion
Explore at:
zip(318068 bytes)Available download formats
Dataset updated
Mar 31, 2021
Authors
Cabinet Shah
Description
Dataset

This dataset was created by Cabinet Shah

Released under Data files © Original Authors

Contents
RICO dataset
kaggle.com
Updated Dec 2, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Onur Gunes (2021). RICO dataset [Dataset]. https://www.kaggle.com/datasets/onurgunes1993/rico-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 2, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Onur Gunes
Description
Context

Data-driven models help mobile app designers understand best practices and trends, and can be used to make predictions about design performance and support the creation of adaptive UIs. This paper presents Rico, the largest repository of mobile app designs to date, created to support five classes of data-driven applications: design search, UI layout generation, UI code generation, user interaction modeling, and user perception prediction. To create Rico, we built a system that combines crowdsourcing and automation to scalably mine design and interaction data from Android apps at runtime. The Rico dataset contains design data from more than 9.3k Android apps spanning 27 categories. It exposes visual, textual, structural, and interactive design properties of more than 66k unique UI screens. To demonstrate the kinds of applications that Rico enables, we present results from training an autoencoder for UI layout similarity, which supports query-by-example search over UIs.

Content

Rico was built by mining Android apps at runtime via human-powered and programmatic exploration. Like its predecessor ERICA, Rico’s app mining infrastructure requires no access to — or modification of — an app’s source code. Apps are downloaded from the Google Play Store and served to crowd workers through a web interface. When crowd workers use an app, the system records a user interaction trace that captures the UIs visited and the interactions performed on them. Then, an automated agent replays the trace to warm up a new copy of the app and continues the exploration programmatically, leveraging a content-agnostic similarity heuristic to efficiently discover new UI states. By combining crowdsourcing and automation, Rico can achieve higher coverage over an app’s UI states than either crawling strategy alone. In total, 13 workers recruited on UpWork spent 2,450 hours using apps on the platform over five months, producing 10,811 user interaction traces. After collecting a user trace for an app, we ran the automated crawler on the app for one hour.

Acknowledgements

UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN https://interactionmining.org/rico

Inspiration

The Rico dataset is large enough to support deep learning applications. We trained an autoencoder to learn an embedding for UI layouts, and used it to annotate each UI with a 64-dimensional vector representation encoding visual layout. This vector representation can be used to compute structurally — and often semantically — similar UIs, supporting example-based search over the dataset. To create training inputs for the autoencoder that embed layout information, we constructed a new image for each UI capturing the bounding box regions of all leaf elements in its view hierarchy, differentiating between text and non-text elements. Rico’s view hierarchies obviate the need for noisy image processing or OCR techniques to create these inputs.
IOS application reviews dataset in English
crawlfeeds.com
csv, zip
Updated Jul 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). IOS application reviews dataset in English [Dataset]. https://crawlfeeds.com/datasets/ios-application-reviews-dataset-in-english
Explore at:
zip, csvAvailable download formats
Dataset updated
Jul 8, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
This comprehensive iOS application reviews dataset contains thousands of authentic user reviews from the Apple App Store in English. The dataset provides valuable insights for app developers, marketers, and researchers studying mobile application performance and user sentiment.

Key Features:

Real user reviews from popular iOS apps

Star ratings from 1 to 5 stars

Review dates and timestamps

App store URLs and metadata

User demographics and location data

App version information

Review titles and detailed feedback

Applications: Perfect for sentiment analysis, app store optimization, mobile app development research, user experience studies, and competitive analysis. This dataset enables businesses to understand user preferences, identify app improvement opportunities, and develop better mobile applications.

Data Quality: All reviews are genuine user feedback collected from the official Apple App Store, ensuring authenticity and reliability for research and business intelligence purposes. The dataset covers various app categories including fitness, shopping, education, entertainment, and productivity applications.
Z
Dataset used for "A Recommender System of Buggy App Checkers for App Store...
data.niaid.nih.gov
Updated Jun 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lionel Seinturier (2021). Dataset used for "A Recommender System of Buggy App Checkers for App Store Moderators" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5034291
Explore at:
Dataset updated
Jun 28, 2021
Dataset provided by
Lionel Seinturier
Romain Rouvoy
Martin Monperrus
Maria Gomez
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is the dataset used for paper: "A Recommender System of Buggy App Checkers for App Store Moderators", published on the International Conference on Mobile Software Engineering and Systems (MOBILESoft) in 2015.

Dataset Collection We built a dataset that consists of a random sample of Android app metadata and user reviews available on the Google Play Store on January and March 2014. Since the Google Play Store is continuously evolving (adding, removing and/or updating apps), we updated the dataset twice. The dataset D1 contains available apps in the Google Play Store in January 2014. Then, we created a new snapshot (D2) of the Google Play Store in March 2014.

The apps belong to the 27 different categories defined by Google (at the time of writing the paper), and the 4 predefined subcategories (free, paid, new_free, and new_paid). For each category-subcategory pair (e.g. tools-free, tools-paid, sports-new_free, etc.), we collected a maximum of 500 samples, resulting in a median number of 1.978 apps per category.

For each app, we retrieved the following metadata: name, package, creator, version code, version name, number of downloads, size, upload date, star rating, star counting, and the set of permission requests.

In addition, for each app, we collected up to a maximum of the latest 500 reviews posted by users in the Google Play Store. For each review, we retrieved its metadata: title, description, device, and version of the app. None of these fields were mandatory, thus several reviews lack some of these details. From all the reviews attached to an app, we only considered the reviews associated with the latest version of the app —i.e., we discarded unversioned and old-versioned reviews. Thus, resulting in a corpus of 1,402,717 reviews (2014 Jan.).

Dataset Stats Some stats about the datasets:

D1 (Jan. 2014) contains 38,781 apps requesting 7,826 different permissions, and 1,402,717 user reviews.

D2 (Mar. 2014) contains 46,644 apps and 9,319 different permission requests, and 1,361,319 user reviews.

Additional stats about the datasets are available here.

Dataset Description To store the dataset, we created a graph database with Neo4j. This dataset therefore consists of a graph describing the apps as nodes and edges. We chose a graph database because the graph visualization helps to identify connections among data (e.g., clusters of apps sharing similar sets of permission requests).

In particular, our dataset graph contains six types of nodes: - APP nodes containing metadata of each app, - PERMISSION nodes describing permission types, - CATEGORY nodes describing app categories, - SUBCATEGORY nodes describing app subcategories, - USER_REVIEW nodes storing user reviews. - TOPIC topics mined from user reviews (using LDA).

Furthermore, there are five types of relationships between APP nodes and each of the remaining nodes:

USES_PERMISSION relationships between APP and PERMISSION nodes

HAS_REVIEW between APP and USER_REVIEW nodes

HAS_TOPIC between USER_REVIEW and TOPIC nodes

BELONGS_TO_CATEGORY between APP and CATEGORY nodes

BELONGS_TO_SUBCATEGORY between APP and SUBCATEGORY nodes

Dataset Files Info

Neo4j 2.0 Databases

googlePlayDB1-Jan2014_neo4j_2_0.rar

googlePlayDB2-Mar2014_neo4j_2_0.rar We provide two Neo4j databases containing the 2 snapshots of the Google Play Store (January and March 2014). These are the original databases created for the paper. The databases were created with Neo4j 2.0. In particular with the tool version 'Neo4j 2.0.0-M06 Community Edition' (latest version available at the time of implementing the paper in 2014).

Neo4j 3.5 Databases

googlePlayDB1-Jan2014_neo4j_3_5_28.rar

googlePlayDB2-Mar2014_neo4j_3_5_28.rar Currently, the version Neo4j 2.0 is deprecated and it is not available for download in the official Neo4j Download Center. We have migrated the original databases (Neo4j 2.0) to Neo4j 3.5.28. The databases can be opened with the tool version: 'Neo4j Community Edition 3.5.28'. The tool can be downloaded from the official Neo4j Donwload page.

In order to open the databases with more recent versions of Neo4j, the databases must be first migrated to the corresponding version. Instructions about the migration process can be found in the Neo4j Migration Guide. First time the Neo4j database is connected, it could request credentials. The username and pasword are: neo4j/neo4j
e
The manifest and store data of 870,515 Android mobile applications - Dataset...
b2find.eudat.eu
Updated Oct 23, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). The manifest and store data of 870,515 Android mobile applications - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/b25ee20e-5268-50ae-9914-4bc70bd4ff1c
Explore at:
Dataset updated
Oct 23, 2023
Description
We built a crawler to collect data from the Google Play store including the application's metadata and APK files. The manifest files were extracted from the APK files and then processed to extract the features. The data set is composed of 870,515 records/apps, and for each app we produced 48 features. The data set was used to built and test two bootstrap aggregating of multiple XGBoost machine learning classifiers. The dataset were collected between April 2017 and November 2018. We then checked the status of these applications on three different occasions; December 2018, February 2019, and May-June 2019.
Google Play Store Apps
kaggle.com
zip
Updated Feb 3, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lavanya (2019). Google Play Store Apps [Dataset]. https://www.kaggle.com/lava18/google-play-store-apps
Explore at:
zip(2037893 bytes)Available download formats
Dataset updated
Feb 3, 2019
Authors
Lavanya
Description
Context

While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.

Content

Each app (row) has values for catergory, rating, size, and more.

Acknowledgements

This information is scraped from the Google Play Store. This app information would not be available without it.

Inspiration

The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!
Google Playstore App Data
dataandsons.com
csv, zip
Updated Jul 28, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gautham Prakash (2020). Google Playstore App Data [Dataset]. https://www.dataandsons.com/categories/markets/google-playstore-app-data
Explore at:
zip, csvAvailable download formats
Dataset updated
Jul 28, 2020
Dataset provided by
Authors
Gautham Prakash
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
About this Dataset

Google Playstore App data of 600K+ applications with all public details. Last updated on July 2020.

Category

Markets

Keywords

Mobile Application,App Store

Row Count

603047

Price

$399.00
A
‘Playstore Analysis’ analyzed by Analyst-2
analyst-2.ai
Updated Nov 12, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Playstore Analysis’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-playstore-analysis-2b2d/41638844/?iid=022-994&v=presentation
Explore at:
Dataset updated
Nov 12, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Playstore Analysis’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/madhav000/playstore-analysis on 30 September 2021.

--- Dataset description provided by original source is as follows ---

Google Play Store team had launched a new feature wherein, certain apps that are promising, are boosted in visibility. The boost will manifest in multiple ways including higher priority in recommendations sections (“Similar apps”, “You might also like”, “New and updated games”). These will also get a boost in search results visibility. This feature will help bring more attention to newer apps that have the potential.

Analysis to be done:

The problem is to identify the apps that are going to be good for Google to promote. App ratings, which are provided by the customers, is always a great indicator of the goodness of the app. The problem reduces to: predict which apps will have high ratings.

Problem Statement:

Google Play Store team is about to launch a new feature wherein, certain apps that are promising, are boosted in visibility. The boost will manifest in multiple ways including higher priority in recommendations sections (“Similar apps”, “You might also like”, “New and updated games”). These will also get a boost in search results visibility. This feature will help bring more attention to newer apps that have the potential.

Content:

Dataset: Google Play Store data (“googleplaystore.csv”)

Fields in the data: App: Application name Category: Category to which the app belongs Rating: Overall user rating of the app Reviews: Number of user reviews for the app Size: Size of the app Installs: Number of user downloads/installs for the app Type: Paid or Free Price: Price of the app Content Rating: Age group the app is targeted at - Children / Mature 21+ / Adult Genres: An app can belong to multiple genres (apart from its main category). For example, a musical family game will belong to Music, Game, Family genres. Last Updated: Date when the app was last updated on Play Store Current Ver: Current version of the app available on Play Store Android Ver: Minimum required Android version

--- Original source retains full ownership of the source dataset ---
TikTok global quarterly downloads 2018-2024
statista.com
es.statista.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista Research Department (2025). TikTok global quarterly downloads 2018-2024 [Dataset]. https://www.statista.com/topics/1002/mobile-app-usage/
Explore at:
Dataset updated
Feb 5, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Description
In the fourth quarter of 2024, TikTok generated around 186 million downloads from users worldwide. Initially launched in China first by ByteDance as Douyin, the short-video format was popularized by TikTok and took over the global social media environment in 2020. In the first quarter of 2020, TikTok downloads peaked at over 313.5 million worldwide, up by 62.3 percent compared to the first quarter of 2019. TikTok interactions: is there a magic formula for content success? In 2024, TikTok registered an engagement rate of approximately 4.64 percent on video content hosted on its platform. During the same examined year, the social video app recorded over 1,100 interactions on average. These interactions were primarily composed of likes, while only recording less than 20 comments per piece of content on average in 2024. The platform has been actively monitoring the issue of fake interactions, as it removed around 236 million fake likes during the first quarter of 2024. Though there is no secret formula to get the maximum of these metrics, recommended video length can possibly contribute to the success of content on TikTok. It was recommended that tiny TikTok accounts with up to 500 followers post videos that are around 2.6 minutes long as of the first quarter of 2024. While, the ideal video duration for huge TikTok accounts with over 50,000 followers was 7.28 minutes. The average length of TikTok videos posted by the creators in 2024 was around 43 seconds. What’s trending on TikTok Shop? Since its launch in September 2023, TikTok Shop has become one of the most popular online shopping platforms, offering consumers a wide variety of products. In 2023, TikTok shops featuring beauty and personal care items sold over 370 million products worldwide. TikTok shops featuring womenswear and underwear, as well as food and beverages, followed with 285 and 138 million products sold, respectively. Similarly, in the United States market, health and beauty products were the most-selling items, accounting for 85 percent of sales made via the TikTok Shop feature during the first month of its launch. In 2023, Indonesia was the market with the largest number of TikTok Shops, hosting over 20 percent of all TikTok Shops. Thailand and Vietnam followed with 18.29 and 17.54 percent of the total shops listed on the famous short video platform, respectively. 
Data from: A Longitudinal Study of Removed Apps in iOS App Store
zenodo.org
Updated Mar 8, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fuqi Lin; Haoyu Wang; Liu Wang; Xuanzhe Liu; Fuqi Lin; Haoyu Wang; Liu Wang; Xuanzhe Liu (2021). A Longitudinal Study of Removed Apps in iOS App Store [Dataset]. http://doi.org/10.5281/zenodo.4588266
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.4588266
Dataset updated
Mar 8, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Fuqi Lin; Haoyu Wang; Liu Wang; Xuanzhe Liu; Fuqi Lin; Haoyu Wang; Liu Wang; Xuanzhe Liu
Description
Dataset for the paper A Longitudinal Study of Removed Apps in iOS App Store (WWW 2021)
Google-Play-App-Rating-Analysis
kaggle.com
Updated Dec 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Moin Uddin Maruf (2020). Google-Play-App-Rating-Analysis [Dataset]. https://www.kaggle.com/moinuddinmaruf/google-play-app-rating-analysis
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 24, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Moin Uddin Maruf
Description
This dataset contains some stats about google play store app.

There's a story behind every dataset and here's your opportunity to share yours. Based on installs, reviews you can sort out the apps. A clear picture can be drawn of apps, you can find out apps of what category are the most expensive, most popular, have most installs. Also various comparison can be done based on the data given in the dataset.
Dataset on Transit Agency Open Data Provision and Uptake for and by App...
figshare.com
xlsx
Updated Jun 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mahtot Gebresselassie; Melanie Baljko (2025). Dataset on Transit Agency Open Data Provision and Uptake for and by App Developers.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.26771650.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.26771650.v1
Dataset updated
Jun 21, 2025
Dataset provided by
Figsharehttp://figshare.com/
Authors
Mahtot Gebresselassie; Melanie Baljko
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The research examines transit agencies’ open data provision, transit agencies’ relationship with developers of transit apps as open data users, and transit apps as open data products in the context of legislated disability regulations in public transportation. Our investigation focused on transit agencies of 50 of the most populous cities in the United States. We used data collected from transit agencies websites, open data portals, smartphone app distribution platforms such as Google Play and the App Store, and the open web. Description of each dataset is available in the document titled "Data Description".

Multilingual Mobile App Review Dataset August 2025

kaggle.com

Updated Jul 31, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Pratyush Puri (2025). Multilingual Mobile App Review Dataset August 2025 [Dataset]. https://www.kaggle.com/datasets/pratyushpuri/multilingual-mobile-app-reviews-dataset-2025

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jul 31, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Pratyush Puri

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Multilingual Mobile App Reviews Dataset 2025

Overview

This comprehensive synthetic dataset contains 2,514 authentic mobile app reviews spanning 40+ popular applications across 24 different languages, making it ideal for multilingual NLP, sentiment analysis, and cross-cultural user behavior research.

Dataset Statistics

Total Records: 2,514 reviews
Columns: 15 features
Languages Covered: 24 international languages
Apps Included: 40+ popular mobile applications
Time Range: 2023-2025 (2-year span)
File Format: CSV
Data Quality: Intentionally includes missing values and mixed data types for data cleaning practice

Column Specifications

Column Name	Data Type	Description	Sample Values	Null Count
`review_id`	Integer	Unique identifier for each review	1, 2, 3, ...	0
`user_id`	String*	User identifier (should be integer)	"1967825", "9242600"	0
`app_name`	String	Name of the mobile application	WhatsApp, Instagram, TikTok	0
`app_category`	String	Application category	Social Networking, Entertainment	0
`review_text`	String	Multilingual review content	"This app is amazing!"	63
`review_language`	String	ISO language code	en, es, fr, zh, hi, ar	0
`rating`	Mixed*	App rating (1.0-5.0, some as strings)	4.5, "3.2", 1.1	38
`review_date`	DateTime	Timestamp of review submission	2024-10-09 19:26:40	0
`verified_purchase`	Boolean	Purchase verification status	True, False	0
`device_type`	String	Device platform	Android, iOS, iPad, Windows Phone	0
`num_helpful_votes`	Mixed*	Helpfulness votes (some as strings)	65, "209", 163	0
`user_age`	Float*	User age (should be integer)	14.0, 18.0, 67.0	0
`user_country`	String	User's country	China, Germany, Nigeria	50
`user_gender`	String	User gender	Male, Female, Non-binary, Prefer not to say	88
`app_version`	String	Application version number	1.4, v8.9, 2.8.37.5926	25

Note: Data types marked with asterisk require cleaning/conversion

Language Distribution

The dataset includes reviews in 24 languages: - European: English (en), Spanish (es), French (fr), German (de), Italian (it), Russian (ru), Polish (pl), Dutch (nl), Swedish (sv), Danish (da), Norwegian (no), Finnish (fi) - Asian: Chinese (zh), Hindi (hi), Japanese (ja), Korean (ko), Thai (th), Vietnamese (vi), Indonesian (id), Malay (ms) - Other: Arabic (ar), Turkish (tr), Filipino (tl)

Application Categories

Reviews cover 18 distinct categories: - Social Networking - Entertainment
- Productivity - Travel & Local - Music & Audio - Video Players & Editors - Shopping - Navigation - Finance - Communication - Education - Photography - Dating - Business - Utilities - Health & Fitness - Games - News & Magazines

Popular Apps Included

40+ applications including: - Social: WhatsApp, Instagram, Facebook, Snapchat, TikTok, LinkedIn, Twitter, Reddit, Pinterest - Entertainment: YouTube, Netflix, Spotify - Productivity: Microsoft Office, Google Drive, Dropbox, OneDrive, Zoom, Discord - Travel: Uber, Lyft, Airbnb, Booking.com, Google Maps, Waze - Finance: PayPal, Venmo - Education: Duolingo, Khan Academy, Coursera, Udemy - Tools: Grammarly, Canva, Adobe Photoshop, VLC, MX Player

Geographic Distribution

Reviews from 24 countries across all continents: - Asia: China, India, Japan, South Korea, Thailand, Vietnam, Indonesia, Malaysia, Philippines, Pakistan, Bangladesh - Europe: Germany, United Kingdom, France, Italy, Spain, Russia, Turkey, Poland - Americas: United States, Canada, Brazil, Mexico - Oceania: Australia - Africa: Nigeria

Data Quality Features

Intentional data challenges for learning: - Missing Values: Strategic nulls in review_text (63), rating (38), user_country (50), user_gender (88), app_version (25) - Data Type Issues: - user_id stored as strings (should be integers) - user_age as floats (should be integers)
- Some ratings as strings (should be floats) - Some helpful_votes as strings (should be integers) - Mixed Version Formats: "1.4", "v8.9", "2.8.37.5926", "14.1.60.318-beta"

Use Cases

This dataset is perfect for: - Multilingual NLP projects and sentiment analysis - Cross-cultural user behavior analysis - App store analytics and rating prediction - Data cleaning and preprocessing practice - Text classification across multiple languages - Time series analysis of app reviews - Geographic sentiment analysis - Data engineering pipeline development

Data Cleaning Opportunities

Convert string IDs to integers
Standardize rating values to float
Han...

Facebook

Twitter

Click to copy link

Link copied

Cite

Business of Apps (2025). App Store Data (2025) [Dataset]. https://www.businessofapps.com/data/app-stores/

App Store Data (2025)

Explore at:

33 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Aug 1, 2025

Dataset authored and provided by

Business of Apps

License

Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically

Description

Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...

Clear search

Close search

Google apps

Main menu

App Store Data (2025)

Apple Appstore & Google Play Store data

IOS App Store reviews dataset

Unlocking User Sentiment: The App Store Reviews Dataset

Google Play Store Datasets

Google Play Store Apps / Games Data, Android Apps Data, Consumer Review...

mac-app-store-apps-metadata

Data from: Google Play Store App Analysis Dataset

Dataset

Contents

RICO dataset

Context

Content

Acknowledgements

Inspiration

IOS application reviews dataset in English

Dataset used for "A Recommender System of Buggy App Checkers for App Store...

The manifest and store data of 870,515 Android mobile applications - Dataset...

Google Play Store Apps

Context

Content

Acknowledgements

Inspiration

Google Playstore App Data

About this Dataset

Category

Keywords

Row Count

Price

‘Playstore Analysis’ analyzed by Analyst-2

Analysis to be done:

Problem Statement:

Content:

TikTok global quarterly downloads 2018-2024

Data from: A Longitudinal Study of Removed Apps in iOS App Store

Google-Play-App-Rating-Analysis

This dataset contains some stats about google play store app.

Dataset on Transit Agency Open Data Provision and Uptake for and by App...

Multilingual Mobile App Review Dataset August 2025

Multilingual Mobile App Reviews Dataset 2025

Overview

Dataset Statistics

Column Specifications

Language Distribution

Application Categories

Popular Apps Included

Geographic Distribution

Data Quality Features

Use Cases

Data Cleaning Opportunities

App Store Data (2025)See More Versions

App Store Data (2025)