Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
During the study period
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
We surveyed 10,208 people from more than 15 countries on their mobile app usage behavior. The countries include USA, China, Japan, Germany, France, Brazil, UK, Italy, Russia, India, Canada, Spain, Australia, Mexico, and South Korea. We asked respondents about: (1) their mobile app user behavior in terms of mobile app usage, including the app stores they use, what triggers them to look for apps, why they download apps, why they abandon apps, and the types of apps they download. (2) their demographics including gender, age, marital status, nationality, country of residence, first language, ethnicity, education level, occupation, and household income (3) their personality using the Big-Five personality traits This dataset contains the results of the survey.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset provides detailed, event-level records of mobile app feature usage, including user interactions, device context, session information, and user segmentation. It enables product teams and UX researchers to analyze feature adoption rates, engagement patterns, and user cohorts, supporting data-driven decisions for app improvement and user experience optimization.
This dataset encompasses mobile smartphone application (app) usage, collected from over 150,000 triple-opt-in first-party US Daily Active Users (DAU). Use it for measurement, attribution or surveying to understand the why. iOS and Android operating system coverage.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
App Download Key StatisticsApp and Game DownloadsiOS App and Game DownloadsGoogle Play App and Game DownloadsGame DownloadsiOS Game DownloadsGoogle Play Game DownloadsApp DownloadsiOS App...
https://www.paradoxintelligence.com/termshttps://www.paradoxintelligence.com/terms
App download rankings, usage metrics, and user engagement data (iOS/Android)
Do you know how much time you spend on an app? Do you know the total use time of a day or average use time of an app?
App usage in Playstore provides that data.
It lists the usage time of apps for each day.
Use the test data to find the Total Minutes that we can use the given app in a day.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
The app analytics market, valued at $7.29 billion in 2025, is experiencing robust growth, projected to expand at a compound annual growth rate (CAGR) of 21.09% from 2025 to 2033. This surge is driven by several key factors. The increasing adoption of mobile applications across diverse industries, coupled with the rising need for businesses to understand user behavior and optimize app performance, fuels the demand for sophisticated analytics solutions. Furthermore, advancements in data analytics technologies, including artificial intelligence (AI) and machine learning (ML), are enabling more insightful and actionable data analysis, further propelling market expansion. The diverse application of app analytics across marketing/advertising, revenue generation, and in-app performance monitoring across various sectors like BFSI, e-commerce, media, travel and tourism, and IT and telecom significantly contributes to this growth. The market is segmented by deployment (mobile apps and website/desktop apps) and end-user industry, with mobile app analytics currently dominating due to the widespread adoption of smartphones. The competitive landscape is characterized by a mix of established technology giants like Google and Amazon alongside specialized app analytics providers like AppsFlyer and Mixpanel. These companies are continuously innovating, integrating new technologies, and expanding their product offerings to cater to the evolving needs of businesses. While the North American market currently holds a significant share, the Asia-Pacific region is expected to witness substantial growth in the coming years driven by increasing smartphone penetration and digitalization initiatives. However, factors like data privacy concerns and the rising complexity of integrating various analytics tools could pose challenges to market growth. Nonetheless, the overall outlook for the app analytics market remains positive, indicating substantial opportunities for players across the value chain. Recent developments include: June 2024 - Comscore and Kochava unveiled an innovative performance media measurement solution, providing marketers with enhanced insights. This cutting-edge cross-screen solution empowers marketers to understand better how linear TV ad campaigns impact both online and offline actions. By integrating Comscore’s Exact Commercial Ratings (ECR) data with Kochava’s sophisticated marketing mix modeling, the solution facilitates the measurement of crucial metrics, including mobile app activities (such as installs and in-app purchases) and website interactions., June 2024 - AppsFlyer announced its integration of the Data Collaboration Platform with Start.io, an omnichannel advertising platform that focuses on real-time mobile audiences for publishers. Through this collaboration, businesses leveraging the AppsFlyer Data Collaboration Platform can merge their Start.io data with campaign metrics and audience insights, creating a more comprehensive dataset for precise audience targeting.. Key drivers for this market are: Increasing Usage of Mobile/Web Apps Across Various End-user Industries, Increasing Adoption of Technologies like 5G Technology and Deeper Penetration of Smartphones; Increase in the Amount of Time Spent on Mobile Devices Coupled With the Increasing Focus on Enhancing Customer Experience. Potential restraints include: Increasing Usage of Mobile/Web Apps Across Various End-user Industries, Increasing Adoption of Technologies like 5G Technology and Deeper Penetration of Smartphones; Increase in the Amount of Time Spent on Mobile Devices Coupled With the Increasing Focus on Enhancing Customer Experience. Notable trends are: Media and Entertainment Industry Expected to Capture Significant Share.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ABSTRACT The increasing use of mobile applications have been escalating with the increasing use of smartphones. In the present study, we examine (a) the adoption behavior of mobile apps using the extended TAM framework, and (b) whether adoption leads to subsequent use behavior and switching intentions. Based on data collected from two surveys in India we test the conceptual model of extended TAM and the effects of behavior on switching intentions using factor analysis and structural equation modeling. The major findings indicate a significant effect of most predictor variables on the perceived usefulness and perceived ease of use of apps. Further, we found a significant effect of behavioral intention on use behavior and subsequent switching intentions to apps from computers/laptops.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is the dataset used for paper: "A Recommender System of Buggy App Checkers for App Store Moderators", published on the International Conference on Mobile Software Engineering and Systems (MOBILESoft) in 2015.
Dataset Collection We built a dataset that consists of a random sample of Android app metadata and user reviews available on the Google Play Store on January and March 2014. Since the Google Play Store is continuously evolving (adding, removing and/or updating apps), we updated the dataset twice. The dataset D1 contains available apps in the Google Play Store in January 2014. Then, we created a new snapshot (D2) of the Google Play Store in March 2014.
The apps belong to the 27 different categories defined by Google (at the time of writing the paper), and the 4 predefined subcategories (free, paid, new_free, and new_paid). For each category-subcategory pair (e.g. tools-free, tools-paid, sports-new_free, etc.), we collected a maximum of 500 samples, resulting in a median number of 1.978 apps per category.
For each app, we retrieved the following metadata: name, package, creator, version code, version name, number of downloads, size, upload date, star rating, star counting, and the set of permission requests.
In addition, for each app, we collected up to a maximum of the latest 500 reviews posted by users in the Google Play Store. For each review, we retrieved its metadata: title, description, device, and version of the app. None of these fields were mandatory, thus several reviews lack some of these details. From all the reviews attached to an app, we only considered the reviews associated with the latest version of the app —i.e., we discarded unversioned and old-versioned reviews. Thus, resulting in a corpus of 1,402,717 reviews (2014 Jan.).
Dataset Stats Some stats about the datasets:
D1 (Jan. 2014) contains 38,781 apps requesting 7,826 different permissions, and 1,402,717 user reviews.
D2 (Mar. 2014) contains 46,644 apps and 9,319 different permission requests, and 1,361,319 user reviews.
Additional stats about the datasets are available here.
Dataset Description To store the dataset, we created a graph database with Neo4j. This dataset therefore consists of a graph describing the apps as nodes and edges. We chose a graph database because the graph visualization helps to identify connections among data (e.g., clusters of apps sharing similar sets of permission requests).
In particular, our dataset graph contains six types of nodes: - APP nodes containing metadata of each app, - PERMISSION nodes describing permission types, - CATEGORY nodes describing app categories, - SUBCATEGORY nodes describing app subcategories, - USER_REVIEW nodes storing user reviews. - TOPIC topics mined from user reviews (using LDA).
Furthermore, there are five types of relationships between APP nodes and each of the remaining nodes:
Dataset Files Info
Neo4j 2.0 Databases
googlePlayDB1-Jan2014_neo4j_2_0.rar
googlePlayDB2-Mar2014_neo4j_2_0.rar We provide two Neo4j databases containing the 2 snapshots of the Google Play Store (January and March 2014). These are the original databases created for the paper. The databases were created with Neo4j 2.0. In particular with the tool version 'Neo4j 2.0.0-M06 Community Edition' (latest version available at the time of implementing the paper in 2014).
Neo4j 3.5 Databases
googlePlayDB1-Jan2014_neo4j_3_5_28.rar
googlePlayDB2-Mar2014_neo4j_3_5_28.rar Currently, the version Neo4j 2.0 is deprecated and it is not available for download in the official Neo4j Download Center. We have migrated the original databases (Neo4j 2.0) to Neo4j 3.5.28. The databases can be opened with the tool version: 'Neo4j Community Edition 3.5.28'. The tool can be downloaded from the official Neo4j Donwload page.
In order to open the databases with more recent versions of Neo4j, the databases must be first migrated to the corresponding version. Instructions about the migration process can be found in the Neo4j Migration Guide.
First time the Neo4j database is connected, it could request credentials. The username and pasword are: neo4j/neo4j
The number of Instagram users in Saudi Arabia was forecast to continuously increase between 2024 and 2028 by in total 1.6 million users (+10.64 percent). According to this forecast, in 2028, the Instagram user base will have increased for the fifth consecutive year to 16.64 million users. User figures, shown here with regards to the platform instagram, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Instagram users in countries like Bahrain and Oman.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This comprehensive synthetic dataset contains 2,514 authentic mobile app reviews spanning 40+ popular applications across 24 different languages, making it ideal for multilingual NLP, sentiment analysis, and cross-cultural user behavior research.
Column Name | Data Type | Description | Sample Values | Null Count |
---|---|---|---|---|
review_id | Integer | Unique identifier for each review | 1, 2, 3, ... | 0 |
user_id | String* | User identifier (should be integer) | "1967825", "9242600" | 0 |
app_name | String | Name of the mobile application | WhatsApp, Instagram, TikTok | 0 |
app_category | String | Application category | Social Networking, Entertainment | 0 |
review_text | String | Multilingual review content | "This app is amazing!" | 63 |
review_language | String | ISO language code | en, es, fr, zh, hi, ar | 0 |
rating | Mixed* | App rating (1.0-5.0, some as strings) | 4.5, "3.2", 1.1 | 38 |
review_date | DateTime | Timestamp of review submission | 2024-10-09 19:26:40 | 0 |
verified_purchase | Boolean | Purchase verification status | True, False | 0 |
device_type | String | Device platform | Android, iOS, iPad, Windows Phone | 0 |
num_helpful_votes | Mixed* | Helpfulness votes (some as strings) | 65, "209", 163 | 0 |
user_age | Float* | User age (should be integer) | 14.0, 18.0, 67.0 | 0 |
user_country | String | User's country | China, Germany, Nigeria | 50 |
user_gender | String | User gender | Male, Female, Non-binary, Prefer not to say | 88 |
app_version | String | Application version number | 1.4, v8.9, 2.8.37.5926 | 25 |
Note: Data types marked with asterisk require cleaning/conversion
The dataset includes reviews in 24 languages: - European: English (en), Spanish (es), French (fr), German (de), Italian (it), Russian (ru), Polish (pl), Dutch (nl), Swedish (sv), Danish (da), Norwegian (no), Finnish (fi) - Asian: Chinese (zh), Hindi (hi), Japanese (ja), Korean (ko), Thai (th), Vietnamese (vi), Indonesian (id), Malay (ms) - Other: Arabic (ar), Turkish (tr), Filipino (tl)
Reviews cover 18 distinct categories:
- Social Networking
- Entertainment
- Productivity
- Travel & Local
- Music & Audio
- Video Players & Editors
- Shopping
- Navigation
- Finance
- Communication
- Education
- Photography
- Dating
- Business
- Utilities
- Health & Fitness
- Games
- News & Magazines
40+ applications including: - Social: WhatsApp, Instagram, Facebook, Snapchat, TikTok, LinkedIn, Twitter, Reddit, Pinterest - Entertainment: YouTube, Netflix, Spotify - Productivity: Microsoft Office, Google Drive, Dropbox, OneDrive, Zoom, Discord - Travel: Uber, Lyft, Airbnb, Booking.com, Google Maps, Waze - Finance: PayPal, Venmo - Education: Duolingo, Khan Academy, Coursera, Udemy - Tools: Grammarly, Canva, Adobe Photoshop, VLC, MX Player
Reviews from 24 countries across all continents: - Asia: China, India, Japan, South Korea, Thailand, Vietnam, Indonesia, Malaysia, Philippines, Pakistan, Bangladesh - Europe: Germany, United Kingdom, France, Italy, Spain, Russia, Turkey, Poland - Americas: United States, Canada, Brazil, Mexico - Oceania: Australia - Africa: Nigeria
Intentional data challenges for learning:
- Missing Values: Strategic nulls in review_text (63), rating (38), user_country (50), user_gender (88), app_version (25)
- Data Type Issues:
- user_id stored as strings (should be integers)
- user_age as floats (should be integers)
- Some ratings as strings (should be floats)
- Some helpful_votes as strings (should be integers)
- Mixed Version Formats: "1.4", "v8.9", "2.8.37.5926", "14.1.60.318-beta"
This dataset is perfect for: - Multilingual NLP projects and sentiment analysis - Cross-cultural user behavior analysis - App store analytics and rating prediction - Data cleaning and preprocessing practice - Text classification across multiple languages - Time series analysis of app reviews - Geographic sentiment analysis - Data engineering pipeline development
The S3 dataset contains the behavior (sensors, statistics of applications, and voice) of 21 volunteers interacting with their smartphones for more than 60 days. The type of users is diverse, males and females in the age range from 18 until 70 have been considered in the dataset generation. The wide range of age is a key aspect, due to the impact of age in terms of smartphone usage. To generate the dataset the volunteers installed a prototype of the smartphone application in on their Android mobile phones.
All attributes of the different kinds of data are writed in a vector. The dataset contains the fellow vectors:
Sensors:
This type of vector contains data belonging to smartphone sensors (accelerometer and gyroscope) that has been acquired in a given windows of time. Each vector is obtained every 20 seconds, and the monitored features are:- Average of accelerometer and gyroscope values.- Maximum and minimum of accelerometer and gyroscope values.- Variance of accelerometer and gyroscope values.- Peak-to-peak (max-min) of X, Y, Z coordinates.- Magnitude for gyroscope and accelerometer.
Statistics:
These vectors contain data about the different applications used by the user recently. Each vector of statistics is calculated every 60 seconds and contains : - Foreground application counters (number of different and total apps) for the last minute and the last day.- Most common app ID and the number of usages in the last minute and the last day. - ID of the currently active app. - ID of the last active app prior to the current one.- ID of the application most frequently utilized prior to the current application. - Bytes transmitted and received through the network interfaces.
Voice:
This kind of vector is generated when the microphone is active in a call o voice note. The speaker vector is an embedding, extracted from the audio, and it contains information about the user's identity. This vector, is usually named "x-vector" in the Speaker Recognition field, and it is calculated following the steps detailed in "egs/sitw/v2" for the Kaldi library, with the models available for the extraction of the embedding.
A summary of the details of the collected database.
- Users: 21 - Sensors vectors: 417.128 - Statistics app's usage vectors: 151.034 - Speaker vectors: 2.720 - Call recordings: 629 - Voice messages: 2.091
This dataset encompasses mobile app usage, web clickstream and location visitation behavior, collected from over 150,000 triple-opt-in first-party US Daily Active Users (DAU). The only omnichannel meter at scale representing iOS and Android platforms.
Includes ties to consumer demographics.
In-app audio, media and social ad exposure data included. Can be commissioned to build other in-app and account level visibility.
The number of social media users in Saudi Arabia was forecast to continuously increase between 2024 and 2029 by in total six million users (+28.05 percent). After the ninth consecutive increasing year, the social media user base is estimated to reach 27.42 million users and therefore a new peak in 2029. Notably, the number of social media users of was continuously increasing over the past years.The shown figures regarding social media users have been derived from survey data that has been processed to estimate missing demographics.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of social media users in countries like Israel and Kuwait.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By [source]
This dataset contains comprehensive performance data of National Basketball Association (NBA) players during the 2019-20 season. It includes all the crucial performance metrics crucial to assess a player’s quality of play. Here, you can compare players across teams, positions and categories and gain deeper insight into their overall performance. This dataset includes useful statistics such as GP (Games Played), Player name, Position, Assists Turnovers Ratio, Blocks per Game, Fouls per Minutes Played, Rebounds per Game and more. Dive in to this detailed overview of NBA player performance and take your understanding of athletes within the organization to another level!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides an in-depth look into the performance of NBA Players throughout the 2019-20 season, allowing an informed analysis of various important statistics. There are a number of ways to use this dataset to both observe and compare players, teams and positions.
By looking at the data you can get an idea of how players are performing across all metrics. The “Points Per Game” metric is particularly useful as it allows quick comparison between different players and teams on their offensive ability. Additionally, exploratory analysis can be conducted by looking at metrics like rebounds or assists per game which allows one to make interesting observations within the game itself such as ball movement being a significant factor for team success.
This dataset also enables further comparison between players from different positions on particular metrics that might be position orientated or generic across all positions such as points per game (ppg). This includes adjusting for positional skill sets; For example guard’s field goal attempts might include more three point shots because it would benefit them more than larger forwards or centres who rely more heavily on in close shot attempts due to their size advantage over their opponents.
This dataset also allows for simple visualisation of player performance with respect to each other; For example one can view points scored against assists ratio when comparing multiple point guards etc., providing further insight into individual performances on certain metrics which otherwise could not be analysed quickly with traditional methods like statistical analysis only within similarly situated groups (e.g.: same position). Furthermore this data set could aid further research in emerging areas such as targeted marketing analytics where identify potential customers based off publically available data regarding factors like ppg et cetera which may highly affect team success orotemode profitability dynamicsincreasedancefficiencyoftheirownopponentteams etcet
- Develop an AI-powered recommendation system that can suggest optimal players to fill out a team based on their performances in the past season.
- Examine trends in player performance across teams and positions, allowing coaches and scouts to make informed decisions when evaluating talent.
- Create a web or mobile app that can compare the performances of multiple players, allowing users to explore different performance metrics head-to-head
If you use this dataset in your research, please credit the original authors. Data Source
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: assists-turnovers.csv | Column name | Description | |:--------------|:----------------------------------| | GP | Number of games played. (Integer) | | Player | Player name. (String) | | Position | Player position. (String) |
File: blocks.csv | Column name | Description | |:--------------|:----------------------------------| | GP | Number of games played. (Integer) | | Player | Player name. (String) | | Position | Player position. (String) |
File: fouls-minutes.csv | Column name | Description | |:--------------|:----------------------...
https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts
We present the first multi-year mobile sensing datasets. Our multi-year data collection studies span four years (10 weeks each year, from 2018 to 2021). The four datasets contain data collected from 705 person-years (497 unique participants) with diverse racial, ability, and immigrant backgrounds. Each year, participants would install a mobile app on their phones and wear a fitness tracker. The app and wearable device passively track multiple sensor streams in the background 24×7, including location, phone usage, calls, Bluetooth, physical activity, and sleep behavior. In addition, participants completed weekly short surveys and two comprehensive surveys on health behaviors and symptoms, social well-being, emotional states, mental health, and other metrics. Our dataset analysis indicates that our datasets capture a wide range of daily human routines, and reveal insights between daily behaviors and important well-being metrics (e.g., depression status). We envision our multi-year datasets can support the ML community in developing generalizable longitudinal behavior modeling algorithms.
Intending to cover the existing gap regarding behavioral datasets modelling interactions of users with individual a multiple devices in Smart Office to later authenticate them continuously, we publish the following collection of datasets, which has been generated after having five users interacting for 60 days with their personal computer and mobile devices. Below you can find a brief description of each dataset.Dataset 1 (2.3 GB). This dataset contains 92975 vectors of features (8096 per vector) that model the interactions of the five users with their personal computers. Each vector contains aggregated data about keyboard and mouse activity, as well as application usage statistics. More info about features meaning can be found in the readme file. Originally, the number of features of this dataset was 24 065 but after filtering the constant features, this number was reduced to 8096. There was a high number of constant features to 0 since each possible digraph (two keys combination) was considered when collecting the data. However, there are many unusual digraphs that the users never introduced in their computers, so these features were deleted in the uploaded dataset.Dataset 2 (8.9 MB). This dataset contains 61918 vectors of features (15 per vector)that model the interactions of the five users with their mobile devices. Each vector contains aggregated data about application usage statistics. More info about features meaning can be found in the readme file.Dataset 3 (28.9 MB). This dataset contains 133590vectors of features (42 per vector)that model the interactions of the five users with their mobile devices. Each vector contains aggregated data about the gyroscope and Accelerometer sensors.More info about features meaning can be found in the readme file.Dataset 4 (162.4 MB). This dataset contains 145465vectors of features (241 per vector)that model the interactions of the five users with both personal computers and mobile devices. Each vector contains the aggregation of the most relevant features of both devices. More info about features meaning can be found in the readme file.Dataset 5 (878.7 KB). This dataset is composed of 7 datasets. Each one of them contains an aggregation of feature vectors generated from the active/inactive intervals of personal computers and mobile devices by considering different time windows ranging from 1h to 24h.1h: 4074 vectors2h: 2149 vectors3h: 1470 vectors4h: 1133 vectors6h: 770 vectors12h: 440 vectors24h: 229 vectors
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
During the study period