100+ datasets found

RICO dataset
kaggle.com
Updated Dec 2, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Onur Gunes (2021). RICO dataset [Dataset]. https://www.kaggle.com/datasets/onurgunes1993/rico-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 2, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Onur Gunes
Description
Context

Data-driven models help mobile app designers understand best practices and trends, and can be used to make predictions about design performance and support the creation of adaptive UIs. This paper presents Rico, the largest repository of mobile app designs to date, created to support five classes of data-driven applications: design search, UI layout generation, UI code generation, user interaction modeling, and user perception prediction. To create Rico, we built a system that combines crowdsourcing and automation to scalably mine design and interaction data from Android apps at runtime. The Rico dataset contains design data from more than 9.3k Android apps spanning 27 categories. It exposes visual, textual, structural, and interactive design properties of more than 66k unique UI screens. To demonstrate the kinds of applications that Rico enables, we present results from training an autoencoder for UI layout similarity, which supports query-by-example search over UIs.

Content

Rico was built by mining Android apps at runtime via human-powered and programmatic exploration. Like its predecessor ERICA, Rico’s app mining infrastructure requires no access to — or modification of — an app’s source code. Apps are downloaded from the Google Play Store and served to crowd workers through a web interface. When crowd workers use an app, the system records a user interaction trace that captures the UIs visited and the interactions performed on them. Then, an automated agent replays the trace to warm up a new copy of the app and continues the exploration programmatically, leveraging a content-agnostic similarity heuristic to efficiently discover new UI states. By combining crowdsourcing and automation, Rico can achieve higher coverage over an app’s UI states than either crawling strategy alone. In total, 13 workers recruited on UpWork spent 2,450 hours using apps on the platform over five months, producing 10,811 user interaction traces. After collecting a user trace for an app, we ran the automated crawler on the app for one hour.

Acknowledgements

UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN https://interactionmining.org/rico

Inspiration

The Rico dataset is large enough to support deep learning applications. We trained an autoencoder to learn an embedding for UI layouts, and used it to annotate each UI with a 64-dimensional vector representation encoding visual layout. This vector representation can be used to compute structurally — and often semantically — similar UIs, supporting example-based search over the dataset. To create training inputs for the autoencoder that embed layout information, we constructed a new image for each UI capturing the bounding box regions of all leaf elements in its view hierarchy, differentiating between text and non-text elements. Rico’s view hierarchies obviate the need for noisy image processing or OCR techniques to create these inputs.
Google Play Store Apps
kaggle.com
zip
Updated Feb 3, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lavanya (2019). Google Play Store Apps [Dataset]. https://www.kaggle.com/lava18/google-play-store-apps
Explore at:
zip(2037893 bytes)Available download formats
Dataset updated
Feb 3, 2019
Authors
Lavanya
Description
Context

While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.

Content

Each app (row) has values for catergory, rating, size, and more.

Acknowledgements

This information is scraped from the Google Play Store. This app information would not be available without it.

Inspiration

The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!
Unlocking User Sentiment: The App Store Reviews Dataset
crawlfeeds.com
json, zip
Updated Jun 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). Unlocking User Sentiment: The App Store Reviews Dataset [Dataset]. https://crawlfeeds.com/datasets/app-store-reviews-dataset
Explore at:
json, zipAvailable download formats
Dataset updated
Jun 20, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
This dataset offers a focused and invaluable window into user perceptions and experiences with applications listed on the Apple App Store. It is a vital resource for app developers, product managers, market analysts, and anyone seeking to understand the direct voice of the customer in the dynamic mobile app ecosystem.

Dataset Specifications:

Investment: $45.0

Status: Published and immediately available.

Category: Ratings and Reviews Data

Format: Compressed ZIP archive containing JSON files, ensuring easy integration into your analytical tools and platforms.

Volume: Comprises 10,000 unique app reviews, providing a robust sample for qualitative and quantitative analysis of user feedback.

Timeliness: Last crawled: (This field is blank in your provided info, which means its recency is currently unknown. If this were a real product, specifying this would be critical for its value proposition.)

Richness of Detail (11 Comprehensive Fields):

Each record in this dataset provides a detailed breakdown of a single App Store review, enabling multi-dimensional analysis:

Review Content:

review: The full text of the user's written feedback, crucial for Natural Language Processing (NLP) to extract themes, sentiment, and common keywords.

title: The title given to the review by the user, often summarizing their main point.

isEdited: A boolean flag indicating whether the review has been edited by the user since its initial submission. This can be important for tracking evolving sentiment or understanding user behavior.

Reviewer & Rating Information:

username: The public username of the reviewer, allowing for analysis of engagement patterns from specific users (though not personally identifiable).

rating: The star rating (typically 1-5) given by the user, providing a quantifiable measure of satisfaction.

App & Origin Context:

app_name: The name of the application being reviewed.

app_id: A unique identifier for the application within the App Store, enabling direct linking to app details or other datasets.

country: The country of the App Store storefront where the review was left, allowing for geographic segmentation of feedback.

Metadata & Timestamps:

_id: A unique identifier for the specific review record in the dataset.

crawled_at: The timestamp indicating when this particular review record was collected by the data provider (Crawl Feeds).

date: The original date the review was posted by the user on the App Store.

Expanded Use Cases & Analytical Applications:

This dataset is a goldmine for understanding what users truly think and feel about mobile applications. Here's how it can be leveraged:

Product Development & Improvement:

Bug Detection & Prioritization: Analyze negative review text to identify recurring technical issues, crashes, or bugs, allowing developers to prioritize fixes based on user impact.

Feature Requests & Roadmap Prioritization: Extract feature suggestions from positive and neutral review text to inform future product roadmap decisions and develop features users actively desire.

User Experience (UX) Enhancement: Understand pain points related to app design, navigation, and overall usability by analyzing common complaints in the review field.

Version Impact Analysis: If integrated with app version data, track changes in rating and sentiment after new app updates to assess the effectiveness of bug fixes or new features.

Market Research & Competitive Intelligence:

Competitor Benchmarking: Analyze reviews of competitor apps (if included or combined with similar datasets) to identify their strengths, weaknesses, and user expectations within a specific app category.

Market Gap Identification: Discover unmet user needs or features that users desire but are not adequately provided by existing apps.

Niche Opportunities: Identify specific use cases or user segments that are underserved based on recurring feedback.

Marketing & App Store Optimization (ASO):

Sentiment Analysis: Perform sentiment analysis on the review and title fields to gauge overall user satisfaction, pinpoint specific positive and negative aspects, and track sentiment shifts over time.

Keyword Optimization: Identify frequently used keywords and phrases in reviews to optimize app store listings, improving discoverability and search ranking.

Messaging Refinement: Understand how users describe and use the app in their own words, which can inform marketing copy and advertising campaigns.

Reputation Management: Monitor rating trends and identify critical reviews quickly to facilitate timely responses and proactive customer engagement.

Academic & Data Science Research:

Natural Language Processing (NLP): The review and title fields are excellent for training and testing NLP models for sentiment analysis, topic modeling, named entity recognition, and text summarization.

User Behavior Analysis: Study patterns in rating distribution, isEdited status, and date to understand user engagement and feedback cycles.

Cross-Country Comparisons: Analyze country-specific reviews to understand regional differences in app perception, feature preferences, or cultural nuances in feedback.

This App Store Reviews dataset provides a direct, unfiltered conduit to understanding user needs and ultimately driving better app performance and greater user satisfaction. Its structured format and granular detail make it an indispensable asset for data-driven decision-making in the mobile app industry.

Multilingual Mobile App Review Dataset August 2025

kaggle.com

Updated Jul 31, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Pratyush Puri (2025). Multilingual Mobile App Review Dataset August 2025 [Dataset]. https://www.kaggle.com/datasets/pratyushpuri/multilingual-mobile-app-reviews-dataset-2025

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jul 31, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Pratyush Puri

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Multilingual Mobile App Reviews Dataset 2025

Overview

This comprehensive synthetic dataset contains 2,514 authentic mobile app reviews spanning 40+ popular applications across 24 different languages, making it ideal for multilingual NLP, sentiment analysis, and cross-cultural user behavior research.

Dataset Statistics

Total Records: 2,514 reviews
Columns: 15 features
Languages Covered: 24 international languages
Apps Included: 40+ popular mobile applications
Time Range: 2023-2025 (2-year span)
File Format: CSV
Data Quality: Intentionally includes missing values and mixed data types for data cleaning practice

Column Specifications

Column Name	Data Type	Description	Sample Values	Null Count
`review_id`	Integer	Unique identifier for each review	1, 2, 3, ...	0
`user_id`	String*	User identifier (should be integer)	"1967825", "9242600"	0
`app_name`	String	Name of the mobile application	WhatsApp, Instagram, TikTok	0
`app_category`	String	Application category	Social Networking, Entertainment	0
`review_text`	String	Multilingual review content	"This app is amazing!"	63
`review_language`	String	ISO language code	en, es, fr, zh, hi, ar	0
`rating`	Mixed*	App rating (1.0-5.0, some as strings)	4.5, "3.2", 1.1	38
`review_date`	DateTime	Timestamp of review submission	2024-10-09 19:26:40	0
`verified_purchase`	Boolean	Purchase verification status	True, False	0
`device_type`	String	Device platform	Android, iOS, iPad, Windows Phone	0
`num_helpful_votes`	Mixed*	Helpfulness votes (some as strings)	65, "209", 163	0
`user_age`	Float*	User age (should be integer)	14.0, 18.0, 67.0	0
`user_country`	String	User's country	China, Germany, Nigeria	50
`user_gender`	String	User gender	Male, Female, Non-binary, Prefer not to say	88
`app_version`	String	Application version number	1.4, v8.9, 2.8.37.5926	25

Note: Data types marked with asterisk require cleaning/conversion

Language Distribution

The dataset includes reviews in 24 languages: - European: English (en), Spanish (es), French (fr), German (de), Italian (it), Russian (ru), Polish (pl), Dutch (nl), Swedish (sv), Danish (da), Norwegian (no), Finnish (fi) - Asian: Chinese (zh), Hindi (hi), Japanese (ja), Korean (ko), Thai (th), Vietnamese (vi), Indonesian (id), Malay (ms) - Other: Arabic (ar), Turkish (tr), Filipino (tl)

Application Categories

Reviews cover 18 distinct categories: - Social Networking - Entertainment
- Productivity - Travel & Local - Music & Audio - Video Players & Editors - Shopping - Navigation - Finance - Communication - Education - Photography - Dating - Business - Utilities - Health & Fitness - Games - News & Magazines

Popular Apps Included

40+ applications including: - Social: WhatsApp, Instagram, Facebook, Snapchat, TikTok, LinkedIn, Twitter, Reddit, Pinterest - Entertainment: YouTube, Netflix, Spotify - Productivity: Microsoft Office, Google Drive, Dropbox, OneDrive, Zoom, Discord - Travel: Uber, Lyft, Airbnb, Booking.com, Google Maps, Waze - Finance: PayPal, Venmo - Education: Duolingo, Khan Academy, Coursera, Udemy - Tools: Grammarly, Canva, Adobe Photoshop, VLC, MX Player

Geographic Distribution

Reviews from 24 countries across all continents: - Asia: China, India, Japan, South Korea, Thailand, Vietnam, Indonesia, Malaysia, Philippines, Pakistan, Bangladesh - Europe: Germany, United Kingdom, France, Italy, Spain, Russia, Turkey, Poland - Americas: United States, Canada, Brazil, Mexico - Oceania: Australia - Africa: Nigeria

Data Quality Features

Intentional data challenges for learning: - Missing Values: Strategic nulls in review_text (63), rating (38), user_country (50), user_gender (88), app_version (25) - Data Type Issues: - user_id stored as strings (should be integers) - user_age as floats (should be integers)
- Some ratings as strings (should be floats) - Some helpful_votes as strings (should be integers) - Mixed Version Formats: "1.4", "v8.9", "2.8.37.5926", "14.1.60.318-beta"

Use Cases

This dataset is perfect for: - Multilingual NLP projects and sentiment analysis - Cross-cultural user behavior analysis - App store analytics and rating prediction - Data cleaning and preprocessing practice - Text classification across multiple languages - Time series analysis of app reviews - Geographic sentiment analysis - Data engineering pipeline development

Data Cleaning Opportunities

Convert string IDs to integers
Standardize rating values to float
Han...

h
Data from: MobileViews
huggingface.co
Updated Nov 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
mllm (2024). MobileViews [Dataset]. https://huggingface.co/datasets/mllmTeam/MobileViews
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 14, 2024
Authors
mllm
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
🚀 MobileViews: A Large-Scale Mobile GUI Dataset

MobileViews is a large-scale dataset designed to support research on mobile agents and mobile user interface (UI) analysis. The first release, MobileViews-600K, includes over 600,000 mobile UI screenshot-view hierarchy (VH) pairs collected from over 20,000 apps on the Google Play Store. This dataset is based on the DroidBot, which we have optimized for large-scale data collection, capturing more comprehensive interaction details while… See the full description on the dataset page: https://huggingface.co/datasets/mllmTeam/MobileViews.
b
App Store Data (2025)
businessofapps.com
Updated Aug 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business of Apps (2025). App Store Data (2025) [Dataset]. https://www.businessofapps.com/data/app-stores/
Explore at:
Dataset updated
Aug 1, 2025
Dataset authored and provided by
Business of Apps
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...
IOS App Store reviews dataset
crawlfeeds.com
csv, zip
Updated Jul 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). IOS App Store reviews dataset [Dataset]. https://crawlfeeds.com/datasets/ios-app-store-reviews-dataset
Explore at:
zip, csvAvailable download formats
Dataset updated
Jul 7, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
Unlock the power of user feedback with our iOS App Store Reviews Dataset, a comprehensive collection of reviews from thousands of apps across various categories. This robust App Store dataset includes essential details such as app names, ratings, user comments, timestamps, and more, offering valuable insights into user experiences and preferences.

Perfect for app developers, marketers, and data analysts, this dataset allows you to conduct sentiment analysis, monitor app performance, and identify trends in user behavior. By leveraging the iOS App Store Reviews Dataset, you can refine app features, optimize marketing strategies, and elevate user satisfaction.

Whether you’re tracking mobile app trends, analyzing specific app categories, or developing data-driven strategies, this App Store dataset is an indispensable tool. Download the iOS App Store Reviews Dataset today or contact us for custom datasets tailored to your unique project requirements.

Ready to take your app insights to the next level? Get the iOS App Store Reviews Dataset now or explore our custom data solutions to meet your needs.
f
Data from: Testing of Mobile Applications in the Wild: A Large-Scale...
figshare.com
txt
Updated Mar 25, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fabiano Pecorelli (2020). Testing of Mobile Applications in the Wild: A Large-Scale Empirical Study on Android Apps [Dataset]. http://doi.org/10.6084/m9.figshare.9980672.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.9980672.v1
Dataset updated
Mar 25, 2020
Dataset provided by
figshare
Authors
Fabiano Pecorelli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Nowadays, mobile applications (a.k.a., apps) are used by over two billion users for every type of need, including social and emergency connectivity. Their pervasiveness in today world has inspired the software testing research community in devising approaches to allow developers to better test their apps and improve the quality of the tests being developed. In spite of this research effort, we still notice a lack of empirical analyses aiming at assessing the actual quality of test cases manually developed by mobile developers: this perspective could provide evidence-based findings on the future research directions in the field as well as on the current status of testing in the wild. As such, we performed a large-scale empirical study targeting 1,780 open-source Android apps and aiming at assessing (1) the extent to which these apps are actually tested, (2) how well-designed are the available tests, and (3) what is their effectiveness. The key results of our study show that mobile developers still tend not to properly test their apps, possibly because of time to market requirements. Furthermore, we discovered that the test cases of the considered apps have a low (i) design quality, both in terms of test code metrics and test smells, and (ii) effectiveness when considering code coverage as well as assertion density.
H
Worldwide Mobile App User Behavior Dataset
dataverse.harvard.edu
doc, xlsx
Updated Sep 28, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2014). Worldwide Mobile App User Behavior Dataset [Dataset]. http://doi.org/10.7910/DVN/27459
Explore at:
doc(56320), xlsx(7037534)Available download formats
Unique identifier
https://doi.org/10.7910/DVN/27459
Dataset updated
Sep 28, 2014
Dataset provided by
Harvard Dataverse
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2012
Area covered
Worldwide
Description
We surveyed 10,208 people from more than 15 countries on their mobile app usage behavior. The countries include USA, China, Japan, Germany, France, Brazil, UK, Italy, Russia, India, Canada, Spain, Australia, Mexico, and South Korea. We asked respondents about: (1) their mobile app user behavior in terms of mobile app usage, including the app stores they use, what triggers them to look for apps, why they download apps, why they abandon apps, and the types of apps they download. (2) their demographics including gender, age, marital status, nationality, country of residence, first language, ethnicity, education level, occupation, and household income (3) their personality using the Big-Five personality traits This dataset contains the results of the survey.
f
Dataset.
plos.figshare.com
figshare.com
xlsx
Updated Oct 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jennifer J. Lee; Mavra Ahmed; Rim Mouhaffel; Mary R. L’Abbé (2023). Dataset. [Dataset]. http://doi.org/10.1371/journal.pdig.0000360.s005
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pdig.0000360.s005
Dataset updated
Oct 25, 2023
Dataset provided by
PLOS Digital Health
Authors
Jennifer J. Lee; Mavra Ahmed; Rim Mouhaffel; Mary R. L’Abbé
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
There has been an increased emphasis on plant-based foods and diets. Although mobile technology has the potential to be a convenient and innovative tool to help consumers adhere to dietary guidelines, little is known about the content and quality of free, popular mobile health (mHealth) plant-based diet apps. The objective of the study was to assess the content and quality of free, popular mHealth apps supporting plant-based diets for Canadians. Free mHealth apps with high user ratings, a high number of user ratings, available on both Apple App and GooglePlay stores, and primarily marketed to help users follow plant-based diet were included. Using pre-defined search terms, Apple App and GooglePlay App stores were searched on December 22, 2020; the top 100 returns for each search term were screened for eligibility. Included apps were downloaded and assessed for quality by three dietitians/nutrition research assistants using the Mobile App Rating Scale (MARS) and the App Quality Evaluation (AQEL) scale. Of the 998 apps screened, 16 apps (mean user ratings±SEM: 4.6±0.1) met the eligibility criteria, comprising 10 recipe managers and meal planners, 2 food scanners, 2 community builders, 1 restaurant identifier, and 1 sustainability assessor. All included apps targeted the general population and focused on changing behaviors using education (15 apps), skills training (9 apps), and/or goal setting (4 apps). Although MARS (scale: 1–5) revealed overall adequate app quality scores (3.8±0.1), domain-specific assessments revealed high functionality (4.0±0.1) and aesthetic (4.0±0.2), but low credibility scores (2.4±0.1). The AQEL (scale: 0–10) revealed overall low score in support of knowledge acquisition (4.5±0.4) and adequate scores in other nutrition-focused domains (6.1–7.6). Despite a variety of free plant-based apps available with different focuses to help Canadians follow plant-based diets, our findings suggest a need for increased credibility and additional resources to complement the low support of knowledge acquisition among currently available plant-based apps. This research received no specific grant from any funding agency.
b
App Downloads Data (2025)
businessofapps.com
Updated Aug 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Business of Apps (2025). App Downloads Data (2025) [Dataset]. https://www.businessofapps.com/data/app-statistics/
Explore at:
Dataset updated
Aug 1, 2025
Dataset authored and provided by
Business of Apps
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
App Download Key StatisticsApp and Game DownloadsiOS App and Game DownloadsGoogle Play App and Game DownloadsGame DownloadsiOS Game DownloadsGoogle Play Game DownloadsApp DownloadsiOS App...
IOS application reviews dataset in English
crawlfeeds.com
csv, zip
Updated Jul 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). IOS application reviews dataset in English [Dataset]. https://crawlfeeds.com/datasets/ios-application-reviews-dataset-in-english
Explore at:
zip, csvAvailable download formats
Dataset updated
Jul 8, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Description
This comprehensive iOS application reviews dataset contains thousands of authentic user reviews from the Apple App Store in English. The dataset provides valuable insights for app developers, marketers, and researchers studying mobile application performance and user sentiment.

Key Features:

Real user reviews from popular iOS apps

Star ratings from 1 to 5 stars

Review dates and timestamps

App store URLs and metadata

User demographics and location data

App version information

Review titles and detailed feedback

Applications: Perfect for sentiment analysis, app store optimization, mobile app development research, user experience studies, and competitive analysis. This dataset enables businesses to understand user preferences, identify app improvement opportunities, and develop better mobile applications.

Data Quality: All reviews are genuine user feedback collected from the official Apple App Store, ensuring authenticity and reliability for research and business intelligence purposes. The dataset covers various app categories including fitness, shopping, education, entertainment, and productivity applications.
Dataset of the paper titled "Strategies to Embed Human Values in Mobile...
zenodo.org
Updated Jan 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rifat Ara Shams; Rifat Ara Shams; Mojtaba Shahin; Mojtaba Shahin; Gillian Oliver; Gillian Oliver; Jon Whittle; Waqar Hussain; Harsha Perera; Arif Nurwidyantoro; Jon Whittle; Waqar Hussain; Harsha Perera; Arif Nurwidyantoro (2025). Dataset of the paper titled "Strategies to Embed Human Values in Mobile Apps: What do End-Users and Practitioners Think?" [Dataset]. http://doi.org/10.5281/zenodo.14627672
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.14627672
Dataset updated
Jan 10, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Rifat Ara Shams; Rifat Ara Shams; Mojtaba Shahin; Mojtaba Shahin; Gillian Oliver; Gillian Oliver; Jon Whittle; Waqar Hussain; Harsha Perera; Arif Nurwidyantoro; Jon Whittle; Waqar Hussain; Harsha Perera; Arif Nurwidyantoro
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is a dataset of the paper titled "Strategies to Embed Human Values in Mobile Apps: What do End-Users and Practitioners Think?". In this study, we conducted a mixed-methods empirical study, which collected data through 13 semi-structured interviews with Bangladeshi agriculture mobile app practitioners and 4 focus groups with 20 Bangladeshi female farmers. Our aim is to identify the extent to which the existing agriculture mobile apps reflect Bangladeshi female farmers' values and to propose potential strategies to address their values in agriculture apps. There are five documents in this dataset.

Questionnaire for focus groups

Questionnaire for interviews

Examples of open coding process

Summary of member checking outcomes

Illustrative quotations for values and strategies
Dataset of the paper titled "Strategies to Embed Human Values in Mobile...
zenodo.org
Updated Jan 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anonymous; Anonymous (2025). Dataset of the paper titled "Strategies to Embed Human Values in Mobile Apps: What do End-Users and Practitioners Think?" [Dataset]. http://doi.org/10.5281/zenodo.14592224
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.14592224
Dataset updated
Jan 3, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Anonymous; Anonymous
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 3, 2025
Description
This is a dataset of the paper titled "Strategies to Embed Human Values in Mobile Apps: What do End-Users and Practitioners Think?". In this study, we conducted a mixed-methods empirical study, which collected data through 13 semi-structured interviews with Bangladeshi agriculture mobile app practitioners and 4 focus groups with 20 Bangladeshi female farmers. Our aim is to identify the extent to which the existing agriculture mobile apps reflect Bangladeshi female farmers' values and to propose potential strategies to address their values in agriculture apps. There are four documents in this dataset.

Questionnaire for focus groups

Questionnaire for interviews

Examples of open coding process

Summary of member checking outcomes

Illustrative quotations for values and strategies

Dataset about user privacy treatment by mobile applications

zenodo.org

zip

Updated Nov 9, 2020

Facebook

Twitter

Click to copy link

Link copied

Cite

Molina L.M.; Molina L.M. (2020). Dataset about user privacy treatment by mobile applications [Dataset]. http://doi.org/10.5281/zenodo.4261664

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.4261664

Dataset updated

Nov 9, 2020

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Molina L.M.; Molina L.M.

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

With academical purposes for the Master in Data Science at UOC, this data extraction project is carried out using Web Scraping techniques on the Exodus-Privacy website, which is dedicated to analyze security and privacy aspects in Android applications. The dataset about user privacy treatment by mobile applications, provides information on trackers that have been included in the application and the device permissions that the user must accept at the time of installation. In addition, it provides more interesting application features for analytical processing of mobile applications.

Dataframe files:
 · exodus.zip: Contains de icon attribute within the dataset file exodus.json (3G) in a [RGBA] 32x32 list format.
 · exodusNoIcon.zip: Contains de dataset file exodusNoIcon.json (100M) with 153.373 png files. Each file is named with the Id attribute within the dataset file.

Dataframe attributes:

{
  "id": {
    "Id": id,
    "Name": "name",
    "Tracker_count": trackersCount,
    "Permissions_count": permissionsCount,
    "Version": "version",
    "Downloads": "downloads",
    "Analysis_date": "analysisDate",
    "Trackers": [
      {
        "Tracker Name": [
          "trackerPurpose"
        ]
      }
    ],
    "Permissions": [
      "permission",
    ],
    "Permissions_warning_count": permissionWarningCount,
    "Developer": "developer",
    "Country": "country",
    "Icon": [
      [
        R,
        G,
        B,
        A
      ]
    ]
  }
}

Mobile internet users worldwide 2020-2029
statista.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista Research Department (2025). Mobile internet users worldwide 2020-2029 [Dataset]. https://www.statista.com/topics/779/mobile-internet/
Explore at:
Dataset updated
Feb 5, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Description
The global number of smartphone users in was forecast to continuously increase between 2024 and 2029 by in total 1.8 billion users (+42.62 percent). After the ninth consecutive increasing year, the smartphone user base is estimated to reach 6.1 billion users and therefore a new peak in 2029. Notably, the number of smartphone users of was continuously increasing over the past years.Smartphone users here are limited to internet users of any age using a smartphone. The shown figures have been derived from survey data that has been processed to estimate missing demographics.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of smartphone users in countries like Australia & Oceania and Asia.
Automated Insights Dataset (AID) and User Interface Depth Dataset (UID)
zenodo.org
data.niaid.nih.gov
Updated Mar 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jonathan Cesar Kuspil; Jonathan Cesar Kuspil; João Vitor Souza Ribeiro; João Vitor Souza Ribeiro; Gislaine Camila Lapasini Leal; Gislaine Camila Lapasini Leal; Guilherme Corredato Guerino; Guilherme Corredato Guerino; Renato Balancieri; Renato Balancieri (2024). Automated Insights Dataset (AID) and User Interface Depth Dataset (UID) [Dataset]. http://doi.org/10.5281/zenodo.10676845
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.10676845
Dataset updated
Mar 19, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Jonathan Cesar Kuspil; Jonathan Cesar Kuspil; João Vitor Souza Ribeiro; João Vitor Souza Ribeiro; Gislaine Camila Lapasini Leal; Gislaine Camila Lapasini Leal; Guilherme Corredato Guerino; Guilherme Corredato Guerino; Renato Balancieri; Renato Balancieri
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Nov 26, 2023
Description
The Automated Insights Dataset (AID) brings metadata from the 200 most downloaded free apps from each of the 32 categories on the Google Play Store, totaling 6400 apps, with information that goes beyond that presented by app stores, also bringing metadata from AppBrain. The User Interface Depth Dataset (UID) brings a high-quality sampling of the AID, and delves into the identification of 7540 components of 50 component types and the capture of 1948 screenshots of the interface of 400 apps. The component set was based on components of Google Material Design and Android Studio.

The datasets can be viewed in the spreadsheets named "Automated Insights Dataset (AID).xlsx" and "User Interface Depth Dataset (UID).xlsx".

The "UID - Screenshots.zip" file contains screenshots of the apps present in the UID, organized in folders by app IDs.

The "Source code of the developed tools.zip" file contains Python codes and complementary files used to collect the datasets.

The "Discarded apps.zip" file contains the apps discarded in the analysis, it presents screenshots of some apps, collected elements and the reasons that led to these apps being discarded.

The "Data explanation.zip" file contains graphical representations of the UID components and textual representations of each data present in the UID and AID, allowing a better understanding of the criteria used.
Mobile internet usage reach in North America 2020-2029
statista.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista Research Department (2025). Mobile internet usage reach in North America 2020-2029 [Dataset]. https://www.statista.com/topics/779/mobile-internet/
Explore at:
Dataset updated
Feb 5, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Description
The population share with mobile internet access in North America was forecast to increase between 2024 and 2029 by in total 2.9 percentage points. This overall increase does not happen continuously, notably not in 2028 and 2029. The mobile internet penetration is estimated to amount to 84.21 percent in 2029. Notably, the population share with mobile internet access of was continuously increasing over the past years.The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the population share with mobile internet access in countries like Caribbean and Europe.
e
The manifest and store data of 870,515 Android mobile applications - Dataset...
b2find.eudat.eu
Updated Oct 23, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). The manifest and store data of 870,515 Android mobile applications - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/b25ee20e-5268-50ae-9914-4bc70bd4ff1c
Explore at:
Dataset updated
Oct 23, 2023
Description
We built a crawler to collect data from the Google Play store including the application's metadata and APK files. The manifest files were extracted from the APK files and then processed to extract the features. The data set is composed of 870,515 records/apps, and for each app we produced 48 features. The data set was used to built and test two bootstrap aggregating of multiple XGBoost machine learning classifiers. The dataset were collected between April 2017 and November 2018. We then checked the status of these applications on three different occasions; December 2018, February 2019, and May-June 2019.
Coronavirus-themed Mobile Apps (Malware) Dataset
zenodo.org
explore.openaire.eu
Updated Apr 21, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
covid19apps; covid19apps (2021). Coronavirus-themed Mobile Apps (Malware) Dataset [Dataset]. http://doi.org/10.5281/zenodo.3875976
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.3875976
Dataset updated
Apr 21, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
covid19apps; covid19apps
Description
As COVID-19 continues to spread across the world, a growing number of malicious campaigns are exploiting the pandemic. It is reported that COVID-19 is being used in a variety of online malicious activities, including Email scam, ransomware and malicious domains. As the number of the afflicted cases continue to surge, malicious campaigns that use coronavirus as a lure are increasing. Malicious developers take advantage of this opportunity to lure mobile users to download and install malicious apps.

However, besides a few media reports, the coronavirus-themed mobile malware has not been well studied. Our community lacks of the comprehensive understanding of the landscape of the coronavirus-themed mobile malware, and no accessible dataset could be used by our researchers to boost COVID-19 related cybersecurity studies.

We make efforts to create a daily growing COVID-19 related mobile app dataset. By the time of mid-November, we have curated a dataset of 4,322 COVID-19 themed apps, and 611 of them are considered to be malicious. The number is growing daily and our dataset will update weekly. For more details, please visit https://covid19apps.github.io

This dataset includes the following files:

(1) covid19apps.xlsx

In this file, we list all the COVID-19 themed apps information, including apk file hashes, released date, package name, AV-Rank, etc.

(2)covid19apps.zip

We put the COVID-19 themed apps Apk samples in zip files . In order to reduce the size of a single file, we divide the sample into multiple zip files for storage. And the APK file name after the file SHA256.

If your papers or articles use our dataset, please use the following bibtex reference to cite our paper: https://arxiv.org/abs/2005.14619

(Accepted to Empirical Software Engineering)

@misc{wang2021virus, title={Beyond the Virus: A First Look at Coronavirus-themed Mobile Malware}, author={Liu Wang and Ren He and Haoyu Wang and Pengcheng Xia and Yuanchun Li and Lei Wu and Yajin Zhou and Xiapu Luo and Yulei Sui and Yao Guo and Guoai Xu}, year={2021}, eprint={2005.14619}, archivePrefix={arXiv}, primaryClass={cs.CR} }

Facebook

Twitter

Click to copy link

Link copied

Cite

Onur Gunes (2021). RICO dataset [Dataset]. https://www.kaggle.com/datasets/onurgunes1993/rico-dataset

RICO dataset

A Mobile App Dataset for Building Data-Driven Design Applications

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Dec 2, 2021

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Onur Gunes

Description

Context

Data-driven models help mobile app designers understand best practices and trends, and can be used to make predictions about design performance and support the creation of adaptive UIs. This paper presents Rico, the largest repository of mobile app designs to date, created to support five classes of data-driven applications: design search, UI layout generation, UI code generation, user interaction modeling, and user perception prediction. To create Rico, we built a system that combines crowdsourcing and automation to scalably mine design and interaction data from Android apps at runtime. The Rico dataset contains design data from more than 9.3k Android apps spanning 27 categories. It exposes visual, textual, structural, and interactive design properties of more than 66k unique UI screens. To demonstrate the kinds of applications that Rico enables, we present results from training an autoencoder for UI layout similarity, which supports query-by-example search over UIs.

Content

Rico was built by mining Android apps at runtime via human-powered and programmatic exploration. Like its predecessor ERICA, Rico’s app mining infrastructure requires no access to — or modification of — an app’s source code. Apps are downloaded from the Google Play Store and served to crowd workers through a web interface. When crowd workers use an app, the system records a user interaction trace that captures the UIs visited and the interactions performed on them. Then, an automated agent replays the trace to warm up a new copy of the app and continues the exploration programmatically, leveraging a content-agnostic similarity heuristic to efficiently discover new UI states. By combining crowdsourcing and automation, Rico can achieve higher coverage over an app’s UI states than either crawling strategy alone. In total, 13 workers recruited on UpWork spent 2,450 hours using apps on the platform over five months, producing 10,811 user interaction traces. After collecting a user trace for an app, we ran the automated crawler on the app for one hour.

Acknowledgements

UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN https://interactionmining.org/rico

Inspiration

The Rico dataset is large enough to support deep learning applications. We trained an autoencoder to learn an embedding for UI layouts, and used it to annotate each UI with a 64-dimensional vector representation encoding visual layout. This vector representation can be used to compute structurally — and often semantically — similar UIs, supporting example-based search over the dataset. To create training inputs for the autoencoder that embed layout information, we constructed a new image for each UI capturing the bounding box regions of all leaf elements in its view hierarchy, differentiating between text and non-text elements. Rico’s view hierarchies obviate the need for noisy image processing or OCR techniques to create these inputs.

Clear search

Close search

Google apps

Main menu

RICO dataset

Context

Content

Acknowledgements

Inspiration

Google Play Store Apps

Context

Content

Acknowledgements

Inspiration

Unlocking User Sentiment: The App Store Reviews Dataset

Multilingual Mobile App Review Dataset August 2025

Multilingual Mobile App Reviews Dataset 2025

Overview

Dataset Statistics

Column Specifications

Language Distribution

Application Categories

Popular Apps Included

Geographic Distribution

Data Quality Features

Use Cases

Data Cleaning Opportunities

Data from: MobileViews

App Store Data (2025)

IOS App Store reviews dataset

Data from: Testing of Mobile Applications in the Wild: A Large-Scale...

Worldwide Mobile App User Behavior Dataset

Dataset.

App Downloads Data (2025)

IOS application reviews dataset in English

Dataset of the paper titled "Strategies to Embed Human Values in Mobile...

Dataset of the paper titled "Strategies to Embed Human Values in Mobile...

Dataset about user privacy treatment by mobile applications

Mobile internet users worldwide 2020-2029

Automated Insights Dataset (AID) and User Interface Depth Dataset (UID)

Mobile internet usage reach in North America 2020-2029

The manifest and store data of 870,515 Android mobile applications - Dataset...

Coronavirus-themed Mobile Apps (Malware) Dataset

RICO dataset

A Mobile App Dataset for Building Data-Driven Design Applications

Context

Content

Acknowledgements

Inspiration