Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
100+ datasets found
  1. User mobile app interaction data

    • kaggle.com
    zip
    Updated Jan 15, 2025
  2. Worldwide Mobile App User Behavior Dataset

    • kaggle.com
    • dataverse.harvard.edu
    zip
    Updated Dec 6, 2023
  3. Mobile App Usage Pattern Analysis by Category

    • kaggle.com
    zip
    Updated May 17, 2025
  4. h

    Data from: MobileViews

    • huggingface.co
    Updated Sep 22, 2024
  5. Screen Time and App Usage Dataset (iOS/Android)

    • kaggle.com
    zip
    Updated Apr 19, 2025
  6. i

    LSApp: Large dataset of Sequential mobile App usage

    • ieee-dataport.org
    Updated Feb 24, 2025
  7. Google Play Store Apps Dataset

    • kaggle.com
    zip
    Updated Oct 30, 2024
  8. c

    Unlocking User Sentiment: The App Store Reviews Dataset

    • crawlfeeds.com
    json, zip
    Updated Jun 20, 2025
  9. m

    Android permissions dataset, Android Malware and benign Application Data set...

    • data.mendeley.com
    Updated Mar 4, 2020
    + more versions
  10. P

    Mobile App Data Alternative Data

    • paradoxintelligence.com
    Updated Sep 5, 2025
  11. Data collection among global most privacy demanding mobile iOS apps 2023, by...

    • statista.com
    Updated Jan 8, 2026
  12. User Feedback Data from the Top 15 Mobile Apps

    • kaggle.com
    zip
    Updated Mar 4, 2024
    + more versions
  13. g

    Mobile Device Usage and User Behavior Dataset

    • gts.ai
    json, csv, excel
    Updated Jan 9, 2025
  14. G

    HUQ aggregated in-app location dataset

    • data.geods.ac.uk
    csv, html
    Updated May 8, 2025
  15. m

    ITC-Net-MingledApp: A comprehensive dataset of mixed mobile application...

    • data.mendeley.com
    Updated Oct 7, 2024
  16. h

    Frappe-mobile-app-usage

    • huggingface.co
    Updated May 12, 2015
  17. h

    mobilerec

    • huggingface.co
    Updated Feb 21, 2023
  18. m

    Android Hybrid Apps Dataset

    • data.mendeley.com
    Updated Jul 19, 2021
  19. c

    IOS application reviews dataset in English

    • crawlfeeds.com
    csv, zip
    Updated Jul 8, 2025
  20. Data collection among global least privacy demanding mobile iOS apps 2023,...

    • statista.com
    Updated Jan 8, 2026
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Mohamed Moslemani (2025). User mobile app interaction data [Dataset]. https://www.kaggle.com/datasets/mohamedmoslemani/user-mobile-app-interaction-data
Organization logo

User mobile app interaction data

Generated interaction data of users on the mobile phone with an Application -

Explore at:
zip(6809111 bytes)Available download formats
Dataset updated
Jan 15, 2025
Authors
Mohamed Moslemani
License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

This dataset has been artificially generated to mimic real-world user interactions within a mobile application. It contains 100,000 rows of data, each row of which represents a single event or action performed by a synthetic user. The dataset was designed to capture many of the attributes commonly tracked by app analytics platforms, such as device details, network information, user demographics, session data, and event-level interactions.

Key Features Included

User & Session Metadata

User ID: A unique integer identifier for each synthetic user. Session ID: Randomly generated session identifiers (e.g., S-123456), capturing the concept of user sessions. IP Address: Fake IP addresses generated via Faker to simulate different network origins. Timestamp: Randomized timestamps (within the last 30 days) indicating when each interaction occurred. Session Duration: An approximate measure (in seconds) of how long a user remained active. Device & Technical Details

Device OS & OS Version: Simulated operating systems (Android/iOS) with plausible version numbers. Device Model: Common phone models (e.g., “Samsung Galaxy S22,” “iPhone 14 Pro,” etc.). Screen Resolution: Typical screen resolutions found in smartphones (e.g., “1080x1920”). Network Type: Indicates whether the user was on Wi-Fi, 5G, 4G, or 3G. Location & Locale

Location Country & City: Random global locations generated using Faker. App Language: Represents the user’s app language setting (e.g., “en,” “es,” “fr,” etc.). User Properties

Battery Level: The phone’s battery level as a percentage (0–100). Memory Usage (MB): Approximate memory consumption at the time of the event. Subscription Status: Boolean flag indicating if the user is subscribed to a premium service. User Age: Random integer ranging from teenagers to seniors (13–80). Phone Number: Fake phone numbers generated via Faker. Push Enabled: Boolean flag indicating if the user has push notifications turned on. Event-Level Interactions

Event Type: The action taken by the user (e.g., “click,” “view,” “scroll,” “like,” “share,” etc.). Event Target: The UI element or screen component interacted with (e.g., “home_page_banner,” “search_bar,” “notification_popup”). Event Value: A numeric field indicating additional context for the event (e.g., intensity, count, rating). App Version: Simulated version identifier for the mobile application (e.g., “4.2.8”). Data Quality & “Noise” To better approximate real-world data, 1% of all fields have been intentionally “corrupted” or altered:

Typos and Misspellings: Random single-character edits, e.g., “Andro1d” instead of “Android.” Missing Values: Some cells might be blank (None) to reflect dropped or unrecorded data. Random String Injections: Occasional random alphanumeric strings inserted where they don’t belong. These intentional discrepancies can help data scientists practice data cleaning, outlier detection, and data wrangling techniques.

Usage & Applications

Data Cleaning & Preprocessing: Ideal for practicing how to handle missing values, inconsistent data, and noise in a realistic scenario. Analytics & Visualization: Demonstrate user interaction funnels, session durations, usage by device/OS, etc. Machine Learning & Modeling: Suitable for building classification or clustering models (e.g., user segmentation, event classification). Simulation for Feature Engineering: Experiment with deriving new features (e.g., session frequency, average battery drain, etc.).

Important Notes & Disclaimer

Synthetic Data: All entries (users, device info, IPs, phone numbers, etc.) are artificially generated and do not correspond to real individuals. Privacy & Compliance: Since no real personal data is present, there are no direct privacy concerns. However, always handle synthetic data ethically.

Search
Clear search
Close search
Google apps
Main menu