41 datasets found
  1. i

    Mobile vs Desktop Usage Statistics 2025

    • innersparkcreative.com
    html
    Updated Sep 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Inner Spark Creative (2025). Mobile vs Desktop Usage Statistics 2025 [Dataset]. https://www.innersparkcreative.com/news/mobile-vs-desktop-usage-statistics-2025-verified
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Sep 3, 2025
    Dataset authored and provided by
    Inner Spark Creative
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Verified dataset of 2025 device usage: share of global web traffic, mobile commerce share of transactions, US daily time spent, app vs web breakdown, and tablet decline.

  2. Mobile internet users worldwide 2020-2029

    • statista.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Mobile internet users worldwide 2020-2029 [Dataset]. https://www.statista.com/topics/779/mobile-internet/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    The global number of smartphone users in was forecast to continuously increase between 2024 and 2029 by in total 1.8 billion users (+42.62 percent). After the ninth consecutive increasing year, the smartphone user base is estimated to reach 6.1 billion users and therefore a new peak in 2029. Notably, the number of smartphone users of was continuously increasing over the past years.Smartphone users here are limited to internet users of any age using a smartphone. The shown figures have been derived from survey data that has been processed to estimate missing demographics.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of smartphone users in countries like Australia & Oceania and Asia.

  3. Mobile internet usage reach in North America 2020-2029

    • statista.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Mobile internet usage reach in North America 2020-2029 [Dataset]. https://www.statista.com/topics/779/mobile-internet/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    The population share with mobile internet access in North America was forecast to increase between 2024 and 2029 by in total 2.9 percentage points. This overall increase does not happen continuously, notably not in 2028 and 2029. The mobile internet penetration is estimated to amount to 84.21 percent in 2029. Notably, the population share with mobile internet access of was continuously increasing over the past years.The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the population share with mobile internet access in countries like Caribbean and Europe.

  4. m

    ITC-Net-Blend-60: A Comprehensive Dataset for Robust Mobile App...

    • data.mendeley.com
    Updated Nov 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marziyeh Bayat (2023). ITC-Net-Blend-60: A Comprehensive Dataset for Robust Mobile App Identification in Real-World Network Environment - Scenario A [Dataset]. http://doi.org/10.17632/ssv23kfcgs.1
    Explore at:
    Dataset updated
    Nov 15, 2023
    Authors
    Marziyeh Bayat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    World
    Description

    This dataset includes network traffic data from more than 50 Android applications across 5 different scenarios. The applications are consistent in all scenarios, but other factors like location, device, and user vary (see Table 2 in the paper). The current repository pertains to Scenario A. Within the repository, for each application, there is a compressed file containing the relevant PCAP files. The PCAP files follow the naming convention: {Application Name}{Scenario ID}{#Trace}_Final.pcap.

  5. d

    Swash Shopping & eCommerce Data - 1.5M Worldwide Users - GDPR Compliant

    • datarade.ai
    .csv, .xls
    Updated Jun 27, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swash (2023). Swash Shopping & eCommerce Data - 1.5M Worldwide Users - GDPR Compliant [Dataset]. https://datarade.ai/data-products/swash-consumer-loyalty-rewards-audience-cryptocurrency-swash
    Explore at:
    .csv, .xlsAvailable download formats
    Dataset updated
    Jun 27, 2023
    Dataset authored and provided by
    Swash
    Area covered
    Costa Rica, Armenia, Bangladesh, Dominica, Bhutan, Hungary, Montserrat, Indonesia, France, Austria
    Description

    Unlock the Power of Behavioural Data with GDPR-Compliant Clickstream Insights.

    Swash clickstream data offers a comprehensive and GDPR-compliant dataset sourced from users worldwide, encompassing both desktop and mobile browsing behaviour. Here's an in-depth look at what sets us apart and how our data can benefit your organisation.

    User-Centric Approach: Unlike traditional data collection methods, we take a user-centric approach by rewarding users for the data they willingly provide. This unique methodology ensures transparent data collection practices, encourages user participation, and establishes trust between data providers and consumers.

    Wide Coverage and Varied Categories: Our clickstream data covers diverse categories, including search, shopping, and URL visits. Whether you are interested in understanding user preferences in e-commerce, analysing search behaviour across different industries, or tracking website visits, our data provides a rich and multi-dimensional view of user activities.

    GDPR Compliance and Privacy: We prioritise data privacy and strictly adhere to GDPR guidelines. Our data collection methods are fully compliant, ensuring the protection of user identities and personal information. You can confidently leverage our clickstream data without compromising privacy or facing regulatory challenges.

    Market Intelligence and Consumer Behaviuor: Gain deep insights into market intelligence and consumer behaviour using our clickstream data. Understand trends, preferences, and user behaviour patterns by analysing the comprehensive user-level, time-stamped raw or processed data feed. Uncover valuable information about user journeys, search funnels, and paths to purchase to enhance your marketing strategies and drive business growth.

    High-Frequency Updates and Consistency: We provide high-frequency updates and consistent user participation, offering both historical data and ongoing daily delivery. This ensures you have access to up-to-date insights and a continuous data feed for comprehensive analysis. Our reliable and consistent data empowers you to make accurate and timely decisions.

    Custom Reporting and Analysis: We understand that every organisation has unique requirements. That's why we offer customisable reporting options, allowing you to tailor the analysis and reporting of clickstream data to your specific needs. Whether you need detailed metrics, visualisations, or in-depth analytics, we provide the flexibility to meet your reporting requirements.

    Data Quality and Credibility: We take data quality seriously. Our data sourcing practices are designed to ensure responsible and reliable data collection. We implement rigorous data cleaning, validation, and verification processes, guaranteeing the accuracy and reliability of our clickstream data. You can confidently rely on our data to drive your decision-making processes.

  6. e

    Mobile Data Collection - Incentive Experiment - Dataset - B2FIND

    • b2find.eudat.eu
    Updated May 12, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Mobile Data Collection - Incentive Experiment - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/b68a3e41-6c2c-52df-a0fe-c7c25edc3305
    Explore at:
    Dataset updated
    May 12, 2019
    Description

    Ziel dieser Studie war es, den Einfluss verschiedener Anreizsysteme auf die Bereitschaft zur Teilnahme an der passiven mobilen Datenerfassung unter deutschen Smartphone-Besitzern experimentell zu messen. Die Daten stammen aus einer Webumfrage unter deutschen Smartphone-Nutzern ab 18 Jahren, die aus einem deutschen, nicht wahrscheinlichen Online-Panel rekrutiert wurden. Im Dezember 2017 beantworteten 1.214 Teilnehmer einen Fragebogen zu den Themen Smartphone-Nutzung und -Fähigkeiten, Datenschutz und Sicherheit, allgemeine Einstellungen gegenüber der Umfrageforschung und Forschungseinrichtungen. Darüber hinaus enthielt der Fragebogen ein Experiment zur Bereitschaft, an der mobilen Datenerhebung unter verschiedenen Anreizbedingungen teilzunehmen. Themen: Besitz von Smartphone, Handy, Desktop- oder Laptop-Computer, Tablet-Computer und/oder E-Book-Reader; Art des Smartphones; Bereitschaft zur Teilnahme an der mobilen Datenerfassung unter verschiedenen Anreizbedingungen; Wahrscheinlichkeit des Herunterladens der App zur Teilnahme an dieser Forschungsstudie; Befragter möchte lieber an der Studie teilnehmen, wenn er 100 Euro erhalten könnte; Gesamtbetrag, den der Befragte für die Teilnahme an der Studie verdienen müsste (offene Antwort); Grund, warum der Befragte nicht an der Forschungsstudie teilnehmen würde; Bereitschaft zur Teilnahme an der Studie für einen Anreiz von insgesamt 60 Euro; Bereitschaft zur Aktivierung verschiedener Funktionen beim Herunterladen der App (Interaktionshistorie, Smartphone-Nutzung, Merkmale des sozialen Netzwerks, Netzqualitäts- und Standortinformationen, Aktivitätsdaten); vorherige Einladung zum Herunterladen der Forschungs-App; Herunterladen der Forschungs-App; Häufigkeit der Nutzung des Smartphones; Smartphone-Aktivitäten (Browsen, E-Mails, Fotografieren, Anzeigen/Post-Social-Media-Inhalte, Einkaufen, Online-Banking, Installieren von Apps, Verwenden von GPS-fähigen Apps, Verbinden über Bluethooth, Spielen, Streaming von Musik/Videos); Selbsteinschätzung der Kompetenz im Umgang mit dem Smartphone; Einstellung zu Umfragen und Teilnahme an Forschungsstudien (persönliches Interesse, Zeitverlust, Verkaufsgespräch, interessante Erfahrung, nützlich); Vertrauen in Institutionen zum Datenschutz (Marktforschungsunternehmen, Universitätsforscher, Regierungsbehörden wie das Statistische Bundesamt, Mobilfunkanbieter, App-Unternehmen, Kreditkartenunternehmen, Online-Händler und Social-Media-Plattformen); allgemeine Datenschutzbedenken; Gefühl der Datenschutzverletzung durch Banken und Kreditkartenunternehmen, Steuerbehörden, Regierungsbehörden, Marktforschung, soziale Netzwerke, Apps und Internetbrowser; Bedenken zur Datensicherheit bei Smartphone-Aktivitäten für Forschungszwecke (Online-Umfrage, Umfrage-Apps, Forschungs-Apps, SMS-Umfrage, Kamera, Aktivitätsdaten, GPS-Ortung, Bluetooth). Demographie: Geschlecht, Alter; Bundesland; höchster Schulabschluss; höchstes berufliches Bildungsniveau. Zusätzlich verkodet wurden: laufende Nummer; Dauer (Reaktionszeit in Sekunden); Gerätetyp, mit dem der Fragebogen ausgefüllt wurde. The goal of this study was to experimentally measure the influence of different incentive schemes on the willingness to participate in passive mobile data collection among German smartphone owners. The data come from a web survey among German smartphone users 18 years and older who were recruited from a German nonprobability online panel. In December 2017, 1,214 respondents completed a questionnaire on smartphone use and skills, privacy and security concerns, general attitudes towards survey research and research institutions. In addition, the questionnaire included an experiment on the willingness to participate in mobile data collection under different incentive conditions. Topics: Ownership of smartphone, cell phone, desktop or laptop computer, tablet computer, and/or e-book reader; type of smartphone; willingness to participate in mobile data collection under different incentive conditions; likelihood of downloading the app to particiapte in this research study; respondent would rather participate in the study if he could receive 100 euros; total amount to be earned for the respondent ot participate in the study (open answer); reason why the respondent wouldn´t participate in the research study; willlingness to participate in the study for an incentive of 60 euros in total; willingness to activate different functions when downloading the app (interaction history, smartphone usage, charateristics of the social network, network quality and location information, activity data); previous invitation for research app download; research app download; frequency of smartphone use; smartphone activities (browsing, e-mails, taking pictures, view/ post social media content, shopping, online banking, installing apps, using GPS-enabled apps, connecting via Bluethooth, playing games, stream music/ videos); self-assessment of smartphone skills; attitude towards surveys and participaton at research studies (personal interest, waste of time, sales pitch, interesting experience, useful); trust in institutions regarding data privacy (market research companies, university researchers, government authorities such as the Federal Statistical Office, mobile service provider, app companies, credit card companies, online retailer, and social media platforms); general privacy concern; feeling of privacy violation by banks and credit card companies, tax authorities, government agencies, market research, social networks, apps, and internet browsers; concern regarding data security with smartphone activities for research purposes (online survey, survey apps, research apps, SMS survey, camera, activity data, GPS location, Bluetooth). Demography: sex, age; federal state; highest level of school education; highest level of vocational education. Additionally coded was: running number; duration (response time in seconds); device type used to fill out the questionnaire.

  7. 201 Hours - English(North America) Scripted Monologue Smartphone and PC...

    • nexdata.ai
    • m.nexdata.ai
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2023). 201 Hours - English(North America) Scripted Monologue Smartphone and PC speech dataset [Dataset]. https://www.nexdata.ai/datasets/speechrecog/33
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset authored and provided by
    Nexdata
    Area covered
    North America
    Variables measured
    Format, Country, Speaker, Language, Accuracy Rate, Content category, Recording device, Recording condition, Language(Region) Code, Features of annotation
    Description

    English(North America) Scripted Monologue Smartphone and PC speech dataset, collected from monologue based on given scripts, covering common expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(302 North American), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

  8. Mobile internet penetration in Europe 2024, by country

    • statista.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Mobile internet penetration in Europe 2024, by country [Dataset]. https://www.statista.com/topics/779/mobile-internet/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    Switzerland is leading the ranking by population share with mobile internet access, recording 95.06 percent. Following closely behind is Ukraine with 95.06 percent, while Moldova is trailing the ranking with 46.83 percent, resulting in a difference of 48.23 percentage points to the ranking leader, Switzerland. The penetration rate refers to the share of the total population having access to the internet via a mobile broadband connection. The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).

  9. f

    Table 1_When the phone’s away, people use their computer to play: distance...

    • datasetcatalog.nlm.nih.gov
    Updated Mar 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Heitmayer, Maxi (2025). Table 1_When the phone’s away, people use their computer to play: distance to the smartphone reduces device usage but not overall distraction and task fragmentation during work.docx [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002068443
    Explore at:
    Dataset updated
    Mar 28, 2025
    Authors
    Heitmayer, Maxi
    Description

    The smartphone helps workers balance the demands of their professional and personal lives but can also be a distraction, affecting productivity, wellbeing, and work-life balance. Drawing from insights on the impact of physical environments on object engagement, this study examines how the distance between the smartphone and the user influences interactions in work contexts. Participants (N = 22) engaged in two 5h knowledge work sessions on the computer, with the smartphone placed outside their immediate reach during one session. Results show that limited smartphone accessibility led to reduced smartphone use, but participants shifted non-work activities to the computer and the time they spent on work and leisure activities overall remained unchanged. These findings suggest that discussions on smartphone disruptiveness in work contexts should consider the specific activities performed, challenging narratives of ‘smartphone addiction’ and ‘smartphone overuse’ as the cause of increased disruptions and lowered work productivity.

  10. K-EmoPhone, A Mobile and Wearable Dataset with In-Situ Emotion, Stress, and...

    • zenodo.org
    Updated Jun 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Soowon Kang; Soowon Kang; Woohyeok Choi; Cheul Young Park; Narae Cha; Auk Kim; Ahsan Habib Khandoker; Leontios Hadjileontiadis; Heepyung Kim; Yong Jeong; Uichin Lee; Woohyeok Choi; Cheul Young Park; Narae Cha; Auk Kim; Ahsan Habib Khandoker; Leontios Hadjileontiadis; Heepyung Kim; Yong Jeong; Uichin Lee (2023). K-EmoPhone, A Mobile and Wearable Dataset with In-Situ Emotion, Stress, and Attention Labels [Dataset]. http://doi.org/10.5281/zenodo.6851298
    Explore at:
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Soowon Kang; Soowon Kang; Woohyeok Choi; Cheul Young Park; Narae Cha; Auk Kim; Ahsan Habib Khandoker; Leontios Hadjileontiadis; Heepyung Kim; Yong Jeong; Uichin Lee; Woohyeok Choi; Cheul Young Park; Narae Cha; Auk Kim; Ahsan Habib Khandoker; Leontios Hadjileontiadis; Heepyung Kim; Yong Jeong; Uichin Lee
    Description

    ABSTRACT: With the popularization of low-cost mobile and wearable sensors, many prior studies used such sensors to track and analyze people's mental well-being, productivity, and behavioral patterns. However, there is a lack of open datasets collected in real-world contexts with affective and cognitive state labels such as emotion, stress, and attention. This limits the advances in affective computing and human-computer interaction research. In this work, we present K-EmoPhone, an in-the-wild naturalistic dataset (n=80, 1-week) of smartphone use, wearable sensing, and self-reported affect states from college students. The dataset contains continuous probing of peripheral physiological signals and mobility data measured by off-the-shelf commercial devices in addition to context and interaction data by users' smartphones. Moreover, the dataset includes self-reports of in-situ affect states (n=5,753) such as emotion, stress level, attention level, and disturbance level, acquired by the experience sampling method. The resulting K-EmoPhone dataset helps to advance the research and development of affective computing, emotion intelligence technologies, and attention management based on mobile and wearable sensor data.

    Last update: Aug. 3, 2022

    -----------------------------

    * Version 1.0.0 (Aug. 3, 2022)

    • Added P##.zip files, where each P## means the separate participant.
    • Added SubjData.zip file, which includes individual characteristics information and labels.
  11. f

    UoS Buildings Image Dataset for Computer Vision Algorithms

    • salford.figshare.com
    application/x-gzip
    Updated Jan 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ali Alameer; Mazin Al-Mosawy (2025). UoS Buildings Image Dataset for Computer Vision Algorithms [Dataset]. http://doi.org/10.17866/rd.salford.20383155.v2
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    Jan 23, 2025
    Dataset provided by
    University of Salford
    Authors
    Ali Alameer; Mazin Al-Mosawy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset for this project is represented by photos, photos for the buildings of the University of Salford, these photos are taken by a mobile phone camera from different angels and different distances , even though this task sounds so easy but it encountered some challenges, these challenges are summarized below: 1. Obstacles. a. Fixed or unremovable objects. When taking several photos for a building or a landscape from different angels and directions ,there are some of these angels blocked by a form of a fixed object such as trees and plants, light poles, signs, statues, cabins, bicycle shades, scooter stands, generators/transformers, construction barriers, construction equipment and any other service equipment so it is unavoidable to represent some photos without these objects included, this will raise 3 questions. - will these objects confuse the model/application we intend to create meaning will that obstacle prevent the model/application from identifying the designated building? - Or will the photos be more precise with these objects and provide the capability for the model/application to identify these building with these obstacles included? - How far is the maximum length for detection? In other words, how far will the mobile device with the application be from the building so it could or could not detect the designated building? b. Removable and moving objects. - Any University is crowded with staff and students especially in the rush hours of the day so it is hard for some photos to be taken without a personnel appearing in that photo in a certain time period of the day. But, due to privacy issues and showing respect to that person, these photos are better excluded. - Parked vehicles, trollies and service equipment can be an obstacle and might appear in these images as well as it can block access to some areas which an image from a certain angel cannot be obtained. - Animals, like dogs, cats, birds or even squirrels cannot be avoided in some photos which are entitled to the same questions above.
    2. Weather. In a deep learning project, more data means more accuracy and less error, at this stage of our project it was agreed to have 50 photos per building but we can increase the number of photos for more accurate results but due to the limitation of time for this project it was agreed for 50 per building only. these photos were taken on cloudy days and to expand our work on this project (as future works and recommendations). Photos on sunny, rainy, foggy, snowy and any other weather condition days can be included. Even photos in different times of the day can be included such as night, dawn, and sunset times. To provide our designated model with all the possibilities to identify these buildings in all available circumstances.

    1. The selected buildings. It was agreed to select 10 buildings only from the University of Salford buildings for this project with at least 50 images per building, these selected building for this project with the number of images taken are:
    2. Chapman: 74 images
    3. Clifford Whitworth Library: 60 images
    4. Cockcroft: 67 images
    5. Maxwell: 80 images
    6. Media City Campus: 92 images
    7. New Adelphi: 93 images
    8. New Science, Engineering & Environment: 78 images
    9. Newton: 92 images
    10. Sports Centre: 55 images
    11. University House: 60 images Peel building is an important figure of the University of Salford due to its distinct and amazing exterior design but unfortunately it was excluded from the selection due to some maintenance activities at the time of collecting the photos for this project as it is partially covered with scaffolding and a lot of movement by personnel and equipment. If the supervisor suggests that this will be another challenge to include in the project then, it is mandatory to collect its photos. There are many other buildings in the University of Salford and again to expand our project in the future, we can include all the buildings of the University of Salford. The full list of buildings of the university can be reviewed by accessing an interactive map on: www.salford.ac.uk/find-us

    12. Expand Further. This project can be improved furthermore with so many capabilities, again due to the limitation of time given to this project , these improvements can be implemented later as future works. In simple words, this project is to create an application that can display the building’s name when pointing a mobile device with a camera to that building. Future featured to be added: a. Address/ location: this will require collection of additional data which is the longitude and latitude of each building included or the post code which will be the same taking under consideration how close these buildings appear on the interactive map application such as Google maps, Google earth or iMaps. b. Description of the building: what is the building for, by which school is this building occupied? and what facilities are included in this building? c. Interior Images: all the photos at this stage were taken for the exterior of the buildings, will interior photos make an impact on the model/application for example, if the user is inside newton or chapman and opens the application, will the building be identified especially the interior of these buildings have a high level of similarity for the corridors, rooms, halls, and labs? Will the furniture and assets will be as obstacles or identification marks? d. Directions to a specific area/floor inside the building: if the interior images succeed with the model/application, it would be a good idea adding a search option to the model/application so it can guide the user to a specific area showing directions to that area, for example if the user is inside newton building and searches for lab 141 it will direct him to the first floor of the building with an interactive arrow that changes while the user is approaching his destination. Or, if the application can identify the building from its interior, a drop down list will be activated with each floor of this building, for example, if the model/application identifies Newton building, the drop down list will be activated and when pressing on that drop down list it will represent interactive tabs for each floor of the building, selecting one of the floors by clicking on its tab will display the facilities on that floor for example if the user presses on floor 1 tab, another screen will appear displaying which facilities are on that floor. Furthermore, if the model/application identifies another building, it should activate a different number of floors as buildings differ in the number of floors from each other. this feature can be improved with a voice assistant that can direct the user after he applies his search (something similar to the voice assistant in Google maps but applied to the interior of the university’s buildings. e. Top View: if a drone with a camera can be afforded, it can provide arial images and top views for the buildings that can be added to the model/application but these images can be similar to the interior images situation , the buildings can be similar to each other from the top with other obstacles included like water tanks and AC units.

    13. Other Questions:

    14. Will the model/application be reproducible? the presumed answer for this question should be YES, IF, the model/application will be fed with the proper data (images) such as images of restaurants, schools, supermarkets, hospitals, government facilities...etc.

  12. Microplastic Dataset for Computer Vision

    • kaggle.com
    Updated Jan 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamadreza Momeni (2024). Microplastic Dataset for Computer Vision [Dataset]. https://www.kaggle.com/datasets/imtkaggleteam/microplastic-dataset-for-computer-vision
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 16, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mohamadreza Momeni
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    About Dataset:

    Auto-Orient: Applied

    Static Crop: 30-85% Horizontal Region, 15-85% Vertical Region

    Modify Classes: 0 remapped, 3 dropped

    Filter Null: Require all images to contain annotations.

    Use cases of this dataset:

    1- Ocean cleanup efforts: Utilize the "Microplastic Dataset" computer vision model to identify and locate microplastic pollution in ocean water samples, allowing for targeted cleanup efforts and better understanding of microplastic distribution in marine environments.

    2- Recycling facility improvements: Integrate the model into recycling facilities to identify and sort microplastic residues in materials, ensuring proper disposal or treatment to prevent their release into the environment.

    3- Microplastic research: Aid researchers in studying the impact of microplastics on ecosystems and human health by automating the detection and analysis of microplastics in various samples, such as water, soil, or air.

    4- Supply chain monitoring: Help industries monitor and evaluate their supply chain processes to identify and reduce microplastic contamination in their products or packaging materials, promoting greener manufacturing practices.

    5- Consumer education and awareness: Develop a mobile app that uses the "Microplastic Dataset" model to enable users to identify potential microplastic contamination in consumer products such as cosmetics or food packaging, encouraging more informed purchasing decisions and raising public awareness on the issue of microplastic pollution.

    Variables measured:

    MPDS Bounding Boxes

    Dataset authored and provided by:

    Panats MP Project

  13. FireSafetyNet: An Image-Based Dataset with Pretrained Weights for Machine...

    • zenodo.org
    Updated Sep 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Angelina Aziz; Angelina Aziz; Jan Hendrik Heinbach; Jan Hendrik Heinbach; Lukas Trost; Lukas Trost (2024). FireSafetyNet: An Image-Based Dataset with Pretrained Weights for Machine Learning-Driven Fire Safety Inspection [Dataset]. http://doi.org/10.5281/zenodo.13358169
    Explore at:
    Dataset updated
    Sep 20, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Angelina Aziz; Angelina Aziz; Jan Hendrik Heinbach; Jan Hendrik Heinbach; Lukas Trost; Lukas Trost
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset offers a diverse collection of images curated to support the development of computer vision models for detecting and inspecting Fire Safety Equipment (FSE) and related components. Images were collected from a variety of public buildings in Germany, including university buildings, student dormitories, and shopping malls. The dataset consists of self-captured images using mobile cameras, providing a broad range of real-world scenarios for FSE detection.

    In the journal paper associated with these image datasets, the open-source dataset FireNet (Boehm et al. 2019) was additionally utilized for training. However, to comply with licensing and distribution regulations, images from FireNet have been excluded from this dataset. Interested users can visit the FireNet repository directly to access and download those images if additional data is required. The provided weights (.pt), however, are trained on the provided self-made images and FireNet using YOLOv8.

    The dataset is organized into six sub-datasets, each corresponding to a specific FSE-related machine learning service:

    1. Service 1: FSE Detection - This sub-dataset provides the foundation for FSE inspection, focusing on the detection of primary FSE components like fire blankets, fire extinguishers, manual call points, and smoke detectors.

    2. Service 2: FSE Marking Detection - Building on the first service, this sub-dataset includes images and annotations for detecting FSE marking signs.

    3. Service 3: Condition Check - Modal - This sub-dataset addresses the inspection of FSE condition in a modal manner, focusing on instances where fire extinguishers might be blocked or otherwise non-compliant. This dataset includes semantic segmentation annotations of fire extinguishers. For upload reasons, this set is split into 3_1_FSE Condition Check_modal_train_data (containing training images and annotations) and 3_1_FSE Condition Check_modal_val_data_and_weights (containing validation images, annotations and the best weights).

    4. Service 4: Condition Check - Amodal - Extending the modal condition check, this sub-dataset involves amodal detection to identify and infer the state of FSE components even when they are partially obscured. This dataset includes semantic segmentation annotations of fire extinguishers. This dataset includes semantic segmentation annotations of fire extinguishers. For upload reasons, this set is split into 4_1_FSE Condition Check_amodal_train_data (containing training images and annotations) and 4_1_FSE Condition Check_amodal_val_data_and_weights (containing validation images, annotations and the best weights).

    5. Service 5: Details Extraction - Inspection Tags - This sub-dataset provides a detailed examination of the inspection tags on fire extinguishers. It includes annotations for extracting semantic information such as the next maintenance date, contributing to a thorough evaluation of FSE maintenance practices.

    6. Service 6: Details Extraction - Fire Classes Symbols - The final sub-dataset focuses on identifying fire class symbols on fire extinguishers.

    This dataset is intended for researchers and practitioners in the field of computer vision, particularly those engaged in building safety and compliance initiatives.

  14. H

    douyin-data

    • dataverse.harvard.edu
    Updated May 14, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ye Tian (2020). douyin-data [Dataset]. http://doi.org/10.7910/DVN/2FCUKX
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 14, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    Ye Tian
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset is collected from Douyin(Douyin is the most popular mobile instant video app in China). With the smartphone app, a user can shot, edit, and post an instant video. A user can also view, digg, comment, and repost videos on Douyin, and share them to other social medias. In the main interface of douyin app, the system will automatically recommend a series of videos to users.

  15. Z

    Data from: Login Data Set for Risk-Based Authentication

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thunem, Sigurd (2022). Login Data Set for Risk-Based Authentication [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6782155
    Explore at:
    Dataset updated
    Jun 30, 2022
    Dataset provided by
    Wiefling, Stephan
    Thunem, Sigurd
    Lo Iacono, Luigi
    Jørgensen, Paul René
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Login Data Set for Risk-Based Authentication

    Synthesized login feature data of >33M login attempts and >3.3M users on a large-scale online service in Norway. Original data collected between February 2020 and February 2021.

    This data sets aims to foster research and development for Risk-Based Authentication (RBA) systems. The data was synthesized from the real-world login behavior of more than 3.3M users at a large-scale single sign-on (SSO) online service in Norway.

    The users used this SSO to access sensitive data provided by the online service, e.g., a cloud storage and billing information. We used this data set to study how the Freeman et al. (2016) RBA model behaves on a large-scale online service in the real world (see Publication). The synthesized data set can reproduce these results made on the original data set (see Study Reproduction). Beyond that, you can use this data set to evaluate and improve RBA algorithms under real-world conditions.

    WARNING: The feature values are plausible, but still totally artificial. Therefore, you should NOT use this data set in productive systems, e.g., intrusion detection systems.

    Overview

    The data set contains the following features related to each login attempt on the SSO:

        Feature
        Data Type
        Description
        Range or Example
    
    
    
    
        IP Address
        String
        IP address belonging to the login attempt
        0.0.0.0 - 255.255.255.255
    
    
        Country
        String
        Country derived from the IP address
        US
    
    
        Region
        String
        Region derived from the IP address
        New York
    
    
        City
        String
        City derived from the IP address
        Rochester
    
    
        ASN
        Integer
        Autonomous system number derived from the IP address
        0 - 600000
    
    
        User Agent String
        String
        User agent string submitted by the client
        Mozilla/5.0 (Windows NT 10.0; Win64; ...
    
    
        OS Name and Version
        String
        Operating system name and version derived from the user agent string
        Windows 10
    
    
        Browser Name and Version
        String
        Browser name and version derived from the user agent string
        Chrome 70.0.3538
    
    
        Device Type
        String
        Device type derived from the user agent string
        (mobile, desktop, tablet, bot, unknown)1
    
    
        User ID
        Integer
        Idenfication number related to the affected user account
        [Random pseudonym]
    
    
        Login Timestamp
        Integer
        Timestamp related to the login attempt
        [64 Bit timestamp]
    
    
        Round-Trip Time (RTT) [ms]
        Integer
        Server-side measured latency between client and server
        1 - 8600000
    
    
        Login Successful
        Boolean
        True: Login was successful, False: Login failed
        (true, false)
    
    
        Is Attack IP
        Boolean
        IP address was found in known attacker data set
        (true, false)
    
    
        Is Account Takeover
        Boolean
        Login attempt was identified as account takeover by incident response team of the online service
        (true, false)
    

    Data Creation

    As the data set targets RBA systems, especially the Freeman et al. (2016) model, the statistical feature probabilities between all users, globally and locally, are identical for the categorical data. All the other data was randomly generated while maintaining logical relations and timely order between the features.

    The timestamps, however, are not identical and contain randomness. The feature values related to IP address and user agent string were randomly generated by publicly available data, so they were very likely not present in the real data set. The RTTs resemble real values but were randomly assigned among users per geolocation. Therefore, the RTT entries were probably in other positions in the original data set.

    The country was randomly assigned per unique feature value. Based on that, we randomly assigned an ASN related to the country, and generated the IP addresses for this ASN. The cities and regions were derived from the generated IP addresses for privacy reasons and do not reflect the real logical relations from the original data set.

    The device types are identical to the real data set. Based on that, we randomly assigned the OS, and based on the OS the browser information. From this information, we randomly generated the user agent string. Therefore, all the logical relations regarding the user agent are identical as in the real data set.

    The RTT was randomly drawn from the login success status and synthesized geolocation data. We did this to ensure that the RTTs are realistic ones.

    Regarding the Data Values

    Due to unresolvable conflicts during the data creation, we had to assign some unrealistic IP addresses and ASNs that are not present in the real world. Nevertheless, these do not have any effects on the risk scores generated by the Freeman et al. (2016) model.

    You can recognize them by the following values:

    ASNs with values >= 500.000

    IP addresses in the range 10.0.0.0 - 10.255.255.255 (10.0.0.0/8 CIDR range)

    Study Reproduction

    Based on our evaluation, this data set can reproduce our study results regarding the RBA behavior of an RBA model using the IP address (IP address, country, and ASN) and user agent string (Full string, OS name and version, browser name and version, device type) as features.

    The calculated RTT significances for countries and regions inside Norway are not identical using this data set, but have similar tendencies. The same is true for the Median RTTs per country. This is due to the fact that the available number of entries per country, region, and city changed with the data creation procedure. However, the RTTs still reflect the real-world distributions of different geolocations by city.

    See RESULTS.md for more details.

    Ethics

    By using the SSO service, the users agreed in the data collection and evaluation for research purposes. For study reproduction and fostering RBA research, we agreed with the data owner to create a synthesized data set that does not allow re-identification of customers.

    The synthesized data set does not contain any sensitive data values, as the IP addresses, browser identifiers, login timestamps, and RTTs were randomly generated and assigned.

    Publication

    You can find more details on our conducted study in the following journal article:

    Pump Up Password Security! Evaluating and Enhancing Risk-Based Authentication on a Real-World Large-Scale Online Service (2022) Stephan Wiefling, Paul René Jørgensen, Sigurd Thunem, and Luigi Lo Iacono. ACM Transactions on Privacy and Security

    Bibtex

    @article{Wiefling_Pump_2022, author = {Wiefling, Stephan and Jørgensen, Paul René and Thunem, Sigurd and Lo Iacono, Luigi}, title = {Pump {Up} {Password} {Security}! {Evaluating} and {Enhancing} {Risk}-{Based} {Authentication} on a {Real}-{World} {Large}-{Scale} {Online} {Service}}, journal = {{ACM} {Transactions} on {Privacy} and {Security}}, doi = {10.1145/3546069}, publisher = {ACM}, year = {2022} }

    License

    This data set and the contents of this repository are licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. See the LICENSE file for details. If the data set is used within a publication, the following journal article has to be cited as the source of the data set:

    Stephan Wiefling, Paul René Jørgensen, Sigurd Thunem, and Luigi Lo Iacono: Pump Up Password Security! Evaluating and Enhancing Risk-Based Authentication on a Real-World Large-Scale Online Service. In: ACM Transactions on Privacy and Security (2022). doi: 10.1145/3546069

    Few (invalid) user agents strings from the original data set could not be parsed, so their device type is empty. Perhaps this parse error is useful information for your studies, so we kept these 1526 entries.↩︎

  16. d

    Datos Global Activity Feed (~20M Monthly Active Users Worldwide)

    • datarade.ai
    .csv, .txt
    Updated May 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datos, A Semrush Company (2023). Datos Global Activity Feed (~20M Monthly Active Users Worldwide) [Dataset]. https://datarade.ai/data-products/datos-global-activity-feed-20m-monthly-active-users-worldwide-datos
    Explore at:
    .csv, .txtAvailable download formats
    Dataset updated
    May 12, 2023
    Dataset authored and provided by
    Datos, A Semrush Company
    Area covered
    Tokelau, Peru, Cyprus, Costa Rica, Guatemala, Andorra, Svalbard and Jan Mayen, Korea (Republic of), Armenia, Malta
    Description

    Datos brings to market anonymized, at scale, consolidated privacy-secured datasets with a granularity rarely found in the market. Get access to the desktop and mobile browsing behavior for millions of users across the globe, packaged into clean, easy-to-understand data products and reports.

    The Datos Activity Feed is an event-level accounting of all observed URL visits executed by devices which Datos has access to over a given period of time.

    This feed can be delivered on a daily basis, delivering the previous day’s data. It can be filtered by any of the fields, so you can focus on what’s important for you, whether it be specific markets or domains.

    Now available with Datos Low-Latency Feed This add-on ensures delivery of approximately 99% of all devices before markets open in New York (the lowest latency product on the market). Our clickstream data is made up of an array of upstream sources. The DLLF makes the daily output of these sources available as they arrive and are processed, rather than a once-daily batch.

  17. Z

    Robot@Home2, a robotic dataset of home environments

    • data.niaid.nih.gov
    Updated Apr 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ambrosio-Cestero, Gregorio (2024). Robot@Home2, a robotic dataset of home environments [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3901563
    Explore at:
    Dataset updated
    Apr 4, 2024
    Dataset provided by
    Ambrosio-Cestero, Gregorio
    Ruiz-Sarmiento, José Raul
    González-Jiménez, Javier
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Robot-at-Home dataset (Robot@Home, paper here) is a collection of raw and processed data from five domestic settings compiled by a mobile robot equipped with 4 RGB-D cameras and a 2D laser scanner. Its main purpose is to serve as a testbed for semantic mapping algorithms through the categorization of objects and/or rooms.

    This dataset is unique in three aspects:

    The provided data were captured with a rig of 4 RGB-D sensors with an overall field of view of 180°H. and 58°V., and with a 2D laser scanner.

    It comprises diverse and numerous data: sequences of RGB-D images and laser scans from the rooms of five apartments (87,000+ observations were collected), topological information about the connectivity of these rooms, and 3D reconstructions and 2D geometric maps of the visited rooms.

    The provided ground truth is dense, including per-point annotations of the categories of the objects and rooms appearing in the reconstructed scenarios, and per-pixel annotations of each RGB-D image within the recorded sequences

    During the data collection, a total of 36 rooms were completely inspected, so the dataset is rich in contextual information of objects and rooms. This is a valuable feature, missing in most of the state-of-the-art datasets, which can be exploited by, for instance, semantic mapping systems that leverage relationships like pillows are usually on beds or ovens are not in bathrooms.

    Robot@Home2

    Robot@Home2, is an enhanced version aimed at improving usability and functionality for developing and testing mobile robotics and computer vision algorithms. It consists of three main components. Firstly, a relational database that states the contextual information and data links, compatible with Standard Query Language. Secondly,a Python package for managing the database, including downloading, querying, and interfacing functions. Finally, learning resources in the form of Jupyter notebooks, runnable locally or on the Google Colab platform, enabling users to explore the dataset without local installations. These freely available tools are expected to enhance the ease of exploiting the Robot@Home dataset and accelerate research in computer vision and robotics.

    If you use Robot@Home2, please cite the following paper:

    Gregorio Ambrosio-Cestero, Jose-Raul Ruiz-Sarmiento, Javier Gonzalez-Jimenez, The Robot@Home2 dataset: A new release with improved usability tools, in SoftwareX, Volume 23, 2023, 101490, ISSN 2352-7110, https://doi.org/10.1016/j.softx.2023.101490.

    @article{ambrosio2023robotathome2,title = {The Robot@Home2 dataset: A new release with improved usability tools},author = {Gregorio Ambrosio-Cestero and Jose-Raul Ruiz-Sarmiento and Javier Gonzalez-Jimenez},journal = {SoftwareX},volume = {23},pages = {101490},year = {2023},issn = {2352-7110},doi = {https://doi.org/10.1016/j.softx.2023.101490},url = {https://www.sciencedirect.com/science/article/pii/S2352711023001863},keywords = {Dataset, Mobile robotics, Relational database, Python, Jupyter, Google Colab}}

    Version historyv1.0.1 Fixed minor bugs.v1.0.2 Fixed some inconsistencies in some directory names. Fixes were necessary to automate the generation of the next version.v2.0.0 SQL based dataset. Robot@Home v1.0.2 has been packed into a sqlite database along with RGB-D and scene files which have been assembled into a hierarchical structured directory free of redundancies. Path tables are also provided to reference files in both v1.0.2 and v2.0.0 directory hierarchies. This version has been automatically generated from version 1.0.2 through the toolbox.v2.0.1 A forgotten foreign key pair have been added.v.2.0.2 The views have been consolidated as tables which allows a considerable improvement in access time.v.2.0.3 The previous version does not include the database. In this version the database has been uploaded.v.2.1.0 Depth images have been updated to 16-bit. Additionally, both the RGB images and the depth images are oriented in the original camera format, i.e. landscape.

  18. AI Training Data | Annotated Checkout Flows for Retail, Restaurant, and...

    • datarade.ai
    Updated Dec 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MealMe (2024). AI Training Data | Annotated Checkout Flows for Retail, Restaurant, and Marketplace Websites [Dataset]. https://datarade.ai/data-products/ai-training-data-annotated-checkout-flows-for-retail-resta-mealme
    Explore at:
    Dataset updated
    Dec 18, 2024
    Dataset provided by
    MealMe, Inc.
    Authors
    MealMe
    Area covered
    United States of America
    Description

    AI Training Data | Annotated Checkout Flows for Retail, Restaurant, and Marketplace Websites Overview

    Unlock the next generation of agentic commerce and automated shopping experiences with this comprehensive dataset of meticulously annotated checkout flows, sourced directly from leading retail, restaurant, and marketplace websites. Designed for developers, researchers, and AI labs building large language models (LLMs) and agentic systems capable of online purchasing, this dataset captures the real-world complexity of digital transactions—from cart initiation to final payment.

    Key Features

    Breadth of Coverage: Over 10,000 unique checkout journeys across hundreds of top e-commerce, food delivery, and service platforms, including but not limited to Walmart, Target, Kroger, Whole Foods, Uber Eats, Instacart, Shopify-powered sites, and more.

    Actionable Annotation: Every flow is broken down into granular, step-by-step actions, complete with timestamped events, UI context, form field details, validation logic, and response feedback. Each step includes:

    Page state (URL, DOM snapshot, and metadata)

    User actions (clicks, taps, text input, dropdown selection, checkbox/radio interactions)

    System responses (AJAX calls, error/success messages, cart/price updates)

    Authentication and account linking steps where applicable

    Payment entry (card, wallet, alternative methods)

    Order review and confirmation

    Multi-Vertical, Real-World Data: Flows sourced from a wide variety of verticals and real consumer environments, not just demo stores or test accounts. Includes complex cases such as multi-item carts, promo codes, loyalty integration, and split payments.

    Structured for Machine Learning: Delivered in standard formats (JSONL, CSV, or your preferred schema), with every event mapped to action types, page features, and expected outcomes. Optional HAR files and raw network request logs provide an extra layer of technical fidelity for action modeling and RLHF pipelines.

    Rich Context for LLMs and Agents: Every annotation includes both human-readable and model-consumable descriptions:

    “What the user did” (natural language)

    “What the system did in response”

    “What a successful action should look like”

    Error/edge case coverage (invalid forms, OOS, address/payment errors)

    Privacy-Safe & Compliant: All flows are depersonalized and scrubbed of PII. Sensitive fields (like credit card numbers, user addresses, and login credentials) are replaced with realistic but synthetic data, ensuring compliance with privacy regulations.

    Each flow tracks the user journey from cart to payment to confirmation, including:

    Adding/removing items

    Applying coupons or promo codes

    Selecting shipping/delivery options

    Account creation, login, or guest checkout

    Inputting payment details (card, wallet, Buy Now Pay Later)

    Handling validation errors or OOS scenarios

    Order review and final placement

    Confirmation page capture (including order summary details)

    Why This Dataset?

    Building LLMs, agentic shopping bots, or e-commerce automation tools demands more than just page screenshots or API logs. You need deeply contextualized, action-oriented data that reflects how real users interact with the complex, ever-changing UIs of digital commerce. Our dataset uniquely captures:

    The full intent-action-outcome loop

    Dynamic UI changes, modals, validation, and error handling

    Nuances of cart modification, bundle pricing, delivery constraints, and multi-vendor checkouts

    Mobile vs. desktop variations

    Diverse merchant tech stacks (custom, Shopify, Magento, BigCommerce, native apps, etc.)

    Use Cases

    LLM Fine-Tuning: Teach models to reason through step-by-step transaction flows, infer next-best-actions, and generate robust, context-sensitive prompts for real-world ordering.

    Agentic Shopping Bots: Train agents to navigate web/mobile checkouts autonomously, handle edge cases, and complete real purchases on behalf of users.

    Action Model & RLHF Training: Provide reinforcement learning pipelines with ground truth “what happens if I do X?” data across hundreds of real merchants.

    UI/UX Research & Synthetic User Studies: Identify friction points, bottlenecks, and drop-offs in modern checkout design by replaying flows and testing interventions.

    Automated QA & Regression Testing: Use realistic flows as test cases for new features or third-party integrations.

    What’s Included

    10,000+ annotated checkout flows (retail, restaurant, marketplace)

    Step-by-step event logs with metadata, DOM, and network context

    Natural language explanations for each step and transition

    All flows are depersonalized and privacy-compliant

    Example scripts for ingesting, parsing, and analyzing the dataset

    Flexible licensing for research or commercial use

    Sample Categories Covered

    Grocery delivery (Instacart, Walmart, Kroger, Target, etc.)

    Restaurant takeout/delivery (Ub...

  19. d

    IP Address Data | Company Domain to MAIDs | Monthly Feed | 300M+ MAIDs to...

    • datarade.ai
    .json, .csv, .xls
    Updated Apr 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Allforce (2024). IP Address Data | Company Domain to MAIDs | Monthly Feed | 300M+ MAIDs to Business Domain Linkages [Dataset]. https://datarade.ai/data-products/company-ip-data-company-domain-to-maids-monthly-feed-30-solution-publishing
    Explore at:
    .json, .csv, .xlsAvailable download formats
    Dataset updated
    Apr 1, 2024
    Dataset authored and provided by
    Allforce
    Area covered
    United States of America
    Description

    Company Domain to MAID | Company IP Data | Monthly Feed | 200M+ MAIDs to Business Domain Connections Introducing our "Company Domain to MAID" dataset, a groundbreaking resource that links over 200 million Mobile Advertising IDs (MAIDs) to corresponding business domains. Updated monthly, this expansive dataset is engineered for organizations aiming to revolutionize their mobile marketing strategies, enhance customer engagement, and gain deeper insights into mobile user behaviors. As a perfect complement to our "Company Domain to IP Address Linkage Database," this product extends the value of your digital marketing and cybersecurity efforts by integrating mobile data for comprehensive B2B insights. Company IP Data Company Data Ideal Add-On for Domain to IP Product This dataset serves as an ideal add-on to our "Company Domain to IP Address Linkage Database," enabling a multi-dimensional approach to digital strategy by encompassing both traditional and mobile digital landscapes. Together, these products offer a holistic view of digital footprints, ensuring your marketing, security, and analytical capabilities are both broad and deeply integrated. Key Features: • Massive Dataset: Access a robust linkage of over 200 million MAIDs to business domains, providing unparalleled coverage across various industries and markets. • High-Quality Data: Benefit from a dataset characterized by its high accuracy and monthly updates, ensuring you have the most current and reliable information for your mobile marketing and engagement strategies. • Seamless Compatibility: Designed for easy integration with existing marketing, CRM, and cybersecurity platforms, enhancing your digital outreach and security protocols with valuable mobile user insights. Benefits: • Advanced Mobile Marketing: Leverage precise MAID to domain mappings to target and engage mobile users more effectively, driving higher engagement rates and improving campaign ROI. • Enhanced Customer Insights: Gain deeper understanding of customer mobile behaviors and preferences, enabling more personalized and impactful marketing strategies. • Comprehensive Digital Footprint: Combine with our Domain to IP product for a complete overview of corporate digital presence, from desktop to mobile, enhancing all aspects of digital marketing and cybersecurity. • Improved Data-Driven Decisions: Utilize the extensive insights provided by linking MAIDs to business domains to inform strategic decisions, from marketing to security to product development. Applications: • Holistic Marketing Strategies: Employ our dataset to craft comprehensive digital marketing campaigns that effectively reach business audiences across all devices, maximizing coverage and impact. • Enhanced B2B Targeting: Perfectly complement your Domain to IP strategies by including mobile targeting, ensuring that your messages reach the right audience, no matter the device. • Robust Cybersecurity Posture: Enhance your cybersecurity measures by incorporating mobile data, providing a more complete picture of potential vulnerabilities and threat vectors. • Market and Competitor Analysis: Analyze mobile engagement trends and behaviors for insights into market dynamics and competitor strategies, guiding your business decisions with rich, actionable data. Our "Company Domain to MAID" dataset is a must-have for businesses looking to capitalize on the immense potential of mobile marketing and engagement, offering a significant advantage in understanding and reaching B2B audiences. As a standalone product or in conjunction with our "Company Domain to IP Address Linkage Database," it represents the pinnacle of digital insight, enabling businesses to navigate the complexities of the modern digital landscape with confidence and precision.

  20. R

    Data from: Action Classification Dataset

    • universe.roboflow.com
    zip
    Updated Oct 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    volleyballanalysis (2023). Action Classification Dataset [Dataset]. https://universe.roboflow.com/volleyballanalysis/action-classification-rgm9d/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 19, 2023
    Dataset authored and provided by
    volleyballanalysis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Action
    Description

    Here are a few use cases for this project:

    1. Sports Analytics: The "action classification" model can be beneficial in analyzing players' performances in volleyball or related sports. It could assess the techniques used by players, thus helping coaches to make strategic decisions.

    2. Broadcasting & Media Coverage: This model could be utilized by sports broadcasting or media companies to provide more in-depth, real-time analysis of volleyball games. Automated identifications of actions could enhance viewer experiences by enriching commentary and enabling advanced visual effects.

    3. Sports Training Apps: Mobile or desktop training apps for aspiring volleyball players can incorporate this computer vision model to provide users with real-time feedback on their action class, helping them improve their skills effectively.

    4. Injury Prevention and Rehabilitation: Physiotherapists and fitness trainers can employ this model to monitor athletes' actions during practice or actual games. It could provide insights into anomalies or wrong techniques that may lead to injury, facilitating proactive preventive measures.

    5. Automated Refereeing: In sports competitions, especially in amateur leagues where expert referees may not always be available, the model can be deployed to act as an automated referee system that ensures all rules are adhered to by identifying all action categories during the game.

    chatgpt wrote.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Inner Spark Creative (2025). Mobile vs Desktop Usage Statistics 2025 [Dataset]. https://www.innersparkcreative.com/news/mobile-vs-desktop-usage-statistics-2025-verified

Mobile vs Desktop Usage Statistics 2025

Explore at:
htmlAvailable download formats
Dataset updated
Sep 3, 2025
Dataset authored and provided by
Inner Spark Creative
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Verified dataset of 2025 device usage: share of global web traffic, mobile commerce share of transactions, US daily time spent, app vs web breakdown, and tablet decline.

Search
Clear search
Close search
Google apps
Main menu