50 datasets found
  1. Multilingual Mobile App Review Dataset August 2025

    • kaggle.com
    Updated Jul 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pratyush Puri (2025). Multilingual Mobile App Review Dataset August 2025 [Dataset]. https://www.kaggle.com/datasets/pratyushpuri/multilingual-mobile-app-reviews-dataset-2025
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 31, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pratyush Puri
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Multilingual Mobile App Reviews Dataset 2025

    Overview

    This comprehensive synthetic dataset contains 2,514 authentic mobile app reviews spanning 40+ popular applications across 24 different languages, making it ideal for multilingual NLP, sentiment analysis, and cross-cultural user behavior research.

    Dataset Statistics

    • Total Records: 2,514 reviews
    • Columns: 15 features
    • Languages Covered: 24 international languages
    • Apps Included: 40+ popular mobile applications
    • Time Range: 2023-2025 (2-year span)
    • File Format: CSV
    • Data Quality: Intentionally includes missing values and mixed data types for data cleaning practice

    Column Specifications

    Column NameData TypeDescriptionSample ValuesNull Count
    review_idIntegerUnique identifier for each review1, 2, 3, ...0
    user_idString*User identifier (should be integer)"1967825", "9242600"0
    app_nameStringName of the mobile applicationWhatsApp, Instagram, TikTok0
    app_categoryStringApplication categorySocial Networking, Entertainment0
    review_textStringMultilingual review content"This app is amazing!"63
    review_languageStringISO language codeen, es, fr, zh, hi, ar0
    ratingMixed*App rating (1.0-5.0, some as strings)4.5, "3.2", 1.138
    review_dateDateTimeTimestamp of review submission2024-10-09 19:26:400
    verified_purchaseBooleanPurchase verification statusTrue, False0
    device_typeStringDevice platformAndroid, iOS, iPad, Windows Phone0
    num_helpful_votesMixed*Helpfulness votes (some as strings)65, "209", 1630
    user_ageFloat*User age (should be integer)14.0, 18.0, 67.00
    user_countryStringUser's countryChina, Germany, Nigeria50
    user_genderStringUser genderMale, Female, Non-binary, Prefer not to say88
    app_versionStringApplication version number1.4, v8.9, 2.8.37.592625

    Note: Data types marked with asterisk require cleaning/conversion

    Language Distribution

    The dataset includes reviews in 24 languages: - European: English (en), Spanish (es), French (fr), German (de), Italian (it), Russian (ru), Polish (pl), Dutch (nl), Swedish (sv), Danish (da), Norwegian (no), Finnish (fi) - Asian: Chinese (zh), Hindi (hi), Japanese (ja), Korean (ko), Thai (th), Vietnamese (vi), Indonesian (id), Malay (ms) - Other: Arabic (ar), Turkish (tr), Filipino (tl)

    Application Categories

    Reviews cover 18 distinct categories: - Social Networking - Entertainment
    - Productivity - Travel & Local - Music & Audio - Video Players & Editors - Shopping - Navigation - Finance - Communication - Education - Photography - Dating - Business - Utilities - Health & Fitness - Games - News & Magazines

    Popular Apps Included

    40+ applications including: - Social: WhatsApp, Instagram, Facebook, Snapchat, TikTok, LinkedIn, Twitter, Reddit, Pinterest - Entertainment: YouTube, Netflix, Spotify - Productivity: Microsoft Office, Google Drive, Dropbox, OneDrive, Zoom, Discord - Travel: Uber, Lyft, Airbnb, Booking.com, Google Maps, Waze - Finance: PayPal, Venmo - Education: Duolingo, Khan Academy, Coursera, Udemy - Tools: Grammarly, Canva, Adobe Photoshop, VLC, MX Player

    Geographic Distribution

    Reviews from 24 countries across all continents: - Asia: China, India, Japan, South Korea, Thailand, Vietnam, Indonesia, Malaysia, Philippines, Pakistan, Bangladesh - Europe: Germany, United Kingdom, France, Italy, Spain, Russia, Turkey, Poland - Americas: United States, Canada, Brazil, Mexico - Oceania: Australia - Africa: Nigeria

    Data Quality Features

    Intentional data challenges for learning: - Missing Values: Strategic nulls in review_text (63), rating (38), user_country (50), user_gender (88), app_version (25) - Data Type Issues: - user_id stored as strings (should be integers) - user_age as floats (should be integers)
    - Some ratings as strings (should be floats) - Some helpful_votes as strings (should be integers) - Mixed Version Formats: "1.4", "v8.9", "2.8.37.5926", "14.1.60.318-beta"

    Use Cases

    This dataset is perfect for: - Multilingual NLP projects and sentiment analysis - Cross-cultural user behavior analysis - App store analytics and rating prediction - Data cleaning and preprocessing practice - Text classification across multiple languages - Time series analysis of app reviews - Geographic sentiment analysis - Data engineering pipeline development

    Data Cleaning Opportunities

    • Convert string IDs to integers
    • Standardize rating values to float
    • Han...
  2. b

    App Downloads Data (2025)

    • businessofapps.com
    Updated Aug 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Business of Apps (2025). App Downloads Data (2025) [Dataset]. https://www.businessofapps.com/data/app-statistics/
    Explore at:
    Dataset updated
    Aug 1, 2025
    Dataset authored and provided by
    Business of Apps
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    App Download Key StatisticsApp and Game DownloadsiOS App and Game DownloadsGoogle Play App and Game DownloadsGame DownloadsiOS Game DownloadsGoogle Play Game DownloadsApp DownloadsiOS App...

  3. H

    Replication Data for: Driving Mobile App User Engagement Through...

    • dataverse.harvard.edu
    Updated Oct 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jens Paschmann; Hernán A. Bruno; Harald J. van Heerde; Franziska Völckner; Kristina Klein (2024). Replication Data for: Driving Mobile App User Engagement Through Gamification [Dataset]. http://doi.org/10.7910/DVN/4YNNWN
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 7, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Jens Paschmann; Hernán A. Bruno; Harald J. van Heerde; Franziska Völckner; Kristina Klein
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Five files are provided: 01_synthetic_data_model.r 02_app_data.rda 03_mean_center_da.rda 04_Bayesian_Tobit_II.stan 05_fit.RData File 01: This R file loads synthetic data and estimates a Bayesian Tobit II model with four outcomes. File 02: The dataset app_data.rda contains 15,481 observations of 100 users. It contains synthetic observations of all variables needed for the analyses. File 03: This dataset includes the means of the transformed variables in the original data, such that variables can be mean-centered analogously in the synthetic data. File 04: This .stan file provides the Bayesian Tobit II Stan code which is integrated into the R file (see File 01) as a stand-alone file in case it needs to be inspected with syntax highlighting or run on a different dataset. File 05: This file includes the output derived from fitting the Stan model on the synthetic data (i.e., a so-called "stanfit object"). If one does not want to run the model, one can directly import this file in R. The readme.txt describes details of the replication files.

  4. TikTok global quarterly downloads 2018-2024

    • statista.com
    • es.statista.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). TikTok global quarterly downloads 2018-2024 [Dataset]. https://www.statista.com/topics/1002/mobile-app-usage/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    In the fourth quarter of 2024, TikTok generated around 186 million downloads from users worldwide. Initially launched in China first by ByteDance as Douyin, the short-video format was popularized by TikTok and took over the global social media environment in 2020. In the first quarter of 2020, TikTok downloads peaked at over 313.5 million worldwide, up by 62.3 percent compared to the first quarter of 2019. TikTok interactions: is there a magic formula for content success? In 2024, TikTok registered an engagement rate of approximately 4.64 percent on video content hosted on its platform. During the same examined year, the social video app recorded over 1,100 interactions on average. These interactions were primarily composed of likes, while only recording less than 20 comments per piece of content on average in 2024. The platform has been actively monitoring the issue of fake interactions, as it removed around 236 million fake likes during the first quarter of 2024. Though there is no secret formula to get the maximum of these metrics, recommended video length can possibly contribute to the success of content on TikTok. It was recommended that tiny TikTok accounts with up to 500 followers post videos that are around 2.6 minutes long as of the first quarter of 2024. While, the ideal video duration for huge TikTok accounts with over 50,000 followers was 7.28 minutes. The average length of TikTok videos posted by the creators in 2024 was around 43 seconds. What’s trending on TikTok Shop? Since its launch in September 2023, TikTok Shop has become one of the most popular online shopping platforms, offering consumers a wide variety of products. In 2023, TikTok shops featuring beauty and personal care items sold over 370 million products worldwide. TikTok shops featuring womenswear and underwear, as well as food and beverages, followed with 285 and 138 million products sold, respectively. Similarly, in the United States market, health and beauty products were the most-selling items, accounting for 85 percent of sales made via the TikTok Shop feature during the first month of its launch. In 2023, Indonesia was the market with the largest number of TikTok Shops, hosting over 20 percent of all TikTok Shops. Thailand and Vietnam followed with 18.29 and 17.54 percent of the total shops listed on the famous short video platform, respectively. 

  5. b

    App Store Data (2025)

    • businessofapps.com
    Updated Aug 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Business of Apps (2025). App Store Data (2025) [Dataset]. https://www.businessofapps.com/data/app-stores/
    Explore at:
    Dataset updated
    Aug 1, 2025
    Dataset authored and provided by
    Business of Apps
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Apple App Store Key StatisticsApps & Games in the Apple App StoreApps in the Apple App StoreGames in the Apple App StoreMost Popular Apple App Store CategoriesPaid vs Free Apps in Apple App...

  6. e

    Mobile app for customers by region, size class, or industry . Share of...

    • data.europa.eu
    json
    Updated Jul 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistikmyndigheten SCB - Statistiska centralbyrån (2025). Mobile app for customers by region, size class, or industry . Share of enterprises with 10 or more employees. Year 2023 [Dataset]. https://data.europa.eu/data/datasets/https-statistikdatabasen-scb-se-dataset-tab3340?locale=en
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jul 14, 2025
    Dataset authored and provided by
    Statistikmyndigheten SCB - Statistiska centralbyrån
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Mobile app for customers, share of enterprises with 10 or more employees by study domain, observations and year

  7. f

    Mobility Data | Global | Reach - 90 Billion Records for Consumer Insights &...

    • factori.ai
    Updated Dec 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Mobility Data | Global | Reach - 90 Billion Records for Consumer Insights & Market Intelligence [Dataset]. https://www.factori.ai/datasets/mobility-data/
    Explore at:
    Dataset updated
    Dec 24, 2024
    License

    https://www.factori.ai/privacy-policyhttps://www.factori.ai/privacy-policy

    Area covered
    Global
    Description

    Mobility data is collected through location-aware mobile apps using an SDK-based implementation. Users explicitly consent to share their location data via a clear opt-in process and are provided with clear opt-out options. Factori ingests, cleans, validates, and exports all location data signals to ensure the highest quality data is available for analysis.

    • Record Count: 90 Billion
    • Capturing Frequency: Once per Event
    • Delivering Frequency: Once per Day
    • Updated: Daily

    Mobility Data Reach

    Our data reach encompasses the total counts available across various categories, including attributes such as country location, MAU (Monthly Active Users), DAU (Daily Active Users), and Monthly Location Pings.

    Data Export Methodology

    We collect data dynamically, offering the most updated data and insights at the best-suited intervals (daily, weekly, monthly, or quarterly).

    Business Needs

    Our data supports various business needs, including consumer insight, market intelligence, advertising, and retail analytics.

  8. d

    SMEs Mobile Business Intelligence Application Project - Subsidy List

    • data.gov.tw
    csv
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of Digital Affairs (2025). SMEs Mobile Business Intelligence Application Project - Subsidy List [Dataset]. https://data.gov.tw/en/datasets/143152
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jun 1, 2025
    Dataset authored and provided by
    Ministry of Digital Affairs
    License

    https://data.gov.tw/licensehttps://data.gov.tw/license

    Description

    This dataset explains the subsidy status of the SMEs Mobile Smart Application Project over the years, providing relevant information such as subsidy recipients, subsidy amounts, affiliated municipalities or counties, and approval dates, providing a reference for the industry to promote mobile smart application.

  9. s

    Reports of non-emergency problems submitted by users of Get It Done

    • data.sandiego.gov
    Updated May 22, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Reports of non-emergency problems submitted by users of Get It Done [Dataset]. https://data.sandiego.gov/datasets/get-it-done-311/
    Explore at:
    csv csv is tabular data. excel, google docs, libreoffice calc or any plain text editor will open files with this format. learn moreAvailable download formats
    Dataset updated
    May 22, 2019
    Description

    The Get It Done program allows residents and visitors to report certain types of non-emergency problems to the City using the Get It Done mobile app, web app, or by telephone. This dataset contains all Get It Done reports the City has received since the program launched in May 2016. New! We have reorganized the data into a single file of currently open reports and closed reports by year. Users who would prefer to get reports by problem type should refer to the datasets for: 72-hour parking violations Graffiti Illegal Dumping Potholes The scope of this data is limited to information from the reports citizen users submit through Get It Done. The data includes fields for the date and time a report was submitted, what the problem was, the location of the problem, and the date when the user was notified that the City addressed the problem. This data does not include details about any work performed to fix a problem or the date and time work was completed. Reports that are referred outside of the Get It Done system have a status of “Referred”. Please note that this data includes every user-submitted report and should not be considered an official record of City maintenance work. For example, users might submit problems that have already been reported, that are the responsibility of another government agency or private business, that cannot be found or verified, or that are already scheduled to be fixed in a long-term maintenance plan. The details about how the City addressed each report are outside of the scope of this dataset. If you have any questions about this data, please contact pandatech@sandiego.gov. If you have questions about your Get It Done report, please refer to your confirmation email.

  10. a

    Nonemployer Statistics - Counties 2019

    • arc-gis-hub-home-arcgishub.hub.arcgis.com
    • covid19-uscensus.hub.arcgis.com
    • +1more
    Updated Dec 28, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    US Census Bureau (2020). Nonemployer Statistics - Counties 2019 [Dataset]. https://arc-gis-hub-home-arcgishub.hub.arcgis.com/maps/700093dc551d46a5a86186582e42bfaa_0/about
    Explore at:
    Dataset updated
    Dec 28, 2020
    Dataset authored and provided by
    US Census Bureau
    Area covered
    Description

    This layer shows data on the number of establishments and revenue for select 2-digit North American Industry Classification System (NAICS) sectors and for NAICS 00, All Sectors. This is shown by county and state boundaries. The full NES data set (available at census.gov) is updated annually to contain the most currently released NES data, and contains estimates and measure of reliability. There are also additional calculated attributes related to this topic, which can be mapped or used within analysis.Current Vintage: 2019CBP Table: NS1900NESData downloaded from: Census Bureau's API for Nonemployer StatisticsDate of API call: December 19, 2022National Figures: data.census.govThe United States Census Bureau's Nonemployer Statistics Program (NES):About this ProgramDataTechnical DocumentationNews & UpdatesThis ready-to-use layer can be used within ArcGIS Pro, ArcGIS Online, its configurable apps, dashboards, Story Maps, custom apps, and mobile apps. Data can also be exported for offline workflows. Please cite the Census Bureau and NES when using this data.Data Processing Notes:Boundaries come from the US Census Bureau TIGER geodatabases. Boundaries are updated at the same time as the data updates (annually), and the boundary vintage appropriately matches the data vintage as specified by the Census Bureau. These are Census Bureau boundaries with water and/or coastlines clipped for cartographic purposes. For census tracts, the water cutouts are derived from a subset of the 2010 AWATER (Area Water) boundaries offered by TIGER. For state and county boundaries, the water and coastlines are derived from the coastlines of the 500k TIGER Cartographic Boundary Shapefiles. The original AWATER and ALAND fields are still available as attributes within the data table (units are square meters).The States layer contains 51 records - all US states, Washington D.C..Blank values represent industries where there either were no businesses in that industry and that geography OR industries where the data had to be withheld to avoid disclosing data for individual companies. Users should visit data.census.gov or Census Business Builder for more details on these withheld records.Data shown in thousands of dollars are indicated by '($1000)' in the field aliasing. Average and Totals include NAICS 11.

  11. D

    Mobile App Analytics Software Market Report | Global Forecast From 2025 To...

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Mobile App Analytics Software Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-mobile-app-analytics-software-market
    Explore at:
    pptx, csv, pdfAvailable download formats
    Dataset updated
    Sep 22, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Mobile App Analytics Software Market Outlook



    The global mobile app analytics software market size was valued at USD 2.5 billion in 2023 and is projected to reach USD 8.4 billion by 2032, growing at a CAGR of 14.3% during the forecast period. This robust growth is driven by increasing smartphone penetration and the growing importance of mobile applications in business strategies. The rising need for real-time data analysis and user insights to optimize app performance and enhance user experience further fuels market expansion.



    One of the primary growth factors for the mobile app analytics software market is the rapid increase in smartphone usage worldwide. With the proliferation of mobile devices, users are spending more time on mobile applications, which has incentivized businesses to invest in mobile app analytics to understand user behavior and improve app functionalities. Moreover, the widespread adoption of mobile devices has provided businesses with rich data sets to analyze, thereby driving the demand for sophisticated analytics tools. This trend is expected to continue as more businesses recognize the value of mobile app analytics in driving customer engagement and retention.



    Another significant growth driver is the increasing demand for personalized user experiences. In today’s competitive market landscape, businesses are striving to deliver personalized content and experiences to their users to gain a competitive edge. Mobile app analytics software enables companies to gather and analyze user data, providing valuable insights that can be used to tailor app experiences to individual users’ preferences and behaviors. This personalization not only enhances user satisfaction but also boosts user retention rates, leading to higher revenue generation for businesses.



    The burgeoning e-commerce sector also plays a crucial role in the growth of the mobile app analytics software market. With the rise of online shopping, e-commerce businesses are increasingly relying on mobile applications to reach their customers. Mobile app analytics software helps e-commerce companies track and analyze user interactions, purchase patterns, and preferences, enabling them to optimize their app performance and marketing strategies. As the e-commerce industry continues to expand, the demand for mobile app analytics software is expected to grow in tandem.



    Regionally, North America holds a dominant position in the mobile app analytics software market, attributed to the high penetration of smartphones and the presence of major technology companies in the region. Additionally, the early adoption of advanced technologies and the increasing focus on digital transformation initiatives further bolster market growth in North America. The Asia Pacific region is also witnessing significant growth, driven by the rapid digitalization of emerging economies and the increasing number of mobile app users. Europe, Latin America, and the Middle East & Africa are also expected to contribute to market growth, supported by the rising adoption of mobile applications and the growing emphasis on user experience optimization.



    Component Analysis



    The mobile app analytics software market is segmented into software and services components. The software segment holds a substantial share of the market, driven by the need for advanced analytical tools to process and interpret vast amounts of user data. Mobile app analytics software offers functionalities such as user behavior analysis, app performance tracking, and marketing campaign effectiveness measurement, which are crucial for businesses aiming to optimize their mobile strategies. As the demand for data-driven decision-making continues to rise, the software segment is expected to maintain its dominance in the market.



    Services, as a component, also play a vital role in the mobile app analytics software market. These services include implementation, consulting, and maintenance, which are essential for ensuring the effective deployment and utilization of mobile app analytics tools. Consulting services, in particular, help businesses understand how to leverage analytics software to achieve their strategic objectives. Additionally, maintenance services ensure that the analytics tools remain up-to-date with the latest technological advancements and market trends, thereby enhancing their effectiveness and reliability.



    Customization services are another critical aspect of the services component. Businesses often require tailored solutions that align with their specific needs and goals. Customization services enable compa

  12. H

    Replication Data for: Capture-Recapture Methods for Data on the Activation...

    • dataverse.harvard.edu
    • search.dataone.org
    txt
    Updated Mar 19, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2018). Replication Data for: Capture-Recapture Methods for Data on the Activation of Applications on Mobile Phones [Dataset]. http://doi.org/10.7910/DVN/H4AOQP
    Explore at:
    txt(9300751), txt(10410228), txt(1850735), txt(1852152), txt(10360607), txt(1848976), txt(10387084), txt(1650319)Available download formats
    Dataset updated
    Mar 19, 2018
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This work is concerned with the analysis of marketing data on the activation of applications (apps) on mobile devices. Each application has a hashed identification number that is specific to the device on which it has been installed. This number can be registered by a platform at each activation of the application. Activations on the same device are linked together using the identification number. By focusing on activations that took place at a business location one can create a capture-recapture data set about devices, that is users, that "visited" the business: the units are owners of mobile devices and the capture occasions are time intervals such as days. A unit is captured when she activates an application, provided that this activation is recorded by the platform providing the data. Statistical capture-recapture techniques can be applied to the app data to estimate the total number of users that visited the business over a time period, thereby providing an indirect estimate of foot traffic. This article argues that the robust design, a method for dealing with a nested mark-recapture experiment, can be used in this context. A new algorithm for estimating the parameters of a robust design with a fairly large number of capture occasions and a simple parametric bootstrap variance estimator are proposed. Moreover, new estimation methods and new theoretical results are introduced for a wider application of the robust design. This is used to analyze a data set about the mobile devices that visited the auto-dealerships of a major auto brand in a US metropolitan area over a period of one year and a half. Technical developments are provided in the Supplementary Material available online.

  13. m

    Allgeier SE - Financial-Leverage-Ratio

    • macro-rankings.com
    csv, excel
    Updated Jul 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). Allgeier SE - Financial-Leverage-Ratio [Dataset]. https://www.macro-rankings.com/Markets/Stocks?Entity=AEIN.XETRA&Item=Financial-Leverage-Ratio
    Explore at:
    excel, csvAvailable download formats
    Dataset updated
    Jul 29, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    germany
    Description

    Financial-Leverage-Ratio Time Series for Allgeier SE. Allgeier SE provides information technology (IT) solutions and software services in Germany. It operates in two segments, Enterprise IT and mgm technology partners. The company provides software lifecycle services, nearshore-/offshore delivery, big data / business intelligence, industry solutions and cloud, managed services & app management, mobile enterprise/apps, process and IT consulting, IT security, enterprise content management, and IT infrastructure services. It is also involved in designing, developing, launching, and operating business software solutions, such as document management; enterprise resource planning; e-commerce, business process management; business digitalization platform and business efficiency solutions; IT services and open-source software development; consultancy, software solutions; cloud transformation and cloudnative application development, as well as provides field service and asset management. The company is engages in Management consultancy and digital consulting, Business analysis and requirements engineering, software modelling and development, design and usability, web and application security, quality assurance, testing automation, SAP integration, process optimization, cloud services. Allgeier SE was founded in 1977 and is headquartered in Munich, Germany.

  14. S

    2019 and 2021 Wuhan Urban and Suburban Mobile-Office Data Set

    • scidb.cn
    Updated May 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Niu Qiang; Zhang Hao (2022). 2019 and 2021 Wuhan Urban and Suburban Mobile-Office Data Set [Dataset]. http://doi.org/10.57760/sciencedb.01736
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 6, 2022
    Dataset provided by
    Science Data Bank
    Authors
    Niu Qiang; Zhang Hao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Wuhan
    Description

    Mobile office is a new production and office activity in the information era. Its development is of great value to the digital transformation of office activities in big cities, the improvement of suburban office location and the optimization and reorganization of urban and suburban office location. Based on the mobile big data of mobile office apps used by urban and suburban unicom users in Wuhan in 2019 and 2021,as well as the distribution data of enterprises, this study summarized and organized the per capita duration of mobile office and the ratio of mobile office workers to employment in Wuhan before and after the epidemic in a grid unit of 1km*1km. This data set includes :(1) basic data sets of the ratio of urban and suburban mobile office workers to employed persons and per capita hours in Wuhan in 2019 and 2021. (2) Basic data set on the distribution of urban and suburban enterprises in Wuhan in 2019 and 2021. (3) Data sets were set for the dependent variable (mobile office hours per capita) and independent variable (product service industry, consumer service industry, information technology industry, and standardized quantity of manufacturing industry) of Wuhan urban-suburb geographical detector in 2019 and 2021. This data set can be used to study the the spatial distribution characteristics of cities’ mobile office activities in mobile-information era, explore the mobile office development regularity of different industries, optimize and recombine the suburban office location.

  15. Number of global social network users 2017-2028

    • statista.com
    • es.statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Number of global social network users 2017-2028 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    How many people use social media?

                  Social media usage is one of the most popular online activities. In 2024, over five billion people were using social media worldwide, a number projected to increase to over six billion in 2028.
    
                  Who uses social media?
                  Social networking is one of the most popular digital activities worldwide and it is no surprise that social networking penetration across all regions is constantly increasing. As of January 2023, the global social media usage rate stood at 59 percent. This figure is anticipated to grow as lesser developed digital markets catch up with other regions
                  when it comes to infrastructure development and the availability of cheap mobile devices. In fact, most of social media’s global growth is driven by the increasing usage of mobile devices. Mobile-first market Eastern Asia topped the global ranking of mobile social networking penetration, followed by established digital powerhouses such as the Americas and Northern Europe.
    
                  How much time do people spend on social media?
                  Social media is an integral part of daily internet usage. On average, internet users spend 151 minutes per day on social media and messaging apps, an increase of 40 minutes since 2015. On average, internet users in Latin America had the highest average time spent per day on social media.
    
                  What are the most popular social media platforms?
                  Market leader Facebook was the first social network to surpass one billion registered accounts and currently boasts approximately 2.9 billion monthly active users, making it the most popular social network worldwide. In June 2023, the top social media apps in the Apple App Store included mobile messaging apps WhatsApp and Telegram Messenger, as well as the ever-popular app version of Facebook.
    
  16. App Developer Data | B2B Contact Data for IT Professionals Worldwide | 170M...

    • datarade.ai
    Updated Oct 27, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2021). App Developer Data | B2B Contact Data for IT Professionals Worldwide | 170M Verified Profiles with Emails & Phone Numbers | Best Price Guaranteed [Dataset]. https://datarade.ai/data-products/app-developer-data-b2b-contact-data-for-it-professionals-wo-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 27, 2021
    Dataset provided by
    Area covered
    Micronesia (Federated States of), Syrian Arab Republic, Eritrea, Anguilla, Italy, Lesotho, Senegal, Vanuatu, Greece, Liechtenstein
    Description

    Success.ai’s B2B Contact Data for IT Professionals Worldwide is an advanced, AI-validated solution designed to help businesses connect with top IT talent and decision-makers globally. With access to over 170 million verified profiles, this dataset includes key contact information such as work emails, phone numbers, and additional professional details, ensuring you can easily engage with IT leaders and specialists across various industries.

    Our comprehensive data is continually updated to ensure accuracy, relevance, and compliance with global standards. Whether you're looking to expand your network, enhance lead generation, or improve recruitment processes, Success.ai’s IT professional database is designed to meet the evolving needs of your business.

    Key Features of Success.ai’s IT Professional Contact Data

    • Global Coverage Across the IT Industry Success.ai offers a diverse range of IT professionals, including but not limited to:

    Software Engineers & Developers: Specialists in coding, programming, and software development. IT Managers & Directors: Decision-makers responsible for IT infrastructure and strategy. Systems Administrators: Experts managing system installations, configurations, and troubleshooting. Cloud Computing Specialists: Professionals focused on cloud storage and infrastructure services. Cybersecurity Experts: IT professionals safeguarding data and systems from cyber threats. IT Consultants & Analysts: Advisers providing strategic recommendations on technology improvements.

    This dataset spans 170M+ verified profiles across more than 250 countries, ensuring you reach the right IT professionals, wherever they are.

    • Verified and Continuously Updated Data

      99% Accuracy: Data is AI-validated to ensure that you are reaching the right contacts with accurate, up-to-date information. Real-Time Updates: Success.ai’s dataset is constantly refreshed, ensuring that the information you receive is always relevant and timely. Global Compliance: Our data collection adheres to GDPR, CCPA, and other data privacy standards, ensuring that your outreach practices are ethical and compliant.

    • Customizable Data Solutions Success.ai provides multiple delivery methods to suit your business needs:

    API Integration: Seamlessly integrate our data into your CRM, marketing automation, or lead-generation systems for real-time updates. Custom Flat Files: Receive highly targeted and segmented datasets, preformatted to your specifications, making integration easy.

    Why Choose Success.ai’s IT Professional Contact Data?

    • Best Price Guarantee We offer the most competitive pricing in the industry, ensuring you get exceptional value for high-quality, verified contact data.

    • Targeted Outreach to IT Professionals Our comprehensive dataset is perfect for precision targeting, making it easier to connect with key IT professionals. With detailed profiles, including work emails and phone numbers, you can engage with decision-makers directly and increase the efficiency of your campaigns.

    • Strategic Use Cases

      Lead Generation: Use our verified contact information to target IT decision-makers and specialists for your lead generation campaigns. Sales Outreach: Reach out to key IT managers, directors, and consultants to promote your product or service and close high-value deals. Recruitment: Source top-tier IT talent with verified contact data for software developers, network administrators, and IT executives. Marketing Campaigns: Run hyper-targeted marketing campaigns for IT professionals globally to promote tech services, job openings, or industry innovations. Business Expansion: Use data-driven insights to expand your global outreach, identifying opportunities and building relationships in untapped markets.

    • Key Data Highlights

      170M+ Verified Profiles of IT professionals worldwide, covering a wide range of roles and industries. 50M Work Emails to help you reach the right IT contacts. 30M Company Profiles with insights on the organizations that these professionals represent. 700M+ LinkedIn Professional Profiles globally, enhancing your ability to access verified IT contacts across various platforms.

    Powerful APIs for Enhanced Functionality

    • Enrichment API Keep your data up to date with our Enrichment API, providing real-time enrichment of your existing contact database. Perfect for businesses that want to maintain accurate and current information about their leads and customers.

    • Lead Generation API Maximize your lead generation campaigns by accessing Success.ai’s vast and verified dataset, which includes work emails and phone numbers for IT professionals worldwide. Our API supports up to 860,000 API calls per day, ensuring scalability for large enterprises.

    • Use Cases for IT Professional Contact Data

    • Lead Generation for IT Solutions Target IT decision-makers, software developers, and cybersecuri...

  17. a

    License Application Points (Business Registrations)

    • hub.arcgis.com
    • data.bendoregon.gov
    • +1more
    Updated Dec 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Bend, Oregon (2024). License Application Points (Business Registrations) [Dataset]. https://hub.arcgis.com/datasets/b7a0cc02c41a4475bbd739ed012632c8
    Explore at:
    Dataset updated
    Dec 9, 2024
    Dataset authored and provided by
    City of Bend, Oregon
    Area covered
    Description

    This dataset represents all current City of Bend business registrations. Registrations are generated using address point data from the time of submission combined with licensing information derived from City of Bend permitting software. Please note data is updated nightly and subject to change as applications are received and reviewed.*Sensitive information has been removed from all publicly shared data. Field Name Description

    OBJECTID For internal use.

    GNMasterProjectID For internal use.

    GNCommonID For internal use.

    LC_RecordID For internal use.

    BusinessNumber The tracking number for this business entity in the City of Bend licensing system.

    BusinessName The name of the business submitted by the applicant

    BusinessTypeCode The business registration type code.

    BusinessTypeDesc The business registration type description.

    BusinessStatusCode The business entity status code.

    BusinessStatusDesc The business entity status description.

    BusinessLocation The business address submitted by the applicant

    LicenseNumber The tracking number used to refer to the current year's business registration in CityView.

    LicenseStatusCode The status code for the current year's business registration.

    LicenseStatusDesc The status description for the current year's business registration.

    LicenseExpirationDate The expiration date for the current year's business registration.

    ClassCode1 NAICS (North American Industry Classification System) classification codes describing the type of good or services provided by the business

    ClassCode2 NAICS (North American Industry Classification System) classification codes describing the type of good or services provided by the business

    ClassCode3 NAICS (North American Industry Classification System) classification codes describing the type of good or services provided by the business

    ClassCode4 NAICS (North American Industry Classification System) classification codes describing the type of good or services provided by the business

    ClassDescription1 NAICS (North American Industry Classification System) classification descriptions describing the type of good or services provided by the business

    ClassDescription2 NAICS (North American Industry Classification System) classification descriptions describing the type of good or services provided by the business

    ClassDescription3 NAICS (North American Industry Classification System) classification descriptions describing the type of good or services provided by the business

    ClassDescription4 NAICS (North American Industry Classification System) classification descriptions describing the type of good or services provided by the business

    BR_MailingAddress* The business contacts mailing address submitted by the applicant

    BR_MailingCity* The business contact mailing city submitted by the applicant

    BR_MailingState* The business contact mailing state submitted by the applicant

    BR_MailingZip* The business contact mailing zip code submitted by the applicant

    BR_PhoneNumber* The business contact phone number submitted by the applicant

    BR_STR_EmergencyPhone* The short term rental business emergency contact phone number submitted by the applicant

    BR_BusinessOpenDate The business opening date submitted by the applicant.

    BR_NonProfit Identifier for non-profit business entities

    BR_HomeBased* Identifier for home based businesses

    BR_BusinessEmail* The business email contact submitted by the applicant

    STR_Type* The short term rental classification type.

    STR_LandUseNumber* The short term rental land use approval number.

    STR_LandUseApprovalDate* The short term rental land use approval date.

    LecacyID For internal use.

    LOCID For internal use.

    SITEADDID For internal use.

    TAXLOT The tax lot for the license (Please note if a license includes multiple tax lots, only one is visible in this field).

    DoingBizInBend For internal use.

    Location_Finaled For internal use.

    GlobalID For internal use.

    CREATEDATE For internal use.

    CREATEBY For internal use.

    LASTUPDATE For internal use.

    UPDATEDBY For internal use.

    InfoStatus For internal use.

    OverallStatus For internal use.

  18. C

    Mobile Food Vendors 032614

    • data.cityofchicago.org
    Updated Aug 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Chicago (2025). Mobile Food Vendors 032614 [Dataset]. https://data.cityofchicago.org/Community-Economic-Development/Mobile-Food-Vendors-032614/jzqw-afn2
    Explore at:
    csv, application/geo+json, kml, xlsx, kmz, xmlAvailable download formats
    Dataset updated
    Aug 31, 2025
    Authors
    City of Chicago
    Description

    This dataset contains all current and active business licenses issued by the Department of Business Affairs and Consumer Protection. This dataset contains a large number of records /rows of data and may not be viewed in full in Microsoft Excel. Therefore, when downloading the file, select CSV from the Export menu. Open the file in an ASCII text editor, such as Notepad or Wordpad, to view and search.

    Data fields requiring description are detailed below.

    APPLICATION TYPE: 'ISSUE' is the record associated with the initial license application. 'RENEW' is a subsequent renewal record. All renewal records are created with a term start date and term expiration date. 'C_LOC' is a change of location record. It means the business moved. 'C_CAPA' is a change of capacity record. Only a few license types my file this type of application. 'C_EXPA' only applies to businesses that have liquor licenses. It means the business location expanded.

    LICENSE STATUS: 'AAI' means the license was issued.

    Business license owners may be accessed at: http://data.cityofchicago.org/Community-Economic-Development/Business-Owners/ezma-pppn To identify the owner of a business, you will need the account number or legal name.

    Data Owner: Business Affairs and Consumer Protection

    Time Period: Current

    Frequency: Data is updated daily

  19. B2B Technographic Data in Morocco

    • kaggle.com
    Updated Sep 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Techsalerator (2024). B2B Technographic Data in Morocco [Dataset]. https://www.kaggle.com/datasets/techsalerator/b2b-technographic-data-in-morocco
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 13, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Techsalerator
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    Morocco
    Description

    Techsalerator’s Business Technographic Data for Morocco: Unlocking Insights into Morocco's Technology Landscape

    Techsalerator’s Business Technographic Data for Morocco provides a detailed and comprehensive dataset essential for businesses, market analysts, and technology vendors seeking to understand and engage with companies operating within Morocco. This dataset offers in-depth insights into the technological landscape, capturing and organizing data related to technology stacks, digital tools, and IT infrastructure used by businesses in the country.

    Please reach out to us at info@techsalerator.com or visit Techsalerator Contact.

    Top 5 Most Utilized Data Fields

    • Company Name: This field lists the names of companies in Morocco, enabling technology vendors to target potential clients and allowing analysts to assess technology adoption trends within specific businesses.

    • Technology Stack: This field outlines the technologies and software solutions a company uses, such as enterprise resource planning (ERP) systems, customer relationship management (CRM) software, and cloud services. Understanding a company's technology stack is key to evaluating its digital maturity and operational needs.

    • Deployment Status: This field indicates whether the technology is currently deployed, planned for future deployment, or under evaluation. Vendors can use this information to assess the level of technology adoption and interest among companies in Morocco.

    • Industry Sector: This field specifies the industry in which the company operates, such as manufacturing, finance, or telecommunications. Knowing the industry helps vendors tailor their products to sector-specific demands and emerging trends in Morocco.

    • Geographic Location: This field identifies the company's headquarters or primary operations within Morocco. Geographic information aids in regional analysis and understanding localized technology adoption patterns across the country.

    Top 5 Technology Trends in Morocco

    • Fintech Innovations: Morocco's financial sector is experiencing a surge in fintech solutions, including digital banking, mobile payments, and blockchain technologies. These innovations are reshaping the financial landscape and enhancing financial inclusion.

    • Smart Cities and Urban Development: Morocco is investing in smart city projects that incorporate IoT (Internet of Things) technologies, data analytics, and intelligent infrastructure to improve urban living conditions and sustainability.

    • E-commerce Growth: The e-commerce sector in Morocco is expanding rapidly, driven by increased internet penetration and consumer demand for online shopping. Businesses are adopting e-commerce platforms and digital marketing strategies to capture this growing market.

    • Renewable Energy Initiatives: Morocco's commitment to renewable energy is evident in its large-scale solar and wind energy projects. Companies are increasingly adopting green technologies to align with national sustainability goals.

    • Cybersecurity Enhancements: With the rise in digital transactions and online activities, Moroccan companies are focusing on strengthening their cybersecurity measures. Investments in data protection, encryption, and threat detection are becoming more prevalent.

    Top 5 Companies with Notable Technographic Data in Morocco

    • Attijariwafa Bank: A leading bank in Morocco, Attijariwafa is implementing advanced digital banking solutions, including mobile apps and online platforms, to enhance customer experience and operational efficiency.

    • Maroc Telecom: A major telecommunications provider, Maroc Telecom is expanding its digital services portfolio, including high-speed internet and mobile connectivity, to support Morocco’s growing digital economy.

    • OCP Group: A global leader in phosphate production, OCP Group is leveraging advanced technologies in its operations, including automation, data analytics, and IoT solutions, to optimize production and logistics.

    • BMCE Bank: Known for its focus on digital transformation, BMCE Bank is investing in fintech solutions and digital platforms to improve banking services and customer engagement.

    • Hewlett Packard Enterprise Morocco: HPE Morocco is providing cutting-edge IT infrastructure solutions, including cloud services and data center technologies, to support businesses in their digital transformation journeys.

    Accessing Techsalerator’s Business Technographic Data

    For those interested in accessing Techsalerator’s Business Technographic Data for Morocco, please contact info@techsalerator.com with your specific needs. Techsalerator offers customized quotes based on the required number of data fields and records, with datasets available for delivery within 24 hours. Ongoing access options can also be arranged upon request.

    Included Data Fie...

  20. b

    Apple Statistics (2025)

    • businessofapps.com
    Updated Jul 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Business of Apps (2025). Apple Statistics (2025) [Dataset]. https://www.businessofapps.com/data/apple-statistics/
    Explore at:
    Dataset updated
    Jul 20, 2025
    Dataset authored and provided by
    Business of Apps
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Apple is one of the most influential and recognisable brands in the world, responsible for the rise of the smartphone with the iPhone. Valued at over $2 trillion in 2021, it is also the most valuable...

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Pratyush Puri (2025). Multilingual Mobile App Review Dataset August 2025 [Dataset]. https://www.kaggle.com/datasets/pratyushpuri/multilingual-mobile-app-reviews-dataset-2025
Organization logo

Multilingual Mobile App Review Dataset August 2025

2.5K reviews across 40+ apps, 24 languages, perfect for NLP & sentiment analysis

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 31, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Pratyush Puri
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Multilingual Mobile App Reviews Dataset 2025

Overview

This comprehensive synthetic dataset contains 2,514 authentic mobile app reviews spanning 40+ popular applications across 24 different languages, making it ideal for multilingual NLP, sentiment analysis, and cross-cultural user behavior research.

Dataset Statistics

  • Total Records: 2,514 reviews
  • Columns: 15 features
  • Languages Covered: 24 international languages
  • Apps Included: 40+ popular mobile applications
  • Time Range: 2023-2025 (2-year span)
  • File Format: CSV
  • Data Quality: Intentionally includes missing values and mixed data types for data cleaning practice

Column Specifications

Column NameData TypeDescriptionSample ValuesNull Count
review_idIntegerUnique identifier for each review1, 2, 3, ...0
user_idString*User identifier (should be integer)"1967825", "9242600"0
app_nameStringName of the mobile applicationWhatsApp, Instagram, TikTok0
app_categoryStringApplication categorySocial Networking, Entertainment0
review_textStringMultilingual review content"This app is amazing!"63
review_languageStringISO language codeen, es, fr, zh, hi, ar0
ratingMixed*App rating (1.0-5.0, some as strings)4.5, "3.2", 1.138
review_dateDateTimeTimestamp of review submission2024-10-09 19:26:400
verified_purchaseBooleanPurchase verification statusTrue, False0
device_typeStringDevice platformAndroid, iOS, iPad, Windows Phone0
num_helpful_votesMixed*Helpfulness votes (some as strings)65, "209", 1630
user_ageFloat*User age (should be integer)14.0, 18.0, 67.00
user_countryStringUser's countryChina, Germany, Nigeria50
user_genderStringUser genderMale, Female, Non-binary, Prefer not to say88
app_versionStringApplication version number1.4, v8.9, 2.8.37.592625

Note: Data types marked with asterisk require cleaning/conversion

Language Distribution

The dataset includes reviews in 24 languages: - European: English (en), Spanish (es), French (fr), German (de), Italian (it), Russian (ru), Polish (pl), Dutch (nl), Swedish (sv), Danish (da), Norwegian (no), Finnish (fi) - Asian: Chinese (zh), Hindi (hi), Japanese (ja), Korean (ko), Thai (th), Vietnamese (vi), Indonesian (id), Malay (ms) - Other: Arabic (ar), Turkish (tr), Filipino (tl)

Application Categories

Reviews cover 18 distinct categories: - Social Networking - Entertainment
- Productivity - Travel & Local - Music & Audio - Video Players & Editors - Shopping - Navigation - Finance - Communication - Education - Photography - Dating - Business - Utilities - Health & Fitness - Games - News & Magazines

Popular Apps Included

40+ applications including: - Social: WhatsApp, Instagram, Facebook, Snapchat, TikTok, LinkedIn, Twitter, Reddit, Pinterest - Entertainment: YouTube, Netflix, Spotify - Productivity: Microsoft Office, Google Drive, Dropbox, OneDrive, Zoom, Discord - Travel: Uber, Lyft, Airbnb, Booking.com, Google Maps, Waze - Finance: PayPal, Venmo - Education: Duolingo, Khan Academy, Coursera, Udemy - Tools: Grammarly, Canva, Adobe Photoshop, VLC, MX Player

Geographic Distribution

Reviews from 24 countries across all continents: - Asia: China, India, Japan, South Korea, Thailand, Vietnam, Indonesia, Malaysia, Philippines, Pakistan, Bangladesh - Europe: Germany, United Kingdom, France, Italy, Spain, Russia, Turkey, Poland - Americas: United States, Canada, Brazil, Mexico - Oceania: Australia - Africa: Nigeria

Data Quality Features

Intentional data challenges for learning: - Missing Values: Strategic nulls in review_text (63), rating (38), user_country (50), user_gender (88), app_version (25) - Data Type Issues: - user_id stored as strings (should be integers) - user_age as floats (should be integers)
- Some ratings as strings (should be floats) - Some helpful_votes as strings (should be integers) - Mixed Version Formats: "1.4", "v8.9", "2.8.37.5926", "14.1.60.318-beta"

Use Cases

This dataset is perfect for: - Multilingual NLP projects and sentiment analysis - Cross-cultural user behavior analysis - App store analytics and rating prediction - Data cleaning and preprocessing practice - Text classification across multiple languages - Time series analysis of app reviews - Geographic sentiment analysis - Data engineering pipeline development

Data Cleaning Opportunities

  • Convert string IDs to integers
  • Standardize rating values to float
  • Han...
Search
Clear search
Close search
Google apps
Main menu