24 datasets found
  1. Play Store Apps

    • kaggle.com
    Updated Sep 16, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aman Chauhan (2022). Play Store Apps [Dataset]. https://www.kaggle.com/datasets/whenamancodes/play-store-apps
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 16, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aman Chauhan
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.

    Each app (row) has values for catergory, rating, size, and more.

    The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!

    googleplaystore.csv

    ColumnsDescription
    AppApplication name
    CategoryCategory the app belongs to
    RatingsOverall user rating of the app (as when scraped)
    ReviewsNumber of user reviews for the app (as when scraped)
    SizeSize of the app (as when scraped)
    InstallsNumber of user downloads/installs for the app (as when scraped)
    TypePaid or Free
    PricePrice of the app (as when scraped)
    Content RatingAge group the app is targeted at - Children / Mature 21+ / Adult
    GenreAn app can belong to multiple genres (apart from its main category). For eg, a musical family game will belong to
    Current VerCurrent version of the app available on Play Store (as when scraped)
    Android VerMin required Android version (as when scraped)

    googleplaystore_user_reviews.csv

    ColumnsDescription
    AppName of app
    Translated ReviewsUser review (Preprocessed and translated to English)
    SentimentPositive/Negative/Neutral (Preprocessed)
    Sentiment_polaritySentiment polarity score
    Sentiment_subjectivitySentiment subjectivity score

    More - Find More Exciting🙀 Datasets Here - An Upvote👍 A Dayᕙ(`▿´)ᕗ , Keeps Aman Hurray Hurray..... ٩(˘◡˘)۶Haha

  2. Average daily time spent on social media worldwide 2012-2024

    • statista.com
    • wwwexpressvpn.online
    • +1more
    Updated Apr 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Average daily time spent on social media worldwide 2012-2024 [Dataset]. https://www.statista.com/statistics/433871/daily-social-media-usage-worldwide/
    Explore at:
    Dataset updated
    Apr 10, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    How much time do people spend on social media? As of 2024, the average daily social media usage of internet users worldwide amounted to 143 minutes per day, down from 151 minutes in the previous year. Currently, the country with the most time spent on social media per day is Brazil, with online users spending an average of three hours and 49 minutes on social media each day. In comparison, the daily time spent with social media in the U.S. was just two hours and 16 minutes. Global social media usageCurrently, the global social network penetration rate is 62.3 percent. Northern Europe had an 81.7 percent social media penetration rate, topping the ranking of global social media usage by region. Eastern and Middle Africa closed the ranking with 10.1 and 9.6 percent usage reach, respectively. People access social media for a variety of reasons. Users like to find funny or entertaining content and enjoy sharing photos and videos with friends, but mainly use social media to stay in touch with current events friends. Global impact of social mediaSocial media has a wide-reaching and significant impact on not only online activities but also offline behavior and life in general. During a global online user survey in February 2019, a significant share of respondents stated that social media had increased their access to information, ease of communication, and freedom of expression. On the flip side, respondents also felt that social media had worsened their personal privacy, increased a polarization in politics and heightened everyday distractions.

  3. Z

    Coronavirus-themed Mobile Apps (Malware) Dataset

    • data.niaid.nih.gov
    • zenodo.org
    Updated Apr 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    covid19apps (2021). Coronavirus-themed Mobile Apps (Malware) Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3875975
    Explore at:
    Dataset updated
    Apr 21, 2021
    Dataset authored and provided by
    covid19apps
    Description

    As COVID-19 continues to spread across the world, a growing number of malicious campaigns are exploiting the pandemic. It is reported that COVID-19 is being used in a variety of online malicious activities, including Email scam, ransomware and malicious domains. As the number of the afflicted cases continue to surge, malicious campaigns that use coronavirus as a lure are increasing. Malicious developers take advantage of this opportunity to lure mobile users to download and install malicious apps.

    However, besides a few media reports, the coronavirus-themed mobile malware has not been well studied. Our community lacks of the comprehensive understanding of the landscape of the coronavirus-themed mobile malware, and no accessible dataset could be used by our researchers to boost COVID-19 related cybersecurity studies.

    We make efforts to create a daily growing COVID-19 related mobile app dataset. By the time of mid-November, we have curated a dataset of 4,322 COVID-19 themed apps, and 611 of them are considered to be malicious. The number is growing daily and our dataset will update weekly. For more details, please visit https://covid19apps.github.io

    This dataset includes the following files:

    (1) covid19apps.xlsx

    In this file, we list all the COVID-19 themed apps information, including apk file hashes, released date, package name, AV-Rank, etc.

    (2)covid19apps.zip

    We put the COVID-19 themed apps Apk samples in zip files . In order to reduce the size of a single file, we divide the sample into multiple zip files for storage. And the APK file name after the file SHA256.

    If your papers or articles use our dataset, please use the following bibtex reference to cite our paper: https://arxiv.org/abs/2005.14619

    (Accepted to Empirical Software Engineering)

    @misc{wang2021virus, title={Beyond the Virus: A First Look at Coronavirus-themed Mobile Malware}, author={Liu Wang and Ren He and Haoyu Wang and Pengcheng Xia and Yuanchun Li and Lei Wu and Yajin Zhou and Xiapu Luo and Yulei Sui and Yao Guo and Guoai Xu}, year={2021}, eprint={2005.14619}, archivePrefix={arXiv}, primaryClass={cs.CR} }

  4. a

    Daily Reservoir Storage and Statistics for Selected Reclamation Reservoirs

    • azgeo-data-hub-agic.hub.arcgis.com
    • rise-usbr.opendata.arcgis.com
    • +2more
    Updated Aug 15, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reclamation_Public (2021). Daily Reservoir Storage and Statistics for Selected Reclamation Reservoirs [Dataset]. https://azgeo-data-hub-agic.hub.arcgis.com/datasets/3ee1b2d5ebc0435583bdb5e30e51f01b
    Explore at:
    Dataset updated
    Aug 15, 2021
    Dataset authored and provided by
    Reclamation_Public
    Area covered
    Description

    This dataset contains daily reservoir storage data and statistics for a selected set of Reclamation reservoirs and reservoir systems. Reservoirs were chosen to include a selection of operationally significant reservoirs or reservoir systems in each Reclamation Region. Reservoir storage values for each included reservoir are updated daily from the water operations database of the Reclamation office that manages the reservoir.This dataset was created for use in the Reservoir Storage Dashboard, located at https://usbr.maps.arcgis.com/apps/dashboards/81aaec3e74024ce6b9a5e50caa20984e.More information about the dashboard and data can be found at https://data.usbr.gov/visualizations/reservoir-conditions/RISE Catalog Item 78633: https://data.usbr.gov/catalog/7964/item/78633To download data, please use the RISE Geospatial Open Data site: https://rise-usbr.opendata.arcgis.com/datasets/3ee1b2d5ebc0435583bdb5e30e51f01b

  5. mac-app-store-apps-release-notes

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MacPaw Inc., mac-app-store-apps-release-notes [Dataset]. https://huggingface.co/datasets/MacPaw/mac-app-store-apps-release-notes
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset provided by
    CleanMyMac
    Authors
    MacPaw Inc.
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for Macappstore Applications Release Notes

    Mac App Store Applications release notes extracted from the metadata from the public API.

    Curated by: MacPaw Inc.

    Language(s) (NLP): Mostly EN, DE License: MIT

      Dataset Details
    

    This dataset is a combined and refined Mac App Store Applications Metadata dataset subset. The main idea behind its creation is to separate the release notes texts of the macOS apps for the convenience of further analysis.… See the full description on the dataset page: https://huggingface.co/datasets/MacPaw/mac-app-store-apps-release-notes.

  6. u

    S3 Dataset

    • portalinvestigacion.um.es
    • figshare.com
    Updated 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    López, Juan Manuel Espín; Celdrán, Alberto Huertas; Marín-Blázquez, Javier G.; Martínez, Francisco Esquembre; Pérez, Gregorio Martínez; López, Juan Manuel Espín; Celdrán, Alberto Huertas; Marín-Blázquez, Javier G.; Martínez, Francisco Esquembre; Pérez, Gregorio Martínez (2021). S3 Dataset [Dataset]. https://portalinvestigacion.um.es/documentos/668fc48db9e7c03b01be0de8?lang=de
    Explore at:
    Dataset updated
    2021
    Authors
    López, Juan Manuel Espín; Celdrán, Alberto Huertas; Marín-Blázquez, Javier G.; Martínez, Francisco Esquembre; Pérez, Gregorio Martínez; López, Juan Manuel Espín; Celdrán, Alberto Huertas; Marín-Blázquez, Javier G.; Martínez, Francisco Esquembre; Pérez, Gregorio Martínez
    Description

    The S3 dataset contains the behavior (sensors, statistics of applications, and voice) of 21 volunteers interacting with their smartphones for more than 60 days. The type of users is diverse, males and females in the age range from 18 until 70 have been considered in the dataset generation. The wide range of age is a key aspect, due to the impact of age in terms of smartphone usage. To generate the dataset the volunteers installed a prototype of the smartphone application in on their Android mobile phones.
    All attributes of the different kinds of data are writed in a vector. The dataset contains the fellow vectors:
    Sensors:
    This type of vector contains data belonging to smartphone sensors (accelerometer and gyroscope) that has been acquired in a given windows of time. Each vector is obtained every 20 seconds, and the monitored features are:- Average of accelerometer and gyroscope values.- Maximum and minimum of accelerometer and gyroscope values.- Variance of accelerometer and gyroscope values.- Peak-to-peak (max-min) of X, Y, Z coordinates.- Magnitude for gyroscope and accelerometer.

    Statistics:
    These vectors contain data about the different applications used by the user recently. Each vector of statistics is calculated every 60 seconds and contains : - Foreground application counters (number of different and total apps) for the last minute and the last day.- Most common app ID and the number of usages in the last minute and the last day. - ID of the currently active app. - ID of the last active app prior to the current one.- ID of the application most frequently utilized prior to the current application. - Bytes transmitted and received through the network interfaces.

    Voice:
    This kind of vector is generated when the microphone is active in a call o voice note. The speaker vector is an embedding, extracted from the audio, and it contains information about the user's identity. This vector, is usually named "x-vector" in the Speaker Recognition field, and it is calculated following the steps detailed in "egs/sitw/v2" for the Kaldi library, with the models available for the extraction of the embedding.


    A summary of the details of the collected database.
    - Users: 21 - Sensors vectors: 417.128 - Statistics app's usage vectors: 151.034 - Speaker vectors: 2.720 - Call recordings: 629 - Voice messages: 2.091

  7. mac-app-store-apps-descriptions

    • huggingface.co
    Updated Sep 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mac-app-store-apps-descriptions [Dataset]. https://huggingface.co/datasets/MacPaw/mac-app-store-apps-descriptions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 25, 2024
    Dataset provided by
    CleanMyMac
    Authors
    MacPaw Inc.
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for Macappstore Applications Descriptions

    Mac App Store Applications descriptions extracted from the metadata from the public API.

    Curated by: MacPaw Inc.

    Language(s) (NLP): Mostly EN, DE License: MIT

      Dataset Details
    

    This dataset is a combined and refined Mac App Store Applications Metadata dataset subset. The main idea behind its creation is to separate the description texts of the macOS apps for the convenience of further analysis.… See the full description on the dataset page: https://huggingface.co/datasets/MacPaw/mac-app-store-apps-descriptions.

  8. S

    documents

    • health.data.ny.gov
    application/rdfxml +5
    Updated Mar 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York State Department of Health (2025). documents [Dataset]. https://health.data.ny.gov/Health/documents/535z-ycei
    Explore at:
    csv, tsv, xml, json, application/rdfxml, application/rssxmlAvailable download formats
    Dataset updated
    Mar 22, 2025
    Authors
    New York State Department of Health
    Description

    This data includes the name and location of active food service establishments and the violations that were found at the time of the inspection. Active food service establishments include only establishments that are currently operating. This dataset excludes inspections conducted in New York City (https://data.cityofnewyork.us/Health/Restaurant-Inspection-Results/4vkw-7nck), Suffolk County (http://apps.suffolkcountyny.gov/health/Restaurant/intro.html) and Erie County (http://www.healthspace.com/erieny). Inspections are a “snapshot” in time and are not always reflective of the day-to-day operations and overall condition of an establishment. Occasionally, remediation may not appear until the following month due to the timing of the updates. Update frequencies and availability of historical inspection data may vary from county to county. Some counties provide this information on their own websites and information found there may be updated more frequently. This dataset is refreshed on a monthly basis. The inspection data contained in this dataset was not collected in a manner intended for use as a restaurant grading system, and should not be construed or interpreted as such. Any use of this data to develop a restaurant grading system is not supported or endorsed by the New York State Department of Health. For more information, visit http://www.health.ny.gov/regulations/nycrr/title_10/part_14/subpart_14-1.htm or go to the “About” tab.

  9. Data from: Novel Corona Virus 2019 Dataset

    • kaggle.com
    zip
    Updated Jan 30, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SRK (2020). Novel Corona Virus 2019 Dataset [Dataset]. https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset
    Explore at:
    zip(3155 bytes)Available download formats
    Dataset updated
    Jan 30, 2020
    Authors
    SRK
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    From World Health Organization - On 31 December 2019, WHO was alerted to several cases of pneumonia in Wuhan City, Hubei Province of China. The virus did not match any other known virus. This raised concern because when a virus is new, we do not know how it affects people.

    So daily level information on the affected people can give some interesting insights when it is made available to the broader data science community.

    Johns Hopkins University has made an excellent dashboard using the affected cases data. This data is extracted from the same link and made available in csv format.

    Content

    2019 Novel Coronavirus (2019-nCoV) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness first detected in Wuhan, China. Early on, many of the patients in the outbreak in Wuhan, China reportedly had some link to a large seafood and animal market, suggesting animal-to-person spread. However, a growing number of patients reportedly have not had exposure to animal markets, indicating person-to-person spread is occurring. At this time, it’s unclear how easily or sustainably this virus is spreading between people - CDC

    This dataset has daily level information on the number of affected cases, deaths and recovery from 2019 novel coronavirus.

    The data is available from 22 Jan 2020.

    Acknowledgements

    Johns Hopkins university has made the data available in google sheets format here. Sincere thanks to them.

    Thanks to WHO, CDC, NHC and DXY for making the data available in first place.

    Picture courtesy : Johns Hopkins University dashboard

    Inspiration

    Some insights could be

    1. Changes in number of affected cases over time
    2. Change in cases over time at country level
    3. Latest number of affected cases
  10. Workout & Fitness Tracker Dataset

    • kaggle.com
    Updated Feb 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adil Shamim (2025). Workout & Fitness Tracker Dataset [Dataset]. https://www.kaggle.com/datasets/adilshamim8/workout-and-fitness-tracker-data/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 8, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Adil Shamim
    Description

    Workout & Fitness Tracker Dataset

    📌 Overview

    This dataset contains 10,000+ records of workout and fitness-related data collected from various fitness apps and devices. It is designed to help analyze and predict workout efficiency based on user activity, health metrics, and lifestyle factors.

    📊 Dataset Features

    Column NameDescription
    User IDUnique identifier for each user
    AgeUser’s age (18-60 years)
    GenderMale, Female, Other
    Height (cm)User’s height in centimeters
    Weight (kg)User’s weight in kilograms
    Workout TypeType of workout (Cardio, Strength, Yoga, HIIT, Cycling, Running)
    Workout Duration (mins)Total time spent in workout
    Calories BurnedTotal calories burned during workout
    Heart Rate (bpm)Average heart rate during the workout
    Steps TakenNumber of steps recorded (for walking/running workouts)
    Distance (km)Distance covered in kilometers
    Workout IntensityLow, Medium, High
    Sleep HoursHours of sleep before the workout
    Water Intake (liters)Water consumed in liters
    Daily Calories IntakeTotal calories consumed in a day
    Resting Heart Rate (bpm)Heart rate when at rest
    VO2 MaxOxygen consumption capacity (indicator of cardiovascular fitness)
    Body Fat (%)Estimated body fat percentage
    Mood Before WorkoutMood before the workout (Happy, Neutral, Tired, Stressed)
    Mood After WorkoutMood after the workout (Energized, Neutral, Fatigued)

    🏋️ Potential Use Cases

    • Predicting Workout Efficiency based on different metrics
    • Analyzing the impact of sleep and nutrition on workout performance
    • Finding correlations between heart rate, workout type, and calories burned
    • Developing AI/ML models to suggest personalized workout plans
    • Tracking fitness habits and their effect on mood
  11. m

    Decadal time series of spatially enhanced relative humidity for Europe at...

    • data.mundialis.de
    • data.opendatascience.eu
    • +2more
    Updated Mar 19, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Decadal time series of spatially enhanced relative humidity for Europe at 1000 m resolution (2000 - 2021) derived from ERA5-Land data [Dataset]. https://data.mundialis.de/geonetwork/srv/search?keyword=CHELSA
    Explore at:
    Dataset updated
    Mar 19, 2022
    Area covered
    Europe
    Description

    Overview: ERA5-Land is a reanalysis dataset providing a consistent view of the evolution of land variables over several decades at an enhanced resolution compared to ERA5. ERA5-Land has been produced by replaying the land component of the ECMWF ERA5 climate reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. Reanalysis produces data that goes several decades back in time, providing an accurate description of the climate of the past. Processing steps: The original hourly ERA5-Land air temperature 2 m above ground and dewpoint temperature 2 m data has been spatially enhanced from 0.1 degree to 30 arc seconds (approx. 1000 m) spatial resolution by image fusion with CHELSA data (V1.2) (https://chelsa-climate.org/). For each day we used the corresponding monthly long-term average of CHELSA. The aim was to use the fine spatial detail of CHELSA and at the same time preserve the general regional pattern and fine temporal detail of ERA5-Land. The steps included aggregation and enhancement, specifically: 1. spatially aggregate CHELSA to the resolution of ERA5-Land 2. calculate difference of ERA5-Land - aggregated CHELSA 3. interpolate differences with a Gaussian filter to 30 arc seconds. 4. add the interpolated differences to CHELSA Subsequently, the temperature time series have been aggregated on a daily basis. From these, daily relative humidity has been calculated for the time period 01/2000 - 07/2021. Relative humidity (rh2m) has been calculated from air temperature 2 m above ground (Ta) and dewpoint temperature 2 m above ground (Td) using the formula for saturated water pressure from Wright (1997): maximum water pressure = 611.21 * exp(17.502 * Ta / (240.97 + Ta)) actual water pressure = 611.21 * exp(17.502 * Td / (240.97 + Td)) relative humidity = actual water pressure / maximum water pressure The resulting relative humidity has been aggregated to decadal averages. Each month is divided into three decades: the first decade of a month covers days 1-10, the second decade covers days 11-20, and the third decade covers days 21-last day of the month. Resultant values have been converted to represent percent * 10, thus covering a theoretical range of [0, 1000]. The data have been reprojected to EU LAEA. File naming scheme (YYYY = year; MM = month; dD = number of decade): ERA5_land_rh2m_avg_decadal_YYYY_MM_dD.tif Projection + EPSG code: EU LAEA (EPSG: 3035) Spatial extent: north: 6874000 south: -485000 west: 869000 east: 8712000 Spatial resolution: 1000 m Temporal resolution: Decadal Pixel values: Percent * 10 (scaled to Integer; example: value 738 = 73.8 %) Software used: GDAL 3.2.2 and GRASS GIS 8.0.0 Original ERA5-Land dataset license: https://apps.ecmwf.int/datasets/licences/copernicus/ CHELSA climatologies (V1.2): Data used: Karger D.N., Conrad, O., Böhner, J., Kawohl, T., Kreft, H., Soria-Auza, R.W., Zimmermann, N.E, Linder, H.P., Kessler, M. (2018): Data from: Climatologies at high resolution for the earth's land surface areas. Dryad digital repository. http://dx.doi.org/doi:10.5061/dryad.kd1d4 Original peer-reviewed publication: Karger, D.N., Conrad, O., Böhner, J., Kawohl, T., Kreft, H., Soria-Auza, R.W., Zimmermann, N.E., Linder, P., Kessler, M. (2017): Climatologies at high resolution for the Earth land surface areas. Scientific Data. 4 170122. https://doi.org/10.1038/sdata.2017.122 Processed by: mundialis GmbH & Co. KG, Germany (https://www.mundialis.de/) Reference: Wright, J.M. (1997): Federal meteorological handbook no. 3 (FCM-H3-1997). Office of Federal Coordinator for Meteorological Services and Supporting Research. Washington, DC Acknowledgements: This study was partially funded by EU grant 874850 MOOD. The contents of this publication are the sole responsibility of the authors and don't necessarily reflect the views of the European Commission.

  12. S

    Data from: erie county

    • health.data.ny.gov
    Updated Mar 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    New York State Department of Health (2025). erie county [Dataset]. https://health.data.ny.gov/Health/erie-county/2d3i-gtsm
    Explore at:
    csv, xml, application/rdfxml, tsv, application/rssxml, kml, application/geo+json, kmzAvailable download formats
    Dataset updated
    Mar 26, 2025
    Authors
    New York State Department of Health
    Area covered
    Erie County
    Description

    This data includes the name and location of food service establishments and the violations that were found at the time of their last inspection. This dataset excludes inspections conducted in New York City (see: https://nycopendata.socrata.com/), Suffolk County (http://apps.suffolkcountyny.gov/health/Restaurant/intro.html) and Erie County. Inspections are a “snapshot” in time and are not always reflective of the day-to-day operations and overall condition of an establishment. Occasionally, remediation may not appear until the following month due to the timing of the updates. Some counties provide this information on their own websites and information found there may be updated more frequently. This dataset is refreshed on a monthly basis.

    Last inspection data is the most recently submitted and available data.

    For more information, check out http://www.health.ny.gov/regulations/nycrr/title_10/part_14/subpart_14-1.htm, or go to the "About" tab.

  13. a

    Data tables for Public COVID-19 Maps

    • communautaire-esrica-apps.hub.arcgis.com
    • open.ottawa.ca
    • +2more
    Updated Sep 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Ottawa (2020). Data tables for Public COVID-19 Maps [Dataset]. https://communautaire-esrica-apps.hub.arcgis.com/datasets/ae347819064d45489ed732306f959a7e
    Explore at:
    Dataset updated
    Sep 8, 2020
    Dataset authored and provided by
    City of Ottawa
    License

    https://ottawa.ca/en/city-hall/get-know-your-city/open-data#open-data-licence-version-2-0https://ottawa.ca/en/city-hall/get-know-your-city/open-data#open-data-licence-version-2-0

    Description

    Rates of confirmed COVID-19 in Ottawa Wards, excluding LTC and RH cases, and number of cases in LTCH and RH in Ottawa Wards. Data are provided for all cases (i.e. cumulative), cases reported within 30 days of the data pull (i.e. last 30 days), and cases reported within 14 days of the data pull (i.e. last 14 days). Based on the most up to date information available at 2pm from the COVID-19 Ottawa Database (The COD) on the day prior to publication.Rates of confirmed COVID-19 in Ottawa Wards, excluding LTC and RH cases, and number of cases in LTCH and RH in Ottawa Wards. Data are provided for all cases (i.e. cumulative), cases reported within 30 days of the data pull (i.e. last 30 days), and cases reported within 14 days of the data pull (i.e. last 14 days). Based on the most up to date information available at 2pm from the COVID-19 Ottawa Database (The COD) on the day prior to publication. You can see the map on Ottawa Public Health's website.Accuracy: Points of consideration for interpretation of the data:Data extracted by Ottawa Public Health at 2pm from the COVID-19 Ottawa Database (The COD) on May 12th, 2020. The COD is a dynamic disease reporting system that allow for continuous updates of case information. These data are a snapshot in time, reflect the most accurate information that OPH has at the time of reporting, and the numbers may differ from other sources. Cases are assigned to Ward geography based on their postal code and Statistics’ Canada’s enhanced postal code conversion file (PCCF+) released in January 2020. Most postal codes have multiple geographic coordinates linked to them. Thus, when available, postal codes were attributed to a XY coordinates based on the Single Link Identifier provided by Statistics’ Canada’s PCCF+. Otherwise, postal codes that fall within the municipal boundaries but whose SLI doesn’t, were attributed to the first XY coordinates within Ottawa listed in the PCCF+. For this reason, results for rural areas should be interpreted with caution as attribution to XY coordinates is less likely to be based on an SLI and rural postal codes typically encompass a much greater surface area than urban postal codes (e.i. greater variability in geographic attribution, less precision in geographic attribution). Population estimates are based on the 2016 Census. Rates calculated from very low case numbers are unstable and should be interpreted with caution. Low case counts have very wide 95% confidence intervals, which are the lower and upper limit within which the true rate lies 95% of the time. A narrow confidence interval leads to a more precise estimate and a wider confidence interval leads to a less precise estimate. In other words, rates calculated from very low case numbers fluctuate so much that we cannot use them to compare different areas or make predictions over time.Update Frequency: Biweekly Attributes:Ward Number – numberWard Name – textCumulative rate (per 100 000 population), excluding cases linked to outbreaks in LTCH and RH – cumulative number of residents with confirmed COVID-19 in a Ward, excluding those linked to outbreaks in LTCH and RH, divided by the total population of that WardCumulative number of cases, excluding cases linked to outbreaks in LTCH and RH - cumulative number of residents with confirmed COVID-19 in a Ward, excluding cases linked to outbreaks in LTCH and RHCumulative number of cases linked to outbreaks in LTCH and RH - Number of residents with confirmed COVID-19 linked to an outbreak in a long-term care home or retirement home by WardRate (per 100 000 population) in the last 30 days, excluding cases linked to outbreaks in LTCH and RH –number of residents with confirmed COVID-19 in a Ward reported in the 30 days prior to the data pull, excluding those linked to outbreaks in LTCH and RH, divided by the total population of that WardNumber of cases in the last 30 days, excluding cases linked to outbreaks in LTCH and RH - cumulative number of residents with confirmed COVID-19 in a Ward reported in the 30 days prior to the data pull, excluding cases linked to outbreaks in LTCH and RHNumber of cases in the last 30 days linked to outbreaks in LTCH and RH - Number of residents with confirmed COVID-19, reported in the 30 days prior to the data pull, linked to an outbreak in a long-term care home or retirement home by WardRate (per 100 000 population) in the last 14 days, excluding cases linked to outbreaks in LTCH and RH –number of residents with confirmed COVID-19 in a Ward reported in the 30 days prior to the data pull, excluding those linked to outbreaks in LTCH and RH, divided by the total population of that WardNumber of cases in the last 14 days, excluding cases linked to outbreaks in LTCH and RH - cumulative number of residents with confirmed COVID-19 in a Ward reported in the 30 days prior to the data pull, excluding cases linked to outbreaks in LTCH and RHContact: OPH Epidemiology Team

  14. UiPad

    • huggingface.co
    Updated Oct 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MacPaw Inc. (2024). UiPad [Dataset]. https://huggingface.co/datasets/MacPaw/UiPad
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 2, 2024
    Dataset provided by
    CleanMyMac
    MacPaw
    Authors
    MacPaw Inc.
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    UiPad - UI Parsing and Accessibility Dataset

    Curated by: MacPaw Inc. Language(s): Mostly EN, UA License: MIT

    Overview UiPad is a dataset created for the IASA Champ 2024 Challenge, focusing on the accessibility and interface understanding of MacOS applications. With growing interest in AI-driven user interface analysis, the dataset aims to bridge the gap in available resources for desktop app accessibility. While mobile apps and web platforms benefit from datasets like RICO and… See the full description on the dataset page: https://huggingface.co/datasets/MacPaw/UiPad.

  15. f

    Precipitation Flux - AgERA5 (Global - Daily - ~10km)

    • data.apps.fao.org
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Precipitation Flux - AgERA5 (Global - Daily - ~10km) [Dataset]. https://data.apps.fao.org/map/catalog/srv/resources/datasets/0c1da7aa-0775-46e8-985b-979c5b5ce995
    Explore at:
    Dataset updated
    Jul 22, 2024
    Description

    Total volume of liquid water (mm3) precipitated over the period 00h-24h local time per unit of area (mm2), per day. Unit: mm day-1. The Precipitation flux variable is part of the Agrometeorological indicators dataset produced by the European Centre for Medium-Range Weather Forecasts (ECMWF) through the Copernicus Climate Change Service (C3S). The Agrometeorological indicators dataset provides daily surface meteorological data for the period from 1979 to present as input for agriculture and agro-ecological studies. This dataset is based on the hourly ECMWF ERA5 data at surface level and is referred to as AgERA5. References: https://doi.org/10.24381/cds.6c68c9bb The Copernicus Climate Change Service (C3S) aims to combine observations of the climate system with the latest science to develop authoritative, quality-assured information about the past, current and future states of the climate in Europe and worldwide. ECMWF operates the Copernicus Climate Change Service on behalf of the European Union and will bring together expertise from across Europe to deliver the service.

  16. d

    Investigating the Educational Potential of Touchscreen Apps for Children's...

    • b2find.dkrz.de
    Updated Feb 12, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Investigating the Educational Potential of Touchscreen Apps for Children's Early Vocabulary Acquisition, 2018-2021 - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/02b98d68-c5c3-50ea-beaf-66e7d33fa4e5
    Explore at:
    Dataset updated
    Feb 12, 2018
    Description

    Touchscreen apps for preschool age children have the potential to teach children valuable skills including language. However, there is limited empirical data on the educational potential of children's touchscreen apps. Furthermore, selecting an educational app from the app marketplace can also be challenging. This data collection developed tools to assess the educational potential of touchscreen apps for preschool age children and assesses the educational potential and language content of apps available in the app marketplace. In addition, children's behavioural interaction and learning from apps were coded.Children are growing up in an increasingly digital age, surrounded by digital media in the home, nursery and at school. Children's language development is strongly related to the language that they hear in their environment (Hart & Risley, 1995) and research suggests that parental screen media use reduces the quality and amount of language spoken to their children (Christakis, Gilkerson, Richards, Zimmerman, Garrison et al., 2009; Kirkorian, Pempek, Murphy, Schmidt, & Anderson, 2009; Pempek, Kirkorian, & Anderson, 2014; Radesky, Silverstein, Zuckerman, & Christakis, 2014). At the same time, screen media has the potential to provide a valuable source of language input and educational entertainment for young children. The American Academy of Paediatrics (AAP) have recently revised their recommendations for children's screen media use to account for this educational potential (AAP, 2016). The AAP encourage children aged 2-5 years to use screen media but restrict their recommendations to high quality educational screen media which should be used alongside their parents and caregivers and for less than 1 hour per day (AAP, 2016). The question remains however, what constitutes educational screen media for young children? For touchscreen apps, a recent review paper highlighted that apps that promote active, engaged, meaningful and socially interactive learning have the potential to educate young children (Hirsh-Pasek, Zosh, Golinkoff, Gray, Robb & Kaufman, 2015). While 72% of apps aimed at children are classed as "educational" (Shuler, Levine & Ree, 2012), there is little research evidence to back up or contradict these claims. Clearer evidence and guidelines for children's educational apps would provide invaluable knowledge for caregivers, early years practitioners and children's app developers. The aim of this project is to investigate children's language learning apps to develop a scientific understanding of the app marketplace and to apply developmental theories of learning, memory and language acquisition to the development of educational touchscreen apps. The proposed project will combine a systematic review of the children's educational app marketplace with a series of empirical studies to explore how children learn language from touchscreen apps and digital media. The first phase of this project, a systematic review of the app marketplace, will to determine the educational potential for apps currently available to children using theories of language acquisition. These findings will then guide a series of empirical studies investigating children's app interaction and language learning outcomes. Furthermore, we will determine the role of caregiver interaction during children's app use on children's language learning outcomes to provide evidence on the AAP (2016) recommendation for parental co-use during children's screen media use. Across these studies, this project will advance our understanding of educational touchscreen apps designed to teach children language by providing evidence-based guidelines for touchscreen apps and contributing to the development of an evidence based word learning app. This project will also make important theoretical contributions to theories of word learning by incorporating evidence for word learning from digital media and as a result promote the development of evidence-based educational screen media for young children. Apps were selected from the top 10 lists in the app stores and via two websites that rate apps. Educational app content was analysed using two complementary tools: (1) a questionnaire for evaluating the educational potential of apps and (2) coding criteria for quantifying the app features (see Kolak et al., 2020). Language in apps were transcribed and analysed according to five psycholinguistic measures and types of grammatical construction. 2-4 year old children were tested on a number of different experimental word learning apps developed as part of this project. Children's behavioural interaction, word learning and visual attention were recorded.

  17. f

    Reference Evapotranspiration - AgERA5 derived (Global - Daily - ~10km)

    • data.apps.fao.org
    • data.amerigeoss.org
    Updated Jun 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Reference Evapotranspiration - AgERA5 derived (Global - Daily - ~10km) [Dataset]. https://data.apps.fao.org/map/catalog/srv/search?resolution=10%20km
    Explore at:
    Dataset updated
    Jun 11, 2024
    Description

    Reference evapotranspiration per day with a spatial resolution of 0.1 degree. Unit: mm day-1. The dataset contains daily values for global land areas, excluding Antarctica, since 1979. The dataset has been prepared according to the FAO Penman - Monteith method as described in FAO Irrigation and Drainage Paper 56. The input variables are part of the Agrometeorological indicators dataset produced by the European Centre for Medium-Range Weather Forecasts (ECMWF) through the Copernicus Climate Change Service (C3S). The Agrometeorological indicators dataset provides daily surface meteorological data for the period from 1979 to present as input for agriculture and agro-ecological studies. This dataset is based on the hourly ECMWF ERA5 data at surface level and is referred to as AgERA5. References: https://doi.org/10.24381/cds.6c68c9bb The Copernicus Climate Change Service (C3S) aims to combine observations of the climate system with the latest science to develop authoritative, quality-assured information about the past, current and future states of the climate in Europe and worldwide. ECMWF operates the Copernicus Climate Change Service on behalf of the European Union and will bring together expertise from across Europe to deliver the service.

  18. f

    Relative Humidity at 12h local time - AgERA5 (Global - Daily - ~10km)

    • data.apps.fao.org
    Updated Mar 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Relative Humidity at 12h local time - AgERA5 (Global - Daily - ~10km) [Dataset]. https://data.apps.fao.org/map/catalog/srv/resources/datasets/7cb8355c-e935-406b-bd02-f48f0ed4543c
    Explore at:
    Dataset updated
    Mar 28, 2024
    Description

    Relative humidity at 12h (local time) at a height of 2 metres above the surface. This variable describes the amount of water vapour present in air expressed as a percentage of the amount needed for saturation at the same temperature. Unit: %. The Relative humidity variable is part of the Agrometeorological indicators dataset produced by the European Centre for Medium-Range Weather Forecasts (ECMWF) through the Copernicus Climate Change Service (C3S). The Agrometeorological indicators dataset provides daily surface meteorological data for the period from 1979 to present as input for agriculture and agro-ecological studies. This dataset is based on the hourly ECMWF ERA5 data at surface level and is referred to as AgERA5. References: https://doi.org/10.24381/cds.6c68c9bb The Copernicus Climate Change Service (C3S) aims to combine observations of the climate system with the latest science to develop authoritative, quality-assured information about the past, current and future states of the climate in Europe and worldwide. ECMWF operates the Copernicus Climate Change Service on behalf of the European Union and will bring together expertise from across Europe to deliver the service.

  19. f

    Number of Rainy Days (Nura-Sarysu - Kazakhstan - Monthly - 500m)

    • data.apps.fao.org
    Updated Jun 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Number of Rainy Days (Nura-Sarysu - Kazakhstan - Monthly - 500m) [Dataset]. https://data.apps.fao.org/map/catalog/srv/resources/datasets/3ceb3b0d-40f0-49e7-bc91-dd04fbb75e71
    Explore at:
    Dataset updated
    Jun 27, 2024
    Description

    Number of Rainy Days calculated for Nura-Sarysu Water Economic basin with complete drainage area of the Nura and Sarysu rivers and Lake Tengiz with buffer. For every pixel, the number of rainy days in a month is reported. This dataset is created by using daily rainfall data from GPM.

  20. Reddit users in the United States 2019-2028

    • statista.com
    Updated Jun 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2024). Reddit users in the United States 2019-2028 [Dataset]. https://www.statista.com/topics/3196/social-media-usage-in-the-united-states/
    Explore at:
    Dataset updated
    Jun 13, 2024
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    United States
    Description

    The number of Reddit users in the United States was forecast to continuously increase between 2024 and 2028 by in total 10.3 million users (+5.21 percent). After the ninth consecutive increasing year, the Reddit user base is estimated to reach 208.12 million users and therefore a new peak in 2028. Notably, the number of Reddit users of was continuously increasing over the past years.User figures, shown here with regards to the platform reddit, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once. Reddit users encompass both users that are logged in and those that are not.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Reddit users in countries like Mexico and Canada.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Aman Chauhan (2022). Play Store Apps [Dataset]. https://www.kaggle.com/datasets/whenamancodes/play-store-apps
Organization logo

Play Store Apps

Web scraped data of 10k Play Store apps for analysing.

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 16, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aman Chauhan
License

Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
License information was derived automatically

Description

While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix-like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern-day techniques (like dynamic page load) using JQuery making scraping more challenging.

Each app (row) has values for catergory, rating, size, and more.

The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!

googleplaystore.csv

ColumnsDescription
AppApplication name
CategoryCategory the app belongs to
RatingsOverall user rating of the app (as when scraped)
ReviewsNumber of user reviews for the app (as when scraped)
SizeSize of the app (as when scraped)
InstallsNumber of user downloads/installs for the app (as when scraped)
TypePaid or Free
PricePrice of the app (as when scraped)
Content RatingAge group the app is targeted at - Children / Mature 21+ / Adult
GenreAn app can belong to multiple genres (apart from its main category). For eg, a musical family game will belong to
Current VerCurrent version of the app available on Play Store (as when scraped)
Android VerMin required Android version (as when scraped)

googleplaystore_user_reviews.csv

ColumnsDescription
AppName of app
Translated ReviewsUser review (Preprocessed and translated to English)
SentimentPositive/Negative/Neutral (Preprocessed)
Sentiment_polaritySentiment polarity score
Sentiment_subjectivitySentiment subjectivity score

More - Find More Exciting🙀 Datasets Here - An Upvote👍 A Dayᕙ(`▿´)ᕗ , Keeps Aman Hurray Hurray..... ٩(˘◡˘)۶Haha

Search
Clear search
Close search
Google apps
Main menu