6 datasets found
  1. AWARE: Dataset for Aspect-Based Sentiment Analysis of Apps Reviews

    • zenodo.org
    • opendatalab.com
    • +2more
    csv
    Updated Jan 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nouf Alturaief; Nouf Alturaief; Hamoud Aljamaan; Hamoud Aljamaan; Malak Baslyman; Malak Baslyman (2022). AWARE: Dataset for Aspect-Based Sentiment Analysis of Apps Reviews [Dataset]. http://doi.org/10.5281/zenodo.5528481
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 25, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nouf Alturaief; Nouf Alturaief; Hamoud Aljamaan; Hamoud Aljamaan; Malak Baslyman; Malak Baslyman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The peer-reviewed paper of AWARE dataset is published in ASEW 2021, and can be accessed through: http://doi.org/10.1109/ASEW52652.2021.00049. Kindly cite this paper when using AWARE dataset.

    Aspect-Based Sentiment Analysis (ABSA) aims to identify the opinion (sentiment) with respect to a specific aspect. Since there is a lack of smartphone apps reviews dataset that is annotated to support the ABSA task, we present AWARE: ABSA Warehouse of Apps REviews.

    AWARE contains apps reviews from three different domains (Productivity, Social Networking, and Games), as each domain has its distinct functionalities and audience. Each sentence is annotated with three labels, as follows:

    • Aspect Term: a term that exists in the sentence and describes an aspect of the app that is expressed by the sentiment. A term value of “N/A” means that the term is not explicitly mentioned in the sentence.
    • Aspect Category: one of the pre-defined set of domain-specific categories that represent an aspect of the app (e.g., security, usability, etc.).
    • Sentiment: positive or negative.

    Note: games domain does not contain aspect terms.

    We provide a comprehensive dataset of 11323 sentences from the three domains, where each sentence is additionally annotated with a Boolean value indicating whether the sentence expresses a positive/negative opinion. In addition, we provide three separate datasets, one for each domain, containing only sentences that express opinions. The file named “AWARE_metadata.csv” contains a description of the dataset’s columns.

    How AWARE can be used?

    We designed AWARE such that it can be used to serve various tasks. The tasks can be, but are not limited to:

    • Sentiment Analysis.
    • Aspect Term Extraction.
    • Aspect Category Classification.
    • Aspect Sentiment Analysis.
    • Explicit/Implicit Aspect Term Classification.
    • Opinion/Not-Opinion Classification.

    Furthermore, researchers can experiment with and investigate the effects of different domains on users' feedback.

  2. m

    Mobile App Logo and User Reviews Recommendation

    • data.mendeley.com
    Updated Aug 15, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Iconix Sas (2024). Mobile App Logo and User Reviews Recommendation [Dataset]. http://doi.org/10.17632/v4ndw78f9b.1
    Explore at:
    Dataset updated
    Aug 15, 2024
    Authors
    Iconix Sas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset offers thorough app metadata from the Google Play Store and a sentiment analysis of user reviews for the app. The first dataset (App_Sentiment_Analysis.csv) provides insights into user views and experiences via translated review texts, sentiment classifications, and numerical ratings for sentiment polarity and subjectivity. The second dataset (Review.csv) covers various program parameters, including ratings, review counts, sizes, installation counts, content ratings, genres, and more. When combined, these datasets allow for an in-depth examination of user reviews and app performance, which supports tactics for app suggestion and enhancement. And also used app logo images using recommendations in this dataset.

  3. h

    google-play-review

    • huggingface.co
    Updated Sep 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jakarta AI Research (2022). google-play-review [Dataset]. https://huggingface.co/datasets/jakartaresearch/google-play-review
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 3, 2022
    Dataset authored and provided by
    Jakarta AI Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is built as a playground for beginner to make a use case for creating sentiment analysis model.

  4. Z

    User Feedback Dataset from the Top 15 Downloaded Mobile Applications

    • data.niaid.nih.gov
    • zenodo.org
    Updated Nov 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hendrawati, Triyani (2023). User Feedback Dataset from the Top 15 Downloaded Mobile Applications [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10204231
    Explore at:
    Dataset updated
    Nov 24, 2023
    Dataset provided by
    hendrawati, Triyani
    Pravitasari, Anindya Apriliyanti
    Herawan, Tutut
    Asnawi, Mohammad Hamid
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset comprises user feedback data collected from 15 globally acclaimed mobile applications, spanning diverse categories. The included applications are among the most downloaded worldwide, providing a rich and varied source for analysis. The dataset is particularly suitable for Natural Language Processing (NLP) applications, such as text classification and topic modeling. List of Included Applications:

    TikTok Instagram Facebook WhatsApp Telegram Zoom Snapchat Facebook Messenger Capcut Spotify YouTube HBO Max Cash App Subway Surfers Roblox Data Columns and Descriptions: Data Columns and Descriptions:

    review_id: Unique identifiers for each user feedback/application review. content: User-generated feedback/review in text format. score: Rating or star given by the user. TU_count: Number of likes/thumbs up (TU) received for the review. app_id: Unique identifier for each application. app_name: Name of the application. RC_ver: Version of the app when the review was created (RC). Terms of Use: This dataset is open access for scientific research and non-commercial purposes. Users are required to acknowledge the authors' work and, in the case of scientific publication, cite the most appropriate reference: M. H. Asnawi, A. A. Pravitasari, T. Herawan, and T. Hendrawati, "The Combination of Contextualized Topic Model and MPNet for User Feedback Topic Modeling," in IEEE Access, vol. 11, pp. 130272-130286, 2023, doi: 10.1109/ACCESS.2023.3332644.

    Researchers and analysts are encouraged to explore this dataset for insights into user sentiments, preferences, and trends across these top mobile applications. If you have any questions or need further information, feel free to contact the dataset authors.

  5. Towards a Data-Driven Requirements Engineering Approach: Automatic Analysis...

    • zenodo.org
    • paperswithcode.com
    • +1more
    Updated Jul 10, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jialiang Wei; Anne-Lise Courbis; Thomas Lambolais; Binbin Xu; Pierre Louis Bernard; Gérard Dray; Jialiang Wei; Anne-Lise Courbis; Thomas Lambolais; Binbin Xu; Pierre Louis Bernard; Gérard Dray (2024). Towards a Data-Driven Requirements Engineering Approach: Automatic Analysis of User Reviews [Dataset]. http://doi.org/10.5281/zenodo.7261877
    Explore at:
    Dataset updated
    Jul 10, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jialiang Wei; Anne-Lise Courbis; Thomas Lambolais; Binbin Xu; Pierre Louis Bernard; Gérard Dray; Jialiang Wei; Anne-Lise Courbis; Thomas Lambolais; Binbin Xu; Pierre Louis Bernard; Gérard Dray
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    6000 French user reviews from three applications on Google Play (Garmin Connect, Huawei Health, Samsung Health) are labelled manually. We selected four labels: rating, bug report, feature request and user experience.

    • Ratings are simple text which express the overall evaluation to that app, including praise, criticism, or dissuasion.
    • Bug reports show the problems that users have met while using the app, like loss of data, crash of app, connection error, etc.
    • Feature requests reflect the demande of users on new function, new content, new interface, etc.
    • In user experience, users describe their experience in relation to the functionality of the app, how does certain functions be helpful.

    As we can observe from the following table, that shows examples of labelled user reviews, each review belongs to one or more categories.

    AppTotalRatingBug reportFeature requestUser experience
    Garmin Connect20001260757170493
    Huawei Health20001068819384289
    Samsung Health20001324491486349

    New Dataset

    Based on this dataset, we developed a labeled dataset containing 6,000 English and 6,000 French reviews for classification, as well as 1,200 bilingual reviews for clustering. The new dataset has been made publicly available on Zenodo at the following link: https://zenodo.org/records/11066414

  6. f

    Data_Sheet_4_Measuring trust: a text analysis approach to compare, contrast,...

    • figshare.com
    pdf
    Updated Nov 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Areen Alsaid; Mengyao Li; Erin K. Chiou; John D. Lee (2023). Data_Sheet_4_Measuring trust: a text analysis approach to compare, contrast, and select trust questionnaires.pdf [Dataset]. http://doi.org/10.3389/fpsyg.2023.1192020.s004
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Nov 15, 2023
    Dataset provided by
    Frontiers
    Authors
    Areen Alsaid; Mengyao Li; Erin K. Chiou; John D. Lee
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionTrust has emerged as a prevalent construct to describe relationships between people and between people and technology in myriad domains. Across disciplines, researchers have relied on many different questionnaires to measure trust. The degree to which these questionnaires differ has not been systematically explored. In this paper, we use a word-embedding text analysis technique to identify the differences and common themes across the most used trust questionnaires and provide guidelines for questionnaire selection.MethodsA review was conducted to identify the existing trust questionnaires. In total, we included 46 trust questionnaires from three main domains (i.e., Automation, Humans, and E-commerce) with a total of 626 items measuring different trust layers (i.e., Dispositional, Learned, and Situational). Next, we encoded the words within each questionnaire using GloVe word embeddings and computed the embedding for each questionnaire item, and for each questionnaire. We reduced the dimensionality of the resulting dataset using UMAP to visualize these embeddings in scatterplots and implemented the visualization in a web app for interactive exploration of the questionnaires (https://areen.shinyapps.io/Trust_explorer/).ResultsAt the word level, the semantic space serves to produce a lexicon of trust-related words. At the item and questionnaire level, the analysis provided recommendation on questionnaire selection based on the dispersion of questionnaires’ items and at the domain and layer composition of each questionnaire. Along with the web app, the results help explore the semantic space of trust questionnaires and guide the questionnaire selection process.DiscussionThe results provide a novel means to compare and select trust questionnaires and to glean insights about trust from spoken dialog or written comments.

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Nouf Alturaief; Nouf Alturaief; Hamoud Aljamaan; Hamoud Aljamaan; Malak Baslyman; Malak Baslyman (2022). AWARE: Dataset for Aspect-Based Sentiment Analysis of Apps Reviews [Dataset]. http://doi.org/10.5281/zenodo.5528481
Organization logo

AWARE: Dataset for Aspect-Based Sentiment Analysis of Apps Reviews

Explore at:
csvAvailable download formats
Dataset updated
Jan 25, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Nouf Alturaief; Nouf Alturaief; Hamoud Aljamaan; Hamoud Aljamaan; Malak Baslyman; Malak Baslyman
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The peer-reviewed paper of AWARE dataset is published in ASEW 2021, and can be accessed through: http://doi.org/10.1109/ASEW52652.2021.00049. Kindly cite this paper when using AWARE dataset.

Aspect-Based Sentiment Analysis (ABSA) aims to identify the opinion (sentiment) with respect to a specific aspect. Since there is a lack of smartphone apps reviews dataset that is annotated to support the ABSA task, we present AWARE: ABSA Warehouse of Apps REviews.

AWARE contains apps reviews from three different domains (Productivity, Social Networking, and Games), as each domain has its distinct functionalities and audience. Each sentence is annotated with three labels, as follows:

  • Aspect Term: a term that exists in the sentence and describes an aspect of the app that is expressed by the sentiment. A term value of “N/A” means that the term is not explicitly mentioned in the sentence.
  • Aspect Category: one of the pre-defined set of domain-specific categories that represent an aspect of the app (e.g., security, usability, etc.).
  • Sentiment: positive or negative.

Note: games domain does not contain aspect terms.

We provide a comprehensive dataset of 11323 sentences from the three domains, where each sentence is additionally annotated with a Boolean value indicating whether the sentence expresses a positive/negative opinion. In addition, we provide three separate datasets, one for each domain, containing only sentences that express opinions. The file named “AWARE_metadata.csv” contains a description of the dataset’s columns.

How AWARE can be used?

We designed AWARE such that it can be used to serve various tasks. The tasks can be, but are not limited to:

  • Sentiment Analysis.
  • Aspect Term Extraction.
  • Aspect Category Classification.
  • Aspect Sentiment Analysis.
  • Explicit/Implicit Aspect Term Classification.
  • Opinion/Not-Opinion Classification.

Furthermore, researchers can experiment with and investigate the effects of different domains on users' feedback.

Search
Clear search
Close search
Google apps
Main menu