Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BDFoodReview is a large-scale dataset containing 334,119 restaurant reviews collected from "Foodpanda Bangladesh". The dataset includes customer reviews in mixed languages (Bangla, English, and Banglish), translated into English, along with their corresponding ratings and sentiment labels.
Dataset Statistics Total Reviews: 334,119 Features/Columns: 19
Potential Applications Sentiment Analysis Restaurant Review Classification Customer Satisfaction Analysis Opinion Mining Natural Language Processing Research Food Service Industry Analysis
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The dataset used in Aspect Term Extraction Using Deep Learning-Based Approach on Indonesian Restaurant Reviews paper, by Rachmansyah Adhi Widhianto and Ade Romadhony.
https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
This is a dataset for practicing Sentiment Analysis, Text analytics and classification etc. Dataset will be updated on regular basis by scrapping reviews from websites .
Analyse your NLP skills and make some amazing notebooks and perform some text classification . The reviews can be positive or negative/liked or disliked . Positive Review : 1 Negative Review : 0
The Kaggle sentiment analysis competition dataset contains unlabeled restaurant reviews used to supplement the labeled SemEval dataset for improved performance in sentiment analysis.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Restaurant Reviews CZ ABSA - 2.15k reviews with their related target and category
The work done is described in the paper: https://doi.org/10.13053/CyS-20-3-2469
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description This dataset contains customer reviews related to handyman services. It has been adapted from a restaurant review dataset by transforming food-related terms into handyman service-related terms. The dataset can be used for sentiment analysis, natural language processing (NLP), and customer feedback analysis in the service industry.
Dataset Features Review (String): Customer feedback about the handyman service, detailing their experience with repairs, maintenance, or installations. Liked (Categorical: Yes/No): Indicates whether the customer was satisfied with the service (Yes) or dissatisfied (No). Usage This dataset is ideal for:
Sentiment Analysis: Train models to classify positive and negative reviews. Customer Experience Research: Identify trends in customer satisfaction and complaints. NLP Applications: Test and develop text classification, keyword extraction, and sentiment prediction models. Potential Applications Developing a handyman service recommendation system. Analyzing customer sentiments to improve service quality. Training machine learning models for automated review classification. Acknowledgments This dataset was transformed from an original restaurant review dataset to suit handyman services, making it relevant for research in service industry sentiment analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
rating
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Datasets of Tripadvisor reviews by UK residents of UK hotels and restaurants, together with the user's rating of the hotel.Datasets are split by:Hotel star level (2, 3, 4 or all[mixed]) or Restaurant;Reviewer gender (M=male-authored reviews; F=female-authored reviews; MF=equal numbers of male and female authored reviews for each rating level);Number of texts (1k, 2k, 4k, 8k, 16k, or all available)Each dataset contains equal numbers of reviews at each rating level.The reviews were selected at random from TripAdvisor.This data is from this paper:Thelwall, M. (2018). Gender bias in machine learning for sentiment analysis. Online Information Review, 42(3), 343-354. doi: 10.1108/OIR-05-2017-0152
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The file contains four public datasets, which are taken from user comments on Laptop and Restaurant. Each dataset is composed of review, aspect words composed of one or more words, and the sentiment polarity of the aspect word. The sentiment polarity is composed of Positive, Neutral, and Negative.
We provide you with a data set in CSV format. The data set contains food review for the restaurant
The target variable is labeled Sentiment.
Create a Classification model to predict the target variable Sentiment.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
The restaurant test data for Subtask 1 Phase A evaluation of the SemEval 2016 Task 5: Aspect Based Sentiment Analysis (ABSA) for French (120 reviews, 668 sentences).
https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/
Dataset Card for ArRestReviews
Dataset Summary
Dataset of 8364 restaurant reviews from qaym.com in Arabic for sentiment analysis
Supported Tasks and Leaderboards
[More Information Needed]
Languages
The dataset is based on Arabic.
Dataset Structure
Data Instances
A typical data point comprises of the following:
"polarity": which is a string value of either 0 or 1 indicating the sentiment around the review
"text": is the… See the full description on the dataset page: https://huggingface.co/datasets/hadyelsahar/ar_res_reviews.
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The Consumer Ratings & Reviews Software market is experiencing robust growth, projected to reach $478 million in 2025 and exhibiting a Compound Annual Growth Rate (CAGR) of 12.7% from 2025 to 2033. This expansion is fueled by the increasing reliance of businesses across diverse sectors—including retail, logistics, media & entertainment, travel & hospitality, and healthcare—on online reviews to enhance brand reputation, drive customer loyalty, and inform business strategies. The shift towards cloud-based solutions simplifies deployment and accessibility, contributing significantly to market growth. Furthermore, the rising adoption of social media and the increasing consumer expectation for transparent and readily available reviews are key drivers. Competitive pressures are driving innovation, with companies constantly refining their offerings to provide comprehensive analytics, sentiment analysis, and automated response features. Segmentation by application and deployment type reflects the market's adaptability to diverse business needs. The North American market currently holds a significant share, driven by early adoption and established e-commerce infrastructure, but growth in regions like Asia-Pacific, fueled by rapid digitalization, presents lucrative opportunities. While the market enjoys considerable momentum, challenges remain. Data security and privacy concerns surrounding sensitive customer information are crucial considerations for businesses. Integration with existing CRM and marketing platforms can also pose complexities for some companies. However, the overall trend points towards continued expansion, with a focus on improving the accuracy and authenticity of reviews and the development of more sophisticated analytics capabilities to leverage review data for strategic decision-making. This includes incorporating AI and machine learning to better understand customer sentiment and identify areas for improvement. The emergence of specialized solutions catering to specific industry needs will further fragment the market, yet simultaneously enhance its reach and overall impact on business operations.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Turkish Sentiment Dataset (Restaurant Reviews)
This dataset contains 15 manually created Turkish sentences labeled for sentiment analysis. The sentences are related to restaurant experiences and are categorized as positive, negative, or neutral.
Labels
olumlu (positive) olumsuz (negative) nötr (neutral)
Format
The dataset uses $ as a delimiter to avoid issues with commas inside the text.
Example
Yemeklerin lezzeti ve sunumu tam bir şölendi, şefi… See the full description on the dataset page: https://huggingface.co/datasets/hanerdem/turkish-sentiment-dataset.
There are many contexts where dyadic data are present. In social networks, users are linked to a variety of items, defining interactions. In the social platform of TripAdvisor, users are linked to restaurants by means of reviews posted by them. Using the information of these interactions, we can get valuable insights for forecasting, proposing tasks related to recommender systems, sentiment analysis, text-based personalisation or text summarisation, among others. Furthermore, in the context of TripAdvisor there is a scarcity of public datasets and lack of well-known benchmarks for model assessment. We present six new TripAdvisor datasets from the restaurants of six different cities: London, New York, New Delhi, Paris, Barcelona and Madrid. If you use this data, please cite the following paper under submission process (preprint - arXiv) We exclusively collected the reviews written in English from the restaurants of each city. The tabular data is comprised of a set of six different CSV files, containing numerical, categorical and text features: parse_count: numerical (integer), corresponding number of extracted review by the web scraper (auto-incremental) author_id: categorical (string), univocal, incremental and anonymous identifier of the user (UID_XXXXXXXXXX) restaurant_name: categorical (string), name of the restaurant matching the review rating_review: numerical (integer), review score in the range 1-5 sample: categorical (string), indicating “positive” sample for scores 4-5 and “negative” for scores 1-3 review_id: categorical (string), univocal and internal identifier of the review (review_XXXXXXXXX) title_review: text, review title review_preview: text, preview of the review, truncated in the website when the text is very long review_full: text, complete review date: timestamp, publication date of the review in the format (day, month, year) city: categorical (string), city of the restaurant which the review was written for url_restaurant: text, restaurant url
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Accuracies for Yelp restaurant dataset with 100.000 reviews.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In recent years, the surge in reviews and comments on newspapers and social media has made sentiment analysis a focal point of interest for researchers. Sentiment analysis is also gaining popularity in the Bengali language. However, Aspect-Based Sentiment Analysis is considered a difficult task in the Bengali language due to the shortage of perfectly labeled datasets and the complex variations in the Bengali language. This study used two open-source benchmark datasets of the Bengali language, Cricket, and Restaurant, for our Aspect-Based Sentiment Analysis task. The original work was based on the Random Forest, Support Vector Machine, K-Nearest Neighbors, and Convolutional Neural Network models. In this work, we used the Bidirectional Encoder Representations from Transformers, the Robustly Optimized BERT Approach, and our proposed hybrid transformative Random Forest and Bidirectional Encoder Representations from Transformers (tRF-BERT) models to compare the results with the existing work. After comparing the results, we can clearly see that all the models used in our work achieved better results than any of the previous works on the same dataset. Amongst them, our proposed transformative Random Forest and Bidirectional Encoder Representations from Transformers achieved the highest F1 score and accuracy. The accuracy and F1 score of aspect detection for the Cricket dataset were 0.89 and 0.85, respectively, and for the Restaurant dataset were 0.92 and 0.89 respectively.
ABSTRACT Going through several reviews could be laborious when this has to be done for multiple restaurants. One could instead read a graphical representation of what is great at the restaurant. Currently on Yelp, the food recommendations are only based on the total number of mentions of the food item in the reviews. Higher mentions, irrespective of the context, get an up-vote toward recommended items. Including context from reviews and tips could greatly improve the list of recommended items. In this project, we combine Named Entity Recognition and Sentiment Analysis of reviews. Based on the sentiment of the reviews we aim to suggest the best dishes of a restaurant or the best restaurant offering a dish. We have leveraged various feature engineering methods to produce state-of-the-art results. We established that if chosen, the appropriate feature vectors can significantly improve the classification performance. Fine-tuning BERT and bi-directional LSTM are producing better results than the machine learning models and if trained for more epochs can eventually prove to be the best classifier models. Keywords: Contextual Recommendation, Named Entity Recognition, BERT, LSTM, Count Vectors, TF-IDF
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘SemEval 2014 Task 4: AspectBasedSentimentAnalysis’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/charitarth/semeval-2014-task-4-aspectbasedsentimentanalysis on 28 January 2022.
--- Dataset description provided by original source is as follows ---
Copied from https://alt.qcri.org/semeval2014/task4/#, all credits to respective authors.
Sentiment analysis is increasingly viewed as a vital task both from an academic and a commercial standpoint. The majority of current approaches, however, attempt to detect the overall polarity of a sentence, paragraph, or text span, regardless of the entities mentioned (e.g., laptops, restaurants) and their aspects (e.g., battery, screen; food, service). By contrast, this task is concerned with aspect based sentiment analysis (ABSA), where the goal is to identify the aspects of given target entities and the sentiment expressed towards each aspect. Datasets consisting of customer reviews with human-authored annotations identifying the mentioned aspects of the target entities and the sentiment polarity of each aspect will be provided.
In particular, the task consists of the following subtasks:
Given a set of sentences with pre-identified entities (e.g., restaurants), identify the aspect terms present in the sentence and return a list containing all the distinct aspect terms. An aspect term names a particular aspect of the target entity.
For example, "I liked the service and the staff, but not the food”, “The food was nothing much, but I loved the staff”. Multi-word aspect terms (e.g., “hard disk”) should be treated as single terms (e.g., in “The hard disk is very noisy” the only aspect term is “hard disk”).
For a given set of aspect terms within a sentence, determine whether the polarity of each aspect term is positive, negative, neutral or conflict (i.e., both positive and negative).
For example:
“I loved their fajitas” → {fajitas: positive} “I hated their fajitas, but their salads were great” → {fajitas: negative, salads: positive} “The fajitas are their first plate” → {fajitas: neutral} “The fajitas were great to taste, but not to see” → {fajitas: conflict}
Given a predefined set of aspect categories (e.g., price, food), identify the aspect categories discussed in a given sentence. Aspect categories are typically coarser than the aspect terms of Subtask 1, and they do not necessarily occur as terms in the given sentence.
For example, given the set of aspect categories {food, service, price, ambience, anecdotes/miscellaneous}:
“The restaurant was too expensive” → {price} “The restaurant was expensive, but the menu was great” → {price, food}
Given a set of pre-identified aspect categories (e.g., {food, price}), determine the polarity (positive, negative, neutral or conflict) of each aspect category.
For example:
“The restaurant was too expensive” → {price: negative} “The restaurant was expensive, but the menu was great” → {price: negative, food: positive}
Two domain-specific datasets for laptops and restaurants, consisting of over 6K sentences with fine-grained aspect-level human annotations have been provided for training.
Restaurant reviews:
This dataset consists of over 3K English sentences from the restaurant reviews of Ganu et al. (2009). The original dataset of Ganu et al. included annotations for coarse aspect categories (Subtask 3) and overall sentence polarities; we modified the dataset to include annotations for aspect terms occurring in the sentences (Subtask 1), aspect term polarities (Subtask 2), and aspect category-specific polarities (Subtask 4). We also corrected some errors (e.g., sentence splitting errors) of the original dataset. Experienced human annotators identified the aspect terms of the sentences and their polarities (Subtasks 1 and 2). Additional restaurant reviews, not in the original dataset of Ganu et al. (2009), are being annotated in the same manner, and they will be used as test data.
Laptop reviews:
This dataset consists of over 3K English sentences extracted from customer reviews of laptops. Experienced human annotators tagged the aspect terms of the sentences (Subtask 1) and their polarities (Subtask 2). This dataset will be used only for Subtasks 1 and 2. Part of this dataset will be reserved as test data.
The sentences in the datasets are annotated using XML tags.
The following example illustrates the format of the annotated sentences of the restaurants dataset. ```xml
The possible values of the polarity field are: “positive”, “negative”, “conflict”, “neutral”. The possible values of the category field are: “food”, “service”, “price”, “ambience”, “anecdotes/miscellaneous”.
The following example illustrates the format of the annotated sentences of the laptops dataset. The format is the same as in the restaurant datasets, with the only exception that there are no annotations for aspect categories. Notice that we annotate only aspect terms naming particular aspects (e.g., “everything about it” does not name a particular aspect).
```xml
In the sentences of both datasets, there is an
--- Original source retains full ownership of the source dataset ---
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset is the first publicly available aspect-based sentiment analysis dataset for the Sorani dialect of Kurdish, addressing a critical gap in natural language processing (NLP) research for low-resource languages. The dataset comprised more than 4000 quadruplet ABSA in the restaurant review domain, written in the Kurdish language (Sorani dialect) using the Perso-Arabic script. The dataset was automatically annotated using a few-shot and prompt based model. This resource is intended for use in machine learning, deep learning, and cross-lingual model adaptation, making it suitable for training, fine-tuning, and benchmarking.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BDFoodReview is a large-scale dataset containing 334,119 restaurant reviews collected from "Foodpanda Bangladesh". The dataset includes customer reviews in mixed languages (Bangla, English, and Banglish), translated into English, along with their corresponding ratings and sentiment labels.
Dataset Statistics Total Reviews: 334,119 Features/Columns: 19
Potential Applications Sentiment Analysis Restaurant Review Classification Customer Satisfaction Analysis Opinion Mining Natural Language Processing Research Food Service Industry Analysis