56 datasets found
  1. Bangla YouTube Sentiment and Emotion datasets

    • kaggle.com
    zip
    Updated Sep 9, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nit003 (2019). Bangla YouTube Sentiment and Emotion datasets [Dataset]. https://www.kaggle.com/datasets/nit003/bangla-youtube-sentiment-and-emotion-datasets
    Explore at:
    zip(439462 bytes)Available download formats
    Dataset updated
    Sep 9, 2019
    Authors
    nit003
    Area covered
    YouTube
    Description

    Context

    This dataset is used in Multilabel sentiment analysis and emotion detection for YouTube comments in different kinds of Bengali videos.

    Content

    There are two files in the folder. There are might be multiple comments with same text. Also it may be noted that, the comments collected here contain abusive and vulgar words, slangs and personal attack. Therefore, we ensure that all annotators are adults.

    Sentiment.csv

    Id - Unique id number for the comment. Text - Text of the data Label - 1 (3 class label) or 2 (5 class label) Score - Denotes the polarity of the comment. In three class labelling : 1(positive), 0 (neutral), -1(negative) In three class labelling : 2 (highly positive), 1(positive), 0 (neutral), -1(negative), -2(highly negative) Lan - Language of the comment. EN (English), BN (Bengali), RN (Romanized Bangla) Domain - Category of the video.

    Emotion.csv

    Id - Unique id number for the comment. Text - Text of the data emotion - Corresponding emotion of the comment. Anger/Joy/Disgust/Fear/Surprise/Sad/None (no emotion found) Lan - Language of the comment. EN (English), BN (Bengali), RN (Romanized Bangla) Domain - Category of the video.

    Acknowledgements

    If you use the dataset in any research work, please cite the following paper as

    N. Irtiza Tripto and M. Eunus Ali, "Detecting Multilabel Sentiment and Emotions from Bangla YouTube Comments," 2018 International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, 2018, pp. 1-6.

    doi: 10.1109/ICBSLP.2018.8554875

    Inspiration

    It will be helpful for researchers specially in analyzing sentiments from social media in non-English language

  2. m

    RevBangla: Bangla Product Sentiment Analysis Dataset

    • data.mendeley.com
    Updated Mar 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saieef Sarower Sunny (2024). RevBangla: Bangla Product Sentiment Analysis Dataset [Dataset]. http://doi.org/10.17632/bnbbcdsf4m.1
    Explore at:
    Dataset updated
    Mar 6, 2024
    Authors
    Saieef Sarower Sunny
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Bangla Product Comments Dataset is a comprehensive collection of product reviews gathered from diverse ecommerce platforms in Bangladesh. This dataset offers a rich source of information reflecting customer opinions and sentiments towards various products available online. This dataset holds significant value for businesses, researchers, and data scientists interested in understanding consumer behavior, product perception, and sentiment analysis within the Bangladeshi ecommerce landscape. By leveraging this dataset, stakeholders can derive actionable insights to enhance product quality, marketing strategies, and overall customer satisfaction.

    Columns:

    1. Product_ID: A unique identifier for each product, facilitating organization and referencing.
    2. Date: The date when the comment was posted, providing temporal context for analysis.
    3. Customer Name: The name or identifier of the customer who submitted the comment, ensuring traceability and potential user segmentation.
    4. Rating: A numerical representation (typically on a scale of 1 to 5) reflecting the customer's overall satisfaction level with the product.
    5. Label Sentiment: A categorical label assigned to each comment indicating the sentiment expressed by the customer (e.g., positive, negative). This classification facilitates sentiment analysis tasks.
    6. Comment: The actual text of the customer's review or comment, conveying specific opinions, feedback, or experiences regarding the product.
  3. Bengali Sentiment Dataset

    • kaggle.com
    zip
    Updated Jul 25, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nuhash Afnan (2020). Bengali Sentiment Dataset [Dataset]. https://www.kaggle.com/nuhashafnan/pseudolabel
    Explore at:
    zip(631287 bytes)Available download formats
    Dataset updated
    Jul 25, 2020
    Authors
    Nuhash Afnan
    Description

    Dataset

    This dataset was created by Nuhash Afnan

    Contents

  4. n

    Data Set For Sentiment Analysis On Bengali News Comments

    • narcis.nl
    • data.mendeley.com
    Updated Sep 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chowdhury, M (via Mendeley Data) (2019). Data Set For Sentiment Analysis On Bengali News Comments [Dataset]. http://doi.org/10.17632/n53xt69gnf.2
    Explore at:
    Dataset updated
    Sep 15, 2019
    Dataset provided by
    Data Archiving and Networked Services (DANS)
    Authors
    Chowdhury, M (via Mendeley Data)
    Description

    This is a data set of Sentiment Analysis On Bangla News Comments where every data was annotated by three different individuals to get three different perspectives and based on the majorities decisions the final tag was chosen. This data set contains 13802 data in total.

  5. m

    Bengali YouTube News Opinion Data for Temporal Sentiment Analysis

    • data.mendeley.com
    Updated Oct 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lomat Haider Chowdhury (2023). Bengali YouTube News Opinion Data for Temporal Sentiment Analysis [Dataset]. http://doi.org/10.17632/3c3j3bkxvn.1
    Explore at:
    Dataset updated
    Oct 5, 2023
    Authors
    Lomat Haider Chowdhury
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    The dataset presents the news articles published in a renowned Bengali YouTube news channel along with the public comments, replies, and other corresponding information. There are 7,62,678 samples of data with 15 features. The features include video URL, title of the news, likes in the video, video views, publishing date, hashtags, video description, comments with corresponding likes, and replies with likes. To ensure the privacy of the commentators, their names have been encoded.

  6. Bengali Identity Bias Evaluation Dataset (BIBED)

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Aug 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dipto Das; Dipto Das; Shion Guha; Shion Guha; Bryan Semaan; Bryan Semaan (2023). Bengali Identity Bias Evaluation Dataset (BIBED) [Dataset]. http://doi.org/10.5281/zenodo.7775521
    Explore at:
    binAvailable download formats
    Dataset updated
    Aug 7, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Dipto Das; Dipto Das; Shion Guha; Shion Guha; Bryan Semaan; Bryan Semaan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Critical studies found NLP systems to bias based on gender and racial identities. However, few studies focused on identities defined by cultural factors like religion and nationality. Compared to English, such research efforts are even further limited in major languages like Bengali due to the unavailability of labeled datasets. Our paper (see the reference) describes a process for developing a bias evaluation dataset highlighting cultural influences on identity. We also provide this Bengali dataset as an artifact outcome that can contribute to future critical research.

    If you find this dataset useful, please cite the associated paper:

    Das, D., Guha, S., & Semaan, B. (2023, May). Toward Cultural Bias Evaluation Datasets: The Case of Bengali Gender, Religious, and National Identity. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP) (pp. 68-83).

    BibTeX:

    @inproceedings{das-etal-2023-toward,
      title = "Toward Cultural Bias Evaluation Datasets: The Case of {B}engali Gender, Religious, and National Identity",
      author = "Das, Dipto and
       Guha, Shion and
       Semaan, Bryan",
      booktitle = "Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP)",
      month = may,
      year = "2023",
      address = "Dubrovnik, Croatia",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/2023.c3nlp-1.8",
      pages = "68--83",
    }
  7. n

    Bangla Bengali sentiment lexicon dictionary with positive and negative words...

    • narcis.nl
    • data.mendeley.com
    Updated Mar 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sazzed, S (via Mendeley Data) (2021). Bangla Bengali sentiment lexicon dictionary with positive and negative words [Dataset]. http://doi.org/10.17632/zggnjpnmwp.2
    Explore at:
    Dataset updated
    Mar 9, 2021
    Dataset provided by
    Data Archiving and Networked Services (DANS)
    Authors
    Sazzed, S (via Mendeley Data)
    Description

    This dataset contains around 1300 positive and negative Bengal ( Bangla ) sentiment words. This lexicon was created from a Bengali review corpus.

    If you use this lexicon please cite following paper-

    @inproceedings{sazzed2020development, title={Development of Sentiment Lexicon in Bengali utilizing Corpus and Cross-lingual Resources}, author={Sazzed, Salim}, booktitle={2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science (IRI)}, pages={237--244}, year={2020}, organization={IEEE Computer Society} }

    https://www.cs.odu.edu/~ssazzed/IEEE_IRI_2020.pdf

  8. f

    Bangla (Bengali) Drama Review Dataset

    • figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    salim sazzed (2023). Bangla (Bengali) Drama Review Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.13162085.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    salim sazzed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The repository contains 3307 Negative reviews and 8500 Positive reviews collected and manually annotated from Youtube Bengali drama.If you use this dataset, please cite the following paper-@inproceedings{sazzed2020cross,title={Cross-lingual sentiment classification in low-resource Bengali language},author={Sazzed, Salim},booktitle={Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)},pages={50--60},year={2020}

    }If you have any questions, please email me- salimsazzad222@gmail.com.

  9. m

    Bangla Sentiment Dataset

    • data.mendeley.com
    Updated Dec 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jahanur Biswas (2024). Bangla Sentiment Dataset [Dataset]. http://doi.org/10.17632/rh67mckhbh.1
    Explore at:
    Dataset updated
    Dec 19, 2024
    Authors
    Jahanur Biswas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Bangla Sentiment Dataset is a curated collection of sentiment-rich textual data in Bangla, focused on recent and trending topics. This dataset has been compiled from diverse sources, including Bangladeshi online newspapers, social media platforms, and blogs, ensuring a wide spectrum of language styles and sentiment expressions.

    Key Features: Focus on Recent Topics: The dataset emphasizes contemporary issues, trending discussions, and popular topics in Bangladeshi society. This includes sentiments on political developments, social movements, entertainment, cultural events, and other recent happenings.

    Source Variety:

    Online Newspapers: Articles, editorials, headlines, and reader comments provide structured and semi-formal sentiment data. Social Media: Posts, tweets, and comments reflect informal, conversational language with high emotional expressiveness. Blogs: Opinion pieces and discussions offer detailed and context-rich sentiment content. Sentiment Labels: Each entry in the dataset is annotated with one of the following sentiment categories:

    Positive (1): Texts expressing happiness, agreement, or optimism. Negative (0): Texts reflecting criticism, disagreement, or pessimism. Neutral (2): Texts presenting balanced or factual statements with minimal emotional bias. Linguistic and Stylistic Diversity: The dataset captures a range of Bangla language variations, including:

    Formal and informal Bangla usage. Regional dialects. Transliterated Bangla (Banglish) commonly used on social media. Real-World Context: The inclusion of recent topics ensures that the dataset is relevant for analyzing public sentiment around current events and trends. This makes it particularly useful for real-time sentiment analysis applications.

    This dataset provides an invaluable resource for researchers and practitioners aiming to explore sentiment analysis in Bangla, with a special emphasis on modern-day relevance and real-world applicability.

  10. h

    BanglaBook

    • huggingface.co
    Updated Jul 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Syed Rifat Raiyan (2024). BanglaBook [Dataset]. https://huggingface.co/datasets/Starscream-11813/BanglaBook
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 17, 2024
    Authors
    Syed Rifat Raiyan
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    BᴀɴɢʟᴀBᴏᴏᴋ: A Large-scale Bangla Dataset for Sentiment Analysis from Book Reviews

    This repository contains the code, data, and models of the paper titled "BᴀɴɢʟᴀBᴏᴏᴋ: A Large-scale Bangla Dataset for Sentiment Analysis from Book Reviews" published in the Findings of the Association for Computational Linguistics: ACL 2023.

    License: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International

      Data Format
    

    Each row consists of a book review sample.… See the full description on the dataset page: https://huggingface.co/datasets/Starscream-11813/BanglaBook.

  11. BAN-ABSA

    • kaggle.com
    zip
    Updated Oct 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mahfuz Ahmed Masum (2020). BAN-ABSA [Dataset]. https://www.kaggle.com/mahfuzahmed/banabsa
    Explore at:
    zip(300359 bytes)Available download formats
    Dataset updated
    Oct 2, 2020
    Authors
    Mahfuz Ahmed Masum
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Mahfuz Ahmed Masum

    Released under CC0: Public Domain

    Contents

  12. m

    A Dataset for Sentiment Polarity Detection of Bengali Book Reviews

    • data.mendeley.com
    Updated Aug 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eftekhar Hossain (2020). A Dataset for Sentiment Polarity Detection of Bengali Book Reviews [Dataset]. http://doi.org/10.17632/2wcw3sxxr3.1
    Explore at:
    Dataset updated
    Aug 8, 2020
    Authors
    Eftekhar Hossain
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 1443 Bangla book reviews. Among them 471 reviews are annotated as negative sentiment and 972 reviews are labelled as positive sentiment. All the reviews are collected from different online book shops and social media groups. The reviews are manually annotated by two native Bengali speakers. Though, the dataset is relatively small but it can be used for learning as well as research purpose.

  13. EBLICT Bangla Sentiment Analysis Dataset

    • kaggle.com
    zip
    Updated Nov 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sushmit (2021). EBLICT Bangla Sentiment Analysis Dataset [Dataset]. https://www.kaggle.com/datasets/sushmit0109/eblict-bangla-sentiment-analysis-dataset
    Explore at:
    zip(18422037 bytes)Available download formats
    Dataset updated
    Nov 30, 2021
    Authors
    Sushmit
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Dataset

    This dataset was created by Sushmit

    Released under GPL 2

    Contents

  14. i

    Bangladesh Airlines Sentiment Review Dataset

    • ieee-dataport.org
    Updated Oct 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khan Md. Hasib (2022). Bangladesh Airlines Sentiment Review Dataset [Dataset]. http://doi.org/10.21227/6fg6-s460
    Explore at:
    Dataset updated
    Oct 25, 2022
    Dataset provided by
    IEEE Dataport
    Authors
    Khan Md. Hasib
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Air travel is one of the most used ways of transit in our daily lives. So it's no wonder that more and more people are sharing their experiences with airlines and airports using web-based online surveys. This dataset aims to do topic modeling and sentiment analysis on Skytrax (airlinequality.com) and Tripadvisor (tripadvisor.com) postings where there is a lot of interest and engagement from people who have used it or want to use it for airlines. The goal of individuals gathering at Skytrax and Tripadvisor is to make better decisions based on the actual experiences of other customers who have flown with airlines. We gathered online reviews submitted by consumers who have flown with Bangladesh airlines in the past. The dataset gives a view of customer online reviews in terms of Bangladesh Airlines that covers all the reviews from September 2018 to July 2021. In this dataset, both 519 online reviews are for domestic routes and 528 online reviews are for international route passengers online review and lastly, work on overall 1047 online reviews of Bangladesh Airlines. The second dataset are combining the Skytrax and Tripadvisor datasets, we arrived with 1095 review records for Bangladesh Airlines.

  15. f

    Sentiment classification for restaurant dataset.

    • figshare.com
    xls
    Updated Sep 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shihab Ahmed; Moythry Manir Samia; Maksuda Haider Sayma; Md. Mohsin Kabir; M. F. Mridha (2024). Sentiment classification for restaurant dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0308050.t010
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Sep 20, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Shihab Ahmed; Moythry Manir Samia; Maksuda Haider Sayma; Md. Mohsin Kabir; M. F. Mridha
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, the surge in reviews and comments on newspapers and social media has made sentiment analysis a focal point of interest for researchers. Sentiment analysis is also gaining popularity in the Bengali language. However, Aspect-Based Sentiment Analysis is considered a difficult task in the Bengali language due to the shortage of perfectly labeled datasets and the complex variations in the Bengali language. This study used two open-source benchmark datasets of the Bengali language, Cricket, and Restaurant, for our Aspect-Based Sentiment Analysis task. The original work was based on the Random Forest, Support Vector Machine, K-Nearest Neighbors, and Convolutional Neural Network models. In this work, we used the Bidirectional Encoder Representations from Transformers, the Robustly Optimized BERT Approach, and our proposed hybrid transformative Random Forest and Bidirectional Encoder Representations from Transformers (tRF-BERT) models to compare the results with the existing work. After comparing the results, we can clearly see that all the models used in our work achieved better results than any of the previous works on the same dataset. Amongst them, our proposed transformative Random Forest and Bidirectional Encoder Representations from Transformers achieved the highest F1 score and accuracy. The accuracy and F1 score of aspect detection for the Cricket dataset were 0.89 and 0.85, respectively, and for the Restaurant dataset were 0.92 and 0.89 respectively.

  16. m

    BANGLA-ABSA: Unique Aspect Based Sentiment Analysis datasets in Bangla...

    • data.mendeley.com
    Updated Jul 9, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mahmudul Hasan (2024). BANGLA-ABSA: Unique Aspect Based Sentiment Analysis datasets in Bangla Language [Dataset]. http://doi.org/10.17632/998m4jy3m9.3
    Explore at:
    Dataset updated
    Jul 9, 2024
    Authors
    Mahmudul Hasan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the Bangla language, sentiment analysis is becoming more and more significant. Aspect-based sentiment analysis (ABSA) predicts the sentiment polarity on an aspect level. The data were collected from numerous individuals with a minimum of two aspects. Every comment is a complex or compound sentence. The datasets are organized in a folder named "BANGLA_ABSA dataset" which has four Excel files, one for each of the datasets: Car_ABSA, Mobile_phone_ABSA, Movie_ABSA, and Restaurant_ABSA. Each Excel file contains three columns namely Id, Comment, and {Aspect category, Sentiment Polarity}. Car_ABSA, Mobile_phone_ABSA, Movie_ABSA, and Restaurant_ABSA datasets have 1149, 975, 800, and 801 rows of data respectively.

  17. P

    Bengali Hate Speech Dataset

    • paperswithcode.com
    • opendatalab.com
    Updated Apr 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md. Rezaul Karim; Bharathi Raja Chakravarthi; John P. McCrae; Michael Cochez (2020). Bengali Hate Speech Dataset [Dataset]. https://paperswithcode.com/dataset/bengali-hate-speech
    Explore at:
    Dataset updated
    Apr 10, 2020
    Authors
    Md. Rezaul Karim; Bharathi Raja Chakravarthi; John P. McCrae; Michael Cochez
    Description

    Introduces three datasets of expressing hate, commonly used topics, and opinions for hate speech detection, document classification, and sentiment analysis, respectively.

  18. h

    Bengali_Cyberbullying_Detection_Comments_Dataset

    • huggingface.co
    Updated Oct 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Faisal Ahmed (2024). Bengali_Cyberbullying_Detection_Comments_Dataset [Dataset]. https://huggingface.co/datasets/faisalahmed/Bengali_Cyberbullying_Detection_Comments_Dataset
    Explore at:
    Dataset updated
    Oct 3, 2024
    Authors
    Faisal Ahmed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 44,001 Bengali comments, curated to detect cyberbullying using Natural Language Processing (NLP) techniques. Each comment is labeled by experts, categorizing different forms of harassment and offensive behavior. The dataset enables the identification of inappropriate content, ranging from mild to severe harassment, facilitating precise classification and analysis. This resource is designed for researchers and developers working on cyberbullying detection, sentiment… See the full description on the dataset page: https://huggingface.co/datasets/faisalahmed/Bengali_Cyberbullying_Detection_Comments_Dataset.

  19. Bangla Multiclass Sentiment Analysis Dataset

    • kaggle.com
    zip
    Updated May 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Towhidul.Tonmoy (2022). Bangla Multiclass Sentiment Analysis Dataset [Dataset]. https://www.kaggle.com/datasets/towhidultonmoy/bangla-multiclass-sentiment-analysis-dataset
    Explore at:
    zip(551995 bytes)Available download formats
    Dataset updated
    May 4, 2022
    Authors
    Towhidul.Tonmoy
    Description

    Dataset

    This dataset was created by Towhidul.Tonmoy

    Contents

  20. Sentiment Analysis of Food Reviews on Bengali Text

    • kaggle.com
    zip
    Updated May 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohd. Istiaq Hossain Junaid (2022). Sentiment Analysis of Food Reviews on Bengali Text [Dataset]. https://www.kaggle.com/sarcasticpsycho/bangla-food-reviews
    Explore at:
    zip(40478 bytes)Available download formats
    Dataset updated
    May 21, 2022
    Authors
    Mohd. Istiaq Hossain Junaid
    Description

    Dataset

    This dataset was created by Mohd. Istiaq Hossain Junaid

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
nit003 (2019). Bangla YouTube Sentiment and Emotion datasets [Dataset]. https://www.kaggle.com/datasets/nit003/bangla-youtube-sentiment-and-emotion-datasets
Organization logo

Bangla YouTube Sentiment and Emotion datasets

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
zip(439462 bytes)Available download formats
Dataset updated
Sep 9, 2019
Authors
nit003
Area covered
YouTube
Description

Context

This dataset is used in Multilabel sentiment analysis and emotion detection for YouTube comments in different kinds of Bengali videos.

Content

There are two files in the folder. There are might be multiple comments with same text. Also it may be noted that, the comments collected here contain abusive and vulgar words, slangs and personal attack. Therefore, we ensure that all annotators are adults.

Sentiment.csv

Id - Unique id number for the comment. Text - Text of the data Label - 1 (3 class label) or 2 (5 class label) Score - Denotes the polarity of the comment. In three class labelling : 1(positive), 0 (neutral), -1(negative) In three class labelling : 2 (highly positive), 1(positive), 0 (neutral), -1(negative), -2(highly negative) Lan - Language of the comment. EN (English), BN (Bengali), RN (Romanized Bangla) Domain - Category of the video.

Emotion.csv

Id - Unique id number for the comment. Text - Text of the data emotion - Corresponding emotion of the comment. Anger/Joy/Disgust/Fear/Surprise/Sad/None (no emotion found) Lan - Language of the comment. EN (English), BN (Bengali), RN (Romanized Bangla) Domain - Category of the video.

Acknowledgements

If you use the dataset in any research work, please cite the following paper as

N. Irtiza Tripto and M. Eunus Ali, "Detecting Multilabel Sentiment and Emotions from Bangla YouTube Comments," 2018 International Conference on Bangla Speech and Language Processing (ICBSLP), Sylhet, 2018, pp. 1-6.

doi: 10.1109/ICBSLP.2018.8554875

Inspiration

It will be helpful for researchers specially in analyzing sentiments from social media in non-English language

Search
Clear search
Close search
Google apps
Main menu