Facebook
TwitterThis dataset was created by Rawan1652002
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset contains short Reddit posts (≤280 characters) about pop music and pop stars, labeled for sentiment analysis.
We collected ~124k posts using keywords like Taylor Swift, Olivia Rodrigo, Grammy, Billboard, and subreddits like popheads, Music, and Billboard. After cleaning and filtering, we kept only short-form, English posts and combined each post’s title and body into a single text column.
The final data set is about 32,000+ rows
Sentiment labels (positive, neutral, negative) were generated using a BERT-based model fine-tuned for social media (CardiffNLP’s Twitter RoBERTa).
This version is ready for NLP sentiment projects — train your own model, explore pop fandom discourse, or benchmark transformer performance on real-world Reddit data.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset includes articles, includes fake and truth articles.
True Articles:
Fake/Misinformation/Propaganda Articles:
Public dataset from:
The author have drop NaN and duplicate values.
Facebook
TwitterThis dataset was created by KGopichand
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset was created by Nikunj Phutela
Released under MIT
Facebook
TwitterThis dataset was created by Big D Dang
Facebook
TwitterThis file contains some arxiv article titles, subject category and abstracts. One may use NLP technique to analyze the dataset, for instance topics modelling.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by Owner
Released under Apache 2.0
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
By Huggingface Hub [source]
This dataset contains conversations between users and experienced psychologists related to mental health topics. Carefully collected and anonymized, the data can be used to further the development of Natural Language Processing (NLP) models which focus on providing mental health advice and guidance. It consists of a variety of questions which will help train NLP models to provide users with appropriate advice in response to their queries. Whether you're an AI developer interested in building the next wave of mental health applications or a therapist looking for insights into how technology is helping people connect; this dataset provides invaluable support for advancing our understanding of human relationships through Artificial Intelligence
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This guide will provide you with the necessary knowledge to effectively use this dataset for Natural Language Processing (NLP)-based applications.
Download and install the dataset: To begin using the dataset, download it from Kaggle onto your system. Once downloaded, unzip and extract the .csv file into a directory of your choice.
Familiarize yourself with the columns: Before working with the data, it’s important to familiarize yourself with all of its components. This dataset contains two columns - Context and Response - which are intentionally structured to produce conversations between users and psychologists related to mental health topics for NLP models dedicated to providing mental health advice and guidance.
Analyze data entries: If possible or desired, take time now to analyze what is included in each entry; this may help you better untangle any challenges that come up during subsequent processes yet won't be required for most steps going forward if you prefer not too jump ahead of yourself at this juncture of your work process just yet! Examine questions asked by users as well as answers provided by experts in order glean an overall picture of what types of conversations are taking place within this pool of data that can help guide further work on NLP models for AI-driven mental health guidance purposes later on down the road!
Cleanse any information not applicable to NLP decisioning relevant application goals: It's important that only meaningful items related towards achieving AI-driven results remain within a clean copy of this Dataset going forward; consider removing all extra many verbatim entries or other pieces uneeded while also otherwise making sure all included content adheres closely enough one particular decisions purpose expected from an end goal perspective before proceeding onwards now until an ultimate end result has been successfully achieved eventually afterwards later on next afterward soon afterwards too following conveniently satisfyingly after accordingly shortly near therefore meaningfully likewise conclusively thoroughly properly productively purposely then eventually effectively finally indeed desirably plus concludingly enjoyably popularly splendidly attractively satisfactorally propitiously outstandingly fluently promisingly opportunely in conclusion efficiently hopefully progressively breathtaking deliciousness ideally genius mayhem invented unique impossibility everlastingly intense qualitative cohesiveness behaviorally affectionately fixed voraciously like alive supportively choicest decisively luckily chaotically co-creatively introducing ageless intricacy voicing auspicious promise enterprisingly preferred mathematically godly happening humorous respective achieve ultra favorability fundamentals essentials speciality grandiose selectively perfectly
- Creating sentence-matching algorithms for natural language processing to accurately match given questions with appropriate advice and guidance.
- Analyzing the psychological conversations to gain insights into topics such as stress, anxiety, and depression.
- Developing personalized natural language processing models tailored to provide users with appropriate advice based on their queries and based on their individual state of mental health
If you use this dataset in your research, please credit the original authors. Data Source
**License: [CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication](https://creativec...
Facebook
TwitterThis dataset was created by Cristiano Battistini
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by hayfay27
Released under Apache 2.0
Facebook
TwitterThis dataset was created by Naman Gautam
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This project aims to develop an NLP model for tasks like sentiment analysis, text classification, or named entity recognition.
For more details, refer to the project guidelines. LinkedIn: https://www.linkedin.com/in/marknature-c/ GitHub: https://github.com/marknature/
Facebook
TwitterThis dataset was created by Aman J
Facebook
TwitterThis dataset was created by bhavya
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by Abylay Zhumagaliyev
Released under Apache 2.0
Facebook
TwitterTokenized data were tokenized by Deepcut of pythainlp word_tokenize. Token folder is the tokens of training data. TF-IDF must be pre-tokenized also by Deepcut.
Facebook
TwitterThis dataset was created by JoyEtike
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset was created by Xiaodong Shi
Released under MIT
Facebook
TwitterThis dataset was created by HIMANSHU_SURYAVANSHI1
Facebook
TwitterThis dataset was created by Rawan1652002