Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
This dataset was originally collected for a data science and machine learning project that aimed at investigating the potential correlation between the amount of time an individual spends on social media and the impact it has on their mental health.
The project involves conducting a survey to collect data, organizing the data, and using machine learning techniques to create a predictive model that can determine whether a person should seek professional help based on their answers to the survey questions.
This project was completed as part of a Statistics course at a university, and the team is currently in the process of writing a report and completing a paper that summarizes and discusses the findings in relation to other research on the topic.
The following is the Google Colab link to the project, done on Jupyter Notebook -
https://colab.research.google.com/drive/1p7P6lL1QUw1TtyUD1odNR4M6TVJK7IYN
The following is the GitHub Repository of the project -
https://github.com/daerkns/social-media-and-mental-health
Libraries used for the Project -
Pandas
Numpy
Matplotlib
Seaborn
Sci-kit Learn
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This comprehensive dataset is a meticulously curated collection of mental health statuses tagged from various statements. The dataset amalgamates raw data from multiple sources, cleaned and compiled to create a robust resource for developing chatbots and performing sentiment analysis.
The dataset integrates information from the following Kaggle datasets:
The dataset consists of statements tagged with one of the following seven mental health statuses: - Normal - Depression - Suicidal - Anxiety - Stress - Bi-Polar - Personality Disorder
The data is sourced from diverse platforms including social media posts, Reddit posts, Twitter posts, and more. Each entry is tagged with a specific mental health status, making it an invaluable asset for:
This dataset is ideal for training machine learning models aimed at understanding and predicting mental health conditions based on textual data. It can be used in various applications such as:
This dataset was created by aggregating and cleaning data from various publicly available datasets on Kaggle. Special thanks to the original dataset creators for their contributions.
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This dataset contains posts from 28 subreddits (15 mental health support groups) from 2018-2020. We used this dataset to understand the impact of COVID-19 on mental health support groups from January to April, 2020 and included older timeframes to obtain baseline posts before COVID-19.
Please cite if you use this dataset:
Low, D. M., Rumker, L., Torous, J., Cecchi, G., Ghosh, S. S., & Talkar, T. (2020). Natural Language Processing Reveals Vulnerable Mental Health Support Groups and Heightened Health Anxiety on Reddit During COVID-19: Observational Study. Journal of medical Internet research, 22(10), e22635.
@article{low2020natural,
title={Natural Language Processing Reveals Vulnerable Mental Health Support Groups and Heightened Health Anxiety on Reddit During COVID-19: Observational Study},
author={Low, Daniel M and Rumker, Laurie and Torous, John and Cecchi, Guillermo and Ghosh, Satrajit S and Talkar, Tanya},
journal={Journal of medical Internet research},
volume={22},
number={10},
pages={e22635},
year={2020},
publisher={JMIR Publications Inc., Toronto, Canada}
}
License
This dataset is made available under the Public Domain Dedication and License v1.0 whose full text can be found at: http://www.opendatacommons.org/licenses/pddl/1.0/
It was downloaded using pushshift API. Re-use of this data is subject to Reddit API terms.
Reddit Mental Health Dataset
Contains posts and text features for the following timeframes from 28 mental health and non-mental health subreddits:
filenames and corresponding timeframes:
post: Jan 1 to April 20, 2020 (called "mid-pandemic" in manuscript; r/COVID19_support appears). Unique users: 320,364. pre: Dec 2018 to Dec 2019. A full year which provides more data for a baseline of Reddit posts. Unique users: 327,289.2019: Jan 1 to April 20, 2019 (r/EDAnonymous appears). A control for seasonal fluctuations to match post data. Unique users: 282,560.2018: Jan 1 to April 20, 2018. A control for seasonal fluctuations to match post data. Unique users: 177,089Unique users across all time windows (pre and 2019 overlap): 826,961.
See manuscript Supplementary Materials (https://doi.org/10.31234/osf.io/xvwcy) for more information.
Note: if subsampling (e.g., to balance subreddits), we recommend bootstrapping analyses for unbiased results.
Facebook
Twitterhttps://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Amod/mental_health_counseling_conversations
This dataset is a compilation of high-quality, real one-on-one mental health counseling conversations between individuals and licensed professionals. Each exchange is structured as a clear question–answer pair, making it directly suitable for fine-tuning or instruction-tuning language models that need to handle sensitive, empathetic, and contextually aware dialogue. Since its public release in 2023, it has been downloaded over 100,000… See the full description on the dataset page: https://huggingface.co/datasets/Amod/mental_health_counseling_conversations.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mental Health reports the prevalence of the mental illness in the past year by age range.
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
Comprehensive Mental Health Insights: A Diverse Dataset of 1000 Individuals Across Professions, Countries, and Lifestyles
This dataset provides a rich collection of anonymized mental health data for 1000 individuals, representing a wide range of ages, genders, occupations, and countries. It aims to shed light on the various factors affecting mental health, offering valuable insights into stress levels, sleep patterns, work-life balance, and physical activity.
Key Features: Demographics: The dataset includes individuals from various countries such as the USA, India, the UK, Canada, and Australia. Each entry captures key demographic information such as age, gender, and occupation (e.g., IT, Healthcare, Education, Engineering).
Mental Health Conditions: The dataset contains data on whether the individuals have reported any mental health issues (Yes/No), along with the severity of these conditions categorized into Low, Medium, or High.
Consultation History: For individuals with mental health conditions, the dataset notes whether they have consulted a mental health professional.
Stress Levels: Each individual’s stress level is classified as Low, Medium, or High, providing insights into how different factors such as work hours or sleep may correlate with mental well-being.
Lifestyle Factors: The dataset includes information on sleep duration, work hours per week, and weekly physical activity hours, offering a detailed picture of how lifestyle factors contribute to mental health.
This dataset can be used for research, analysis, or machine learning models to predict mental health trends, uncover correlations between work-life balance and mental well-being, and explore the impact of stress and physical activity on mental health.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A poll released to mark World Mental Health Day reveals that during the 12 months preceding the survey, 15% of respondents across EU Member States sought professional help for psychological or emotional problems and 7% took antidepressants, mostly for depression or anxiety. According to the results, there is still stigma attached to mental disorders, with 22% of those surveyed saying they would find it difficult to speak to a person with a "significant mental disorder". This issue and the other results will be discussed during the next thematic conference under the European Pact for Mental Health and Well-being. The main themes addressed in this report are: • The state of mental well-being – how well people feel mentally and physically, and what impact has this had on their lives• Level of comfort at work – how secure people feel in their current jobs, whether they feel their skills match their current role and whether they feel they receive adequate recognition/respect for what they do • Care and treatment – what help and treatment people have sought to ameliorate any mental health conditions they have experienced • Perceptions of people with mental illness – how comfortable people feel about interacting with those with a mental health problem
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Mental Health Corpus is a collection of texts related to people with anxiety, depression, and other mental health issues. The corpus consists of two columns: one containing the comments, and the other containing labels indicating whether the comments are considered poisonous or not. The corpus can be used for a variety of purposes, such as sentiment analysis, toxic language detection, and mental health language analysis. The data in the corpus may be useful for researchers, mental health professionals, and others interested in understanding the language and sentiment surrounding mental health issues.
Facebook
TwitterThe Katie A. Settlement Agreement requires the Department of Health Care Services (DHCS) to collect and post data used to evaluate utilization of services and timely access to appropriate care. These county datasets show services used by children and youth (under the age of 21) identified as Katie A. Subclass members and/or utilizing Katie A. specialty mental health services (Intensive Care Coordination, Intensive Home Based Services, and Therapeutic Foster Care). This data assists in evaluating each county’s progress with implementing.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset comprises mental health data from 1977 Bangladeshi university students across 15 top universities, collected from November to December 2023 using Google Forms. It includes assessments of academic anxiety, stress, and depression using widely used psychometric scales. The structured questionnaire covers sociodemographic variables and their associations, facilitating comprehensive analysis. Statistical analysis yielded satisfactory internal consistency (Cronbach’s alpha: 0.79), with anonymized participant data valuable for policymakers.
Facebook
TwitterNOTE: This dataset is no longer being updated but is being kept for historical reference. Comprehensive information about the Chicago Department of Public Health (CDPH)'s mental health clinics and community partners. Providers on this site offer free mental health services to Chicagoans regardless of ability to pay, immigration status, or health insurance. Information includes location of site(s), hours of operation, populations served, specific service types, and contact information. The previous version of this dataset, linked below, contained additional data elements that were difficult to maintain over time. That dataset is available for historical reference.
Facebook
TwitterAccording to a March 2024 survey conducted in the United States, 32 percent of adults reported feeling that social media had neither a positive nor negative effect on their own mental health. Only seven percent of social media users said that online platforms had a very positive effect on their mental health, while 12 percent of users said it had a very negative impact. Furthermore, 22 percent of respondents said social media had a somewhat negative effect on their mental health. Is social media addictive? A 2023 survey of individuals between 11 and 59 years old in the United States found that over 73 percent of TikTok users agreed that the platform was addictive. Furthermore, nearly 27 percent of those surveyed reported experiencing negative psychological effects related to TikTok use. Users belonging to Generation Z were the most likely to say that TikTok is addictive, yet millennials felt the negative effects of using the app more so than Gen Z. In the U.S., it is also not uncommon for social media users to take breaks from using online platforms, and as of March 2024, over a third of adults in the country had done so. Following mental health-related content Although online users may be aware of the negative and addictive aspects of social media, it is also a useful tool for finding supportive content. In a global survey conducted in 2023, 32 percent of social media users followed therapists and mental health professionals on social media. Overall, 24 percent of respondents said that they followed people on social media if they had the same condition as they did. Between January 2020 and March 2023, British actress and model Cara Delevingne was the celebrity mental health activist with the highest growth in searches tying her name to the topic.
Facebook
TwitterThese are the detailed tables pertaining to adult mental health from the 2010 National Survey on Drug Use and Health (NSDUH). These detailed tables present totals and prevalence estimates of past year any mental illness (AMI), serious mental illness (SMI), suicidal thoughts and behavior, major depressive episode (MDE), treatment for depression (among adults with MDE), mental health service utilization, and measuers related to the co-occurrence of mental disorders with substance use or with substance use disorders. Results are provided for age group, gender, race/ethnicity, education level, employment status, poverty level, geographic area, insurance status. Comparisons are made between 2011 and 2002 to 2010.
Facebook
TwitterThe U.S. Census Bureau, in collaboration with five federal agencies, launched the Household Pulse Survey to produce data on the social and economic impacts of Covid-19 on American households. The Household Pulse Survey was designed to gauge the impact of the pandemic on employment status, consumer spending, food security, housing, education disruptions, and dimensions of physical and mental wellness. The survey was designed to meet the goal of accurate and timely weekly estimates. It was conducted by an internet questionnaire, with invitations to participate sent by email and text message. The sample frame is the Census Bureau Master Address File Data. Housing units linked to one or more email addresses or cell phone numbers were randomly selected to participate, and one respondent from each housing unit was selected to respond for him or herself. Estimates are weighted to adjust for nonresponse and to match Census Bureau estimates of the population by age, gender, race and ethnicity, and educational attainment. All estimates shown meet the NCHS Data Presentation Standards for Proportions.
Facebook
TwitterGuide for APD Mental Health First Response Dataset
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Dataset Card for "heliosbrahma/mental_health_chatbot_dataset"
Dataset Description
Dataset Summary
This dataset contains conversational pair of questions and answers in a single text related to Mental Health. Dataset was curated from popular healthcare blogs like WebMD, Mayo Clinic and HeatlhLine, online FAQs etc. All questions and answers have been anonymized to remove any PII data and pre-processed to remove any unwanted characters.
Languages
The… See the full description on the dataset page: https://huggingface.co/datasets/heliosbrahma/mental_health_chatbot_dataset.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is created by combining surveys from the last five years (2017-2021) conducted by Open Sourcing Mental Illness (a non-profit corporation) to determine the presence of mental health issues among those employed in the technology sector and to measure attitudes regarding mental health in the workplace.
Survey responses in the dataset are filtered based on having a tech role as primary criteria and descriptive questions are removed besides checking for data consistency and validity of survey responses to make analysis possible.
OSMI has made the data sets from the 2017 to 2021 survey available on Kaggle.
Facebook
TwitterThis dataset is used for the analysis of study results
Facebook
TwitterMeetX/mental-health-dataset-mistral7b dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterThe following datasets are based on the children and youth (under age 21) beneficiary population and consist of aggregate Mental Health Service data derived from Medi-Cal claims, encounter, and eligibility systems. These datasets were developed in accordance with California Welfare and Institutions Code (WIC) § 14707.5 (added as part of Assembly Bill 470 on 10/7/17). Please contact BHData@dhcs.ca.gov for any questions or to request previous years’ versions of these datasets. Note: The Performance Dashboard AB 470 Report Application Excel tool development has been discontinued. Please see the Behavioral Health reporting data hub at https://behavioralhealth-data.dhcs.ca.gov/ for access to dashboards utilizing these datasets and other behavioral health data.
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
This dataset was originally collected for a data science and machine learning project that aimed at investigating the potential correlation between the amount of time an individual spends on social media and the impact it has on their mental health.
The project involves conducting a survey to collect data, organizing the data, and using machine learning techniques to create a predictive model that can determine whether a person should seek professional help based on their answers to the survey questions.
This project was completed as part of a Statistics course at a university, and the team is currently in the process of writing a report and completing a paper that summarizes and discusses the findings in relation to other research on the topic.
The following is the Google Colab link to the project, done on Jupyter Notebook -
https://colab.research.google.com/drive/1p7P6lL1QUw1TtyUD1odNR4M6TVJK7IYN
The following is the GitHub Repository of the project -
https://github.com/daerkns/social-media-and-mental-health
Libraries used for the Project -
Pandas
Numpy
Matplotlib
Seaborn
Sci-kit Learn