Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by kuncha manjula
Released under Apache 2.0
How much time do people spend on social media? As of 2024, the average daily social media usage of internet users worldwide amounted to 143 minutes per day, down from 151 minutes in the previous year. Currently, the country with the most time spent on social media per day is Brazil, with online users spending an average of three hours and 49 minutes on social media each day. In comparison, the daily time spent with social media in the U.S. was just two hours and 16 minutes. Global social media usageCurrently, the global social network penetration rate is 62.3 percent. Northern Europe had an 81.7 percent social media penetration rate, topping the ranking of global social media usage by region. Eastern and Middle Africa closed the ranking with 10.1 and 9.6 percent usage reach, respectively. People access social media for a variety of reasons. Users like to find funny or entertaining content and enjoy sharing photos and videos with friends, but mainly use social media to stay in touch with current events friends. Global impact of social mediaSocial media has a wide-reaching and significant impact on not only online activities but also offline behavior and life in general. During a global online user survey in February 2019, a significant share of respondents stated that social media had increased their access to information, ease of communication, and freedom of expression. On the flip side, respondents also felt that social media had worsened their personal privacy, increased a polarization in politics and heightened everyday distractions.
The number of social media users in the United States was forecast to continuously increase between 2024 and 2029 by in total 26 million users (+8.55 percent). After the ninth consecutive increasing year, the social media user base is estimated to reach 330.07 million users and therefore a new peak in 2029. Notably, the number of social media users of was continuously increasing over the past years.The shown figures regarding social media users have been derived from survey data that has been processed to estimate missing demographics.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
According to a March 2024 survey conducted in the United States, 32 percent of adults reported feeling that social media had neither a positive nor negative effect on their own mental health. Only seven percent of social media users said that online platforms had a very positive effect on their mental health, while 12 percent of users said it had a very negative impact. Furthermore, 22 percent of respondents said social media had a somewhat negative effect on their mental health. Is social media addictive? A 2023 survey of individuals between 11 and 59 years old in the United States found that over 73 percent of TikTok users agreed that the platform was addictive. Furthermore, nearly 27 percent of those surveyed reported experiencing negative psychological effects related to TikTok use. Users belonging to Generation Z were the most likely to say that TikTok is addictive, yet millennials felt the negative effects of using the app more so than Gen Z. In the U.S., it is also not uncommon for social media users to take breaks from using online platforms, and as of March 2024, over a third of adults in the country had done so. Following mental health-related content Although online users may be aware of the negative and addictive aspects of social media, it is also a useful tool for finding supportive content. In a global survey conducted in 2023, 32 percent of social media users followed therapists and mental health professionals on social media. Overall, 24 percent of respondents said that they followed people on social media if they had the same condition as they did. Between January 2020 and March 2023, British actress and model Cara Delevingne was the celebrity mental health activist with the highest growth in searches tying her name to the topic.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Please cite the following paper when using this dataset:
N. Thakur, “Mpox narrative on Instagram: A labeled multilingual dataset of Instagram posts on mpox for sentiment, hate speech, and anxiety analysis,” arXiv [cs.LG], 2024, URL: https://arxiv.org/abs/2409.05292
Abstract
The world is currently experiencing an outbreak of mpox, which has been declared a Public Health Emergency of International Concern by WHO. During recent virus outbreaks, social media platforms have played a crucial role in keeping the global population informed and updated regarding various aspects of the outbreaks. As a result, in the last few years, researchers from different disciplines have focused on the development of social media datasets focusing on different virus outbreaks. No prior work in this field has focused on the development of a dataset of Instagram posts about the mpox outbreak. The work presented in this paper (stated above) aims to address this research gap. It presents this multilingual dataset of 60,127 Instagram posts about mpox, published between July 23, 2022, and September 5, 2024. This dataset contains Instagram posts about mpox in 52 languages. For each of these posts, the Post ID, Post Description, Date of publication, language, and translated version of the post (translation to English was performed using the Google Translate API) are presented as separate attributes in the dataset.
After developing this dataset, sentiment analysis, hate speech detection, and anxiety or stress detection were also performed. This process included classifying each post into
These results are presented as separate attributes in the dataset for the training and testing of machine learning algorithms for sentiment, hate speech, and anxiety or stress detection, as well as for other applications.
The 52 distinct languages in which Instagram posts are present in the dataset are English, Portuguese, Indonesian, Spanish, Korean, French, Hindi, Finnish, Turkish, Italian, German, Tamil, Urdu, Thai, Arabic, Persian, Tagalog, Dutch, Catalan, Bengali, Marathi, Malayalam, Swahili, Afrikaans, Panjabi, Gujarati, Somali, Lithuanian, Norwegian, Estonian, Swedish, Telugu, Russian, Danish, Slovak, Japanese, Kannada, Polish, Vietnamese, Hebrew, Romanian, Nepali, Czech, Modern Greek, Albanian, Croatian, Slovenian, Bulgarian, Ukrainian, Welsh, Hungarian, and Latvian.
The following table represents the data description for this dataset
Attribute Name |
Attribute Description |
Post ID |
Unique ID of each Instagram post |
Post Description |
Complete description of each post in the language in which it was originally published |
Date |
Date of publication in MM/DD/YYYY format |
Language |
Language of the post as detected using the Google Translate API |
Translated Post Description |
Translated version of the post description. All posts which were not in English were translated into English using the Google Translate API. No language translation was performed for English posts. |
Sentiment |
Results of sentiment analysis (using translated Post Description) where each post was classified into one of the sentiment classes: fear, surprise, joy, sadness, anger, disgust, and neutral |
Hate |
Results of hate speech detection (using translated Post Description) where each post was classified as hate or not hate |
Anxiety or Stress |
Results of anxiety or stress detection (using translated Post Description) where each post was classified as stress/anxiety detected or no stress/anxiety detected. |
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset comprises 4,038 tweets in Spanish, related to discussions about artificial intelligence (AI), and was created and utilized in the publication "Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights," (10.1109/IE61493.2024.10599899) presented at the 20th International Conference on Intelligent Environments. It is designed to support research on public perception, sentiment, and engagement with AI topics on social media from a Spanish-speaking perspective. Each entry includes detailed annotations covering sentiment analysis, user engagement metrics, and user profile characteristics, among others.
Tweets were gathered through the Twitter API v1.1 by targeting keywords and hashtags associated with artificial intelligence, focusing specifically on content in Spanish. The dataset captures a wide array of discussions, offering a holistic view of the Spanish-speaking public's sentiment towards AI.
Guerrero-Contreras, G., Balderas-Díaz, S., Serrano-Fernández, A., & Muñoz, A. (2024, June). Enhancing Sentiment Analysis on Social Media: Integrating Text and Metadata for Refined Insights. In 2024 International Conference on Intelligent Environments (IE) (pp. 62-69). IEEE.
This dataset is aimed at academic researchers and practitioners with interests in:
The dataset is provided in CSV format, ensuring compatibility with a wide range of data analysis tools and programming environments.
The dataset is available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, permitting sharing, copying, distribution, transmission, and adaptation of the work for any purpose, including commercial, provided proper attribution is given.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Egypt Internet Usage: Social Media Market Share: Desktop: Fark data was reported at 0.000 % in 01 Mar 2025. This stayed constant from the previous number of 0.000 % for 28 Feb 2025. Egypt Internet Usage: Social Media Market Share: Desktop: Fark data is updated daily, averaging 0.000 % from Mar 2024 (Median) to 01 Mar 2025, with 196 observations. The data reached an all-time high of 0.330 % in 18 Dec 2024 and a record low of 0.000 % in 01 Mar 2025. Egypt Internet Usage: Social Media Market Share: Desktop: Fark data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Egypt – Table EG.SC.IU: Internet Usage: Social Media Market Share.
Attribution-NonCommercial-ShareAlike 2.0 (CC BY-NC-SA 2.0)https://creativecommons.org/licenses/by-nc-sa/2.0/
License information was derived automatically
ABSTRACT
---------------
Online web communities often face bans for violating platform policies, encouraging their migration to alternative platforms. This migration, however, can result in increased toxicity and unforeseen consequences on the new platform. In recent years, researchers have collected data from many alternative platforms, indicating coordinated efforts leading to offline events, conspiracy movements, hate speech propagation, and harassment. Thus, it becomes crucial to characterize and understand these alternative platforms. To advance research in this direction, we collect and release a large-scale dataset from Scored -- an alternative Reddit platform that sheltered banned fringe communities, for example, c/TheDonald (a prominent right-wing community) and c/GreatAwakening (a conspiratorial community). Over four years, we collected approximately 57M posts from Scored, with at least 58 communities identified as migrating from Reddit and over 950 communities created since the platform's inception. Furthermore, we provide sentence embeddings of all posts in our dataset, generated through a state-of-the-art model, to further advance the field in characterizing the discussions within these communities. We aim to provide these resources to facilitate their investigations without the need for extensive data collection and processing efforts.
File-name | Data-points |
comments-2020 | 12,774,203 |
comments-2021 | 16,097,941 |
comments-2022 | 12,730,301 |
comments-2023 | 8,919,159 |
submissions-2020-to-2023 | 6,293,980 |
This dataset is published at "AAAI ICWSM 2024 (INTERNATIONAL AAAI CONFERENCE ON WEB AND SOCIAL MEDIA)" hosted at Buffalo, NY, USA.
This dataset is available for free to use under terms of the non-commercial license CC BY-NC-SA 4.0.
@inproceedings{patel2024idrama,
title={iDRAMA-Scored-2024: A Dataset of the Scored Social Media Platform from 2020 to 2023},
author={Patel, Jay and Paudel, Pujan and De Cristofaro, Emiliano and Stringhini, Gianluca and Blackburn, Jeremy},
booktitle={Proceedings of the International AAAI Conference on Web and Social Media},
volume={18},
pages={2014--2024},
year={2024},
issn = {2334-0770},
doi = {10.1609/icwsm.v18i1.31444},
}
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Sri Lanka Internet Usage: Social Media Market Share: All Platforms: Youku data was reported at 0.000 % in 24 Apr 2024. This records a decrease from the previous number of 0.040 % for 23 Apr 2024. Sri Lanka Internet Usage: Social Media Market Share: All Platforms: Youku data is updated daily, averaging 0.000 % from Mar 2024 (Median) to 24 Apr 2024, with 22 observations. The data reached an all-time high of 0.110 % in 26 Mar 2024 and a record low of 0.000 % in 24 Apr 2024. Sri Lanka Internet Usage: Social Media Market Share: All Platforms: Youku data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Sri Lanka – Table LK.SC.IU: Internet Usage: Social Media Market Share.
The number of Twitter users in the United States was forecast to continuously increase between 2024 and 2028 by in total 4.3 million users (+5.32 percent). After the ninth consecutive increasing year, the Twitter user base is estimated to reach 85.08 million users and therefore a new peak in 2028. Notably, the number of Twitter users of was continuously increasing over the past years.User figures, shown here regarding the platform twitter, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Twitter users in countries like Canada and Mexico.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Pakistan Internet Usage: Social Media Market Share: Desktop: Fark data was reported at 0.000 % in 24 Nov 2024. This stayed constant from the previous number of 0.000 % for 23 Nov 2024. Pakistan Internet Usage: Social Media Market Share: Desktop: Fark data is updated daily, averaging 0.000 % from Apr 2024 (Median) to 24 Nov 2024, with 60 observations. The data reached an all-time high of 0.200 % in 22 Sep 2024 and a record low of 0.000 % in 24 Nov 2024. Pakistan Internet Usage: Social Media Market Share: Desktop: Fark data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Pakistan – Table PK.SC.IU: Internet Usage: Social Media Market Share.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Zambia Internet Usage: Social Media Market Share: All Platforms: Youku data was reported at 0.000 % in 01 May 2024. This stayed constant from the previous number of 0.000 % for 30 Apr 2024. Zambia Internet Usage: Social Media Market Share: All Platforms: Youku data is updated daily, averaging 0.000 % from Dec 2023 (Median) to 01 May 2024, with 124 observations. The data reached an all-time high of 0.700 % in 27 Apr 2024 and a record low of 0.000 % in 01 May 2024. Zambia Internet Usage: Social Media Market Share: All Platforms: Youku data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Zambia – Table ZM.SC.IU: Internet Usage: Social Media Market Share.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The average Twitter user spends 5.1 hours per month on the platform.
The number of Reddit users in the United States was forecast to continuously increase between 2024 and 2028 by in total 10.3 million users (+5.21 percent). After the ninth consecutive increasing year, the Reddit user base is estimated to reach 208.12 million users and therefore a new peak in 2028. Notably, the number of Reddit users of was continuously increasing over the past years.User figures, shown here with regards to the platform reddit, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period and count multiple accounts by persons only once. Reddit users encompass both users that are logged in and those that are not.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Reddit users in countries like Mexico and Canada.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Bahrain Internet Usage: Social Media Market Share: All Platforms: Sina Weibo data was reported at 0.070 % in 02 May 2024. This records an increase from the previous number of 0.000 % for 01 May 2024. Bahrain Internet Usage: Social Media Market Share: All Platforms: Sina Weibo data is updated daily, averaging 0.000 % from Apr 2024 (Median) to 02 May 2024, with 5 observations. The data reached an all-time high of 0.070 % in 02 May 2024 and a record low of 0.000 % in 01 May 2024. Bahrain Internet Usage: Social Media Market Share: All Platforms: Sina Weibo data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Bahrain – Table BH.SC.IU: Internet Usage: Social Media Market Share.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Please cite the following paper when using this dataset:N. Thakur, V. Su, M. Shao, K. Patel, H. Jeong, V. Knieling, and A.Bian “A labelled dataset for sentiment analysis of videos on YouTube, TikTok, and other sources about the 2024 outbreak of measles,” arXiv [cs.CY], 2024. Available: https://doi.org/10.48550/arXiv.2406.07693AbstractThis dataset contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder of the websites include Instagram and Facebook as well as the websites of various global and local news organizations. For each of these videos, the URL of the video, title of the post, description of the post, and the date of publication of the video are presented as separate attributes in the dataset. After developing this dataset, sentiment analysis (using VADER), subjectivity analysis (using TextBlob), and fine-grain sentiment analysis (using DistilRoBERTa-base) of the video titles and video descriptions were performed. This included classifying each video title and video description into (i) one of the sentiment classes i.e. positive, negative, or neutral, (ii) one of the subjectivity classes i.e. highly opinionated, neutral opinionated, or least opinionated, and (iii) one of the fine-grain sentiment classes i.e. fear, surprise, joy, sadness, anger, disgust, or neutral. These results are presented as separate attributes in the dataset for the training and testing of machine learning algorithms for performing sentiment analysis or subjectivity analysis in this field as well as for other applications. The paper associated with this dataset (please see the above-mentioned citation) also presents a list of open research questions that may be investigated using this dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Pakistan Internet Usage: Social Media Market Share: Mobile: Sina Weibo data was reported at 0.000 % in 17 Mar 2024. This stayed constant from the previous number of 0.000 % for 16 Mar 2024. Pakistan Internet Usage: Social Media Market Share: Mobile: Sina Weibo data is updated daily, averaging 0.000 % from Mar 2024 (Median) to 17 Mar 2024, with 8 observations. The data reached an all-time high of 0.200 % in 13 Mar 2024 and a record low of 0.000 % in 17 Mar 2024. Pakistan Internet Usage: Social Media Market Share: Mobile: Sina Weibo data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Pakistan – Table PK.SC.IU: Internet Usage: Social Media Market Share.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Bahrain Internet Usage: Social Media Market Share: All Platforms: Youku data was reported at 0.000 % in 18 Apr 2024. This stayed constant from the previous number of 0.000 % for 17 Apr 2024. Bahrain Internet Usage: Social Media Market Share: All Platforms: Youku data is updated daily, averaging 0.000 % from Apr 2024 (Median) to 18 Apr 2024, with 9 observations. The data reached an all-time high of 0.070 % in 14 Apr 2024 and a record low of 0.000 % in 18 Apr 2024. Bahrain Internet Usage: Social Media Market Share: All Platforms: Youku data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Bahrain – Table BH.SC.IU: Internet Usage: Social Media Market Share.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Oman Internet Usage: Social Media Market Share: Mobile: news.ycombinator.com data was reported at 0.000 % in 04 Jan 2025. This stayed constant from the previous number of 0.000 % for 03 Jan 2025. Oman Internet Usage: Social Media Market Share: Mobile: news.ycombinator.com data is updated daily, averaging 0.000 % from May 2024 (Median) to 04 Jan 2025, with 48 observations. The data reached an all-time high of 0.190 % in 31 Dec 2024 and a record low of 0.000 % in 04 Jan 2025. Oman Internet Usage: Social Media Market Share: Mobile: news.ycombinator.com data remains active status in CEIC and is reported by Statcounter Global Stats. The data is categorized under Global Database’s Oman – Table OM.SC.IU: Internet Usage: Social Media Market Share.
The number of social media users in Ireland was forecast to continuously increase between 2024 and 2029 by in total 0.6 million users (+12.85 percent). After the seventh consecutive increasing year, the social media user base is estimated to reach 5.24 million users and therefore a new peak in 2029. The shown figures regarding social media users have been derived from survey data that has been processed to estimate missing demographics.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by kuncha manjula
Released under Apache 2.0