100+ datasets found
  1. Daily Social Media Active Users

    • kaggle.com
    zip
    Updated May 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shaik Barood Mohammed Umar Adnaan Faiz (2025). Daily Social Media Active Users [Dataset]. https://www.kaggle.com/datasets/umeradnaan/daily-social-media-active-users
    Explore at:
    zip(126814 bytes)Available download formats
    Dataset updated
    May 5, 2025
    Authors
    Shaik Barood Mohammed Umar Adnaan Faiz
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Description:

    The "Daily Social Media Active Users" dataset provides a comprehensive and dynamic look into the digital presence and activity of global users across major social media platforms. The data was generated to simulate real-world usage patterns for 13 popular platforms, including Facebook, YouTube, WhatsApp, Instagram, WeChat, TikTok, Telegram, Snapchat, X (formerly Twitter), Pinterest, Reddit, Threads, LinkedIn, and Quora. This dataset contains 10,000 rows and includes several key fields that offer insights into user demographics, engagement, and usage habits.

    Dataset Breakdown:

    • Platform: The name of the social media platform where the user activity is tracked. It includes globally recognized platforms, such as Facebook, YouTube, and TikTok, that are known for their large, active user bases.

    • Owner: The company or entity that owns and operates the platform. Examples include Meta for Facebook, Instagram, and WhatsApp, Google for YouTube, and ByteDance for TikTok.

    • Primary Usage: This category identifies the primary function of each platform. Social media platforms differ in their primary usage, whether it's for social networking, messaging, multimedia sharing, professional networking, or more.

    • Country: The geographical region where the user is located. The dataset simulates global coverage, showcasing users from diverse locations and regions. It helps in understanding how user behavior varies across different countries.

    • Daily Time Spent (min): This field tracks how much time a user spends on a given platform on a daily basis, expressed in minutes. Time spent data is critical for understanding user engagement levels and the popularity of specific platforms.

    • Verified Account: Indicates whether the user has a verified account. This feature mimics real-world patterns where verified users (often public figures, businesses, or influencers) have enhanced status on social media platforms.

    • Date Joined: The date when the user registered or started using the platform. This data simulates user account history and can provide insights into user retention trends or platform growth over time.

    Context and Use Cases:

    • This synthetic dataset is designed to offer a privacy-friendly alternative for analytics, research, and machine learning purposes. Given the complexities and privacy concerns around using real user data, especially in the context of social media, this dataset offers a clean and secure way to develop, test, and fine-tune applications, models, and algorithms without the risks of handling sensitive or personal information.

    Researchers, data scientists, and developers can use this dataset to:

    • Model User Behavior: By analyzing patterns in daily time spent, verified status, and country of origin, users can model and predict social media engagement behavior.

    • Test Analytics Tools: Social media monitoring and analytics platforms can use this dataset to simulate user activity and optimize their tools for engagement tracking, reporting, and visualization.

    • Train Machine Learning Algorithms: The dataset can be used to train models for various tasks like user segmentation, recommendation systems, or churn prediction based on engagement metrics.

    • Create Dashboards: This dataset can serve as the foundation for creating user-friendly dashboards that visualize user trends, platform comparisons, and engagement patterns across the globe.

    • Conduct Market Research: Business intelligence teams can use the data to understand how various demographics use social media, offering valuable insights into the most engaged regions, platform preferences, and usage behaviors.

    • Sources of Inspiration: This dataset is inspired by public data from industry reports, such as those from Statista, DataReportal, and other market research platforms. These sources provide insights into the global user base and usage statistics of popular social media platforms. The synthetic nature of this dataset allows for the use of realistic engagement metrics without violating any privacy concerns, making it an ideal tool for educational, analytical, and research purposes.

    The structure and design of the dataset are based on real-world usage patterns and aim to represent a variety of users from different backgrounds, countries, and activity levels. This diversity makes it an ideal candidate for testing data-driven solutions and exploring social media trends.

    Future Considerations:

    As the social media landscape continues to evolve, this dataset can be updated or extended to include new platforms, engagement metrics, or user behaviors. Future iterations may incorporate features like post frequency, follower counts, engagement rates (likes, comments, shares), or even sentiment analysis from user-generated content.

    By leveraging this dataset, analysts and data scientists can create better, more effective strategies ...

  2. Social Media Dataset

    • kaggle.com
    zip
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nixie6254 (2025). Social Media Dataset [Dataset]. https://www.kaggle.com/datasets/nixie6254/social-media-dataset
    Explore at:
    zip(28057 bytes)Available download formats
    Dataset updated
    Apr 17, 2025
    Authors
    Nixie6254
    Description

    This dataset consists of 734 entries representing social media activity and performance from a local SME (Micro, Small, and Medium Enterprise) across TikTok, Instagram, and Twitter platforms. It captures key metrics related to audience interaction and content strategy effectiveness, and is valuable for evaluating and optimizing digital marketing efforts for small businesses.

    Area : Target location or customer region where the UMKM's content is directed. Category : The business content category (e.g., product promotion, education, seasonal campaign). Day : The day of the week the content was published. Month : The month the post went live. Platform : The social media platform used by the UMKM (TikTok, Instagram, or Twitter). Post Type : The format of the content posted: image, video, carousel, or text. Timestamp : The exact date and time when the content was posted. User : The username or business account that posted the content. Week : Week number within the year for time-based analysis. Year : The year the content was posted. Comments : Total number of comments received on the post. Engagement Rate : A calculated metric showing how engaging the content is (based on likes, comments, shares vs. reach/impressions). Hour : Hour of the day the post was published. Impressions : Number of times the content appeared on users' feeds. Likes : Number of likes the post received. Reach : Number of unique users who saw the content. Shares : Number of times users shared the content.

  3. Instagram accounts with the most followers worldwide 2024

    • statista.com
    • de.statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Instagram accounts with the most followers worldwide 2024 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    Cristiano Ronaldo has one of the most popular Instagram accounts as of April 2024.

                  The Portuguese footballer is the most-followed person on the photo sharing app platform with 628 million followers. Instagram's own account was ranked first with roughly 672 million followers.
    
                  How popular is Instagram?
    
                  Instagram is a photo-sharing social networking service that enables users to take pictures and edit them with filters. The platform allows users to post and share their images online and directly with their friends and followers on the social network. The cross-platform app reached one billion monthly active users in mid-2018. In 2020, there were over 114 million Instagram users in the United States and experts project this figure to surpass 127 million users in 2023.
    
                  Who uses Instagram?
    
                  Instagram audiences are predominantly young – recent data states that almost 60 percent of U.S. Instagram users are aged 34 years or younger. Fall 2020 data reveals that Instagram is also one of the most popular social media for teens and one of the social networks with the biggest reach among teens in the United States.
    
                  Celebrity influencers on Instagram
                  Many celebrities and athletes are brand spokespeople and generate additional income with social media advertising and sponsored content. Unsurprisingly, Ronaldo ranked first again, as the average media value of one of his Instagram posts was 985,441 U.S. dollars.
    
  4. Social Media Behavior Dataset

    • kaggle.com
    zip
    Updated Nov 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shibin Shereef (2024). Social Media Behavior Dataset [Dataset]. https://www.kaggle.com/datasets/shibinshereef1/social-media-behavior-dataset
    Explore at:
    zip(7429 bytes)Available download formats
    Dataset updated
    Nov 25, 2024
    Authors
    Shibin Shereef
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset contains 600 synthetic entries simulating social media activity across three major platforms: Twitter, Reddit, and Instagram. The data was generated to analyze trends, sentiments, and user engagement patterns based on hashtags and posts. It can be useful for researchers, data analysts, and machine learning enthusiasts interested in studying social media behavior.

    Dataset Structure The dataset includes the following columns:

    Date: The date of the post, ranging across a simulated timeline. Platform: The social media platform where the post was made (Twitter, Reddit, or Instagram). Hashtag: The main hashtag associated with the post, such as #AI, #MachineLearning, or #Python. Post Content: The text of the post, crafted to simulate common social media interactions. Sentiment: The sentiment of the post, classified as Positive, Neutral, or Negative. Likes: The number of likes the post received. Shares: The number of shares or retweets the post received. Potential Use Cases Sentiment analysis: Train machine learning models to detect sentiment in text. Hashtag popularity analysis: Determine which hashtags are most commonly used or generate the most engagement. Engagement trends: Explore correlations between post sentiment and engagement metrics (likes/shares). Platform comparison: Compare user behavior across different social media platforms. Acknowledgments This dataset is fully synthetic and was generated using Python. It does not contain any real user data and is intended for educational and research purposes.

  5. Social Media Datasets

    • brightdata.com
    .json, .csv, .xlsx
    Updated Sep 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2022). Social Media Datasets [Dataset]. https://brightdata.com/products/datasets/social-media
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Sep 7, 2022
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Gain valuable insights with our comprehensive Social Media Dataset, designed to help businesses, marketers, and analysts track trends, monitor engagement, and optimize strategies. This dataset provides structured and reliable social media data from multiple platforms.

    Dataset Features

    User Profiles: Access public social media profiles, including usernames, bios, follower counts, engagement metrics, and more. Ideal for audience analysis, influencer marketing, and competitive research. Posts & Content: Extract posts, captions, hashtags, media (images/videos), timestamps, and engagement metrics such as likes, shares, and comments. Useful for trend analysis, sentiment tracking, and content strategy optimization. Comments & Interactions: Analyze user interactions, including replies, mentions, and discussions. This data helps brands understand audience sentiment and engagement patterns. Hashtag & Trend Tracking: Monitor trending hashtags, topics, and viral content across platforms to stay ahead of industry trends and consumer interests.

    Customizable Subsets for Specific Needs Our Social Media Dataset is fully customizable, allowing you to filter data based on platform, region, keywords, engagement levels, or specific user profiles. Whether you need a broad dataset for market research or a focused subset for brand monitoring, we tailor the dataset to your needs.

    Popular Use Cases

    Brand Monitoring & Reputation Management: Track brand mentions, customer feedback, and sentiment analysis to manage online reputation effectively. Influencer Marketing & Audience Analysis: Identify key influencers, analyze engagement metrics, and optimize influencer partnerships. Competitive Intelligence: Monitor competitor activity, content performance, and audience engagement to refine marketing strategies. Market Research & Consumer Insights: Analyze social media trends, customer preferences, and emerging topics to inform business decisions. AI & Predictive Analytics: Leverage structured social media data for AI-driven trend forecasting, sentiment analysis, and automated content recommendations.

    Whether you're tracking brand sentiment, analyzing audience engagement, or monitoring industry trends, our Social Media Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.

  6. Top 100+ Social Media Platforms/Sites (2025)

    • kaggle.com
    zip
    Updated Jan 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taimoor Khurshid Chughtai (2025). Top 100+ Social Media Platforms/Sites (2025) [Dataset]. https://www.kaggle.com/datasets/taimoor888/top-100-social-media-platformssites-2025
    Explore at:
    zip(2761 bytes)Available download formats
    Dataset updated
    Jan 12, 2025
    Authors
    Taimoor Khurshid Chughtai
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset provides detailed rankings and key metrics for 100+ social media platforms and sites in 2025. It includes information such as user base, popularity trends, and global reach. Ideal for analyzing social media growth, user engagement, and market trends. Whether you're a data scientist, marketer, or researcher, this dataset offers valuable insights into the evolving digital landscape.

  7. Social Media Platforms in the UK - Market Research Report (2015-2030)

    • ibisworld.com
    Updated Nov 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IBISWorld (2025). Social Media Platforms in the UK - Market Research Report (2015-2030) [Dataset]. https://www.ibisworld.com/united-kingdom/market-research-reports/social-media-platforms-industry/
    Explore at:
    Dataset updated
    Nov 15, 2025
    Dataset authored and provided by
    IBISWorld
    License

    https://www.ibisworld.com/about/termsofuse/https://www.ibisworld.com/about/termsofuse/

    Time period covered
    2015 - 2030
    Area covered
    United Kingdom
    Description

    Over the five years through 2025-26, industry revenue is forecast to expand at a compound annual rate of 20.3% to reach £12.5 billion. Social media platforms are integral to people's lives, offering ways to communicate, create and view content and share information. According to Ofcom, approximately 89% of UK internet users in 2023 used social media apps or sites. Teenagers and young adults are the biggest users. Advertising is the primary revenue source for social media platforms, although subscription-based services are gaining momentum as platforms seek to diversify their incomes. TikTok is the success story of the past five years, becoming the most downloaded app between 2020 and 2022, according to Apptopia. The short-form video platform has over 30 million monthly users in the UK in 2025. After Musk's takeover, X, formerly known as Twitter, adjusted its content moderation and allowed previously banned accounts to return. As a result, over 600 advertisers pulled their ads from the site because of fears their brand may be associated with malcontent. In response to falling ad revenue, X has introduced a subscription-based service which enables users to verify themselves and boosts the number of people who view their tweets. Meta-owned Facebook and Instagram have responded by introducing a similar service. In 2025, more social media platforms are using AI to boost user engagement. This improves click-through rates and drives higher advertising revenue. Industry revenue is expected to grow by 6.3% in 2025-26. Over the five years through 2030-31, social media platforms' revenue is projected to climb at an estimated 9.2% to reach £19.4 billion. Regulations relating to how data is collected, stored, and shared will force advertisers and platforms to rethink how they can target their desired demographics. The tightening of regulations will raise industry compliance costs, weighing on profit margin. Older age groups present a new revenue opportunity for social media platforms if they can bridge the gap between passive TV consumption and interactive digital engagement. Augmented Reality (AR) technology will move beyond filters to become standard for immersive product trials, interactive ads, and virtual meetups

  8. 🚀 Viral Social Media Trends & Engagement Analysis

    • kaggle.com
    zip
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atharva Soundankar (2025). 🚀 Viral Social Media Trends & Engagement Analysis [Dataset]. https://www.kaggle.com/datasets/atharvasoundankar/viral-social-media-trends-and-engagement-analysis
    Explore at:
    zip(230834 bytes)Available download formats
    Dataset updated
    May 23, 2025
    Authors
    Atharva Soundankar
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset captures the pulse of viral social media trends across TikTok, Instagram, Twitter, and YouTube. It provides insights into the most popular hashtags, content types, and user engagement levels, offering a comprehensive view of how trends unfold across platforms. With regional data and influencer-driven content, this dataset is perfect for:

    • Trend analysis 🔍
    • Sentiment modeling 💭
    • Understanding influencer marketing 📈

    Dive in to explore what makes content go viral, the behaviors that drive engagement, and how trends evolve on a global scale! 🌍

  9. m

    Graph-Based Social Media Data on Mental Health Topics

    • data.mendeley.com
    Updated Nov 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Ady Sanjaya (2024). Graph-Based Social Media Data on Mental Health Topics [Dataset]. http://doi.org/10.17632/z45txpdp7f.2
    Explore at:
    Dataset updated
    Nov 4, 2024
    Authors
    Samuel Ady Sanjaya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is structured as a graph, where nodes represent users and edges capture their interactions, including tweets, retweets, replies, and mentions. Each node provides detailed user attributes, such as unique ID, follower and following counts, and verification status, offering insights into each user's identity, role, and influence in the mental health discourse. The edges illustrate user interactions, highlighting engagement patterns and types of content that drive responses, such as tweet impressions. This interconnected structure enables sentiment analysis and public reaction studies, allowing researchers to explore engagement trends and identify the mental health topics that resonate most with users.

    The dataset consists of three files: 1. Edges Data: Contains graph data essential for social network analysis, including fields for UserID (Source), UserID (Destination), Post/Tweet ID, and Date of Relationship. This file enables analysis of user connections without including tweet content, maintaining compliance with Twitter/X’s data-sharing policies. 2. Nodes Data: Offers user-specific details relevant to network analysis, including UserID, Account Creation Date, Follower and Following counts, Verified Status, and Date Joined Twitter. This file allows researchers to examine user behavior (e.g., identifying influential users or spam-like accounts) without direct reference to tweet content. 3. Twitter/X Content Data: This file contains only the raw tweet text as a single-column dataset, without associated user identifiers or metadata. By isolating the text, we ensure alignment with anonymization standards observed in similar published datasets, safeguarding user privacy in compliance with Twitter/X's data guidelines. This content is crucial for addressing the research focus on mental health discourse in social media. (References to prior Data in Brief publications involving Twitter/X data informed the dataset's structure.)

  10. Top 100 social media profiles

    • kaggle.com
    zip
    Updated Aug 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Medaxone (2022). Top 100 social media profiles [Dataset]. https://www.kaggle.com/datasets/medaxone/top-100-social-media-profiles
    Explore at:
    zip(9135 bytes)Available download formats
    Dataset updated
    Aug 7, 2022
    Authors
    Medaxone
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    A list of the most popular (top 100 by followers) Instagram, Twitter, YouTube, Twitch, and TikTok users. NB! For YouTube the followers are subscribers and the posts are videos.

  11. Social Media Engagement Report

    • kaggle.com
    zip
    Updated Apr 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ali Reda Elblgihy (2024). Social Media Engagement Report [Dataset]. https://www.kaggle.com/datasets/aliredaelblgihy/social-media-engagement-report
    Explore at:
    zip(49114657 bytes)Available download formats
    Dataset updated
    Apr 13, 2024
    Authors
    Ali Reda Elblgihy
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    *****Documentation Process***** 1. Data Preparation: - Upload the data into Power Query to assess quality and identify duplicate values, if any. - Verify data quality and types for each column, addressing any miswriting or inconsistencies. 2. Data Management: - Duplicate the original data sheet for future reference and label the new sheet as the "Working File" to preserve the integrity of the original dataset. 3. Understanding Metrics: - Clarify the meaning of column headers, particularly distinguishing between Impressions and Reach, and comprehend how Engagement Rate is calculated. - Engagement Rate formula: Total likes, comments, and shares divided by Reach. 4. Data Integrity Assurance: - Recognize that Impressions should outnumber Reach, reflecting total views versus unique audience size. - Investigate discrepancies between Reach and Impressions to ensure data integrity, identifying and resolving root causes for accurate reporting and analysis. 5. Data Correction: - Collaborate with the relevant team to rectify data inaccuracies, specifically addressing the discrepancy between Impressions and Reach. - Engage with the concerned team to understand the root cause of discrepancies between Impressions and Reach. - Identify instances where Impressions surpass Reach, potentially attributable to data transformation errors. - Following the rectification process, meticulously adjust the dataset to reflect the corrected Impressions and Reach values accurately. - Ensure diligent implementation of the corrections to maintain the integrity and reliability of the data. - Conduct a thorough recalculation of the Engagement Rate post-correction, adhering to rigorous data integrity standards to uphold the credibility of the analysis. 6. Data Enhancement: - Categorize Audience Age into three groups: "Senior Adults" (45+ years), "Mature Adults" (31-45 years), and "Adolescent Adults" (<30 years) within a new column named "Age Group." - Split date and time into separate columns using the text-to-columns option for improved analysis. 7. Temporal Analysis: - Introduce a new column for "Weekend and Weekday," renamed as "Weekday Type," to discern patterns and trends in engagement. - Define time periods by categorizing into "Morning," "Afternoon," "Evening," and "Night" based on time intervals. 8. Sentiment Analysis: - Populate blank cells in the Sentiment column with "Mixed Sentiment," denoting content containing both positive and negative sentiments or ambiguity. 9. Geographical Analysis: - Group countries and obtain additional continent data from an online source (e.g., https://statisticstimes.com/geography/countries-by-continents.php). - Add a new column for "Audience Continent" and utilize XLOOKUP function to retrieve corresponding continent data.

    *****Drawing Conclusions and Providing a Summary*****

    • The data is equally distributed across different categories, platforms, and over the years.
    • Most of our audience comprises senior adults (aged 45 and above).
    • Most of our audience exhibit mixed sentiments about our posts. However, an equal portion expresses consistent sentiments.
    • The majority of our posts were located in Africa.
    • The number of posts increased from the first year to the second year and remained relatively consistent for the third year.
    • The optimal time for posting is during the night on weekdays.
    • The highest engagement rates were observed in Croatia then Malawi.
    • The number of posts targeting senior adults is significantly higher than the other two categories. However, the engagement rates for mature and adolescent adults are also noteworthy, based on the number of targeted posts.
  12. Social Media Political Content Analysis Dataset

    • kaggle.com
    zip
    Updated May 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Faisal Hameed (2024). Social Media Political Content Analysis Dataset [Dataset]. https://www.kaggle.com/datasets/fysalhameed/impact-of-social-media-on-political-consent
    Explore at:
    zip(355107 bytes)Available download formats
    Dataset updated
    May 13, 2024
    Authors
    Faisal Hameed
    Description

    This dataset contains simulated data for social media users' demographics, behaviors, and perceptions related to political content. It includes features such as age, gender, education level, occupation, social media usage frequency, exposure to political content, and perceptions of accuracy and relevance.

    the features included in the "Social Media Political Content Analysis Dataset":

    1. Age: Age of the user.
    2. Gender: Gender identity of the user.
    3. Education Level: Highest level of education attained by the user.
    4. Occupation: Current occupation of the user.
    5. Political Affiliation: Political leaning or affiliation of the user (e.g., Liberal, Conservative, Independent).
    6. Geographic Location: Country or region where the user is located (e.g., USA, UK, Canada, Australia).
    7. Social Media Usage Frequency: Frequency of social media usage by the user (e.g., 0-1 hour, 1-2 hours, 2-4 hours, 4+ hours).
    8. Preferred Social Media: Social media platform preferred by the user (e.g., Facebook, Twitter, Instagram).
    9. Political Content Exposure: Frequency of exposure to political content on social media (e.g., Once a day, Few times a week, Rarely, Several times a day).
    10. Types of Political Content: Types of political content consumed by the user (e.g., News articles, Opinion pieces, Memes).
    11. Sources of Political Content: Sources from which the user obtains political content (e.g., Mainstream media, Political parties, Independent bloggers).
    12. Recency of Exposure: Recency of the user's exposure to political content (e.g., Within the last hour, Within the last 24 hours, Within the last week, Longer than a week ago).
    13. Interactions Frequency: Frequency of user interactions with political content on social media (e.g., Once a day, Few times a week, Rarely, Several times a day).
    14. Political Content Topics: Topics of political content that interest the user (e.g., Economy, Healthcare, Immigration, Environment).
    15. Perception of Accuracy: User's perception of the accuracy of political content on social media (e.g., Very accurate, Somewhat accurate, Not accurate).
    16. Awareness of Algorithms: Whether the user is aware of algorithms that determine their social media feed (e.g., Yes, No).
    17. Perception of Relevance: User's perception of the relevance of political content on social media (e.g., Very relevant, Somewhat relevant, Not relevant).
    18. Personal Impact: User's perception of the personal impact of political content on social media (e.g., Strong impact, Moderate impact, No impact).
    19. Trust in Social Media: User's level of trust in social media as a source of political information (e.g., Trust a lot, Trust somewhat, Do not trust).
    20. Concerns about Algorithms: User's level of concern about algorithms shaping their social media experience (e.g., Very concerned, Somewhat concerned, Not concerned).
    21. Overall Quality of Discourse: User's perception of the overall quality of political discourse on social media (e.g., High quality, Moderate quality, Low quality).
    22. Views on Influence: User's perception of the influence of political content on social media (e.g., Very influential, Somewhat influential, Not influential).
    23. Suggestions for Improvement: User's suggestions for improving the quality or experience of political content on social media (e.g., Increase transparency, Provide more diverse sources, Improve fact-checking, Enhance user controls).
  13. Iranian Credibility on Social Media

    • kaggle.com
    zip
    Updated Dec 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francis (2024). Iranian Credibility on Social Media [Dataset]. https://www.kaggle.com/datasets/noeyislearning/iranian-credibility-on-social-media
    Explore at:
    zip(9733 bytes)Available download formats
    Dataset updated
    Dec 5, 2024
    Authors
    Francis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Iran
    Description

    This dataset provides a comprehensive analysis of the factors influencing the credibility of information on social media among Iranian users. The research focuses on identifying the most significant factors that affect the perceived credibility of information shared on various social media platforms. The dataset includes demographic information, social media usage patterns, and ratings of various attributes related to information credibility.

    Key Features

    • Demographic Data: Includes age, gender, education level, study field, and university.
    • Social Media Usage: Details on the number of social media memberships, active platforms, and average hours spent per day.
    • Credibility Factors: Ratings on various attributes such as source trustworthiness, media structure, message accuracy, and more.
    • Comprehensive Coverage: Covers multiple dimensions of social media usage and information credibility.
    • User-Centric Insights: Provides insights into how users perceive and interact with information on social media.

    Potential Uses

    • Academic Research: Investigate the factors that influence information credibility on social media.
    • Social Media Analysis: Understand user behavior and preferences on social media platforms.
    • Policy Development: Inform policies related to information dissemination and credibility on social media.
    • Marketing and Advertising: Tailor content strategies based on user perceptions of credibility.
    • User Experience Design: Improve the design and functionality of social media platforms to enhance information credibility.
  14. d

    Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends,...

    • datarade.ai
    .json, .csv
    Updated Aug 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataplex (2024). Dataplex: Reddit Data | Global Social Media Data | 2.1M+ subreddits: trends, audience insights + more | Ideal for Interest-Based Segmentation [Dataset]. https://datarade.ai/data-products/dataplex-reddit-data-global-social-media-data-1-1m-mill-dataplex
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Aug 12, 2024
    Dataset authored and provided by
    Dataplex
    Area covered
    Mexico, Chile, Botswana, Martinique, Holy See, Gambia, Côte d'Ivoire, Jersey, Christmas Island, Macao
    Description

    The Reddit Subreddit Dataset by Dataplex offers a comprehensive and detailed view of Reddit’s vast ecosystem, now enhanced with appended AI-generated columns that provide additional insights and categorization. This dataset includes data from over 2.1 million subreddits, making it an invaluable resource for a wide range of analytical applications, from social media analysis to market research.

    Dataset Overview:

    This dataset includes detailed information on subreddit activities, user interactions, post frequency, comment data, and more. The inclusion of AI-generated columns adds an extra layer of analysis, offering sentiment analysis, topic categorization, and predictive insights that help users better understand the dynamics of each subreddit.

    2.1 Million Subreddits with Enhanced AI Insights: The dataset covers over 2.1 million subreddits and now includes AI-enhanced columns that provide: - Sentiment Analysis: AI-driven sentiment scores for posts and comments, allowing users to gauge community mood and reactions. - Topic Categorization: Automated categorization of subreddit content into relevant topics, making it easier to filter and analyze specific types of discussions. - Predictive Insights: AI models that predict trends, content virality, and user engagement, helping users anticipate future developments within subreddits.

    Sourced Directly from Reddit:

    All social media data in this dataset is sourced directly from Reddit, ensuring accuracy and authenticity. The dataset is updated regularly, reflecting the latest trends and user interactions on the platform. This ensures that users have access to the most current and relevant data for their analyses.

    Key Features:

    • Subreddit Metrics: Detailed data on subreddit activity, including the number of posts, comments, votes, and user participation.
    • User Engagement: Insights into how users interact with content, including comment threads, upvotes/downvotes, and participation rates.
    • Trending Topics: Track emerging trends and viral content across the platform, helping you stay ahead of the curve in understanding social media dynamics.
    • AI-Enhanced Analysis: Utilize AI-generated columns for sentiment analysis, topic categorization, and predictive insights, providing a deeper understanding of the data.

    Use Cases:

    • Social Media Analysis: Researchers and analysts can use this dataset to study online behavior, track the spread of information, and understand how content resonates with different audiences.
    • Market Research: Marketers can leverage the dataset to identify target audiences, understand consumer preferences, and tailor campaigns to specific communities.
    • Content Strategy: Content creators and strategists can use insights from the dataset to craft content that aligns with trending topics and user interests, maximizing engagement.
    • Academic Research: Academics can explore the dynamics of online communities, studying everything from the spread of misinformation to the formation of online subcultures.

    Data Quality and Reliability:

    The Reddit Subreddit Dataset emphasizes data quality and reliability. Each record is carefully compiled from Reddit’s vast database, ensuring that the information is both accurate and up-to-date. The AI-generated columns further enhance the dataset's value, providing automated insights that help users quickly identify key trends and sentiments.

    Integration and Usability:

    The dataset is provided in a format that is compatible with most data analysis tools and platforms, making it easy to integrate into existing workflows. Users can quickly import, analyze, and utilize the data for various applications, from market research to academic studies.

    User-Friendly Structure and Metadata:

    The data is organized for easy navigation and analysis, with metadata files included to help users identify relevant subreddits and data points. The AI-enhanced columns are clearly labeled and structured, allowing users to efficiently incorporate these insights into their analyses.

    Ideal For:

    • Data Analysts: Conduct in-depth analyses of subreddit trends, user engagement, and content virality. The dataset’s extensive coverage and AI-enhanced insights make it an invaluable tool for data-driven research.
    • Marketers: Use the dataset to better understand your target audience, tailor campaigns to specific interests, and track the effectiveness of marketing efforts across Reddit.
    • Researchers: Explore the social dynamics of online communities, analyze the spread of ideas and information, and study the impact of digital media on public discourse, all while leveraging AI-generated insights.

    This dataset is an essential resource for anyone looking to understand the intricacies of Reddit's vast ecosystem, offering the data and AI-enhanced insights needed to drive informed decisions and strategies across various fields. Whether you’re tracking emerging trends, analyzing user behavior, or conduc...

  15. Social Media Listening Market Analysis, Size, and Forecast 2025-2029: North...

    • technavio.com
    pdf
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Social Media Listening Market Analysis, Size, and Forecast 2025-2029: North America (US and Canada), Europe (France, Germany, UK), Middle East and Africa , APAC (China, India, Japan, South Korea), South America (Brazil), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/social-media-listening-market-industry-analysis
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Apr 17, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    License

    https://www.technavio.com/content/privacy-noticehttps://www.technavio.com/content/privacy-notice

    Time period covered
    2025 - 2029
    Area covered
    Germany, United States, France, Canada, United Kingdom
    Description

    Snapshot img

    Social Media Listening Market Size 2025-2029

    The social media listening market size is forecast to increase by USD 4.87 billion at a CAGR of 8.9% between 2024 and 2029.

    The market is experiencing significant growth, driven primarily by the increasing usage of social media platforms worldwide. With over 4.3 billion users as of 2021, social media has become a powerful tool for businesses to engage with their customers and gain valuable insights into consumer behavior and preferences. A key trend in this market is the integration of Artificial Intelligence (AI) and Machine Learning (ML) technologies in social media listening solutions, enabling more accurate and efficient data analysis. However, this market is not without challenges. Data privacy and regulatory compliance are becoming increasingly important, with stricter regulations being implemented to protect user data.
    Companies must ensure they have strong data security measures in place to comply with these regulations and maintain consumer trust. Additionally, the vast amount of data generated on social media requires sophisticated analytics tools to extract meaningful insights. As such, businesses seeking to capitalize on the opportunities presented by the market must invest in advanced analytics solutions and prioritize data security and privacy. By doing so, they can effectively navigate the challenges and stay ahead of the competition.
    

    What will be the Size of the Social Media Listening Market during the forecast period?

    Request Free Sample

    Social media listening has emerged as a crucial business tool, enabling organizations to gain valuable insights from the vast amount of data generated through social media activity. This data is analyzed using techniques such as topic modeling and sentiment scoring to understand consumer behavior, preferences, and trends. Social media geographics and demographics provide essential context, while social media reach and volume measure the scope and impact of conversations. Social media pulse and sentiment reflect the current sentiment and buzz surrounding specific topics, offering real-time insights into market dynamics and trends.
    Social media listening software is a vital component of the global market for social media analytics. Social media influence is assessed through the size and engagement of an audience, providing valuable information for marketing and brand management strategies. The social media landscape and heatmap offer a comprehensive view of the social media ecosystem, helping businesses stay informed and adapt to evolving patterns.
    

    How is this Social Media Listening Industry segmented?

    The social media listening industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

    Type
    
      Software
      Services
    
    
    End-user
    
      Retail and e-commerce
      IT and telecom
      BFSI
      Media and entertainment
      Others
    
    
    Geography
    
      North America
    
        US
        Canada
    
    
      Europe
    
        France
        Germany
        UK
    
    
      Middle East and Africa
    
    
    
      APAC
    
        China
        India
        Japan
        South Korea
    
    
      South America
    
        Brazil
    
    
      Rest of World (ROW)
    

    By Type Insights

    The software segment is estimated to witness significant growth during the forecast period. This segment encompasses platforms and tools that offer real-time, automated, and scalable capabilities to monitor and analyze social media conversations across various channels such as Twitter, Facebook, Instagram, LinkedIn, TikTok, and Reddit. Real-time monitoring is a key feature of these solutions, empowering brands to identify mentions, trends, and sentiment as they emerge. By staying abreast of evolving topics, businesses can respond promptly to customer concerns, capitalize on viral events, and maintain a strong online presence. Artificial Intelligence (AI) and Machine Learning (ML) technologies are integral to social media listening software, enabling advanced topic identification, sentiment analysis, and trend recognition.

    These technologies enable businesses to gain valuable customer insights, inform product development, and enhance customer experience. Social media listening platforms also offer data visualization and reporting features, allowing businesses to analyze and present their findings in a clear and actionable manner. Additionally, they provide social media dashboards, alerts, and governance tools to ensure compliance with social media policies and ethical standards. In summary, social media listening software plays a pivotal role in the global market for social media analytics, offering real-time insights and advanced capabilities to help businesses navigate the complex social media landscape and engage effectively with their audience.

    Get a glance at the market report of share of v

  16. MicroBlog-Hot-Search-Labeled

    • kaggle.com
    zip
    Updated May 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ChaneMo (2024). MicroBlog-Hot-Search-Labeled [Dataset]. https://www.kaggle.com/datasets/chanemo/weibo-hot-searchlabeled
    Explore at:
    zip(4152220 bytes)Available download formats
    Dataset updated
    May 19, 2024
    Authors
    ChaneMo
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Weibo is one of the mainstream social media platforms in China. Among its features, trending topics serve as an important real-time information source for Weibo users, consisting of the most popular search terms at the moment. Weibo's official platform does not provide corresponding tag information for these trending topics, making it difficult for users to access specific categories of topics. To address this issue, we collected over 6,000 trending topic data entries from November 24th to December 23rd, 2020. Each entry was manually categorized into one of eight major categories: "(时政)Politics", "(科技)Technology", "(科普)Popular Science", "(娱乐)Entertainment", "(体育)Sports", "(社会讨论/话题)Social Discussions/Topics", "(时事)Current Affairs" and "(经济)Economy". This categorization aims to facilitate subsequent applications. Besides, we provide another dataset of hot search that are unlabeled. - Politics: The current political news happening now. - Technology: News related to high-tech products. - Popular Science: News topics about popularizing knowledge. - Entertainment: News related to celebrities or variety shows. - Sports: News related to sports events or sports celebrities. - Social Discussions/Topics: Hot topics being discussed by the general public. - Current Affairs: Current social events happening now. - Economy: News related to the economy.

  17. B

    Data from: The State of Social Media in Canada 2022

    • borealisdata.ca
    • dataone.org
    • +1more
    Updated Sep 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philip Mai; Anatoliy Gruzd (2022). The State of Social Media in Canada 2022 [Dataset]. http://doi.org/10.5683/SP3/BDFE7S
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 14, 2022
    Dataset provided by
    Borealis
    Authors
    Philip Mai; Anatoliy Gruzd
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Canada
    Description

    The report provides a snapshot of the social media usage trends amongst online Canadian adults based on an online survey of 1500 participants. Canada continues to be one of the most connected countries in the world. An overwhelming majority of online Canadian adults (94%) have an account on at least one social media platform. However, the 2022 survey results show that the COVID-19 pandemic has ushered in some changes in how and where Canadians are spending their time on social media. Dominant platforms such as Facebook, messaging apps and YouTube are still on top but are losing ground to newer platforms such as TikTok and more niche platforms such as Reddit and Twitch.

  18. d

    Removal and Enforcement Actions by Social Media Companies: Year and Month...

    • dataful.in
    Updated Nov 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataful (Factly) (2025). Removal and Enforcement Actions by Social Media Companies: Year and Month wise Number of Content Removed and Accounts Banned/Suspended by SSMIs and Violation Category [Dataset]. https://dataful.in/datasets/18652
    Explore at:
    csv, application/x-parquet, xlsxAvailable download formats
    Dataset updated
    Nov 5, 2025
    Dataset authored and provided by
    Dataful (Factly)
    License

    https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions

    Area covered
    India
    Variables measured
    Social Media Intermediaries Ban actions
    Description

    This dataset presents year and month wise enforcement actions taken by Significant Social Media Intermediaries (SSMIs) from 2021 to the present, compiled from the mandatory monthly transparency reports published under Rule 4(1)(d) of the Information Technology (Intermediary Guidelines and Digital Media Ethics Code) Rules, 2021. It includes counts of content removed, accounts suspended or banned, and chatrooms, comments, edit profiles and livestreams restricted, along with the policy or violation category (e.g., child sexual exploitation, terrorism, hate speech, bullying, violence, regulated goods, misinformation, etc.).

    To enable comparability across platforms with different reporting terms, the dataset uses a standardised enforcement classification:

    1. enforcement_type:

    The type of action taken: a. Content Actioned (any enforcement such as warning, downranking, age-gating), b. Content Removed (content deleted or made inaccessible), c. Account Banned (account suspension or disabling), d. Quality Metric (AI moderation accuracy indicators reported by some platforms).

    1. proactive_flag:

    Whether the platform identified and enforced before user reports: a. Proactive = Found via automated detection or internal review systems, b. Unknown = Platform did not specify proactive vs reactive.

    Notes: 1. SSMI denotes to Significant Social Media Intermediaries, with over 50,00,000 registered users in India, which primarily or solely enables online interaction between two or more users and allows them to create, upload, share, disseminate, modify or access information using its services

    1. Facebook & Instagram (Meta) a. Content Actioned counts any enforcement, not only removals (e.g., removals, warning screens/covering, age gates, downranking). b. Proactive Rate = (items found & actioned proactively) ÷ (total content actioned).

    2. X/Twitter a. Child Sexual Exploitation and terrorism suspensions are largely proactive, flagged using proprietary tools and industry hash-sharing systems. b. Data reflects global enforcement, not only India.

    3. Google / YouTube a. Number of removal actions as a result of automated detection captures actions triggered by automated systems (ML + human-trained models).

    4. ShareChat a. Content Removed / Taken Down / UGC discard / Comments/Chatrooms deleted are standardised as Content Removed. b. Also includes rights-holder reporting workflow for copyright/IP and automated proactive monitoring for harmful content.

    5. WhatsApp a. Reports Proactively Banned Accounts, meaning accounts banned before any user reports.

    6. Koo a. Distinguishes between Content Removed, Content Actioned (flagged/downranked), and Account Banned. b. Automation Correct/Wrong reflect AI moderation accuracy, not enforcement outcomes.

  19. Social Media Engagement: A Comprehensive Analysis

    • kaggle.com
    zip
    Updated Jul 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mehmet ISIK (2023). Social Media Engagement: A Comprehensive Analysis [Dataset]. https://www.kaggle.com/datasets/mehmetisik/livedataset
    Explore at:
    zip(101676 bytes)Available download formats
    Dataset updated
    Jul 12, 2023
    Authors
    Mehmet ISIK
    Description

    Dataset Overview

    This comprehensive dataset offers an in-depth analysis of social media engagements across various platforms. It captures the dynamics of user interactions by tracking the number of reactions, comments, shares, and types of posts. Ideal for social media analysts, marketers, and researchers, this dataset serves as a critical tool for understanding digital communication trends and enhancing social media strategies. Each entry provides detailed metrics on how posts are received by audiences, enabling data-driven insights into content performance.

    Key Features:

    📌 num_reactions: Total number of reactions a post receives, encapsulating the overall engagement. 📍 num_comments: Reflects the level of audience interaction through comments. 📸 num_shares: Indicates the virality of the post by counting how many times it has been shared. ❤️ num_likes: Tracks the number of likes, showing general approval of the content. 🥰 num_loves: Captures more intense affection reactions to posts. 😮 num_wows: Measures the surprise or awe factor of the post. 😂 num_hahas: Counts instances of amusement or laughter triggered by the post. 😢 num_sads: Reflects the number of sad reactions, indicating emotional impact. 😡 num_angrys: Tracks angry reactions, highlighting content that might be controversial or upsetting. 🔗 status_type_link: Binary indicator of whether the post includes a link, enhancing its informational value. 🖼️ status_type_photo: Identifies posts with photos, crucial for visual content analysis. 📝 status_type_status: Marks textual posts, focusing on written content engagement. 🎥 status_type_video: Distinguishes posts with videos, important for engagement in dynamic content.

    This dataset not only aids in measuring the effectiveness of social media campaigns but also supports the development of targeted marketing strategies and content optimization efforts to maximize audience engagement.

  20. Z

    #IndonesiaHumanRightsSOS Twitter Hashtag Tweets Dataset

    • data.niaid.nih.gov
    Updated Jul 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Azmi Nawwar (2024). #IndonesiaHumanRightsSOS Twitter Hashtag Tweets Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4362504
    Explore at:
    Dataset updated
    Jul 19, 2024
    Dataset provided by
    UIN Syarif Hidayatullah Jakarta
    Authors
    Azmi Nawwar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset ini merupakan hasil dari scraping pada media sosial twitter dengan menggunakan aplikasi twint yang ditujukan pada hashtag #IndonesiaHumanRightsSOS. Scraping data dilakukan untuk cuitan yang dibuat dari tanggal 18 Desember 2020 10:59 AM s/d 19 Desember 2020 23:18 PM.

    Pada dataset mengandung 106.903 Row data dengan informasi terkait: User ID, Username, Twitter Name,Tweets, dsb.

    Selain itu dilampirkan juga contoh data yang telah dianalisis berupa wordcloud,username cloud, 100 most used word & most active username.

    -

    This dataset is the result of scraping on social media twitter using the twint application aimed at the hashtag #IndonesiaHumanRightsSOS. Data scraping is done for tweets made from December 18 2020 10:59 AM to December 19 2020 23:18 PM.

    The dataset contains 106,903 rows of data with related information: User ID, Username, Twitter Name, Tweets, etc.

    Also there is an example of the data that has been analyzed in the form of wordcloud, username cloud, 100 most used words & most active username.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Shaik Barood Mohammed Umar Adnaan Faiz (2025). Daily Social Media Active Users [Dataset]. https://www.kaggle.com/datasets/umeradnaan/daily-social-media-active-users
Organization logo

Daily Social Media Active Users

"A thorough dataset that displays user activity on major social media platforms

Explore at:
zip(126814 bytes)Available download formats
Dataset updated
May 5, 2025
Authors
Shaik Barood Mohammed Umar Adnaan Faiz
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Description:

The "Daily Social Media Active Users" dataset provides a comprehensive and dynamic look into the digital presence and activity of global users across major social media platforms. The data was generated to simulate real-world usage patterns for 13 popular platforms, including Facebook, YouTube, WhatsApp, Instagram, WeChat, TikTok, Telegram, Snapchat, X (formerly Twitter), Pinterest, Reddit, Threads, LinkedIn, and Quora. This dataset contains 10,000 rows and includes several key fields that offer insights into user demographics, engagement, and usage habits.

Dataset Breakdown:

  • Platform: The name of the social media platform where the user activity is tracked. It includes globally recognized platforms, such as Facebook, YouTube, and TikTok, that are known for their large, active user bases.

  • Owner: The company or entity that owns and operates the platform. Examples include Meta for Facebook, Instagram, and WhatsApp, Google for YouTube, and ByteDance for TikTok.

  • Primary Usage: This category identifies the primary function of each platform. Social media platforms differ in their primary usage, whether it's for social networking, messaging, multimedia sharing, professional networking, or more.

  • Country: The geographical region where the user is located. The dataset simulates global coverage, showcasing users from diverse locations and regions. It helps in understanding how user behavior varies across different countries.

  • Daily Time Spent (min): This field tracks how much time a user spends on a given platform on a daily basis, expressed in minutes. Time spent data is critical for understanding user engagement levels and the popularity of specific platforms.

  • Verified Account: Indicates whether the user has a verified account. This feature mimics real-world patterns where verified users (often public figures, businesses, or influencers) have enhanced status on social media platforms.

  • Date Joined: The date when the user registered or started using the platform. This data simulates user account history and can provide insights into user retention trends or platform growth over time.

Context and Use Cases:

  • This synthetic dataset is designed to offer a privacy-friendly alternative for analytics, research, and machine learning purposes. Given the complexities and privacy concerns around using real user data, especially in the context of social media, this dataset offers a clean and secure way to develop, test, and fine-tune applications, models, and algorithms without the risks of handling sensitive or personal information.

Researchers, data scientists, and developers can use this dataset to:

  • Model User Behavior: By analyzing patterns in daily time spent, verified status, and country of origin, users can model and predict social media engagement behavior.

  • Test Analytics Tools: Social media monitoring and analytics platforms can use this dataset to simulate user activity and optimize their tools for engagement tracking, reporting, and visualization.

  • Train Machine Learning Algorithms: The dataset can be used to train models for various tasks like user segmentation, recommendation systems, or churn prediction based on engagement metrics.

  • Create Dashboards: This dataset can serve as the foundation for creating user-friendly dashboards that visualize user trends, platform comparisons, and engagement patterns across the globe.

  • Conduct Market Research: Business intelligence teams can use the data to understand how various demographics use social media, offering valuable insights into the most engaged regions, platform preferences, and usage behaviors.

  • Sources of Inspiration: This dataset is inspired by public data from industry reports, such as those from Statista, DataReportal, and other market research platforms. These sources provide insights into the global user base and usage statistics of popular social media platforms. The synthetic nature of this dataset allows for the use of realistic engagement metrics without violating any privacy concerns, making it an ideal tool for educational, analytical, and research purposes.

The structure and design of the dataset are based on real-world usage patterns and aim to represent a variety of users from different backgrounds, countries, and activity levels. This diversity makes it an ideal candidate for testing data-driven solutions and exploring social media trends.

Future Considerations:

As the social media landscape continues to evolve, this dataset can be updated or extended to include new platforms, engagement metrics, or user behaviors. Future iterations may incorporate features like post frequency, follower counts, engagement rates (likes, comments, shares), or even sentiment analysis from user-generated content.

By leveraging this dataset, analysts and data scientists can create better, more effective strategies ...

Search
Clear search
Close search
Google apps
Main menu