Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset contains LinkedIn Influencers' post details and other details(post dependent as well as independent) per post. This dataset can be used to analyze LinkedIn reach based on post content and related account details.
This dataset is great for Exploratory Data Analysis and NLP tasks.
The data was scraped using BeautifulSoup and Selenium.Last updated on 15th Feb,2021
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
📱 About Dataset Overview This Social Media Engagement Dataset contains comprehensive engagement metrics from 5,000 social media posts across six major platforms: Instagram, Twitter, Facebook, LinkedIn, TikTok, and YouTube. The dataset spans over 2 years (2024-2025) and provides valuable insights into content performance, audience engagement patterns, and influencer analytics.
Dataset Contents The dataset includes 20 detailed features covering various aspects of social media engagement:
Post Information Post_ID: Unique identifier for each post Timestamp: Date and time when the post was published Platform: Social media platform (Instagram, Twitter, Facebook, LinkedIn, TikTok, YouTube) Content_Type: Type of content (Photo, Video, Reel, Tweet, Story, etc.) Category: Content category (Technology, Fashion, Food, Travel, Fitness, Education, Entertainment, Business, Lifestyle, Gaming, Health, Sports) Engagement Metrics Likes: Number of likes/reactions received Comments: Number of comments on the post Shares: Number of shares/retweets/reposts Views: Total number of views Saves: Number of bookmarks/saves Engagement_Rate: Calculated engagement rate percentage Account Information Follower_Count: Number of followers of the account Influencer_Tier: Classification (Nano, Micro, Mid-tier, Macro) Is_Verified: Whether the account is verified (True/False) Content Characteristics Hashtag_Count: Number of hashtags used Content_Length: Length in characters (text) or seconds (video) Sentiment: Sentiment analysis (Positive, Neutral, Negative) Has_Media: Whether post contains media (True/False) Temporal Features Hour_of_Day: Hour when the post was published (0-23) Day_of_Week: Day of the week (Monday-Sunday) Use Cases This dataset is perfect for:
📊 Predictive Analytics: Build ML models to predict engagement rates 📈 Data Visualization: Create insightful dashboards and charts 🤖 Machine Learning: Classification, regression, and clustering tasks ⏰ Time Series Analysis: Analyze posting patterns and optimal timing 🎯 Content Strategy: Optimize content strategy based on data insights 🔍 Sentiment Analysis: Study correlation between sentiment and engagement 📱 Platform Comparison: Compare performance across different platforms 💼 Influencer Marketing: Analyze influencer tier performance Technical Details Format: CSV Size: ~651 KB Rows: 5,000 Columns: 20 Time Period: January 2024 - December 2025 Missing Values: None Potential Research Questions What time of day generates the most engagement? Which platform has the highest engagement rates? How does content type affect performance? Does verified status impact engagement? What's the optimal hashtag count? How does sentiment correlate with engagement? Notes Engagement metrics are platform-realistic and proportional All data is synthetically generated for educational and research purposes Suitable for beginners and advanced data scientists
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description:
The "Daily Social Media Active Users" dataset provides a comprehensive and dynamic look into the digital presence and activity of global users across major social media platforms. The data was generated to simulate real-world usage patterns for 13 popular platforms, including Facebook, YouTube, WhatsApp, Instagram, WeChat, TikTok, Telegram, Snapchat, X (formerly Twitter), Pinterest, Reddit, Threads, LinkedIn, and Quora. This dataset contains 10,000 rows and includes several key fields that offer insights into user demographics, engagement, and usage habits.
Dataset Breakdown:
Platform: The name of the social media platform where the user activity is tracked. It includes globally recognized platforms, such as Facebook, YouTube, and TikTok, that are known for their large, active user bases.
Owner: The company or entity that owns and operates the platform. Examples include Meta for Facebook, Instagram, and WhatsApp, Google for YouTube, and ByteDance for TikTok.
Primary Usage: This category identifies the primary function of each platform. Social media platforms differ in their primary usage, whether it's for social networking, messaging, multimedia sharing, professional networking, or more.
Country: The geographical region where the user is located. The dataset simulates global coverage, showcasing users from diverse locations and regions. It helps in understanding how user behavior varies across different countries.
Daily Time Spent (min): This field tracks how much time a user spends on a given platform on a daily basis, expressed in minutes. Time spent data is critical for understanding user engagement levels and the popularity of specific platforms.
Verified Account: Indicates whether the user has a verified account. This feature mimics real-world patterns where verified users (often public figures, businesses, or influencers) have enhanced status on social media platforms.
Date Joined: The date when the user registered or started using the platform. This data simulates user account history and can provide insights into user retention trends or platform growth over time.
Context and Use Cases:
Researchers, data scientists, and developers can use this dataset to:
Model User Behavior: By analyzing patterns in daily time spent, verified status, and country of origin, users can model and predict social media engagement behavior.
Test Analytics Tools: Social media monitoring and analytics platforms can use this dataset to simulate user activity and optimize their tools for engagement tracking, reporting, and visualization.
Train Machine Learning Algorithms: The dataset can be used to train models for various tasks like user segmentation, recommendation systems, or churn prediction based on engagement metrics.
Create Dashboards: This dataset can serve as the foundation for creating user-friendly dashboards that visualize user trends, platform comparisons, and engagement patterns across the globe.
Conduct Market Research: Business intelligence teams can use the data to understand how various demographics use social media, offering valuable insights into the most engaged regions, platform preferences, and usage behaviors.
Sources of Inspiration: This dataset is inspired by public data from industry reports, such as those from Statista, DataReportal, and other market research platforms. These sources provide insights into the global user base and usage statistics of popular social media platforms. The synthetic nature of this dataset allows for the use of realistic engagement metrics without violating any privacy concerns, making it an ideal tool for educational, analytical, and research purposes.
The structure and design of the dataset are based on real-world usage patterns and aim to represent a variety of users from different backgrounds, countries, and activity levels. This diversity makes it an ideal candidate for testing data-driven solutions and exploring social media trends.
Future Considerations:
As the social media landscape continues to evolve, this dataset can be updated or extended to include new platforms, engagement metrics, or user behaviors. Future iterations may incorporate features like post frequency, follower counts, engagement rates (likes, comments, shares), or even sentiment analysis from user-generated content.
By leveraging this dataset, analysts and data scientists can create better, more effective strategies ...
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset contains LinkedIn Influencers' post details and other details(post dependent as well as independent) per post. This dataset can be used to analyze LinkedIn reach based on post content and related account details.
This dataset is great for Exploratory Data Analysis and NLP tasks.
The data was scraped using BeautifulSoup and Selenium.Last updated on 15th Feb,2021