39 datasets found
  1. h

    Tiktok-Videos

    • huggingface.co
    Updated Oct 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DataHive AI (2025). Tiktok-Videos [Dataset]. https://huggingface.co/datasets/datahiveai/Tiktok-Videos
    Explore at:
    Dataset updated
    Oct 5, 2025
    Dataset authored and provided by
    DataHive AI
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    TikTok Video Analytics Dataset

    Sample TikTok video dataset with comprehensive engagement metrics and metadata. Each row represents a single TikTok video with content and detailed analytics. This is a sample dataset. To access the full version or request any custom dataset tailored to your needs, contact DataHive at contact@datahive.ai.

      Files Included
    

    train.csv – TikTok video analytics data

      What's included
    

    Video URLs and identifiers Comprehensive engagement… See the full description on the dataset page: https://huggingface.co/datasets/datahiveai/Tiktok-Videos.

  2. TikTok User Engagement Data

    • kaggle.com
    zip
    Updated Oct 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yakhyojon (2023). TikTok User Engagement Data [Dataset]. https://www.kaggle.com/datasets/yakhyojon/tiktok
    Explore at:
    zip(813245 bytes)Available download formats
    Dataset updated
    Oct 18, 2023
    Authors
    Yakhyojon
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    TikTok is the leading destination for short-form mobile video. The platform is built to help imaginations thrive. TikTok's mission is to create a place for inclusive, joyful, and authentic content–where people can safely discover, create, and connect.

    Column nameTypeDescription
    #intTikTok assigned number for video with claim/opinion.
    claim_statusobjWhether the published video has been identified as an “opinion” or a “claim.” In this dataset, an “opinion” refers to an individual’s or group’s personal belief or thought. A “claim” refers to information that is either unsourced or from an unverified source.
    video_idintRandom identifying number assigned to video upon publication on TikTok.
    video_duration_secintHow long the published video is measured in seconds.
    video_transcription_textobjTranscribed text of the words spoken in the published video.
    verified_statusobjIndicates the status of the TikTok user who published the video in terms of their verification, either “verified” or “not verified.”
    author_ban_statusobjIndicates the status of the TikTok user who published the video in terms of their permissions: “active,” “under scrutiny,” or “banned.”
    video_view_countfloatThe total number of times the published video has been viewed.
    video_like_countfloatThe total number of times the published video has been liked by other users.
    video_share_countfloatThe total number of times the published video has been shared by other users.
    video_download_countfloatThe total number of times the published video has been downloaded by other users.
    video_comment_countfloatThe total number of comments on the published video.
  3. TikTok Video Dataset

    • kaggle.com
    zip
    Updated Mar 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wasif Ullah (2025). TikTok Video Dataset [Dataset]. https://www.kaggle.com/datasets/wasifullahcs/tiktok-video-dataset
    Explore at:
    zip(1835515 bytes)Available download formats
    Dataset updated
    Mar 8, 2025
    Authors
    Wasif Ullah
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    his dataset contains a large collection of TikTok video metadata fetched using the TikTok Scraper API. It includes videos from multiple regions (e.g., US, India,) and categories (e.g., fyp, dance, comedy, food, travel, etc.). Each video entry

    provides detailed information such as:

    Video ID: Unique identifier for the video. Region: The region where the video is popular. Category: The keyword/category used to fetch the video (e.g., dance, comedy). Title: The title of the video. Duration: The length of the video in seconds. Play URL: Direct link to the video. Watermarked URL: Link to the watermarked version of the video. Cover Image: URL of the video's cover image. Music URL: Link to the music used in the video. Timestamp: The date and time when the data was fetched.

    How This Dataset Can Be Helpful

    Trend Analysis: Analyze trending videos across different regions and categories. Identify patterns in video popularity based on region, duration, or category.

    Machine Learning: Train models to predict video popularity based on features like duration, region, and category. Build recommendation systems for TikTok videos.

    Content Moderation: Use the dataset to analyze video content for moderation purposes.

    Sentiment Analysis: Perform sentiment analysis on video titles to understand user preferences.

    Cross-Region Insights: Compare video trends across different regions to understand cultural differences.

    How to Use This Dataset Filter by Region: Analyze videos from a specific region (e.g., US or India).

    Filter by Category: Focus on videos from a specific category (e.g., dance or comedy).

    Trend Analysis: Identify trending videos based on timestamp and region.

    Machine Learning: Use the dataset to train models for video popularity prediction or recommendation systems.

  4. Tiktok 2025 Dataset

    • kaggle.com
    zip
    Updated Jun 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haziq Halifi (2025). Tiktok 2025 Dataset [Dataset]. https://www.kaggle.com/datasets/haziqhalifi/tiktok-2025-dataset
    Explore at:
    zip(889553 bytes)Available download formats
    Dataset updated
    Jun 13, 2025
    Authors
    Haziq Halifi
    Description

    This dataset contains comprehensive information about TikTok posts, originally fetched from RapidAPI. It provides valuable insights into various aspects of TikTok content, including details about the videos, their creators, and audience engagement metrics.

    Here's a breakdown of the columns included in this dataset:

    video_id: A unique identifier for each TikTok video. author: The username or handle of the TikTok account that posted the video. description: The textual description or caption provided by the creator for the video. (Note: This column contains some missing values.) likes: The number of likes the video has received. comments: The number of comments on the video. shares: The number of times the video has been shared. plays: The total number of plays or views the video has accumulated. (Note: This column contains some missing values.) hashtags: A list of hashtags used in the video's description, which helps categorize content and improve discoverability. (Note: This column contains some missing values.) music: Information about the background music or sound used in the video. create_time: The timestamp indicating when the video was created or published. (Note: This column contains some missing values.) video_url: The direct URL to the TikTok video. fetch_time: The timestamp when the data for the video was fetched from the API. (Note: This column has a high number of missing values.) views: Another metric for the number of views. (Note: This column has a high number of missing values and appears to overlap with plays.) posted_time: The time the video was posted. (Note: This column has a high number of missing values and appears to overlap with create_time.) Potential Uses of This Dataset:

    Content Analysis: Analyze popular TikTok content by examining descriptions, hashtags, and engagement metrics. Trend Identification: Identify trending topics, music, and creators on TikTok. Audience Engagement Studies: Understand how different types of content generate likes, comments, shares, and plays. Creator Analysis: Study the posting habits and performance of various TikTok creators. Social Media Research: Conduct research on the dynamics of content dissemination and user interaction on short-form video platforms. Notes on Data Quality:

    The description, plays, hashtags, and create_time columns have some missing values, which may require handling (e.g., imputation or removal) depending on your analysis. The fetch_time, views, and posted_time columns are largely empty, suggesting they may not be reliable for comprehensive analysis. It is recommended to primarily rely on create_time for timestamps and plays for engagement metrics. This dataset can be a valuable resource for anyone looking to explore the vast and dynamic world of TikTok content and user engagement.

  5. f

    A Labelled Dataset for Sentiment Analysis of videos on YouTube, TikTok, and...

    • figshare.com
    • data.niaid.nih.gov
    • +1more
    application/csv
    Updated Jun 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirmalya Thakur; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian (2024). A Labelled Dataset for Sentiment Analysis of videos on YouTube, TikTok, and other sources about the 2024 Outbreak of Measles [Dataset]. http://doi.org/10.6084/m9.figshare.26086492.v1
    Explore at:
    application/csvAvailable download formats
    Dataset updated
    Jun 24, 2024
    Dataset provided by
    figshare
    Authors
    Nirmalya Thakur; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    Please cite the following paper when using this dataset:N. Thakur, V. Su, M. Shao, K. Patel, H. Jeong, V. Knieling, and A.Bian “A labelled dataset for sentiment analysis of videos on YouTube, TikTok, and other sources about the 2024 outbreak of measles,” arXiv [cs.CY], 2024. Available: https://doi.org/10.48550/arXiv.2406.07693AbstractThis dataset contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder of the websites include Instagram and Facebook as well as the websites of various global and local news organizations. For each of these videos, the URL of the video, title of the post, description of the post, and the date of publication of the video are presented as separate attributes in the dataset. After developing this dataset, sentiment analysis (using VADER), subjectivity analysis (using TextBlob), and fine-grain sentiment analysis (using DistilRoBERTa-base) of the video titles and video descriptions were performed. This included classifying each video title and video description into (i) one of the sentiment classes i.e. positive, negative, or neutral, (ii) one of the subjectivity classes i.e. highly opinionated, neutral opinionated, or least opinionated, and (iii) one of the fine-grain sentiment classes i.e. fear, surprise, joy, sadness, anger, disgust, or neutral. These results are presented as separate attributes in the dataset for the training and testing of machine learning algorithms for performing sentiment analysis or subjectivity analysis in this field as well as for other applications. The paper associated with this dataset (please see the above-mentioned citation) also presents a list of open research questions that may be investigated using this dataset.

  6. Z

    Invasion of Ukraine Discourse on TikTok Dataset

    • data.niaid.nih.gov
    Updated May 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Steel, Benjamin; Parker, Sara; Ruths, Derek (2023). Invasion of Ukraine Discourse on TikTok Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7534951
    Explore at:
    Dataset updated
    May 11, 2023
    Dataset provided by
    Department of Political Science, McGill University, Montreal, QC, Canada
    School of Computer Science, McGill University, Montreal, QC, Canada
    Authors
    Steel, Benjamin; Parker, Sara; Ruths, Derek
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Ukraine
    Description

    This is a dataset of videos and comments related to the invasion of Ukraine, published on TikTok by a number of users over the year of 2022. It was compiled by Benjamin Steel, Sara Parker and Derek Ruths at the Network Dynamics Lab, McGill University. We created this dataset to facilitate the study of TikTok, and the nature of social interaction on the platform relevant to a major political event.

    The dataset has been released here on Zenodo: https://doi.org/10.5281/zenodo.7534952 as well as on Github: https://github.com/networkdynamics/data-and-code/tree/master/ukraine_tiktok

    To create the dataset, we identified hashtags and keywords explicitly related to the conflict to collect a core set of videos (or ”TikToks”). We then compiled comments associated with these videos. All of the data captured is publically available information, and contains personally identifiable information. In total we collected approximately 16 thousand videos and 12 million comments, from approximately 6 million users. There are approximately 1.9 comments on average per user captured, and 1.5 videos per user who posted a video. The author personally collected this data using the web scraping PyTok library, developed by the author: https://github.com/networkdynamics/pytok.

    Due to scraping duration, this is just a sample of the publically available discourse concerning the invasion of Ukraine on TikTok. Due to the fuzzy search functionality of the TikTok, the dataset contains videos with a range of relatedness to the invasion.

    We release here the unique video IDs of the dataset in a CSV format. The data was collected without the specific consent of the content creators, so we have released only the data required to re-create it, to allow users to delete content from TikTok and be removed from the dataset if they wish. Contained in this repository are scripts that will automatically pull the full dataset, which will take the form of JSON files organised into a folder for each video. The JSON files are the entirety of the data returned by the TikTok API. We include a script to parse the JSON files into CSV files with the most commonly used data. We plan to further expand this dataset as collection processes progress and the war continues. We will version the dataset to ensure reproducibility.

    To build this dataset from the IDs here:

    Go to https://github.com/networkdynamics/pytok and clone the repo locally

    Run pip install -e . in the pytok directory

    Run pip install pandas tqdm to install these libraries if not already installed

    Run get_videos.py to get the video data

    Run video_comments.py to get the comment data

    Run user_tiktoks.py to get the video history of the users

    Run hashtag_tiktoks.py or search_tiktoks.py to get more videos from other hashtags and search terms

    Run load_json_to_csv.py to compile the JSON files into two CSV files, comments.csv and videos.csv

    If you get an error about the wrong chrome version, use the command line argument get_videos.py --chrome-version YOUR_CHROME_VERSION Please note pulling data from TikTok takes a while! We recommend leaving the scripts running on a server for a while for them to finish downloading everything. Feel free to play around with the delay constants to either speed up the process or avoid TikTok rate limiting.

    Please do not hesitate to make an issue in this repo to get our help with this!

    The videos.csv will contain the following columns:

    video_id: Unique video ID

    createtime: UTC datetime of video creation time in YYYY-MM-DD HH:MM:SS format

    author_name: Unique author name

    author_id: Unique author ID

    desc: The full video description from the author

    hashtags: A list of hashtags used in the video description

    share_video_id: If the video is sharing another video, this is the video ID of that original video, else empty

    share_video_user_id: If the video is sharing another video, this the user ID of the author of that video, else empty

    share_video_user_name: If the video is sharing another video, this is the user name of the author of that video, else empty

    share_type: If the video is sharing another video, this is the type of the share, stitch, duet etc.

    mentions: A list of users mentioned in the video description, if any

    The comments.csv will contain the following columns:

    comment_id: Unique comment ID

    createtime: UTC datetime of comment creation time in YYYY-MM-DD HH:MM:SS format

    author_name: Unique author name

    author_id: Unique author ID

    text: Text of the comment

    mentions: A list of users that are tagged in the comment

    video_id: The ID of the video the comment is on

    comment_language: The language of the comment, as predicted by the TikTok API

    reply_comment_id: If the comment is replying to another comment, this is the ID of that comment

    The date can be compiled into a user interaction network to facilitate study of interaction dynamics. There is code to help with that here: https://github.com/networkdynamics/polar-seeds. Additional scripts for further preprocessing of this data can be found there too.

  7. TikTok global quarterly downloads 2018-2024

    • statista.com
    • de.statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department, TikTok global quarterly downloads 2018-2024 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    In the fourth quarter of 2024, TikTok generated around 186 million downloads from users worldwide. Initially launched in China first by ByteDance as Douyin, the short-video format was popularized by TikTok and took over the global social media environment in 2020. In the first quarter of 2020, TikTok downloads peaked at over 313.5 million worldwide, up by 62.3 percent compared to the first quarter of 2019.

                  TikTok interactions: is there a magic formula for content success?
    
                  In 2024, TikTok registered an engagement rate of approximately 4.64 percent on video content hosted on its platform. During the same examined year, the social video app recorded over 1,100 interactions on average. These interactions were primarily composed of likes, while only recording less than 20 comments per piece of content on average in 2024.
                  The platform has been actively monitoring the issue of fake interactions, as it removed around 236 million fake likes during the first quarter of 2024. Though there is no secret formula to get the maximum of these metrics, recommended video length can possibly contribute to the success of content on TikTok.
                  It was recommended that tiny TikTok accounts with up to 500 followers post videos that are around 2.6 minutes long as of the first quarter of 2024. While, the ideal video duration for huge TikTok accounts with over 50,000 followers was 7.28 minutes. The average length of TikTok videos posted by the creators in 2024 was around 43 seconds.
    
                  What’s trending on TikTok Shop?
    
                  Since its launch in September 2023, TikTok Shop has become one of the most popular online shopping platforms, offering consumers a wide variety of products. In 2023, TikTok shops featuring beauty and personal care items sold over 370 million products worldwide.
                  TikTok shops featuring womenswear and underwear, as well as food and beverages, followed with 285 and 138 million products sold, respectively. Similarly, in the United States market, health and beauty products were the most-selling items,
                  accounting for 85 percent of sales made via the TikTok Shop feature during the first month of its launch. In 2023, Indonesia was the market with the largest number of TikTok Shops, hosting over 20 percent of all TikTok Shops. Thailand and Vietnam followed with 18.29 and 17.54 percent of the total shops listed on the famous short video platform, respectively.
    
  8. TikTok Viral Trends 2025

    • kaggle.com
    zip
    Updated Sep 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imaad Mahmood (2025). TikTok Viral Trends 2025 [Dataset]. https://www.kaggle.com/datasets/imaadmahmood/tiktok-viral-trends-2025
    Explore at:
    zip(2940 bytes)Available download formats
    Dataset updated
    Sep 16, 2025
    Authors
    Imaad Mahmood
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    TikTok Viral Trends 2025

    September 2025 Viral Video Insights

    Overview

    This dataset, titled TikTok Viral Trends 2025, provides a curated snapshot of 50 trending TikTok videos from September 2025, capturing the platform's dynamic content landscape. Sourced from real-time web analyses and social media insights (e.g., X posts, trend reports from reputable sources like Ramdam, NapoleonCat, and Tokchart), it focuses on viral videos across diverse categories such as Entertainment, Music, Comedy, Lifestyle, Beauty, Sustainability, and Technology. The dataset is designed for data scientists, researchers, and enthusiasts interested in analyzing social media trends, predicting virality, or exploring multimodal machine learning applications (e.g., NLP, time-series, or clustering). It stands out from existing Kaggle datasets by offering fresh, 2025-specific data with rich metadata, including engagement metrics, hashtags, and sound/trend associations.

    Dataset Description

    • Size: 50 records, each representing a trending TikTok video or aggregated trend data from September 2025.
    • Format: CSV (tiktok_data.csv).
    • Source: Aggregated from public web sources and social media posts, ensuring authenticity and compliance with data-sharing guidelines. Specific sources are cited per record (e.g., post:72, web:65).
    • Update: Reflects trends as of September 16, 2025, making it more current than 2023-2024 TikTok datasets on Kaggle.

    Columns

    The dataset contains the following 12 columns: - video_id: Unique identifier for each video or trend (integer or hashtag-based). - author: Creator username or group (anonymized as "Unknown" where not specified). - description: Brief summary of the video content or trend, derived from source context. - upload_date: Approximate or exact posting date (YYYY-MM-DD). - views: Reported view count (e.g., millions, billions for hashtag aggregates; "N/A" if unavailable). - likes: Reported like count (e.g., thousands, millions; "N/A" if unavailable). - shares: Share count (often "N/A" due to limited public data). - comments: Comment count (often "N/A" due to limited public data). - hashtags: Key hashtags associated with the video or trend (e.g., #Kpop, #Viral). - category: Inferred content category (e.g., Entertainment, Music, Comedy, Lifestyle, Sustainability, Tech). - sound_or_trend: Associated audio track or challenge name driving the trend (e.g., "Soda Pop dance", "JUMP"). - source: Citation of data origin (e.g., post:72 for X post ID, web:65 for web source ID).

    Key Features

    • Diverse Categories: Includes K-pop (e.g., BLACKPINK, SEVENTEEN), dance challenges (e.g., Espresso Dance), AI-driven content (e.g., Identity Swap), comedy, lifestyle (e.g., SustainableSeptember), and beauty trends, reflecting TikTok's global appeal.
    • High Engagement: Videos with reported metrics show millions of views (e.g., 29.4M for BLACKPINK’s JUMP) and likes, with hashtag trends like #Perfume reaching 39.3B views.
    • Multimodal Potential: Supports text analysis (descriptions, hashtags), numerical analysis (views, likes), and categorical analysis (categories, sounds).
    • Timeliness: Captures September 2025 trends, including seasonal (e.g., Autumn Cozy Challenge) and cultural moments (e.g., K-pop releases, viral memes).

    Potential Use Cases

    This dataset is ideal for a variety of machine learning and data analysis tasks on Kaggle, including but not limited to: - Virality Prediction: Use views, likes, and hashtags to train regression or classification models (e.g., XGBoost, neural networks) to predict video success. - Trend Analysis: Apply clustering (e.g., K-means) or topic modeling (e.g., LDA) to identify emerging content themes or regional differences. - NLP Applications: Analyze descriptions and hashtags with BERT or word embeddings to study sentiment, cultural trends, or influencer impact. - Time-Series Forecasting: Leverage upload_date and engagement metrics for temporal analysis of trend lifecycles. - Recommendation Systems: Build content recommendation models based on category, sound, or hashtag similarities. - Social Media Ethics: Explore AI-driven trends (e.g., deepfake Identity Swaps) for studies on misinformation or content authenticity.

    Data Collection

    • Methodology: Data was aggregated from public web sources (e.g., trend reports, news snippets) and X posts discussing viral TikTok content. No private or restricted data was used, ensuring ethical sourcing.
    • Limitations: Some metrics (e.g., shares, comments) are "N/A" due to limited public availability. View and like counts are reported where available, with aggregates for trends (e.g., 686.4K videos for #Ominous). Exact metrics may vary slightly due to real-time fluctuations.
    • Verification: All entries ...
  9. TikTok Trending Videos

    • kaggle.com
    zip
    Updated Mar 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erik van de Ven (2021). TikTok Trending Videos [Dataset]. https://www.kaggle.com/datasets/erikvdven/tiktok-trending-december-2020/code
    Explore at:
    zip(3046350172 bytes)Available download formats
    Dataset updated
    Mar 27, 2021
    Authors
    Erik van de Ven
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Trending on TikTok

    We are probably all familiar with TikTok. People tend to spend hours each day scrolling through the millions of videos which are uploaded every single day. Not to mention the uploaders who are giving anything to get as many likes and followers as possible. But what makes one TikTok video a true hit or a miss? I give you an opportunity to figure this out ;)

    I scraped the first 1000 trending videos on TikTok, using an unofficial TikTok web-scraper. Note to mention I had to provide my user information to scrape the trending information, so trending might be a personalized page. But that doesn't change the fact that certain people and videos got a certain amount of likes and comments.

    I transformed the data into usable csv files and attached the actual videos as well.

    What's in the files

    Videos.zip This file contains the actual 1000 trending TikTok videos. Each filename corresponds to the id key in the trending.json file.

    trending.json The raw scraped dataset. I figured splitting up the dataset resulted in messy errors. For example: a user might have one avatar while posting a video and another while posting the next video. This resulted in multiple users with the same name, id etc. except for the avatar. So I decided to post the raw data and I will show you how to translate this multi-level JSON structure to a single DataFrame in my first Notebook.

    Acknowledgements

    Many thanks to Andrew Nord the creator of the tiktok-scraper, and his contributers.

    Inspiration

    So what does make a TikTok video a true hit? Is it the moment when a video is uploaded? Or perhaps the amount of followers is an important factor? Maybe the hashtags or even the music being used?

    So... are you the one who unlocks the mystery?

  10. TikTokData.xlsx

    • figshare.com
    xlsx
    Updated Jun 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emily Zawacki (2022). TikTokData.xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.20069333.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 14, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Emily Zawacki
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We used TikTok’s built-in account analytics to download and record video and account metrics for the period between 10/8/2021 and 2/6/2022. We collected the following summary data for each individual video: video views, likes, comments, shares, total cumulative play time, average duration the video was watched, percentage of viewers who watched the full video, unique reached audience, and the percentage of video views by section (For You, personal profile, Following, hashtags).
    We evaluated the “success” of videos based on reach and engagement metrics, as well as viewer retention (how long a video is watched). We used metrics of reach (number of unique users the video was seen by) and engagement (likes, comments, and shares) to calculate the engagement rate of each video. The engagement rate is calculated as the engagement parameter as a percentage of total reach (e.g., Likes / Audience Reached *100).

  11. TikTok: account removed 2020-2024, by reason

    • statista.com
    • de.statista.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department, TikTok: account removed 2020-2024, by reason [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    During the fourth quarter 2024, approximately 20.6 million TikTok accounts were removed from the platform due to suspicion of being operated by users under the age of 13. During the last measured period, around 185 million fake accounts were removed from fake accounts removed from TikTok.

  12. h

    TikTok_Most_Shared_Video_Transcription_Example

    • huggingface.co
    Updated Aug 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gopher AI (2025). TikTok_Most_Shared_Video_Transcription_Example [Dataset]. https://huggingface.co/datasets/Gopher-Lab/TikTok_Most_Shared_Video_Transcription_Example
    Explore at:
    Dataset updated
    Aug 11, 2025
    Dataset authored and provided by
    Gopher AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    📲 Example Dataset: TikTok Scraper Tool

    👉 Start Scraping TikTok: TikTok Scraper Tool

      ✨ Key Features
    

    ⚡ Instant Transcription – Turn any TikTok video into an AI-ready transcript
    🎯 Metadata – Get the title, language description, and video hashtags
    🔗 URL-Based Access – Just drop in a TikTok video URL to start scraping
    🧩 LLM-Ready Output – Receive clean JSON ready for agents, RAG, or AI tools
    💸 Free Tier – Use up to 100 queries during the beta period
    💫 Easy… See the full description on the dataset page: https://huggingface.co/datasets/Gopher-Lab/TikTok_Most_Shared_Video_Transcription_Example.

  13. h

    Tiktok_Chatgpt_Prompt_Guide

    • huggingface.co
    Updated Aug 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gopher AI (2025). Tiktok_Chatgpt_Prompt_Guide [Dataset]. https://huggingface.co/datasets/Gopher-Lab/Tiktok_Chatgpt_Prompt_Guide
    Explore at:
    Dataset updated
    Aug 11, 2025
    Dataset authored and provided by
    Gopher AI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    📲 Example Dataset: TikTok Scraper Tool

    👉 Start Scraping TikTok: TikTok Scraper Tool

      ✨ Key Features
    

    ⚡ Instant Transcription – Turn any TikTok video into an AI-ready transcript
    🎯 Metadata – Get the title, language, description, and video hashtags
    🔗 URL-Based Access – Just drop in a TikTok video URL to start scraping
    🧩 LLM-Ready Output – Receive clean JSON ready for agents, RAG, or AI tools
    💸 Free Tier – Use up to 100 queries during the beta period
    💫 Easy… See the full description on the dataset page: https://huggingface.co/datasets/Gopher-Lab/Tiktok_Chatgpt_Prompt_Guide.

  14. d

    TikTok Political Engagement Dataset

    • search.dataone.org
    Updated Oct 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Biswas, Ahana; Javadian Sabet, Alireza; Lin, Yu-Ru (2025). TikTok Political Engagement Dataset [Dataset]. http://doi.org/10.7910/DVN/CHYOPR
    Explore at:
    Dataset updated
    Oct 29, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Biswas, Ahana; Javadian Sabet, Alireza; Lin, Yu-Ru
    Description

    This repository contains all IDs for political TikTok posts used in the study “Toxic Politics and TikTok Engagement in the 2024 U.S. Election”, published in the Harvard Kennedy School Misinformation Review. The project investigates how political partisanship, toxicity, and topical content influence user engagement with TikTok videos during the 2024 U.S. presidential election cycle. If you use this dataset, please cite: Biswas, A., Javadian Sabet, A., & Lin, Y.-R. (2025). Toxic politics and TikTok engagement in the 2024 U.S. election. Harvard Kennedy School (HKS) Misinformation Review. https://doi.org/10.37016/mr-2020-181

  15. d

    Replication Data for: How effective are TikTok misinformation debunking...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhargava, Puneet (2023). Replication Data for: How effective are TikTok misinformation debunking videos? [Dataset]. http://doi.org/10.7910/DVN/0BL67B
    Explore at:
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Bhargava, Puneet
    Description

    Replication Data for: How effective are TikTok misinformation debunking videos? Data, Preregistration, Qualtrics, Scripts, Videos

  16. TikTok Celebrity

    • kaggle.com
    zip
    Updated Sep 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdullah Khan (2024). TikTok Celebrity [Dataset]. https://www.kaggle.com/datasets/abdullahkhan900/tiktok-celebrity
    Explore at:
    zip(1917 bytes)Available download formats
    Dataset updated
    Sep 22, 2024
    Authors
    Abdullah Khan
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    https://s3-prod.adage.com/s3fs-public/20230807_celeb_run_agencies_3x2.jpg" alt="Celebs"> The dataset you provided appears to focus on TikTok celebrities and contains the following columns:

    Celebrity: The name or handle of the TikTok celebrity. Followers: The number of followers the celebrity has, often represented in millions or billions. Following: The number of accounts the celebrity follows, which may be represented as thousands (K) or just a number. Likes: The total number of likes the celebrity’s videos have received, often represented in millions or billions. T.Videos: The total number of videos posted by the celebrity. Video Duration: The typical duration of their videos, which ranges from a few seconds (e.g., 10 - 15 seconds) to over a minute. Average Views: The average number of views their videos receive, often in millions. Net Worth: The estimated net worth of the celebrity, often represented in millions or billions of dollars. Most Viewed Video: The number of views for their most popular video, usually in millions or billions. Most Liked Video: The number of likes for their most popular video, represented in millions or billions. Video Category: The types or categories of videos the celebrity posts, such as comedy, dance, acting, challenges, etc.

  17. H

    Replication Data for "Beyond affective polarization: How emotion and...

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Jul 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sang Jung Kim; Isabel Villanueva; Kaiping Chen (2023). Replication Data for "Beyond affective polarization: How emotion and identity cues are used in anti-vaccination conspiracies on TikTok" [Dataset]. http://doi.org/10.7910/DVN/U6FIQW
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 29, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Sang Jung Kim; Isabel Villanueva; Kaiping Chen
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This deposit provides the analyzed dataset (anonymized) and the R scripts to reproduce the figure/tables in our manuscript. Our paper examines the emotional cues and identity cues used in TikTok videos about (anti) vaccination.

  18. Number of global social network users 2017-2028

    • statista.com
    • de.statista.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Number of global social network users 2017-2028 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    How many people use social media?

                  Social media usage is one of the most popular online activities. In 2024, over five billion people were using social media worldwide, a number projected to increase to over six billion in 2028.
    
                  Who uses social media?
                  Social networking is one of the most popular digital activities worldwide and it is no surprise that social networking penetration across all regions is constantly increasing. As of January 2023, the global social media usage rate stood at 59 percent. This figure is anticipated to grow as lesser developed digital markets catch up with other regions
                  when it comes to infrastructure development and the availability of cheap mobile devices. In fact, most of social media’s global growth is driven by the increasing usage of mobile devices. Mobile-first market Eastern Asia topped the global ranking of mobile social networking penetration, followed by established digital powerhouses such as the Americas and Northern Europe.
    
                  How much time do people spend on social media?
                  Social media is an integral part of daily internet usage. On average, internet users spend 151 minutes per day on social media and messaging apps, an increase of 40 minutes since 2015. On average, internet users in Latin America had the highest average time spent per day on social media.
    
                  What are the most popular social media platforms?
                  Market leader Facebook was the first social network to surpass one billion registered accounts and currently boasts approximately 2.9 billion monthly active users, making it the most popular social network worldwide. In June 2023, the top social media apps in the Apple App Store included mobile messaging apps WhatsApp and Telegram Messenger, as well as the ever-popular app version of Facebook.
    
  19. Average daily time spent on social media worldwide 2012-2024

    • statista.com
    • de.statista.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Average daily time spent on social media worldwide 2012-2024 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    How much time do people spend on social media?

                  As of 2024, the average daily social media usage of internet users worldwide amounted to 143 minutes per day, down from 151 minutes in the previous year. Currently, the country with the most time spent on social media per day is Brazil, with online users spending an average of three hours and 49 minutes on social media each day. In comparison, the daily time spent with social media in
                  the U.S. was just two hours and 16 minutes. Global social media usageCurrently, the global social network penetration rate is 62.3 percent. Northern Europe had an 81.7 percent social media penetration rate, topping the ranking of global social media usage by region. Eastern and Middle Africa closed the ranking with 10.1 and 9.6 percent usage reach, respectively.
                  People access social media for a variety of reasons. Users like to find funny or entertaining content and enjoy sharing photos and videos with friends, but mainly use social media to stay in touch with current events friends. Global impact of social mediaSocial media has a wide-reaching and significant impact on not only online activities but also offline behavior and life in general.
                  During a global online user survey in February 2019, a significant share of respondents stated that social media had increased their access to information, ease of communication, and freedom of expression. On the flip side, respondents also felt that social media had worsened their personal privacy, increased a polarization in politics and heightened everyday distractions.
    
  20. Global social network penetration 2019-2028

    • statista.com
    • de.statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Global social network penetration 2019-2028 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    The global social media penetration rate in was forecast to continuously increase between 2024 and 2028 by in total 11.6 (+18.19 percent). After the ninth consecutive increasing year, the penetration rate is estimated to reach 75.31 and therefore a new peak in 2028. Notably, the social media penetration rate of was continuously increasing over the past years.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
DataHive AI (2025). Tiktok-Videos [Dataset]. https://huggingface.co/datasets/datahiveai/Tiktok-Videos

Tiktok-Videos

datahiveai/Tiktok-Videos

TikTok Video Analytics Dataset

Explore at:
Dataset updated
Oct 5, 2025
Dataset authored and provided by
DataHive AI
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

TikTok Video Analytics Dataset

Sample TikTok video dataset with comprehensive engagement metrics and metadata. Each row represents a single TikTok video with content and detailed analytics. This is a sample dataset. To access the full version or request any custom dataset tailored to your needs, contact DataHive at contact@datahive.ai.

  Files Included

train.csv – TikTok video analytics data

  What's included

Video URLs and identifiers Comprehensive engagement… See the full description on the dataset page: https://huggingface.co/datasets/datahiveai/Tiktok-Videos.

Search
Clear search
Close search
Google apps
Main menu