100+ datasets found
  1. Instagram Hashtags Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Dec 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2024). Instagram Hashtags Dataset [Dataset]. https://brightdata.com/products/datasets/instagram/hashtags
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Dec 20, 2024
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Use our Instagram Hashtags dataset (public data) to extract insights by filtering hashtags, follower counts, account type, or engagement metrics. Depending on your needs, you can purchase the full dataset or a customized subset. Popular use cases include trend analysis, brand monitoring, hashtag optimization, and influencer marketing. The dataset includes key data points such as hashtags, engagement scores, associated posts, locations, account types (business/non-business), and much more.

  2. h

    instagram-dataset-frr

    • huggingface.co
    Updated Apr 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scene Genie (2024). instagram-dataset-frr [Dataset]. https://huggingface.co/datasets/scene-genie/instagram-dataset-frr
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 21, 2024
    Dataset authored and provided by
    Scene Genie
    Description

    scene-genie/instagram-dataset-frr dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. Instagram: most popular posts as of 2025

    • statista.com
    • ai-chatbox.pro
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon (2025). Instagram: most popular posts as of 2025 [Dataset]. https://www.statista.com/topics/1882/instagram/
    Explore at:
    Dataset updated
    Jun 3, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    Instagram’s most popular post As of June 2025, the most popular post on Instagram was Lionel Messi and his teammates after winning the 2022 FIFA World Cup with Argentina, posted by the account @leomessi. Messi's post, which racked up over 60 million likes within a day, knocked off the reigning post, which was 'Photo of an Egg'. Originally posted in January 2021, 'Photo of an Egg' surpassed the world’s most popular Instagram post at that time, which was a photo by Kylie Jenner’s daughter totaling 18 million likes. After several cryptic posts published by the account, World Record Egg revealed itself to be a part of a mental health campaign aimed at the pressures of social media use. Instagram’s most popular accounts As of April 2024, the official Instagram account @instagram had the most followers of any account on the platform, with 672 million followers. Portuguese footballer Cristiano Ronaldo (@cristiano) was the most followed individual with 628 million followers, while Selena Gomez (@selenagomez) was the most followed woman on the platform with 429 million. Additionally, Inter Miami CF striker Lionel Messi (@leomessi) had a total of 502 million. Celebrities such as The Rock, Kylie Jenner, and Ariana Grande all had over 380 million followers each. Instagram influencers In the United States, the leading content category of Instagram influencers was lifestyle, with 15.25 percent of influencers creating lifestyle content in 2021. Music ranked in second place with 10.96 percent, followed by family with 8.24 percent. Having a large audience can be very lucrative: Instagram influencers in the United States, Canada and the United Kingdom with over 90,000 followers made around 1,221 US dollars per post. Instagram around the globe Instagram’s worldwide popularity continues to grow, and India is the leading country in terms of number of users, with over 362.9 million users as of January 2024. The United States had 169.65 million Instagram users and Brazil had 134.6 million users. The social media platform was also very popular in Indonesia and Turkey, with 100.9 and 57.1, respectively. As of January 2024, Instagram was the fourth most popular social network in the world, behind Facebook, YouTube and WhatsApp.

  4. i

    Data from: Five Years of COVID-19 Discourse on Instagram: A Labeled...

    • ieee-dataport.org
    Updated Jan 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirmalya Thakur (2025). Five Years of COVID-19 Discourse on Instagram: A Labeled Instagram Dataset of Over Half a Million Posts for Multilingual Sentiment Analysis [Dataset]. https://ieee-dataport.org/documents/five-years-covid-19-discourse-instagram-labeled-instagram-dataset-over-half-million-posts
    Explore at:
    Dataset updated
    Jan 22, 2025
    Authors
    Nirmalya Thakur
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To download this dataset without purchasing an IEEE Dataport subscription

  5. Instagram: distribution of global audiences 2025, by gender

    • statista.com
    • ai-chatbox.pro
    Updated Jun 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon (2025). Instagram: distribution of global audiences 2025, by gender [Dataset]. https://www.statista.com/topics/1882/instagram/
    Explore at:
    Dataset updated
    Jun 3, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    As of April 2025, Instagram was slightly more popular with men than women, with men accounting for 52.5 percent of the platform’s global users. Additionally, the social media app was most popular amongst younger audiences, with almost 31.6 percent of users aged between 25 and 34 years. Instagram’s Global Audience As of February 2025, Instagram was the third most popular social media platform globally, reaching two billion monthly active users (MAU). This number is projected to keep growing with no signs of slowing down, which is not a surprise as the global online social penetration rate across all regions is constantly increasing. As of February 2025, the country with the largest Instagram audience was India with 413.85 million users, followed by the United States with 171.7 million users. Who is winning over the generations? Even though Instagram’s audience is considerably larger than TikTok’s on a global scale, TikTok has shown itself to be a fierce competitor, particularly amongst younger audiences. TikTok was the most downloaded mobile app globally in 2024, generating 825.48 million downloads. As of 2022, Generation Z in the United States spent more time on TikTok than on Instagram monthly.

  6. Instagram: distribution of global audiences 2025, by age group

    • statista.com
    • ai-chatbox.pro
    Updated Jun 2, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Instagram: distribution of global audiences 2025, by age group [Dataset]. https://www.statista.com/statistics/325587/instagram-global-age-group/
    Explore at:
    Dataset updated
    Jun 2, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Apr 2025
    Area covered
    Worldwide
    Description

    As of April 2025, almost 32 percent of global Instagram audiences were aged between 25 and 34 years, and 29.5 percent of users were aged between 25 and 34 years. Overall, 16.3 percent of users belonged to the 35 to 44 year age group. Instagram users With roughly one billion monthly active users, Instagram belongs to the most popular social networks worldwide. The social photo sharing app is especially popular in India and in the United States, which have respectively 413.85 million and 171.7 million Instagram users each. Instagram features One of the most popular features of Instagram is Stories. Users can post photos and videos to their Stories stream and the content is live for others to view for 24 hours before it disappears. In January 2019, the company reported that there were 500 million daily active Instagram Stories users. Instagram Stories directly competes with Snapchat, another photo sharing app that initially became famous due to it’s “vanishing photos” feature. As of the first quarter of 2025, Snapchat had 460 million daily active users.

  7. Threads, an Instagram app Reviews

    • kaggle.com
    Updated Jul 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saloni Jhalani (2023). Threads, an Instagram app Reviews [Dataset]. https://www.kaggle.com/datasets/saloni1712/threads-an-instagram-app-reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Saloni Jhalani
    License

    Attribution-NoDerivs 4.0 (CC BY-ND 4.0)https://creativecommons.org/licenses/by-nd/4.0/
    License information was derived automatically

    Description

    The Threads, an Instagram App Reviews dataset is a comprehensive collection of user reviews from the Threads mobile app on Google Play Store & App Store, capturing valuable insights and sentiments. The dataset enables the understanding of user satisfaction, evaluation of app performance, and identification of emerging patterns.

    The way data was collected

    Scraping Threads App reviews on Google Play Store & App Store

    Ideas for using this dataset
    • Sentiment analysis
    • What makes the application receive 1-star and 5-star

    Note - It was last updated on July 26th 2023

  8. h

    instagram-images-with-captions

    • huggingface.co
    Updated Nov 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    K Cosmos (2024). instagram-images-with-captions [Dataset]. https://huggingface.co/datasets/kkcosmos/instagram-images-with-captions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 29, 2024
    Authors
    K Cosmos
    License

    https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

    Description

    kkcosmos/instagram-images-with-captions dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. h

    Instagram

    • huggingface.co
    Updated Apr 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WayleX (2024). Instagram [Dataset]. https://huggingface.co/datasets/WayleX/Instagram
    Explore at:
    Dataset updated
    Apr 15, 2024
    Authors
    WayleX
    Description

    WayleX/Instagram dataset hosted on Hugging Face and contributed by the HF Datasets community

  10. Instagram: number of global users 2020-2025

    • statista.com
    Updated May 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Instagram: number of global users 2020-2025 [Dataset]. https://www.statista.com/statistics/183585/instagram-number-of-global-users/
    Explore at:
    Dataset updated
    May 22, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    In 2021, there were 1.21 billion monthly active users of Meta's Instagram, making up over 28 percent of the world's internet users. By 2025, it has been forecast that there will be 1.44 billion monthly active users of the social media platform, which would account for 31.2 percent of global internet users.

    How popular is Instagram?

    Instagram, as of January 2022, was the fourth most popular social media platform in the world in terms of user numbers. YouTube and WhatsApp ranked in second and third place, respectively, whilst Facebook remained the most popular, with almost three billion monthly active users worldwide.

    India had the largest number of Instagram users as of January 2022, with a total of over 230 million users in the country. The second-largest Instagram audience could be found in the United States, with almost 160 million people subscribing to the photo and video sharing app.

    Gen Z and Instagram

    As of September 2021, Gen Z users in the United States spent an average of five hours per week on Instagram. Although Instagram ranked third in terms of hours per week spent on the platform, Gen Z users spent considerably more time on TikTok, amounting to a weekly average of over 10 hours being spent on the mobile-first video app.

    Most followed accounts on Instagram

    As of May 2022, Instagram’s own account had 504.37 million followers. In terms of celebrities, Portuguese footballer Cristiano Ronaldo (@chistiano) had over 440.41 million followers on the social network. Moreover, the average media value of an Instagram post by Ronaldo was over 985,000 U.S. dollars.

    The most liked post on Instagram as of May 2022 was Photo of an Egg, which was posted in 2019 by the account @world_record_egg. Photo of an Egg has not only exceeded 55 million likes on the platform, but it also has nearly 3.5 million comments, and the account itself has over 4.5 million Instagram followers. After mysterious posts published by the account, World Record Egg revealed itself as part of a mental health campaign aimed at the difficulties and demands of using social media.

  11. R

    Instagram Dataset

    • universe.roboflow.com
    zip
    Updated Oct 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alonso Bravo Moyano (2021). Instagram Dataset [Dataset]. https://universe.roboflow.com/alonso-bravo-moyano/instagram-j7ivu/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 12, 2021
    Dataset authored and provided by
    Alonso Bravo Moyano
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Tags Bounding Boxes
    Description

    Instagram

    ## Overview
    
    Instagram is a dataset for object detection tasks - it contains Tags annotations for 896 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  12. U.S. Instagram users 2025, by age group

    • statista.com
    • ai-chatbox.pro
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista, U.S. Instagram users 2025, by age group [Dataset]. https://www.statista.com/statistics/398166/us-instagram-user-age-distribution/
    Explore at:
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jan 2025
    Area covered
    United States
    Description

    As of January 2025, 28.3 percent of Instagram users in the United States were aged between 25 and 34 years, making up the platform’s largest audience in the country. Overall, the group with the second-highest usage was those aged 18 to 24 years, with 26.5 percent of users, followed by 19.4 percent that were aged 35 to 44 years. The age group that used Instagram the least was those older than 65 years, making up 5.8 percent of users, respectively. Popular social media platforms in the United States Instagram was the third most visited social media platform in the United States as of March 2024 across desktop, mobile, and tablet devices combined, accounting for 14.13 percent of all social media visits in the country. Despite Instagram’s popularity, Facebook remained the leading social media platform by far in the U.S., accounting for almost 50 percent of all social media visits. Pinterest and X (formerly Twitter) ranked second and fourth in terms of visits, with 21.2 percent and 10.91 percent of visits, respectively. Instagram’s global audience Instagram’s global popularity continues to surge, with no signs of slowing down. As of January 2024, the country with the largest Instagram audience was India with 362.9 million users, while the United States had the second-highest market with 169.65 million users, followed by Brazil, Indonesia, and Turkey, respectively. As of April 2024, Instagram was the third most popular social network globally, reaching two billion monthly active users.

  13. Z

    A set of generated Instagram Data Download Packages (DDPs) to investigate...

    • data.niaid.nih.gov
    Updated Jan 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laura Boeschoten (2021). A set of generated Instagram Data Download Packages (DDPs) to investigate their structure and content [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4472605
    Explore at:
    Dataset updated
    Jan 28, 2021
    Dataset provided by
    Ruben van den Goorbergh
    Laura Boeschoten
    Daniel Oberski
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Instagram data-download example dataset

    In this repository you can find a data-set consisting of 11 personal Instagram archives, or Data-Download Packages (DDPs).

    How the data was generated

    These Instagram accounts were all new and generated by a group of researchers who were interested to figure out in detail the structure and variety in structure of these Instagram DDPs. The participants user the Instagram account extensively for approximately a week. The participants also intensively communicated with each other so that the data can be used as an example of a network.

    The data was primarily generated to evaluate the performance of de-identification software. Therefore, the text in the DDPs particularly contain many randomly chosen (Dutch) first names, phone numbers, e-mail addresses and URLS. In addition, the images in the DDPs contain many faces and text as well. The DDPs contain faces and text (usernames) of third parties. However, only content of so-called `professional accounts' are shared, such as accounts of famous individuals or institutions who self-consciously and actively seek publicity, and these sources are easily publicly available. Furthermore, the DDPs do not contain sensitive personal data of these individuals.

    Obtaining your Instagram DDP

    After using the Instagram accounts intensively for approximately a week, the participants requested their personal Instagram DDPs by using the following steps. You can follow these steps yourself if you are interested in your personal Instagram DDP.

    1. Go to www.instagram.com and log in
    2. Click on your profile picture, go to Settings and Privacy and Security
    3. Scroll to Data download and click Request download
    4. Enter your email adress and click Next
    5. Enter your password and click Request download

    Instagram then delivered the data in a compressed zip folder with the format username_YYYYMMDD.zip (i.e., Instagram handle and date of download) to the participant, and the participants shared these DDPs with us.

    Data cleaning

    To comply with the Instagram user agreement, participants shared their full name, phone number and e-mail address. In addition, Instagram logged the i.p. addresses the participant used during their active period on Instagram. After colleting the DDPs, we manually replaced such information with random replacements such that the DDps shared here do not contain any personal data of the participants.

    How this data-set can be used

    This data-set was generated with the intention to evaluate the performance of the de-identification software. We invite other researchers to use this data-set for example to investigate what type of data can be found in Instagram DDPs or to investigate the structure of Instagram DDPs. The packages can also be used for example data-analyses, although no substantive research questions can be answered using this data as the data does not reflect how research subjects behave `in the wild'.

    Authors

    The data collection is executed by Laura Boeschoten, Ruben van den Goorbergh and Daniel Oberski of Utrecht University. For questions, please contact l.boeschoten@uu.nl.

    Acknowledgments

    The researchers would like to thank everyone who participated in this data-generation project.

  14. H

    Data from: Fashion conversation data on Instagram

    • dataverse.harvard.edu
    • search.dataone.org
    Updated May 11, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu-i Ha; Sejeong Kwon; Meeyoung Cha; Jungseock Joo (2017). Fashion conversation data on Instagram [Dataset]. http://doi.org/10.7910/DVN/K7AW6F
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 11, 2017
    Dataset provided by
    Harvard Dataverse
    Authors
    Yu-i Ha; Sejeong Kwon; Meeyoung Cha; Jungseock Joo
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Our fashion dataset is composed of information about 24,752 posts by 13,350 people on Instagram. The data collection was done over a month period in January, 2015. We searched for posts mentioning 48 internationally renowned fashion brand names as hashtag. Our data contain information about hashtags as well as image features based on deep learning (Convolutional Neural Network or CNN). The list of learned features include selfies, body snaps, marketing shots, non-fashion, faces, logo, etc. Please refer to our paper for the full description of how we built our deep learning model.

  15. Instagram dataset

    • kaggle.com
    Updated Oct 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rituraj Kumar (2021). Instagram dataset [Dataset]. https://www.kaggle.com/datasets/riturajsingh99/instagram-dataset/discussion?sort=undefined
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 11, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rituraj Kumar
    Description

    Dataset

    This dataset was created by Rituraj Kumar

    Contents

  16. Instagram fake and real accounts dataset

    • kaggle.com
    Updated Jan 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    reza jafari (2021). Instagram fake and real accounts dataset [Dataset]. https://www.kaggle.com/rezaunderfit/instagram-fake-and-real-accounts-dataset/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 7, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    reza jafari
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Fake users extracted from "Instagram fake accounts dataset". Real user are extracted from my private account on Instagram. This dataset can be use for detecting fake users.

    How was the data collected?

    For fake user i bought followers and save theme on a file.

  17. i

    Instagram

    • ieee-dataport.org
    Updated Nov 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Feihu Che (2024). Instagram [Dataset]. https://ieee-dataport.org/documents/instagram
    Explore at:
    Dataset updated
    Nov 13, 2024
    Authors
    Feihu Che
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    and the work adds the graph structure information through Instagram's public API. The main objective is distinguish between commercial and regular user accounts.

  18. n

    Instagram users in United States of America

    • napoleoncat.com
    png
    Updated Feb 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NapoleonCat (2025). Instagram users in United States of America [Dataset]. https://napoleoncat.com/stats/instagram-users-in-united_states_of_america/2025/02
    Explore at:
    pngAvailable download formats
    Dataset updated
    Feb 15, 2025
    Dataset authored and provided by
    NapoleonCat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Feb 2025
    Area covered
    United States
    Description

    There were 172 606 200 Instagram users in United States of America in February 2025, which accounted for 50.1% of its entire population. The majority of them were women - 55.4%. People aged 25 to 34 were the largest user group (48 800 000). The highest difference between men and women occurs within people aged 35 to 44, where women lead by 14 900 000.

  19. d

    Influencer Database 2025 | TikTok and Instagram Influencers Database | 10M+...

    • datarade.ai
    .csv, .xls
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lead for Business, Influencer Database 2025 | TikTok and Instagram Influencers Database | 10M+ Influencer Data [Dataset]. https://datarade.ai/data-products/influencer-database-2024-tiktok-and-instagram-influencers-d-lead-for-business
    Explore at:
    .csv, .xlsAvailable download formats
    Dataset authored and provided by
    Lead for Business
    Area covered
    Djibouti, Equatorial Guinea, Montenegro, Saint Martin (French part), Uzbekistan, Samoa, Colombia, Mali, Guadeloupe, Papua New Guinea
    Description

    Companies of all sizes seek out influencer collaborations that can provide a lasting ROI. Check out some of the brands that use our platform to manage the full life cycle of their influencer marketing campaigns.

    We know that contact records are at the heart of every influencer database. That's why we introduced custom properties to reflect the unique needs of your influencer data.

    • 10M+ Influencers • Get the Look Your Brand Is After • Increase Audience Size and Demographics • Gain Insights for a Stronger Campaign • Effectively track and measure the impact of your campaigns

  20. Data from: Five Years of COVID-19 Discourse on Instagram: A Labeled...

    • zenodo.org
    bin
    Updated Oct 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirmalya Thakur, Ph.D.; Nirmalya Thakur, Ph.D. (2024). Five Years of COVID-19 Discourse on Instagram: A Labeled Instagram Dataset of Over Half a Million Posts for Multilingual Sentiment Analysis [Dataset]. http://doi.org/10.5281/zenodo.13896353
    Explore at:
    binAvailable download formats
    Dataset updated
    Oct 21, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nirmalya Thakur, Ph.D.; Nirmalya Thakur, Ph.D.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Oct 6, 2024
    Description

    Please cite the following paper when using this dataset:

    N. Thakur, “Five Years of COVID-19 Discourse on Instagram: A Labeled Instagram Dataset of Over Half a Million Posts for Multilingual Sentiment Analysis”, Proceedings of the 7th International Conference on Machine Learning and Natural Language Processing (MLNLP 2024), Chengdu, China, October 18-20, 2024 (Paper accepted for publication, Preprint available at: https://arxiv.org/abs/2410.03293)

    Abstract

    The outbreak of COVID-19 served as a catalyst for content creation and dissemination on social media platforms, as such platforms serve as virtual communities where people can connect and communicate with one another seamlessly. While there have been several works related to the mining and analysis of COVID-19-related posts on social media platforms such as Twitter (or X), YouTube, Facebook, and TikTok, there is still limited research that focuses on the public discourse on Instagram in this context. Furthermore, the prior works in this field have only focused on the development and analysis of datasets of Instagram posts published during the first few months of the outbreak. The work presented in this paper aims to address this research gap and presents a novel multilingual dataset of 500,153 Instagram posts about COVID-19 published between January 2020 and September 2024. This dataset contains Instagram posts in 161 different languages. After the development of this dataset, multilingual sentiment analysis was performed using VADER and twitter-xlm-roberta-base-sentiment. This process involved classifying each post as positive, negative, or neutral. The results of sentiment analysis are presented as a separate attribute in this dataset.

    For each of these posts, the Post ID, Post Description, Date of publication, language code, full version of the language, and sentiment label are presented as separate attributes in the dataset.

    The Instagram posts in this dataset are present in 161 different languages out of which the top 10 languages in terms of frequency are English (343041 posts), Spanish (30220 posts), Hindi (15832 posts), Portuguese (15779 posts), Indonesian (11491 posts), Tamil (9592 posts), Arabic (9416 posts), German (7822 posts), Italian (5162 posts), Turkish (4632 posts)

    There are 535,021 distinct hashtags in this dataset with the top 10 hashtags in terms of frequency being #covid19 (169865 posts), #covid (132485 posts), #coronavirus (117518 posts), #covid_19 (104069 posts), #covidtesting (95095 posts), #coronavirusupdates (75439 posts), #corona (39416 posts), #healthcare (38975 posts), #staysafe (36740 posts), #coronavirusoutbreak (34567 posts)

    The following is a description of the attributes present in this dataset

    • Post ID: Unique ID of each Instagram post
    • Post Description: Complete description of each post in the language in which it was originally published
    • Date: Date of publication in MM/DD/YYYY format
    • Language code: Language code (for example: “en”) that represents the language of the post as detected using the Google Translate API
    • Full Language: Full form of the language (for example: “English”) that represents the language of the post as detected using the Google Translate API
    • Sentiment: Results of sentiment analysis (using the preprocessed version of each post) where each post was classified as positive, negative, or neutral

    Open Research Questions

    This dataset is expected to be helpful for the investigation of the following research questions and even beyond:

    1. How does sentiment toward COVID-19 vary across different languages?
    2. How has public sentiment toward COVID-19 evolved from 2020 to the present?
    3. How do cultural differences affect social media discourse about COVID-19 across various languages?
    4. How has COVID-19 impacted mental health, as reflected in social media posts across different languages?
    5. How effective were public health campaigns in shifting public sentiment in different languages?
    6. What patterns of vaccine hesitancy or support are present in different languages?
    7. How did geopolitical events influence public sentiment about COVID-19 in multilingual social media discourse?
    8. What role does social media discourse play in shaping public behavior toward COVID-19 in different linguistic communities?
    9. How does the sentiment of minority or underrepresented languages compare to that of major world languages regarding COVID-19?
    10. What insights can be gained by comparing the sentiment of COVID-19 posts in widely spoken languages (e.g., English, Spanish) to those in less common languages?

    All the Instagram posts that were collected during this data mining process to develop this dataset were publicly available on Instagram and did not require a user to log in to Instagram to view the same (at the time of writing this paper).

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bright Data (2024). Instagram Hashtags Dataset [Dataset]. https://brightdata.com/products/datasets/instagram/hashtags
Organization logo

Instagram Hashtags Dataset

Explore at:
.json, .csv, .xlsxAvailable download formats
Dataset updated
Dec 20, 2024
Dataset authored and provided by
Bright Datahttps://brightdata.com/
License

https://brightdata.com/licensehttps://brightdata.com/license

Area covered
Worldwide
Description

Use our Instagram Hashtags dataset (public data) to extract insights by filtering hashtags, follower counts, account type, or engagement metrics. Depending on your needs, you can purchase the full dataset or a customized subset. Popular use cases include trend analysis, brand monitoring, hashtag optimization, and influencer marketing. The dataset includes key data points such as hashtags, engagement scores, associated posts, locations, account types (business/non-business), and much more.

Search
Clear search
Close search
Google apps
Main menu