100+ datasets found
  1. Twitter Friends

    • kaggle.com
    zip
    Updated Sep 2, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hubert Wassner (2016). Twitter Friends [Dataset]. https://www.kaggle.com/hwassner/TwitterFriends
    Explore at:
    zip(183520459 bytes)Available download formats
    Dataset updated
    Sep 2, 2016
    Authors
    Hubert Wassner
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Twitter Friends and hashtags

    Context

    This datasets is an extract of a wider database aimed at collecting Twitter user's friends (other accound one follows). The global goal is to study user's interest thru who they follow and connection to the hashtag they've used.

    Content

    It's a list of Twitter user's informations. In the JSON format one twitter user is stored in one object of this more that 40.000 objects list. Each object holds :

    • avatar : URL to the profile picture

    • followerCount : the number of followers of this user

    • friendsCount : the number of people following this user.

    • friendName : stores the @name (without the '@') of the user (beware this name can be changed by the user)

    • id : user ID, this number can not change (you can retrieve screen name with this service : https://tweeterid.com/)

    • friends : the list of IDs the user follows (data stored is IDs of users followed by this user)

    • lang : the language declared by the user (in this dataset there is only "en" (english))

    • lastSeen : the time stamp of the date when this user have post his last tweet.

    • tags : the hashtags (whith or without #) used by the user. It's the "trending topic" the user tweeted about.

    • tweetID : Id of the last tweet posted by this user.

    You also have the CSV format which uses the same naming convention.

    These users are selected because they tweeted on Twitter trending topics, I've selected users that have at least 100 followers and following at least 100 other account (in order to filter out spam and non-informative/empty accounts).

    Acknowledgements

    This data set is build by Hubert Wassner (me) using the Twitter public API. More data can be obtained on request (hubert.wassner AT gmail.com), at this time I've collected over 5 milions in different languages. Some more information can be found here (in french only) : http://wassner.blogspot.fr/2016/06/recuperer-des-profils-twitter-par.html

    Past Research

    No public research have been done (until now) on this dataset. I made a private application which is described here : http://wassner.blogspot.fr/2016/09/twitter-profiling.html (in French) which uses the full dataset (Millions of full profiles).

    Inspiration

    On can analyse a lot of stuff with this datasets :

    • stats about followers & followings
    • manyfold learning or unsupervised learning from friend list
    • hashtag prediction from friend list

    Contact

    Feel free to ask any question (or help request) via Twitter : @hwassner

    Enjoy! ;)

  2. w

    Websites using Twitter Friends

    • webtechsurvey.com
    csv
    Updated Nov 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WebTechSurvey (2025). Websites using Twitter Friends [Dataset]. https://webtechsurvey.com/technology/twitter-friends
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 22, 2025
    Dataset authored and provided by
    WebTechSurvey
    License

    https://webtechsurvey.com/termshttps://webtechsurvey.com/terms

    Time period covered
    2025
    Area covered
    Global
    Description

    A complete list of live websites using the Twitter Friends technology, compiled through global website indexing conducted by WebTechSurvey.

  3. twitter-friends.parquet

    • kaggle.com
    zip
    Updated Nov 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lixysc (2024). twitter-friends.parquet [Dataset]. https://www.kaggle.com/datasets/lixysc/twitter-friends-parquet
    Explore at:
    zip(258012659 bytes)Available download formats
    Dataset updated
    Nov 20, 2024
    Authors
    lixysc
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by lixysc

    Released under Apache 2.0

    Contents

  4. Friends auf Twitter

    • de.statista.com
    Updated Sep 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Friends auf Twitter [Dataset]. https://de.statista.com/statistik/daten/studie/71738/umfrage/anzahl-der-friends-auf-twitter-in-2009/
    Explore at:
    Dataset updated
    Sep 15, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Nov 2009
    Area covered
    Weltweit
    Description

    Die Grafik zeigt eine prozentuale Verteilung der Freundesanzahl von Twitter-Accounts. Accounts mit 6 bis 10 Friends machen einen Anteil von 8,7 Prozent aus.

  5. twitter-friends

    • kaggle.com
    zip
    Updated Dec 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    jettzfc (2024). twitter-friends [Dataset]. https://www.kaggle.com/datasets/jettzfc/twitter-friends/code
    Explore at:
    zip(258012659 bytes)Available download formats
    Dataset updated
    Dec 13, 2024
    Authors
    jettzfc
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by jettzfc

    Released under Apache 2.0

    Contents

  6. Number of Twitter followers of the New York Mets 2011-2024

    • statista.com
    Updated Dec 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Number of Twitter followers of the New York Mets 2011-2024 [Dataset]. https://www.statista.com/statistics/274736/twitter-followers-of-the-new-york-mets/
    Explore at:
    Dataset updated
    Dec 15, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    The number of X (Twitter) followers of the Major League Baseball team New York Mets increased considerably from September 2011 to November 2024. In the last recorded month, the team's social media account had around 1.27 million followers.

  7. m

    Raw Twitter Datasets Based on Depressive Words

    • data.mendeley.com
    • ieee-dataport.org
    Updated Sep 2, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sawrav Chowdhury (2020). Raw Twitter Datasets Based on Depressive Words [Dataset]. http://doi.org/10.17632/4rd637tddf.1
    Explore at:
    Dataset updated
    Sep 2, 2020
    Authors
    Sawrav Chowdhury
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Right now we see that depression is one of the most common problems in our society. Most of the time people are committed suicide only cause of depression. And till now there is no proper lab test way for detecting depression. Generally, doctors are detecting depression by asking some knowledge-base questions. On the other hand, there are a good number of people using social media platforms right now, where they are sharing their daily experiences, emotion, and other activity with their friends. Twitter is one of the common social platforms and also popular for data collection. I was collecting these datasets from twitter based on some depressive words. I hope that this twitter datasets will help researchers to detect depression more precisely.

  8. Twitter: number of followers of popular luxury brands 2020

    • statista.com
    Updated Nov 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2020). Twitter: number of followers of popular luxury brands 2020 [Dataset]. https://www.statista.com/statistics/693784/luxury-brands-follower-twitter/
    Explore at:
    Dataset updated
    Nov 15, 2020
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2020
    Area covered
    Worldwide
    Description

    This statistic provides information on the most popular luxury brands on Twitter, ranked by number of followers. In 2020, the French luxury brand Chanel was ranked first with 13 million Twitter followers, followed by Burberry, Dior, and Louis Vuitton with 8 million followers each.

  9. Z

    Following/Followers and Tags on 0.1 million Twitter Users

    • data.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yoshida, Mitsuo; Yamaguchi, Yuto (2020). Following/Followers and Tags on 0.1 million Twitter Users [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13966
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Toyohashi University of Technology
    University of Tsukuba
    Authors
    Yoshida, Mitsuo; Yamaguchi, Yuto
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Abstract (our paper)

    Why does Smith follow Johnson on Twitter? In most cases, the reason why users follow other users is unavailable. In this work, we answer this question by proposing TagF, which analyzes the who-follows-whom network (matrix) and the who-tags-whom network (tensor) simultaneously. Concretely, our method decomposes a coupled tensor constructed from these matrix and tensor. The experimental results on million-scale Twitter networks show that TagF uncovers different, but explainable reasons why users follow other users.

    Data

    coupled_tensor: The first column is the source user id (from user id), the second column is the destination user id (to user id), and the third column is the tag id.

    users.id: The first column is the user id for coupled_tensor, and the second column is the user id on Twitter.

    tags.id: The first column is the tag id for coupled_tensor, and the second column is the tag (i.e. slug or list name) on Twitter. On the tags, ###follow### and ###friend### are special tags expressing follower and following.

    Publication

    This dataset was created for our study. If you make use of this dataset, please cite: Yuto Yamaguchi, Mitsuo Yoshida, Christos Faloutsos, Hiroyuki Kitagawa. Why Do You Follow Him? Multilinear Analysis on Twitter. Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). pp.137-138, 2015. http://doi.org/10.1145/2740908.2742715

    Code

    Our code outputting experiment results made available at: https://github.com/yamaguchiyuto/tagf

    Note

    If you would like to use larger dataset, the dataset on 1 million seed users made available at: http://dx.doi.org/10.5281/zenodo.16267 (The dataset on 0.1 million seed users is not subset of the dataset on 1 million seed users.)

  10. H

    Data from: DISMISS: Database of Indian Social Media Influencers on Twitter

    • dataverse.harvard.edu
    • dataone.org
    Updated Apr 4, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arshia Arya; Soham De; Dibyendu Mishra; Gazal Shekhawat; Ankur Sharma; Anmol Panda; Faisal M Lalani; Parantak Singh; Ramaravind Kommiya Mothilal; Rynaa Grover; Sachita Nishal; Saloni Dash; Shehla Rashid Shora; Syeda Zainab Akbar; Joyojeet Pal (2022). DISMISS: Database of Indian Social Media Influencers on Twitter [Dataset]. http://doi.org/10.7910/DVN/BPY2JY
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 4, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Arshia Arya; Soham De; Dibyendu Mishra; Gazal Shekhawat; Ankur Sharma; Anmol Panda; Faisal M Lalani; Parantak Singh; Ramaravind Kommiya Mothilal; Rynaa Grover; Sachita Nishal; Saloni Dash; Shehla Rashid Shora; Syeda Zainab Akbar; Joyojeet Pal
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Databases of highly networked individuals have been indispensable in studying narratives and influence on social media. To support studies on Twitter in India, we present a systematically categorized database of accounts of influence on Twitter in India, identified and annotated through an iterative process of friends, networks, and self-described profile information, verified manually. We built an initial set of accounts based on the friend network of a seed set of accounts based on real-world renown in various fields, and then snowballed friends of friends" multiple times, and rank ordered individuals based on the number of in-group connections, and overall followers. We then manually classified identified accounts under the categories of entertainment, sports, business, government, institutions, journalism, civil society accounts that have independent standing outside of social media, as well as a category ofdigital first" referring to accounts that derive their primary influence from online activity. Overall, we annotated 11580 unique accounts across all categories. The database is useful studying various questions related to the role of influencers in polarisation, misinformation, extreme speech, political discourse etc.

  11. [Dataset] Analysis of ego-networks of two CS-related Twitter accounts

    • data.europa.eu
    unknown
    Updated Dec 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2022). [Dataset] Analysis of ego-networks of two CS-related Twitter accounts [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-7372309?locale=ga
    Explore at:
    unknown(7024950)Available download formats
    Dataset updated
    Dec 24, 2022
    Dataset authored and provided by
    Zenodohttp://zenodo.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explanation/Overview: This is the dataset for the analyses and results on Twitter Ego-Networks of two CS-related accounts (@EuCitSci & @SciStarter). The username have been anonymised. Purpose: The purpose of this dataset is to provide the basis to reproduce the results reported in the associated deliverable. As such, it does not represent raw data, but rather files that already include certain analysis steps (like calculated degrees or other SNA-related measures), ready for analysis, visualisation and interpretation with R or any other network visualisation software (e.g., Gephi). The edges represent the follow relation and were retrieved using the Twitter API. All usernames except those of the two ego-accounts were anonymised by assigning a random number to each node. Due to the size of the network, we do not include any .gexf or .gml files in this upload, but rather resort to node and edge lists in the .json format Relatedness: The networks are the ego-networks for two related public accounts that are associated with citizen science (@EuCitSci & @SciStarter). Content: In this Zenodo entry, two files can be found. edges.json Represents the edge list, with the columns: source target wherefrom account EuCitSci 23929 friendslist EuCitSci Source and target are the necessary columns for the network creation and in this example indicate that EuCitSci follows 23929. wherefrom indicates the origin of this relation in the crawling (i.e., whether it was retrieved using the friends or follower list) and account indicates the ego-account it belongs to. Thus, the edges can also be separated using this account attribute as they represent two distinct networks. nodes.json Represents the nodes in the networks. The following data fields are contained: username CS followers_count friends_count ... 1116 CS 689 514 ... ... favourites_count listed_count statuses_count ... ... 2601 18 2141 ... ... degree in_degree out_degree ... ... 21 5 16 ... ... reciprocity account ... 0.48 EuCitSci Username represents the numerical and anonymised username, CS the community-membership. The different counts (e.g., followers_count) indicate the number of followers the user had at the time of the retrieval by the Twitter API. degree refers to the degree in the network (similarly the in- and out_degree), while reciprocity refers to the number of mutual edges in respect to the total number of edges per node (see here). Account is similar as described above. Grouping: The data is grouped according the ego-account it is associated to, as can be read above (i.e., the account attribute).

  12. Z

    Radical Right On Twitter (ROT)

    • data.niaid.nih.gov
    Updated May 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laila Sprejer; Helen Margetts; Kleber Oliveira; David O'Sullivan; Bertie Vidgen (2022). Radical Right On Twitter (ROT) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6393084
    Explore at:
    Dataset updated
    May 30, 2022
    Dataset provided by
    MASCI, University of Limerick
    The Alan Turing Institute
    Authors
    Laila Sprejer; Helen Margetts; Kleber Oliveira; David O'Sullivan; Bertie Vidgen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We collected the Radical Right On Twitter dataset (ROT7) to advance research into radical right activity online. The resource addresses a lack of data in this field, particularly data that relates to the activity of radical right actors. The dataset was funded without commercial support.

    ROT follows six months of Twitter activity (8th July 2020 to 9th January 2021) from 35 radical right actors. We follow the advice given by Williams, Burnap, and Sloan (2017) for publishing Twitter data on sensitive topics. ROT includes:

    It contains:

    Actors' content: all content produced by the actors, including posts (n = 22,131), replies (n = 19,947), quotes (n = 11,314), and retweets (n = 37,283).

    Actors' profiles: Twitter profile information for all 35 actors.

    Actors' followers: a list of each actor's followers, collected each day (combined n = 6,592,056).

    Actors' friends: a list of each actor's friends, collected each day (combined n = 262,856).

    Direct engagement: all tweets which engage with actors, including replies, quotes and retweets (n = 31,443,828).

    Engagers’ followers: List of followers of every user who replied, quoted or retweeted actors' content. We only collected users' list of followers once, even if they engaged with the actors multiple times during the period studied.

    Other engagement: all other tweets collected through Twitter API that mentions an actor (n = 10,939,868).

  13. Logistic regression model with all predictors using data without outliers.

    • plos.figshare.com
    xls
    Updated Jun 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karolina Sylwester; Matthew Purver (2023). Logistic regression model with all predictors using data without outliers. [Dataset]. http://doi.org/10.1371/journal.pone.0137422.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Karolina Sylwester; Matthew Purver
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Republican followers were coded as 0 and Democrat followers as 1.Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1Logistic regression model with all predictors using data without outliers.

  14. t

    Analyzing Twitter Ad Performance for the Last 9 Months - Data Analysis

    • tomtunguz.com
    Updated Aug 26, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomasz Tunguz (2013). Analyzing Twitter Ad Performance for the Last 9 Months - Data Analysis [Dataset]. https://tomtunguz.com/twitter-ads-performance/
    Explore at:
    Dataset updated
    Aug 26, 2013
    Dataset provided by
    Theory Ventures
    Authors
    Tomasz Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analyze 9 months of Twitter ad performance data: $2.9k spent, 1,650 followers gained at $1.73 CPF. Key insights on social media ROI for startups.

  15. m

    Twitter/X Dataset on Public Conversation Around COP30 (#COP30noBrasil)

    • data.mendeley.com
    Updated Nov 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafael Carrasco (2025). Twitter/X Dataset on Public Conversation Around COP30 (#COP30noBrasil) [Dataset]. http://doi.org/10.17632/t5kcr3nkfc.1
    Explore at:
    Dataset updated
    Nov 26, 2025
    Authors
    Rafael Carrasco
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset contains all public posts collected from the social media platform X (formerly Twitter) that included the official hashtag #COP30noBrasil during the COP30 climate summit, between 10 and 21 November 2025. The data were retrieved using Tweet Binder and comprise a total of 1,139 interactions, including original posts, retweets and replies. For each entry, the dataset includes metadata describing user characteristics (such as verification status and follower–following ratio), content features (format, text, language), and interaction metrics (likes, reposts, replies). The dataset also includes derived analytical variables used in the associated research article. These include engagement, calculated as a percentage based on interaction metrics, and sentiment polarity, computed using the VADER (Valence Aware Dictionary and sEntiment Reasoner) Python library. The text fields were kept in their original languages, reflecting the multilingual nature of the conversation around COP30. This resource enables the examination of public discourse on climate negotiations, content diffusion dynamics, and the emotional tone of climate-related communication. It provides a structured and reusable dataset for researchers interested in climate communication, digital public spheres, social media analytics, and environmental politics.

  16. Twitter Language Use Reflects Psychological Differences between Democrats...

    • plos.figshare.com
    docx
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karolina Sylwester; Matthew Purver (2023). Twitter Language Use Reflects Psychological Differences between Democrats and Republicans [Dataset]. http://doi.org/10.1371/journal.pone.0137422
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Karolina Sylwester; Matthew Purver
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Previous research has shown that political leanings correlate with various psychological factors. While surveys and experiments provide a rich source of information for political psychology, data from social networks can offer more naturalistic and robust material for analysis. This research investigates psychological differences between individuals of different political orientations on a social networking platform, Twitter. Based on previous findings, we hypothesized that the language used by liberals emphasizes their perception of uniqueness, contains more swear words, more anxiety-related words and more feeling-related words than conservatives’ language. Conversely, we predicted that the language of conservatives emphasizes group membership and contains more references to achievement and religion than liberals’ language. We analysed Twitter timelines of 5,373 followers of three Twitter accounts of the American Democratic and 5,386 followers of three accounts of the Republican parties’ Congressional Organizations. The results support most of the predictions and previous findings, confirming that Twitter behaviour offers valid insights to offline behaviour.

  17. Number of Twitter followers of the Miami Marlins 2011-2024

    • statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista, Number of Twitter followers of the Miami Marlins 2011-2024 [Dataset]. https://www.statista.com/statistics/274792/twitter-followers-of-the-miami-marlins/
    Explore at:
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    The number of X (Twitter) followers of the Major League Baseball team Miami Marlins increased substantially from September 2011 to November 2024. In the last recorded month, the team's social media account had around 0.42 million followers.

  18. Random twitter user and friends data

    • kaggle.com
    zip
    Updated Apr 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Selahattin Can Ölçer (2023). Random twitter user and friends data [Dataset]. https://www.kaggle.com/datasets/selahattincanler/random-twitter-user-and-friends-data
    Explore at:
    zip(11746 bytes)Available download formats
    Dataset updated
    Apr 29, 2023
    Authors
    Selahattin Can Ölçer
    Description

    First column : User Second column: Friends that user have (random selected 2 people)

  19. Initial logistic regression model.

    • figshare.com
    • plos.figshare.com
    xls
    Updated Jun 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karolina Sylwester; Matthew Purver (2023). Initial logistic regression model. [Dataset]. http://doi.org/10.1371/journal.pone.0137422.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 11, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Karolina Sylwester; Matthew Purver
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Republican followers were coded as 0 and Democrat followers as 1.Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1Initial logistic regression model.

  20. f

    The relationships between journalists’ ideology and their Twitter friends’...

    • plos.figshare.com
    xls
    Updated Oct 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qin Li; Hans J. G. Hassell; Robert M. Bond (2023). The relationships between journalists’ ideology and their Twitter friends’ and followers’ ideology. [Dataset]. http://doi.org/10.1371/journal.pone.0291544.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 18, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Qin Li; Hans J. G. Hassell; Robert M. Bond
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The relationships between journalists’ ideology and their Twitter friends’ and followers’ ideology.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hubert Wassner (2016). Twitter Friends [Dataset]. https://www.kaggle.com/hwassner/TwitterFriends
Organization logo

Twitter Friends

40k full Twitter user profile data (including who they follow!)

Explore at:
zip(183520459 bytes)Available download formats
Dataset updated
Sep 2, 2016
Authors
Hubert Wassner
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

Twitter Friends and hashtags

Context

This datasets is an extract of a wider database aimed at collecting Twitter user's friends (other accound one follows). The global goal is to study user's interest thru who they follow and connection to the hashtag they've used.

Content

It's a list of Twitter user's informations. In the JSON format one twitter user is stored in one object of this more that 40.000 objects list. Each object holds :

  • avatar : URL to the profile picture

  • followerCount : the number of followers of this user

  • friendsCount : the number of people following this user.

  • friendName : stores the @name (without the '@') of the user (beware this name can be changed by the user)

  • id : user ID, this number can not change (you can retrieve screen name with this service : https://tweeterid.com/)

  • friends : the list of IDs the user follows (data stored is IDs of users followed by this user)

  • lang : the language declared by the user (in this dataset there is only "en" (english))

  • lastSeen : the time stamp of the date when this user have post his last tweet.

  • tags : the hashtags (whith or without #) used by the user. It's the "trending topic" the user tweeted about.

  • tweetID : Id of the last tweet posted by this user.

You also have the CSV format which uses the same naming convention.

These users are selected because they tweeted on Twitter trending topics, I've selected users that have at least 100 followers and following at least 100 other account (in order to filter out spam and non-informative/empty accounts).

Acknowledgements

This data set is build by Hubert Wassner (me) using the Twitter public API. More data can be obtained on request (hubert.wassner AT gmail.com), at this time I've collected over 5 milions in different languages. Some more information can be found here (in french only) : http://wassner.blogspot.fr/2016/06/recuperer-des-profils-twitter-par.html

Past Research

No public research have been done (until now) on this dataset. I made a private application which is described here : http://wassner.blogspot.fr/2016/09/twitter-profiling.html (in French) which uses the full dataset (Millions of full profiles).

Inspiration

On can analyse a lot of stuff with this datasets :

  • stats about followers & followings
  • manyfold learning or unsupervised learning from friend list
  • hashtag prediction from friend list

Contact

Feel free to ask any question (or help request) via Twitter : @hwassner

Enjoy! ;)

Search
Clear search
Close search
Google apps
Main menu