100+ datasets found

Twitter Friends
kaggle.com
zip
Updated Sep 2, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hubert Wassner (2016). Twitter Friends [Dataset]. https://www.kaggle.com/hwassner/TwitterFriends
Explore at:
zip(183520459 bytes)Available download formats
Dataset updated
Sep 2, 2016
Authors
Hubert Wassner
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
Twitter Friends and hashtags

Context

This datasets is an extract of a wider database aimed at collecting Twitter user's friends (other accound one follows). The global goal is to study user's interest thru who they follow and connection to the hashtag they've used.

Content

It's a list of Twitter user's informations. In the JSON format one twitter user is stored in one object of this more that 40.000 objects list. Each object holds :

avatar : URL to the profile picture

followerCount : the number of followers of this user

friendsCount : the number of people following this user.

friendName : stores the @name (without the '@') of the user (beware this name can be changed by the user)

id : user ID, this number can not change (you can retrieve screen name with this service : https://tweeterid.com/)

friends : the list of IDs the user follows (data stored is IDs of users followed by this user)

lang : the language declared by the user (in this dataset there is only "en" (english))

lastSeen : the time stamp of the date when this user have post his last tweet.

tags : the hashtags (whith or without #) used by the user. It's the "trending topic" the user tweeted about.

tweetID : Id of the last tweet posted by this user.

You also have the CSV format which uses the same naming convention.

These users are selected because they tweeted on Twitter trending topics, I've selected users that have at least 100 followers and following at least 100 other account (in order to filter out spam and non-informative/empty accounts).

Acknowledgements

This data set is build by Hubert Wassner (me) using the Twitter public API. More data can be obtained on request (hubert.wassner AT gmail.com), at this time I've collected over 5 milions in different languages. Some more information can be found here (in french only) : http://wassner.blogspot.fr/2016/06/recuperer-des-profils-twitter-par.html

Past Research

No public research have been done (until now) on this dataset. I made a private application which is described here : http://wassner.blogspot.fr/2016/09/twitter-profiling.html (in French) which uses the full dataset (Millions of full profiles).

Inspiration

On can analyse a lot of stuff with this datasets :

stats about followers & followings

manyfold learning or unsupervised learning from friend list

hashtag prediction from friend list

Contact

Feel free to ask any question (or help request) via Twitter : @hwassner

Enjoy! ;)
w
Websites using Twitter Friends
webtechsurvey.com
csv
Updated Nov 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Twitter Friends [Dataset]. https://webtechsurvey.com/technology/twitter-friends
Explore at:
csvAvailable download formats
Dataset updated
Nov 22, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Twitter Friends technology, compiled through global website indexing conducted by WebTechSurvey.
twitter-friends.parquet
kaggle.com
zip
Updated Nov 20, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
lixysc (2024). twitter-friends.parquet [Dataset]. https://www.kaggle.com/datasets/lixysc/twitter-friends-parquet
Explore at:
zip(258012659 bytes)Available download formats
Dataset updated
Nov 20, 2024
Authors
lixysc
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by lixysc

Released under Apache 2.0

Contents
Friends auf Twitter
de.statista.com
Updated Sep 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Friends auf Twitter [Dataset]. https://de.statista.com/statistik/daten/studie/71738/umfrage/anzahl-der-friends-auf-twitter-in-2009/
Explore at:
Dataset updated
Sep 15, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Nov 2009
Area covered
Weltweit
Description
Die Grafik zeigt eine prozentuale Verteilung der Freundesanzahl von Twitter-Accounts. Accounts mit 6 bis 10 Friends machen einen Anteil von 8,7 Prozent aus.
twitter-friends
kaggle.com
zip
Updated Dec 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
jettzfc (2024). twitter-friends [Dataset]. https://www.kaggle.com/datasets/jettzfc/twitter-friends/code
Explore at:
zip(258012659 bytes)Available download formats
Dataset updated
Dec 13, 2024
Authors
jettzfc
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by jettzfc

Released under Apache 2.0

Contents
Number of Twitter followers of the New York Mets 2011-2024
statista.com
Updated Dec 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Number of Twitter followers of the New York Mets 2011-2024 [Dataset]. https://www.statista.com/statistics/274736/twitter-followers-of-the-new-york-mets/
Explore at:
Dataset updated
Dec 15, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States
Description
The number of X (Twitter) followers of the Major League Baseball team New York Mets increased considerably from September 2011 to November 2024. In the last recorded month, the team's social media account had around 1.27 million followers.
m
Raw Twitter Datasets Based on Depressive Words
data.mendeley.com
ieee-dataport.org
Updated Sep 2, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sawrav Chowdhury (2020). Raw Twitter Datasets Based on Depressive Words [Dataset]. http://doi.org/10.17632/4rd637tddf.1
Explore at:
Unique identifier
https://doi.org/10.17632/4rd637tddf.1
Dataset updated
Sep 2, 2020
Authors
Sawrav Chowdhury
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Right now we see that depression is one of the most common problems in our society. Most of the time people are committed suicide only cause of depression. And till now there is no proper lab test way for detecting depression. Generally, doctors are detecting depression by asking some knowledge-base questions. On the other hand, there are a good number of people using social media platforms right now, where they are sharing their daily experiences, emotion, and other activity with their friends. Twitter is one of the common social platforms and also popular for data collection. I was collecting these datasets from twitter based on some depressive words. I hope that this twitter datasets will help researchers to detect depression more precisely.
Twitter: number of followers of popular luxury brands 2020
statista.com
Updated Nov 15, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2020). Twitter: number of followers of popular luxury brands 2020 [Dataset]. https://www.statista.com/statistics/693784/luxury-brands-follower-twitter/
Explore at:
Dataset updated
Nov 15, 2020
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2020
Area covered
Worldwide
Description
This statistic provides information on the most popular luxury brands on Twitter, ranked by number of followers. In 2020, the French luxury brand Chanel was ranked first with 13 million Twitter followers, followed by Burberry, Dior, and Louis Vuitton with 8 million followers each.
Z
Following/Followers and Tags on 0.1 million Twitter Users
data.niaid.nih.gov
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yoshida, Mitsuo; Yamaguchi, Yuto (2020). Following/Followers and Tags on 0.1 million Twitter Users [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13966
Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
Toyohashi University of Technology
University of Tsukuba
Authors
Yoshida, Mitsuo; Yamaguchi, Yuto
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Abstract (our paper)

Why does Smith follow Johnson on Twitter? In most cases, the reason why users follow other users is unavailable. In this work, we answer this question by proposing TagF, which analyzes the who-follows-whom network (matrix) and the who-tags-whom network (tensor) simultaneously. Concretely, our method decomposes a coupled tensor constructed from these matrix and tensor. The experimental results on million-scale Twitter networks show that TagF uncovers different, but explainable reasons why users follow other users.

Data

coupled_tensor: The first column is the source user id (from user id), the second column is the destination user id (to user id), and the third column is the tag id.

users.id: The first column is the user id for coupled_tensor, and the second column is the user id on Twitter.

tags.id: The first column is the tag id for coupled_tensor, and the second column is the tag (i.e. slug or list name) on Twitter. On the tags, ###follow### and ###friend### are special tags expressing follower and following.

Publication

This dataset was created for our study. If you make use of this dataset, please cite: Yuto Yamaguchi, Mitsuo Yoshida, Christos Faloutsos, Hiroyuki Kitagawa. Why Do You Follow Him? Multilinear Analysis on Twitter. Proceedings of the 24th International Conference on World Wide Web (WWW '15 Companion). pp.137-138, 2015. http://doi.org/10.1145/2740908.2742715

Code

Our code outputting experiment results made available at: https://github.com/yamaguchiyuto/tagf

Note

If you would like to use larger dataset, the dataset on 1 million seed users made available at: http://dx.doi.org/10.5281/zenodo.16267 (The dataset on 0.1 million seed users is not subset of the dataset on 1 million seed users.)
H
Data from: DISMISS: Database of Indian Social Media Influencers on Twitter
dataverse.harvard.edu
dataone.org
Updated Apr 4, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arshia Arya; Soham De; Dibyendu Mishra; Gazal Shekhawat; Ankur Sharma; Anmol Panda; Faisal M Lalani; Parantak Singh; Ramaravind Kommiya Mothilal; Rynaa Grover; Sachita Nishal; Saloni Dash; Shehla Rashid Shora; Syeda Zainab Akbar; Joyojeet Pal (2022). DISMISS: Database of Indian Social Media Influencers on Twitter [Dataset]. http://doi.org/10.7910/DVN/BPY2JY
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/BPY2JY
Dataset updated
Apr 4, 2022
Dataset provided by
Harvard Dataverse
Authors
Arshia Arya; Soham De; Dibyendu Mishra; Gazal Shekhawat; Ankur Sharma; Anmol Panda; Faisal M Lalani; Parantak Singh; Ramaravind Kommiya Mothilal; Rynaa Grover; Sachita Nishal; Saloni Dash; Shehla Rashid Shora; Syeda Zainab Akbar; Joyojeet Pal
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Databases of highly networked individuals have been indispensable in studying narratives and influence on social media. To support studies on Twitter in India, we present a systematically categorized database of accounts of influence on Twitter in India, identified and annotated through an iterative process of friends, networks, and self-described profile information, verified manually. We built an initial set of accounts based on the friend network of a seed set of accounts based on real-world renown in various fields, and then snowballed friends of friends" multiple times, and rank ordered individuals based on the number of in-group connections, and overall followers. We then manually classified identified accounts under the categories of entertainment, sports, business, government, institutions, journalism, civil society accounts that have independent standing outside of social media, as well as a category ofdigital first" referring to accounts that derive their primary influence from online activity. Overall, we annotated 11580 unique accounts across all categories. The database is useful studying various questions related to the role of influencers in polarisation, misinformation, extreme speech, political discourse etc.
[Dataset] Analysis of ego-networks of two CS-related Twitter accounts
data.europa.eu
unknown
Updated Dec 24, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo (2022). [Dataset] Analysis of ego-networks of two CS-related Twitter accounts [Dataset]. https://data.europa.eu/data/datasets/oai-zenodo-org-7372309?locale=ga
Explore at:
unknown(7024950)Available download formats
Dataset updated
Dec 24, 2022
Dataset authored and provided by
Zenodohttp://zenodo.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explanation/Overview: This is the dataset for the analyses and results on Twitter Ego-Networks of two CS-related accounts (@EuCitSci & @SciStarter). The username have been anonymised. Purpose: The purpose of this dataset is to provide the basis to reproduce the results reported in the associated deliverable. As such, it does not represent raw data, but rather files that already include certain analysis steps (like calculated degrees or other SNA-related measures), ready for analysis, visualisation and interpretation with R or any other network visualisation software (e.g., Gephi). The edges represent the follow relation and were retrieved using the Twitter API. All usernames except those of the two ego-accounts were anonymised by assigning a random number to each node. Due to the size of the network, we do not include any .gexf or .gml files in this upload, but rather resort to node and edge lists in the .json format Relatedness: The networks are the ego-networks for two related public accounts that are associated with citizen science (@EuCitSci & @SciStarter). Content: In this Zenodo entry, two files can be found. edges.json Represents the edge list, with the columns: source target wherefrom account EuCitSci 23929 friendslist EuCitSci Source and target are the necessary columns for the network creation and in this example indicate that EuCitSci follows 23929. wherefrom indicates the origin of this relation in the crawling (i.e., whether it was retrieved using the friends or follower list) and account indicates the ego-account it belongs to. Thus, the edges can also be separated using this account attribute as they represent two distinct networks. nodes.json Represents the nodes in the networks. The following data fields are contained: username CS followers_count friends_count ... 1116 CS 689 514 ... ... favourites_count listed_count statuses_count ... ... 2601 18 2141 ... ... degree in_degree out_degree ... ... 21 5 16 ... ... reciprocity account ... 0.48 EuCitSci Username represents the numerical and anonymised username, CS the community-membership. The different counts (e.g., followers_count) indicate the number of followers the user had at the time of the retrieval by the Twitter API. degree refers to the degree in the network (similarly the in- and out_degree), while reciprocity refers to the number of mutual edges in respect to the total number of edges per node (see here). Account is similar as described above. Grouping: The data is grouped according the ego-account it is associated to, as can be read above (i.e., the account attribute).
Z
Radical Right On Twitter (ROT)
data.niaid.nih.gov
Updated May 30, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Laila Sprejer; Helen Margetts; Kleber Oliveira; David O'Sullivan; Bertie Vidgen (2022). Radical Right On Twitter (ROT) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6393084
Explore at:
Dataset updated
May 30, 2022
Dataset provided by
MASCI, University of Limerick
The Alan Turing Institute
Authors
Laila Sprejer; Helen Margetts; Kleber Oliveira; David O'Sullivan; Bertie Vidgen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We collected the Radical Right On Twitter dataset (ROT7) to advance research into radical right activity online. The resource addresses a lack of data in this field, particularly data that relates to the activity of radical right actors. The dataset was funded without commercial support.

ROT follows six months of Twitter activity (8th July 2020 to 9th January 2021) from 35 radical right actors. We follow the advice given by Williams, Burnap, and Sloan (2017) for publishing Twitter data on sensitive topics. ROT includes:

It contains:

Actors' content: all content produced by the actors, including posts (n = 22,131), replies (n = 19,947), quotes (n = 11,314), and retweets (n = 37,283).

Actors' profiles: Twitter profile information for all 35 actors.

Actors' followers: a list of each actor's followers, collected each day (combined n = 6,592,056).

Actors' friends: a list of each actor's friends, collected each day (combined n = 262,856).

Direct engagement: all tweets which engage with actors, including replies, quotes and retweets (n = 31,443,828).

Engagers’ followers: List of followers of every user who replied, quoted or retweeted actors' content. We only collected users' list of followers once, even if they engaged with the actors multiple times during the period studied.

Other engagement: all other tweets collected through Twitter API that mentions an actor (n = 10,939,868).
Logistic regression model with all predictors using data without outliers.
plos.figshare.com
xls
Updated Jun 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karolina Sylwester; Matthew Purver (2023). Logistic regression model with all predictors using data without outliers. [Dataset]. http://doi.org/10.1371/journal.pone.0137422.t007
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0137422.t007
Dataset updated
Jun 2, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Karolina Sylwester; Matthew Purver
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Republican followers were coded as 0 and Democrat followers as 1.Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1Logistic regression model with all predictors using data without outliers.
t
Analyzing Twitter Ad Performance for the Last 9 Months - Data Analysis
tomtunguz.com
Updated Aug 26, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tomasz Tunguz (2013). Analyzing Twitter Ad Performance for the Last 9 Months - Data Analysis [Dataset]. https://tomtunguz.com/twitter-ads-performance/
Explore at:
Dataset updated
Aug 26, 2013
Dataset provided by
Theory Ventures
Authors
Tomasz Tunguz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analyze 9 months of Twitter ad performance data: $2.9k spent, 1,650 followers gained at $1.73 CPF. Key insights on social media ROI for startups.
m
Twitter/X Dataset on Public Conversation Around COP30 (#COP30noBrasil)
data.mendeley.com
Updated Nov 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rafael Carrasco (2025). Twitter/X Dataset on Public Conversation Around COP30 (#COP30noBrasil) [Dataset]. http://doi.org/10.17632/t5kcr3nkfc.1
Explore at:
Unique identifier
https://doi.org/10.17632/t5kcr3nkfc.1
Dataset updated
Nov 26, 2025
Authors
Rafael Carrasco
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
This dataset contains all public posts collected from the social media platform X (formerly Twitter) that included the official hashtag #COP30noBrasil during the COP30 climate summit, between 10 and 21 November 2025. The data were retrieved using Tweet Binder and comprise a total of 1,139 interactions, including original posts, retweets and replies. For each entry, the dataset includes metadata describing user characteristics (such as verification status and follower–following ratio), content features (format, text, language), and interaction metrics (likes, reposts, replies). The dataset also includes derived analytical variables used in the associated research article. These include engagement, calculated as a percentage based on interaction metrics, and sentiment polarity, computed using the VADER (Valence Aware Dictionary and sEntiment Reasoner) Python library. The text fields were kept in their original languages, reflecting the multilingual nature of the conversation around COP30. This resource enables the examination of public discourse on climate negotiations, content diffusion dynamics, and the emotional tone of climate-related communication. It provides a structured and reusable dataset for researchers interested in climate communication, digital public spheres, social media analytics, and environmental politics.
Twitter Language Use Reflects Psychological Differences between Democrats...
plos.figshare.com
docx
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karolina Sylwester; Matthew Purver (2023). Twitter Language Use Reflects Psychological Differences between Democrats and Republicans [Dataset]. http://doi.org/10.1371/journal.pone.0137422
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0137422
Dataset updated
May 30, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Karolina Sylwester; Matthew Purver
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Previous research has shown that political leanings correlate with various psychological factors. While surveys and experiments provide a rich source of information for political psychology, data from social networks can offer more naturalistic and robust material for analysis. This research investigates psychological differences between individuals of different political orientations on a social networking platform, Twitter. Based on previous findings, we hypothesized that the language used by liberals emphasizes their perception of uniqueness, contains more swear words, more anxiety-related words and more feeling-related words than conservatives’ language. Conversely, we predicted that the language of conservatives emphasizes group membership and contains more references to achievement and religion than liberals’ language. We analysed Twitter timelines of 5,373 followers of three Twitter accounts of the American Democratic and 5,386 followers of three accounts of the Republican parties’ Congressional Organizations. The results support most of the predictions and previous findings, confirming that Twitter behaviour offers valid insights to offline behaviour.
Number of Twitter followers of the Miami Marlins 2011-2024
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Number of Twitter followers of the Miami Marlins 2011-2024 [Dataset]. https://www.statista.com/statistics/274792/twitter-followers-of-the-miami-marlins/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States
Description
The number of X (Twitter) followers of the Major League Baseball team Miami Marlins increased substantially from September 2011 to November 2024. In the last recorded month, the team's social media account had around 0.42 million followers.
Random twitter user and friends data
kaggle.com
zip
Updated Apr 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Selahattin Can Ölçer (2023). Random twitter user and friends data [Dataset]. https://www.kaggle.com/datasets/selahattincanler/random-twitter-user-and-friends-data
Explore at:
zip(11746 bytes)Available download formats
Dataset updated
Apr 29, 2023
Authors
Selahattin Can Ölçer
Description
First column : User Second column: Friends that user have (random selected 2 people)
Initial logistic regression model.
figshare.com
plos.figshare.com
xls
Updated Jun 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karolina Sylwester; Matthew Purver (2023). Initial logistic regression model. [Dataset]. http://doi.org/10.1371/journal.pone.0137422.t005
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0137422.t005
Dataset updated
Jun 11, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Karolina Sylwester; Matthew Purver
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Republican followers were coded as 0 and Democrat followers as 1.Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1Initial logistic regression model.
f
The relationships between journalists’ ideology and their Twitter friends’...
plos.figshare.com
xls
Updated Oct 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qin Li; Hans J. G. Hassell; Robert M. Bond (2023). The relationships between journalists’ ideology and their Twitter friends’ and followers’ ideology. [Dataset]. http://doi.org/10.1371/journal.pone.0291544.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0291544.t003
Dataset updated
Oct 18, 2023
Dataset provided by
PLOS ONE
Authors
Qin Li; Hans J. G. Hassell; Robert M. Bond
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The relationships between journalists’ ideology and their Twitter friends’ and followers’ ideology.

Facebook

Twitter

Click to copy link

Link copied

Cite

Hubert Wassner (2016). Twitter Friends [Dataset]. https://www.kaggle.com/hwassner/TwitterFriends

Twitter Friends

40k full Twitter user profile data (including who they follow!)

Explore at:

zip(183520459 bytes)Available download formats

Dataset updated

Sep 2, 2016

Authors

Hubert Wassner

License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

Twitter Friends and hashtags

Context

This datasets is an extract of a wider database aimed at collecting Twitter user's friends (other accound one follows). The global goal is to study user's interest thru who they follow and connection to the hashtag they've used.

Content

It's a list of Twitter user's informations. In the JSON format one twitter user is stored in one object of this more that 40.000 objects list. Each object holds :

avatar : URL to the profile picture
followerCount : the number of followers of this user
friendsCount : the number of people following this user.
friendName : stores the @name (without the '@') of the user (beware this name can be changed by the user)
id : user ID, this number can not change (you can retrieve screen name with this service : https://tweeterid.com/)
friends : the list of IDs the user follows (data stored is IDs of users followed by this user)
lang : the language declared by the user (in this dataset there is only "en" (english))
lastSeen : the time stamp of the date when this user have post his last tweet.
tags : the hashtags (whith or without #) used by the user. It's the "trending topic" the user tweeted about.
tweetID : Id of the last tweet posted by this user.

You also have the CSV format which uses the same naming convention.

These users are selected because they tweeted on Twitter trending topics, I've selected users that have at least 100 followers and following at least 100 other account (in order to filter out spam and non-informative/empty accounts).

Acknowledgements

This data set is build by Hubert Wassner (me) using the Twitter public API. More data can be obtained on request (hubert.wassner AT gmail.com), at this time I've collected over 5 milions in different languages. Some more information can be found here (in french only) : http://wassner.blogspot.fr/2016/06/recuperer-des-profils-twitter-par.html

Past Research

No public research have been done (until now) on this dataset. I made a private application which is described here : http://wassner.blogspot.fr/2016/09/twitter-profiling.html (in French) which uses the full dataset (Millions of full profiles).

Inspiration

On can analyse a lot of stuff with this datasets :

stats about followers & followings
manyfold learning or unsupervised learning from friend list
hashtag prediction from friend list

Contact

Feel free to ask any question (or help request) via Twitter : @hwassner

Enjoy! ;)

Clear search

Close search

Google apps

Main menu

Twitter Friends

Twitter Friends and hashtags

Context

Content

Acknowledgements

Past Research

Inspiration

Contact

Websites using Twitter Friends

twitter-friends.parquet

Dataset

Contents

Friends auf Twitter

twitter-friends

Dataset

Contents

Number of Twitter followers of the New York Mets 2011-2024

Raw Twitter Datasets Based on Depressive Words

Twitter: number of followers of popular luxury brands 2020

Following/Followers and Tags on 0.1 million Twitter Users

Data from: DISMISS: Database of Indian Social Media Influencers on Twitter

[Dataset] Analysis of ego-networks of two CS-related Twitter accounts

Radical Right On Twitter (ROT)

Logistic regression model with all predictors using data without outliers.

Analyzing Twitter Ad Performance for the Last 9 Months - Data Analysis

Twitter/X Dataset on Public Conversation Around COP30 (#COP30noBrasil)

Twitter Language Use Reflects Psychological Differences between Democrats...

Number of Twitter followers of the Miami Marlins 2011-2024

Random twitter user and friends data

Initial logistic regression model.

The relationships between journalists’ ideology and their Twitter friends’...

Twitter Friends

40k full Twitter user profile data (including who they follow!)

Twitter Friends and hashtags

Context

Content

Acknowledgements

Past Research

Inspiration

Contact