In March 2024, X's web page Twitter.com had *** billion website visits worldwide, up from *** billion site visits the previous month. Formerly known as Twitter, X is a microblogging and social networking service that allows most of its users to write short posts with a maximum of 280 characters.
In the six months ending March 2024, the United States accounted for 23.21 percent of web traffic to the Twitter.com URL. Japan ranked second, accounting for 16.06 percent of traffic to the web page, and was followed by the United Kingdom, representing 5.51 percent of the web address online volume.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Author: Víctor Yeste. Universitat Politècnica de Valencia.The object of this study is the design of a cybermetric methodology whose objectives are to measure the success of the content published in online media and the possible prediction of the selected success variables.In this case, due to the need to integrate data from two separate areas, such as web publishing and the analysis of their shares and related topics on Twitter, has opted for programming as you access both the Google Analytics v4 reporting API and Twitter Standard API, always respecting the limits of these.The website analyzed is hellofriki.com. It is an online media whose primary intention is to solve the need for information on some topics that provide daily a vast number of news in the form of news, as well as the possibility of analysis, reports, interviews, and many other information formats. All these contents are under the scope of the sections of cinema, series, video games, literature, and comics.This dataset has contributed to the elaboration of the PhD Thesis:Yeste Moreno, VM. (2021). Diseño de una metodología cibermétrica de cálculo del éxito para la optimización de contenidos web [Tesis doctoral]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/176009Data have been obtained from each last-minute news article published online according to the indicators described in the doctoral thesis. All related data are stored in a database, divided into the following tables:tesis_followers: User ID list of media account followers.tesis_hometimeline: data from tweets posted by the media account sharing breaking news from the web.status_id: Tweet IDcreated_at: date of publicationtext: content of the tweetpath: URL extracted after processing the shortened URL in textpost_shared: Article ID in WordPress that is being sharedretweet_count: number of retweetsfavorite_count: number of favoritestesis_hometimeline_other: data from tweets posted by the media account that do not share breaking news from the web. Other typologies, automatic Facebook shares, custom tweets without link to an article, etc. With the same fields as tesis_hometimeline.tesis_posts: data of articles published by the web and processed for some analysis.stats_id: Analysis IDpost_id: Article ID in WordPresspost_date: article publication date in WordPresspost_title: title of the articlepath: URL of the article in the middle webtags: Tags ID or WordPress tags related to the articleuniquepageviews: unique page viewsentrancerate: input ratioavgtimeonpage: average visit timeexitrate: output ratiopageviewspersession: page views per sessionadsense_adunitsviewed: number of ads viewed by usersadsense_viewableimpressionpercent: ad display ratioadsense_ctr: ad click ratioadsense_ecpm: estimated ad revenue per 1000 page viewstesis_stats: data from a particular analysis, performed at each published breaking news item. Fields with statistical values can be computed from the data in the other tables, but total and average calculations are saved for faster and easier further processing.id: ID of the analysisphase: phase of the thesis in which analysis has been carried out (right now all are 1)time: "0" if at the time of publication, "1" if 14 days laterstart_date: date and time of measurement on the day of publicationend_date: date and time when the measurement is made 14 days latermain_post_id: ID of the published article to be analysedmain_post_theme: Main section of the published article to analyzesuperheroes_theme: "1" if about superheroes, "0" if nottrailer_theme: "1" if trailer, "0" if notname: empty field, possibility to add a custom name manuallynotes: empty field, possibility to add personalized notes manually, as if some tag has been removed manually for being considered too generic, despite the fact that the editor put itnum_articles: number of articles analysednum_articles_with_traffic: number of articles analysed with traffic (which will be taken into account for traffic analysis)num_articles_with_tw_data: number of articles with data from when they were shared on the media’s Twitter accountnum_terms: number of terms analyzeduniquepageviews_total: total page viewsuniquepageviews_mean: average page viewsentrancerate_mean: average input ratioavgtimeonpage_mean: average duration of visitsexitrate_mean: average output ratiopageviewspersession_mean: average page views per sessiontotal: total of ads viewedadsense_adunitsviewed_mean: average of ads viewedadsense_viewableimpressionpercent_mean: average ad display ratioadsense_ctr_mean: average ad click ratioadsense_ecpm_mean: estimated ad revenue per 1000 page viewsTotal: total incomeretweet_count_mean: average incomefavorite_count_total: total of favoritesfavorite_count_mean: average of favoritesterms_ini_num_tweets: total tweets on the terms on the day of publicationterms_ini_retweet_count_total: total retweets on the terms on the day of publicationterms_ini_retweet_count_mean: average retweets on the terms on the day of publicationterms_ini_favorite_count_total: total of favorites on the terms on the day of publicationterms_ini_favorite_count_mean: average of favorites on the terms on the day of publicationterms_ini_followers_talking_rate: ratio of followers of the media Twitter account who have recently published a tweet talking about the terms on the day of publicationterms_ini_user_num_followers_mean: average followers of users who have spoken of the terms on the day of publicationterms_ini_user_num_tweets_mean: average number of tweets published by users who spoke about the terms on the day of publicationterms_ini_user_age_mean: average age in days of users who have spoken of the terms on the day of publicationterms_ini_ur_inclusion_rate: URL inclusion ratio of tweets talking about terms on the day of publicationterms_end_num_tweets: total tweets on terms 14 days after publicationterms_ini_retweet_count_total: total retweets on terms 14 days after publicationterms_ini_retweet_count_mean: average retweets on terms 14 days after publicationterms_ini_favorite_count_total: total bookmarks on terms 14 days after publicationterms_ini_favorite_count_mean: average of favorites on terms 14 days after publicationterms_ini_followers_talking_rate: ratio of media Twitter account followers who have recently posted a tweet talking about the terms 14 days after publicationterms_ini_user_num_followers_mean: average followers of users who have spoken of the terms 14 days after publicationterms_ini_user_num_tweets_mean: average number of tweets published by users who have spoken about the terms 14 days after publicationterms_ini_user_age_mean: the average age in days of users who have spoken of the terms 14 days after publicationterms_ini_ur_inclusion_rate: URL inclusion ratio of tweets talking about terms 14 days after publication.tesis_terms: data of the terms (tags) related to the processed articles.stats_id: Analysis IDtime: "0" if at the time of publication, "1" if 14 days laterterm_id: Term ID (tag) in WordPressname: Name of the termslug: URL of the termnum_tweets: number of tweetsretweet_count_total: total retweetsretweet_count_mean: average retweetsfavorite_count_total: total of favoritesfavorite_count_mean: average of favoritesfollowers_talking_rate: ratio of followers of the media Twitter account who have recently published a tweet talking about the termuser_num_followers_mean: average followers of users who were talking about the termuser_num_tweets_mean: average number of tweets published by users who were talking about the termuser_age_mean: average age in days of users who were talking about the termurl_inclusion_rate: URL inclusion ratio
Social network X/Twitter is particularly popular in the United States, and as of February 2025, the microblogging service had an audience reach of 103.9 million users in the country. Japan and the India were ranked second and third with more than 70 million and 25 million users respectively. Global Twitter usage As of the second quarter of 2021, X/Twitter had 206 million monetizable daily active users worldwide. The most-followed Twitter accounts include figures such as Elon Musk, Justin Bieber and former U.S. president Barack Obama. X/Twitter and politics X/Twitter has become an increasingly relevant tool in domestic and international politics. The platform has become a way to promote policies and interact with citizens and other officials, and most world leaders and foreign ministries have an official Twitter account. Former U.S. president Donald Trump used to be a prolific Twitter user before the platform permanently suspended his account in January 2021. During an August 2018 survey, 61 percent of respondents stated that Trump's use of Twitter as President of the United States was inappropriate.
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Updated daily
As of December 2022, X/Twitter's audience accounted for over *** million monthly active users worldwide. This figure was projected to ******** to approximately *** million by 2024, a ******* of around **** percent compared to 2022.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
With the increasing number of accidents in Indonesia, analysis of accident data is still need to considered and analyzed. Moreover, traffic accident information from social media such as Twitter is easy to obtain when compared to other data source from police or government institutions. In this work, we tried to create a traffic accident dataset. We do scraping on Twitter and get 157.613 tweets, then do a text processing which involves case folding, filtering, tokenizing, and stopword removal. To find out wheter the tweet is true about accident information, we take some data as testing data then do word embedding using FastText method and classification with SVM, KNN, and Naïve Bayes algorithms then test the model with K-fold validation and confusion matrix accuracy. The results show that the validation score and accuracy score is good with 86% and 88% when SVM algorithm with linear kernel was done. This result model then applied to all dataset containing 157,613 tweets taken through the scraping process since April 2019 until April 2020 and produce predictions that can be used for further research and uploaded to www.dodyagung.com/dataset/accident.
This dataset is part of my graduate thesis and international journal paper, please cite if you use this dataset. This is the example in American Psychological Association (APA), 6th Edition format.
Saputro, D. A., & Girsang, A. S. (2020). Classification of Traffic Accident Information Using Machine Learning from Social Media. International Journal of Emerging Trends in Engineering Research, 8(3), 630–637. https://doi.org/10.30534/ijeter/2020/04832020
Header Image Source : www.kornberglawfirm.com
Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
In December 2024, Facebook was the social media platform that directed the most traffic to third-party website links in Peru, with around ***** percent of the referrals. Pinterest and Instagram followed, providing around ***** percent and ***** percent of the traffic, respectively. Other networks generated smaller shares, such as YouTube with **** percent of social media referral traffic, and X (formerly Twitter), with **** percent of the clicks.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The database includes three datasets. All of them were extracted from a dataset published by X (Twitter Transparency Websites) that includes tweets from malicious accounts trying to manipulate public opinion in the Kingdom of Saudi Arabia. Although the propagandist tweets were published by malicious accounts, as X (Twitter) stated, the tweets at their level were not classified as propaganda or not. Propagandists usually mix propaganda and non-propaganda tweets in an attempt to hide their identities. Therefore, it was necessary to classify their tweets as propaganda or not, based on the propaganda technique used. Since the datasets are very large, we annotated a sample of 2,100 tweets. The datasets are made up of 16,355,558 tweets from propagandist users focused on sports and banking topics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The object of this study is the design of a cybermetric methodology whose objectives are to measure the success of the content published in an online media and the possible prediction of the selected success variables.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
QAP regressions for popular websites (Alexa)/ videos (YouTube)/ topics (Twitter) similarity across countries (Final block, September).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Preliminary research efforts regarding Social Media Platforms and their contribution to website traffic in LAMs. Through the Similar Web API, the leading social networks (Facebook, Twitter, Youtube, Instagram, Reddit, Pinterest, LinkedIn) that drove traffic to each one of the 220 cases in our dataset were identified and analyzed in the first sheet. Aggregated results proved that Facebook platform was responsible for 46.1% of social traffic (second sheet).
In December 2023, Facebook was by far the social media platform that directed the most traffic to third-party website links in Colombia, with more than 65 percent of the referrals. Instagram and Pinterest followed, each providing around 7.67 percent and 5.22 percent of the traffic. Other networks generated smaller shares, such as YouTube with 3.51 percent of social media referral traffic, and Twitter, with 2.29 percent of clicks.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains a CSV file related to ChatGPT including keywords(chatgpt, chat gpt) #hashtags and @mentions about ChatGPT. OpenAI's conversational AI model. The file includes information on 500,000 tweets. The dataset aims to help understand public opinion, trends, and potential applications of ChatGPT by analyzing tweet volume, sentiment, user engagement, and the influence of key AI events. The dataset offers valuable insights for companies, researchers, and policymakers, allowing them to make informed decisions and shape the future of AI-powered conversational technologies.
Check out my Comprehensive Analysis on this dataset: Medium article "Cracking the ChatGPT Code: A Deep Dive into 500,000 Tweets using Advanced NLP Techniques"
Learn about the collection process in Medium article "Effortlessly Scraping Massive Twitter Data"
CC0
Original Data Source: 500k ChatGPT-related Tweets Jan-Mar 2023
As of March 2024, Twitter.com accounted for over 25.6 percent of social media referral traffic to Tumblr.com. Tumblr's second-largest social media traffic driver was youtube.com, which generated more than 19.9 percent of social media traffic to Tumblr. Overall, social media referrals made up 7.63 percent of the web traffic to the platform.
The number of Twitter users in the United States was forecast to continuously increase between 2024 and 2028 by in total 4.3 million users (+5.32 percent). After the ninth consecutive increasing year, the Twitter user base is estimated to reach 85.08 million users and therefore a new peak in 2028. Notably, the number of Twitter users of was continuously increasing over the past years.User figures, shown here regarding the platform twitter, have been estimated by taking into account company filings or press material, secondary research, app downloads and traffic data. They refer to the average monthly active users over the period.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of Twitter users in countries like Canada and Mexico.
The purpose of the dataset is identify tweets from the @Ma3Route twitter handles that report road traffic crash reports and to identify the location of the crashes. Using the Twitter API, tweets were scraped from Ma3Route, which is a mobile/web/SMS platform that crowdsources transport data and provides users with information on on road traffic crash reports as well as traffic, matatu directions, and driving reports. This dataset provides the tweet IDs of tweets from Ma3Route from May 2012 until July 2020.
Kenya (primarily Nairobi)
Road traffic crash reports
Tweets from Ma3Route
Observation data/ratings [obs]
Other [oth]
Provides information on router utilization for capacity planning.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Puff Bar, a disposable electronic nicotine delivery system (ENDS), was the ENDS brand most commonly used by U.S. youth in 2021. We explored whether Puff Bar’s rise in marketplace prominence was detectable through advertising, retail sales, social media, and web traffic data sources. We retrospectively documented potential signals of interest in and uptake of Puff Bar in the United States using metrics based on advertising (Numerator and Comperemedia), retail sales (NielsenIQ), social media (Twitter, via Sprinklr), and web traffic (Similarweb) data from January 2019 to June 2022. We selected metrics based on (1) data availability, (2) potential to graph metric longitudinally, and (3) variability in metric. We graphed metrics and assessed data patterns compared to data for Vuse, a comparator product, and in the context of regulatory events significant to Puff Bar. The number of Twitter posts that contained a Puff Bar term (social media), Puff Bar product sales measured in dollars (sales), and the number of visits to the Puff Bar website (web traffic) exhibited potential for surveilling Puff Bar due to ease of calculation, comprehensibility, and responsiveness to events. Advertising tracked through Numerator and Comperemedia did not appear to capture marketing from Puff Bar’s manufacturer or drive change in marketplace prominence. This study demonstrates how quantitative changes in metrics developed using advertising, retail sales, social media, and web traffic data sources detected changes in Puff Bar’s marketplace prominence. We conclude that low-effort, scalable, rapid signal detection capabilities can be an important part of a multi-component tobacco surveillance program.
Facebook was the leading social media website based on the share of visits in Estonia in 2023, having accounted for roughly 70 percent of the web traffic on third-party websites via clicks on social media sites. X, formerly Twitter, ranked second with around nine percent.
In March 2024, X's web page Twitter.com had *** billion website visits worldwide, up from *** billion site visits the previous month. Formerly known as Twitter, X is a microblogging and social networking service that allows most of its users to write short posts with a maximum of 280 characters.