100+ datasets found

Website Traffic
kaggle.com
zip
Updated Aug 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AnthonyTherrien (2024). Website Traffic [Dataset]. https://www.kaggle.com/datasets/anthonytherrien/website-traffic
Explore at:
zip(65228 bytes)Available download formats
Dataset updated
Aug 5, 2024
Authors
AnthonyTherrien
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Description
Dataset Overview

This dataset provides detailed information on website traffic, including page views, session duration, bounce rate, traffic source, time spent on page, previous visits, and conversion rate.

Dataset Description

Page Views: The number of pages viewed during a session.

Session Duration: The total duration of the session in minutes.

Bounce Rate: The percentage of visitors who navigate away from the site after viewing only one page.

Traffic Source: The origin of the traffic (e.g., Organic, Social, Paid).

Time on Page: The amount of time spent on the specific page.

Previous Visits: The number of previous visits by the same visitor.

Conversion Rate: The percentage of visitors who completed a desired action (e.g., making a purchase).

Data Summary

Total Records: 2000

Total Features: 7

Key Features

Page Views: This feature indicates the engagement level of the visitors by showing how many pages they visit during their session.

Session Duration: This feature measures the length of time a visitor stays on the website, which can indicate the quality of the content.

Bounce Rate: A critical metric for understanding user behavior. A high bounce rate may indicate that visitors are not finding what they are looking for.

Traffic Source: Understanding where your traffic comes from can help in optimizing marketing strategies.

Time on Page: This helps in analyzing which pages are retaining visitors' attention the most.

Previous Visits: This can be used to analyze the loyalty of visitors and the effectiveness of retention strategies.

Conversion Rate: The ultimate metric for measuring the effectiveness of the website in achieving its goals.

Usage

This dataset can be used for various analyses such as:

Identifying key drivers of engagement and conversion.

Analyzing the effectiveness of different traffic sources.

Understanding user behavior patterns and optimizing the website accordingly.

Improving marketing strategies based on traffic source performance.

Enhancing user experience by analyzing time spent on different pages.

Acknowledgments

This dataset was generated for educational purposes and is not from a real website. It serves as a tool for learning data analysis and machine learning techniques.
Data from: Website Traffic Analysis
kaggle.com
zip
Updated Sep 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bhanupratap Biswas (2024). Website Traffic Analysis [Dataset]. https://www.kaggle.com/datasets/bhanupratapbiswas/website-traffic-analysis
Explore at:
zip(5409593 bytes)Available download formats
Dataset updated
Sep 1, 2024
Authors
Bhanupratap Biswas
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Website Traffic Analysis

Website traffic analysis is the process of monitoring and evaluating the visitors to a website. It provides insights into how users are interacting with the site, where they are coming from, which pages they visit most often, and how long they stay. By analyzing this data, businesses can understand user behavior, improve site performance, and optimize content to increase engagement and conversions.

Key metrics include the number of visitors, page views, bounce rate, traffic sources (organic, referral, direct), and geographic location. Website traffic analysis is essential for enhancing SEO, refining marketing strategies, and boosting overall user experience.
Website traffic strategies by industry and size of enterprise
www150.statcan.gc.ca
datasets.ai
+1more
csv, html
Updated Jun 11, 2014
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2014). Website traffic strategies by industry and size of enterprise [Dataset]. http://doi.org/10.25318/2210001801-eng
Explore at:
csv, htmlAvailable download formats
Unique identifier
https://doi.org/10.25318/2210001801-eng
Dataset updated
Jun 11, 2014
Dataset provided by
Government of Canadahttp://www.gg.ca/
Statistics Canadahttps://statcan.gc.ca/en
Authors
Government of Canada, Statistics Canada
License
https://www.statcan.gc.ca/en/terms-conditions/open-licencehttps://www.statcan.gc.ca/en/terms-conditions/open-licence
Area covered
Canada
Description
Digital technology and Internet use, website traffic strategies, by North American Industry Classification System (NAICS) and size of enterprise for Canada from 2012 to 2013.
W
Website Traffic Analysis Tool Report
marketresearchforecast.com
doc, pdf, ppt
Updated Feb 26, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Forecast (2026). Website Traffic Analysis Tool Report [Dataset]. https://www.marketresearchforecast.com/reports/website-traffic-analysis-tool-541802
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Feb 26, 2026
Dataset authored and provided by
Market Research Forecast
License
https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy
Time period covered
2026 - 2034
Area covered
Global
Variables measured
Market Size
Description
Unlock explosive website growth! Discover the booming $15 billion website traffic analysis tool market, projected to reach $45 billion by 2033. Explore key trends, leading companies (Semrush, Ahrefs, Google Analytics), and regional insights in our comprehensive market analysis.
W
Website Traffic Analysis Tool Report
archivemarketresearch.com
doc, pdf, ppt
Updated Jan 6, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2026). Website Traffic Analysis Tool Report [Dataset]. https://www.archivemarketresearch.com/reports/website-traffic-analysis-tool-30314
Explore at:
doc, pdf, pptAvailable download formats
Dataset updated
Jan 6, 2026
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2026 - 2034
Area covered
Global
Variables measured
Market Size
Description
The size of the Website Traffic Analysis Tool market was valued at USD XXX million in 2024 and is projected to reach USD XXX million by 2033, with an expected CAGR of XX % during the forecast period.
d
NYC.gov Web Analytics
catalog.data.gov
data.cityofnewyork.us
+2more
Updated Sep 30, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.cityofnewyork.us (2022). NYC.gov Web Analytics [Dataset]. https://catalog.data.gov/dataset/nyc-gov-web-analytics
Explore at:
Dataset updated
Sep 30, 2022
Dataset provided by
data.cityofnewyork.us
Area covered
New York
Description
Web traffic statistics for the top 2000 most visited pages on nyc.gov by month.
Network Traffic Analytics Market Size, Share, Trends Industry Analysis...
technavio.com
pdf
Updated Jun 21, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Technavio (2018). Network Traffic Analytics Market Size, Share, Trends Industry Analysis Forecast 2022 Technavio [Dataset]. https://www.technavio.com/report/global-network-traffic-analytics-market-analysis-share-2018
Explore at:
pdfAvailable download formats
Dataset updated
Jun 21, 2018
Dataset provided by
TechNavio
Authors
Technavio
License
https://www.technavio.com/content/privacy-noticehttps://www.technavio.com/content/privacy-notice
Description
snapshot-tab-pane Global network traffic analytics Industry OverviewTechnavio’s analysts have identified the increasing use of network traffic analytics solutions to be one of major factors driving market growth. With the rapidly changing IT infrastructure, security hackers can steal valuable information through various modes. With the increasing dependence on web applications and websites for day-to-day activities and financial transactions, the instances of theft have increased globally. Also, the emergence of social networking websites has aided the malicious attackers to extract valuable information from vulnerable users. The increasing consumer dependence on web applications and websites for day-to-day activities and financial transactions are further increasing the risks of theft. This encourages the organizations to adopt network traffic analytics solutions. Want a bigger picture? Try a FREE sample of this report now!See the complete table of contents and list of exhibits, as well as selected illustrations and example pages from this report. Companies coveredThe network traffic analytics market is fairly concentrated due to the presence of few established companies offering innovative and differentiated software and services. By offering a complete analysis of the competitiveness of the players in the network monitoring tools market offering varied software and services, this network traffic analytics industry analysis report will aid clients identify new growth opportunities and design new growth strategies. The report offers a complete analysis of a number of companies including: AllotCisco SystemsIBMJuniper NetworksMicrosoftSymantecNetwork traffic analytics market growth based on geographic regionsAmericasAPACEMEAWith a complete study of the growth opportunities for the companies across regions such as the Americas, APAC, and EMEA, our industry research analysts have estimated that countries in the Americas will contribute significantly to the growth of the network monitoring tools market throughout the predicted period.Network traffic analytics market growth based on end-userTelecomBFSIHealthcareMedia and entertainmentAccording to our market research experts, the telecom end-user industry will be the major end-user of the network monitoring tools market throughout the forecast period. Factors such as increasing use of network traffic analytics solutions and increasing use of mobile devices at workplaces will contribute to the growth of the market shares of the telecom industry in the network traffic analytics market. Key highlights of the global network traffic analytics market for the forecast years 2018-2022:CAGR of the market during the forecast period 2018-2022Detailed information on factors that will accelerate the growth of the network traffic analytics market during the next five yearsPrecise estimation of the global network traffic analytics market size and its contribution to the parent marketAccurate predictions on upcoming trends and changes in consumer behaviorGrowth of the network traffic analytics industry across various geographies such as the Americas, APAC, and EMEAA thorough analysis of the market’s competitive landscape and detailed information on several vendorsComprehensive information about factors that will challenge the growth of network traffic analytics companies Get more value with Technavio’s INSIGHTS subscription platform! Gain easy access to all of Technavio’s reports, along with on-demand services.Try the demo This market research report analyzes the market outlook and provides a list of key trends, drivers, and challenges that are anticipated to impact the global network traffic analytics market and its stakeholders over the forecast years.The global network traffic analytics market analysts at Technavio have also considered how the performance of other related markets in the vertical will impact the size of this market till 2022. Some of the markets most likely to influence the growth of the network traffic analytics market over the coming years are the Global Network as a Service Marketand the Global Data Analytics Outsourcing Market. Technavio’s collection of market research reports offer insights into the growth of markets across various industries. Additionally, we also provide customized reports based on the specific requirement of our clients.
t
Measuring Dark Social Using Google Analytics - Data Analysis
tomtunguz.com
Updated Mar 18, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tomasz Tunguz (2013). Measuring Dark Social Using Google Analytics - Data Analysis [Dataset]. https://tomtunguz.com/dark-social-on-google-analytics/
Explore at:
Dataset updated
Mar 18, 2013
Dataset provided by
Theory Ventures
Authors
Tomasz Tunguz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Learn how to measure dark social traffic in Google Analytics, which drives 40% of blog visitors. Essential tracking insights for startup founders and content marketers.
i
Data from: In-browser and network traffic based web response time...
ieee-dataport.org
Updated May 18, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carlos Lopez (2022). In-browser and network traffic based web response time measurements [Dataset]. https://ieee-dataport.org/open-access/browser-and-network-traffic-based-web-response-time-measurements
Explore at:
Dataset updated
May 18, 2022
Authors
Carlos Lopez
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
out of which 20 used plaintext HTTP browsing
Monthly web traffic to vestiairecollective.com 2024
statista.com
Updated Nov 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Monthly web traffic to vestiairecollective.com 2024 [Dataset]. https://www.statista.com/statistics/1549385/monthly-web-visits-to-vestiaire-collective/
Explore at:
Dataset updated
Nov 28, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Sep 2024 - Nov 2024
Area covered
Worldwide
Description
In the measured time period, November 2024 saw the highest figures for online traffic to the C2C fashion marketplace vestiairecollective.com. According to the data, desktop and mobile visits to vestiairecollective.com reached **** million visits that month.
Network Traffic Dataset
kaggle.com
zip
Updated Oct 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ravikumar Gattu (2023). Network Traffic Dataset [Dataset]. https://www.kaggle.com/datasets/ravikumargattu/network-traffic-dataset
Explore at:
zip(6783827 bytes)Available download formats
Dataset updated
Oct 31, 2023
Authors
Ravikumar Gattu
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

The data presented here was obtained in a Kali Machine from University of Cincinnati,Cincinnati,OHIO by carrying out packet captures for 1 hour during the evening on Oct 9th,2023 using Wireshark.This dataset consists of 394137 instances were obtained and stored in a CSV (Comma Separated Values) file.This large dataset could be used utilised for different machine learning applications for instance classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

The dataset can be used for a variety of machine learning tasks, such as network intrusion detection, traffic classification, and anomaly detection.

Content :

This network traffic dataset consists of 7 features.Each instance contains the information of source and destination IP addresses, The majority of the properties are numeric in nature, however there are also nominal and date kinds due to the Timestamp.

The network traffic flow statistics (No. Time Source Destination Protocol Length Info) were obtained using Wireshark (https://www.wireshark.org/).

Dataset Columns:

No : Number of Instance. Timestamp : Timestamp of instance of network traffic Source IP: IP address of Source Destination IP: IP address of Destination Portocol: Protocol used by the instance Length: Length of Instance Info: Information of Traffic Instance

Acknowledgements :

I would like thank University of Cincinnati for giving the infrastructure for generation of network traffic data set.

Ravikumar Gattu , Susmitha Choppadandi

Inspiration : This dataset goes beyond the majority of network traffic classification datasets, which only identify the type of application (WWW, DNS, ICMP,ARP,RARP) that an IP flow contains. Instead, it generates machine learning models that can identify specific applications (like Tiktok,Wikipedia,Instagram,Youtube,Websites,Blogs etc.) from IP flow statistics (there are currently 25 applications in total).

**Dataset License: ** CC0: Public Domain

Dataset Usages : This dataset can be used for different machine learning applications in the field of cybersecurity such as classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

ML techniques benefits from this Dataset :

This dataset is highly useful because it consists of 394137 instances of network traffic data obtained by using the 25 applications on a public,private and Enterprise networks.Also,the dataset consists of very important features that can be used for most of the applications of Machine learning in cybersecurity.Here are few of the potential machine learning applications that could be benefited from this dataset are :

Network Performance Monitoring : This large network traffic data set can be utilised for analysing the network traffic to identifying the network patterns in the network .This help in designing the network security algorithms for minimise the network probelms.

Anamoly Detection : Large network traffic dataset can be utilised training the machine learning models for finding the irregularitues in the traffic which could help identify the cyber attacks.

3.Network Intrusion Detection : This large dataset could be utilised for machine algorithms training and designing the models for detection of the traffic issues,Malicious traffic network attacks and DOS attacks as well.
Data from: Analysis of the Quantitative Impact of Social Networks General...
figshare.com
produccioncientifica.ucm.es
doc
Updated Oct 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Parra; Santiago Martínez Arias; Sergio Mena Muñoz (2022). Analysis of the Quantitative Impact of Social Networks General Data.doc [Dataset]. http://doi.org/10.6084/m9.figshare.21329421.v1
Explore at:
docAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.21329421.v1
Dataset updated
Oct 14, 2022
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
David Parra; Santiago Martínez Arias; Sergio Mena Muñoz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
General data recollected for the studio " Analysis of the Quantitative Impact of Social Networks on Web Traffic of Cybermedia in the 27 Countries of the European Union". Four research questions are posed: what percentage of the total web traffic generated by cybermedia in the European Union comes from social networks? Is said percentage higher or lower than that provided through direct traffic and through the use of search engines via SEO positioning? Which social networks have a greater impact? And is there any degree of relationship between the specific weight of social networks in the web traffic of a cybermedia and circumstances such as the average duration of the user's visit, the number of page views or the bounce rate understood in its formal aspect of not performing any kind of interaction on the visited page beyond reading its content? To answer these questions, we have first proceeded to a selection of the cybermedia with the highest web traffic of the 27 countries that are currently part of the European Union after the United Kingdom left on December 31, 2020. In each nation we have selected five media using a combination of the global web traffic metrics provided by the tools Alexa (https://www.alexa.com/), which ceased to be operational on May 1, 2022, and SimilarWeb (https:// www.similarweb.com/). We have not used local metrics by country since the results obtained with these first two tools were sufficiently significant and our objective is not to establish a ranking of cybermedia by nation but to examine the relevance of social networks in their web traffic. In all cases, cybermedia whose property corresponds to a journalistic company have been selected, ruling out those belonging to telecommunications portals or service providers; in some cases they correspond to classic information companies (both newspapers and televisions) while in others they refer to digital natives, without this circumstance affecting the nature of the research proposed.
Below we have proceeded to examine the web traffic data of said cybermedia. The period corresponding to the months of October, November and December 2021 and January, February and March 2022 has been selected. We believe that this six-month stretch allows possible one-time variations to be overcome for a month, reinforcing the precision of the data obtained. To secure this data, we have used the SimilarWeb tool, currently the most precise tool that exists when examining the web traffic of a portal, although it is limited to that coming from desktops and laptops, without taking into account those that come from mobile devices, currently impossible to determine with existing measurement tools on the market. It includes:

Web traffic general data: average visit duration, pages per visit and bounce rate Web traffic origin by country Percentage of traffic generated from social media over total web traffic Distribution of web traffic generated from social networks Comparison of web traffic generated from social netwoks with direct and search procedures
a
TMS daily traffic counts CSV
hub.arcgis.com
catalogue.data.govt.nz
+1more
Updated Aug 30, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Waka Kotahi (2020). TMS daily traffic counts CSV [Dataset]. https://hub.arcgis.com/datasets/9cb86b342f2d4f228067a7437a7f7313
Explore at:
Dataset updated
Aug 30, 2020
Dataset authored and provided by
Waka Kotahi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
You can also access an API version of this dataset.

TMS

(traffic monitoring system) daily-updated traffic counts API

Important note: due to the size of this dataset, you won't be able to open it fully in Excel. Use notepad / R / any software package which can open more than a million rows.

Data reuse caveats: as per license.

Data quality

statement: please read the accompanying user manual, explaining:

how

this data is collected identification of count stations traffic monitoring technology monitoring hierarchy and conventions typical survey specification data calculation TMS operation.

Traffic

monitoring for state highways: user manual

[PDF 465 KB]

The data is at daily granularity. However, the actual update

frequency of the data depends on the contract the site falls within. For telemetry

sites it's once a week on a Wednesday. Some regional sites are fortnightly, and

some monthly or quarterly. Some are only 4 weeks a year, with timing depending

on contractors’ programme of work.

Data quality caveats: you must use this data in

conjunction with the user manual and the following caveats.

The

road sensors used in data collection are subject to both technical errors and environmental interference.Data is compiled from a variety of sources. Accuracy may vary and the data should only be used as a guide.As not all road sections are monitored, a direct calculation of Vehicle Kilometres Travelled (VKT) for a region is not possible.Data is sourced from Waka Kotahi New Zealand Transport Agency TMS data.For sites that use dual loops classification is by length. Vehicles with a length of less than 5.5m are classed as light vehicles. Vehicles over 11m long are classed as heavy vehicles. Vehicles between 5.5 and 11m are split 50:50 into light and heavy.In September 2022, the National Telemetry contract was handed to a new contractor. During the handover process, due to some missing documents and aged technology, 40 of the 96 national telemetry traffic count sites went offline. Current contractor has continued to upload data from all active sites and have gradually worked to bring most offline sites back online. Please note and account for possible gaps in data from National Telemetry Sites.

The NZTA Vehicle

Classification Relationships diagram below shows the length classification (typically dual loops) and axle classification (typically pneumatic tube counts),

and how these map to the Monetised benefits and costs manual, table A37,

page 254.

Monetised benefits and costs manual [PDF 9 MB]

For the full TMS

classification schema see Appendix A of the traffic counting manual vehicle

classification scheme (NZTA 2011), below.

Traffic monitoring for state highways: user manual [PDF 465 KB]

State highway traffic monitoring (map)

State highway traffic monitoring sites
app-measurement.com Website Traffic, Ranking, Analytics [January 2026]
semrush.ebundletools.com
Updated Feb 20, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Semrush (2026). app-measurement.com Website Traffic, Ranking, Analytics [January 2026] [Dataset]. https://semrush.ebundletools.com/website/app-measurement.com/overview/
Explore at:
Dataset updated
Feb 20, 2026
Dataset authored and provided by
Semrushhttps://fr.semrush.com/
License
https://semrush.ebundletools.com/company/legal/terms-of-service/https://semrush.ebundletools.com/company/legal/terms-of-service/
Time period covered
Feb 20, 2026
Area covered
Worldwide
Variables measured
visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
Measurement technique
Semrush Traffic Analytics; Click-stream data
Description
app-measurement.com is ranked #11840 in HK with 25.88K Traffic. Categories: . Learn more about website traffic, market share, and more!
Monthly web traffic to hm.com in 2025
statista.com
Updated Feb 3, 2026
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2026). Monthly web traffic to hm.com in 2025 [Dataset]. https://www.statista.com/statistics/1496371/monthly-web-visits-to-hm/
Explore at:
Dataset updated
Feb 3, 2026
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Oct 2025 - Dec 2025
Area covered
Worldwide
Description
In the measured time period, November 2025 saw the highest figures for online traffic to the fashion retail website hm.com. According to the data, desktop and mobile visits to hm.com reached nearly *** million visits that month.
Share of e-commerce traffic worldwide 2019, by source and medium
statista.com
Updated Nov 28, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Share of e-commerce traffic worldwide 2019, by source and medium [Dataset]. https://www.statista.com/statistics/820293/online-traffic-source-and-medium-e-commerce-sessions/
Explore at:
Dataset updated
Nov 28, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Nov 2018 - Oct 2019
Area covered
Worldwide
Description
This statistic presents the distribution of global e-commerce sessions as of October 2019, by source and medium. During the measured period, search traffic generated ** percent of total e-commerce session. Overall, ** percent were generated through organic search traffic and ** percent were generated through paid search.
Global share of human and bot web traffic 2013-2024
statista.com
Updated Nov 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Global share of human and bot web traffic 2013-2024 [Dataset]. https://www.statista.com/statistics/1264226/human-and-bot-web-traffic-share/
Explore at:
Dataset updated
Nov 25, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
In 2024, most of the global website traffic was still generated by humans, but bot traffic is constantly growing. Fraudulent traffic through bad bot actors accounted for 37 percent of global web traffic in the most recently measured period, representing an increase of 12 percent from the previous year. Sophistication of Bad Bots on the rise The complexity of malicious bot activity has dramatically increased in recent years. Advanced bad bots have doubled in prevalence over the past 2 years, indicating a surge in the sophistication of cyber threats. Simultaneously, the share of simple bad bots drastically increased over the last years, suggesting a shift in the landscape of automated threats. Meanwhile, areas like food and groceries, sports, gambling, and entertainment faced the highest amount of advanced bad bots, with more than 70 percent of their bot traffic affected by evasive applications. Good and bad bots across industries The impact of bot traffic varies across different sectors. Bad bots accounted for over 50 percent of the telecom and ISPs, community and society, and computing and IT segments web traffic. However, not all bot traffic is considered bad. Some of these applications help index websites for search engines or monitor website performance, assisting users throughout their online search. Therefore, areas like entertainment, food and groceries, and even areas targeted by bad bots themselves experienced notable levels of good bot traffic, demonstrating the diverse applications of benign automated systems across different sectors.
Z
Network Traffic Analysis: Data and Code
data-staging.niaid.nih.gov
data.niaid.nih.gov
Updated Jun 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Moran, Madeline; Honig, Joshua; Ferrell, Nathan; Soni, Shreena; Homan, Sophia; Chan-Tin, Eric (2024). Network Traffic Analysis: Data and Code [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_11479410
Explore at:
Dataset updated
Jun 12, 2024
Dataset provided by
Homan, Sophia
Moran, Madeline
Soni, Shreena
Chan-Tin, Eric
Ferrell, Nathan
Honig, Joshua
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Code:

Packet_Features_Generator.py & Features.py

To run this code:

pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j

-h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j

Purpose:

Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.

Uses Features.py to calcualte the features.

startMachineLearning.sh & machineLearning.py

To run this code:

bash startMachineLearning.sh

This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags

Options (to be edited within this file):

--evaluate-only to test 5 fold cross validation accuracy

--test-scaling-normalization to test 6 different combinations of scalers and normalizers

Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use

--grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'

Purpose:

Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.

Data

Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.

Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:

First number is a classification number to denote what website, query, or vr action is taking place.

The remaining numbers in each line denote:

The size of a packet,

and the direction it is traveling.

negative numbers denote incoming packets

positive numbers denote outgoing packets

Figure 4 Data

This data uses specific lines from the Virtual Reality.txt file.

The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.

The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.

The .xlsx and .csv file are identical

Each file includes (from right to left):

The origional packet data,

each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,

and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.
GTT23: A 2023 Dataset of Genuine Tor Traces
zenodo.org
data.niaid.nih.gov
+1more
Updated Apr 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rob Jansen; Rob Jansen; Ryan Wails; Ryan Wails; Aaron Johnson; Aaron Johnson (2024). GTT23: A 2023 Dataset of Genuine Tor Traces [Dataset]. http://doi.org/10.5281/zenodo.10620520
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.10620520
Dataset updated
Apr 11, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Rob Jansen; Rob Jansen; Ryan Wails; Ryan Wails; Aaron Johnson; Aaron Johnson
Time period covered
2023
Description
The GTT23 dataset contains network metadata of encrypted traffic measured from exit relays in the Tor network over a 13-week measurement period in 2023. The metadata is suitable for analyzing and evaluating website fingerprinting attacks and defenses.

Our dataset measurement process was designed to prioritize safety and privacy and was developed through consultation with the Tor Research Safety Board (TRSB, submission #37). Our TRSB interaction resulted in a “No Objections” score.

The measurement process, additional safety and ethical considerations, and a statistical analysis of the dataset will be presented in further detail in a forthcoming publication.
Traffic Monitoring Guide 2022/Updated 2024
planningdivisiongis-nmdot.hub.arcgis.com
Updated Dec 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NMDOT ArcGIS Online (2025). Traffic Monitoring Guide 2022/Updated 2024 [Dataset]. https://planningdivisiongis-nmdot.hub.arcgis.com/documents/2d072f1ddba841c0889c624990e40779
Explore at:
Dataset updated
Dec 17, 2025
Dataset provided by
New Mexico Department of Transportationhttps://www.dot.nm.gov/
Authors
NMDOT ArcGIS Online
Description
This edition of the Traffic Monitoring Guide (TMG) is intended to provide the most up-to-date guidance to State highway agencies about the policies, standards, procedures, reporting, and equipment utilized in a traffic monitoring program. The TMG presents recommendations that help improve and advance current programs with a view toward the future of traffic monitoring and with consideration for transportation regulations resulting from the Fixing America's Surface Transportation (FAST) Act and its predecessors.The needs for traffic monitoring data at both the Federal, State, and local levels require that agencies have a well-designed traffic monitoring program. Traffic data are needed to assess current and past system performance and to predict future performance. Improved traffic data, including data on ramps, are needed for reporting in the Highway Performance Monitoring System (HPMS), and there are now opportunities to use traffic data acquired from Intelligent Transportation Systems (ITS) to support coordination of planning and operations functions at the various agency levels.Continued improvements in traffic data collection technology have allowed States to improve their data collection processes and to streamline quality assurance and quality control (QA/QC) procedures. New technology also enables States to collect data on micromobility traffic, including bicycle and pedestrian traffic. The use of micromobility traffic data supports analyses regarding the impacts to the transportation network (on volumes and safety) resulting from the use of bicycles and other micromobility devices as alternative travel methods. The new technologies and procedures for traffic monitoring presented in the TMG are supplemented with practical examples from State experiences to improve traffic monitoring programs.The TMG is written to assist both experienced traffic data collection personnel and those who are less experienced or new to traffic data collection. Reference material that will benefit traffic data collection programs is found in the Appendices.This edition of the TMG also includes data formats for reporting traffic data, including the Individual Vehicle Record (IVR) format for reporting volume, vehicle speed, vehicle classification, and vehicle weight data. Data formats are also provided for reporting micromobility data for those agencies with capabilities to collect this type of data. This edition of the TMG has been developed with considerable input from State traffic data program managers. This approach has resulted in a practical guidance document that FHWA anticipates will be helpful to States in improving their business processes, technology, and equipment used to successfully manage their traffic monitoring programs.

Facebook

Twitter

Click to copy link

Link copied

Cite

AnthonyTherrien (2024). Website Traffic [Dataset]. https://www.kaggle.com/datasets/anthonytherrien/website-traffic

Website Traffic

Website Traffic and User Engagement Metrics

Explore at:

zip(65228 bytes)Available download formats

Dataset updated

Aug 5, 2024

Authors

AnthonyTherrien

License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Dataset Overview

This dataset provides detailed information on website traffic, including page views, session duration, bounce rate, traffic source, time spent on page, previous visits, and conversion rate.

Dataset Description

Page Views: The number of pages viewed during a session.
Session Duration: The total duration of the session in minutes.
Bounce Rate: The percentage of visitors who navigate away from the site after viewing only one page.
Traffic Source: The origin of the traffic (e.g., Organic, Social, Paid).
Time on Page: The amount of time spent on the specific page.
Previous Visits: The number of previous visits by the same visitor.
Conversion Rate: The percentage of visitors who completed a desired action (e.g., making a purchase).

Data Summary

Total Records: 2000
Total Features: 7

Key Features

Page Views: This feature indicates the engagement level of the visitors by showing how many pages they visit during their session.
Session Duration: This feature measures the length of time a visitor stays on the website, which can indicate the quality of the content.
Bounce Rate: A critical metric for understanding user behavior. A high bounce rate may indicate that visitors are not finding what they are looking for.
Traffic Source: Understanding where your traffic comes from can help in optimizing marketing strategies.
Time on Page: This helps in analyzing which pages are retaining visitors' attention the most.
Previous Visits: This can be used to analyze the loyalty of visitors and the effectiveness of retention strategies.
Conversion Rate: The ultimate metric for measuring the effectiveness of the website in achieving its goals.

Usage

This dataset can be used for various analyses such as:

Identifying key drivers of engagement and conversion.
Analyzing the effectiveness of different traffic sources.
Understanding user behavior patterns and optimizing the website accordingly.
Improving marketing strategies based on traffic source performance.
Enhancing user experience by analyzing time spent on different pages.

Acknowledgments

This dataset was generated for educational purposes and is not from a real website. It serves as a tool for learning data analysis and machine learning techniques.

Clear search

Close search

Google apps

Main menu

Website Traffic

Dataset Overview

Dataset Description

Data Summary

Key Features

Usage

Acknowledgments

Data from: Website Traffic Analysis

Website traffic strategies by industry and size of enterprise

Website Traffic Analysis Tool Report

Website Traffic Analysis Tool Report

NYC.gov Web Analytics

Network Traffic Analytics Market Size, Share, Trends Industry Analysis...

Measuring Dark Social Using Google Analytics - Data Analysis

Data from: In-browser and network traffic based web response time...

Monthly web traffic to vestiairecollective.com 2024

Network Traffic Dataset

Data from: Analysis of the Quantitative Impact of Social Networks General...

TMS daily traffic counts CSV

app-measurement.com Website Traffic, Ranking, Analytics [January 2026]

Monthly web traffic to hm.com in 2025

Share of e-commerce traffic worldwide 2019, by source and medium

Global share of human and bot web traffic 2013-2024

Network Traffic Analysis: Data and Code

GTT23: A 2023 Dataset of Genuine Tor Traces

Traffic Monitoring Guide 2022/Updated 2024

Website Traffic

Website Traffic and User Engagement Metrics

Dataset Overview

Dataset Description

Data Summary

Key Features

Usage

Acknowledgments