100+ datasets found
  1. Global data traffic 1H 2021, by category

    • statista.com
    Updated Nov 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Global data traffic 1H 2021, by category [Dataset]. https://www.statista.com/statistics/1312357/global-data-traffic-by-content-type/
    Explore at:
    Dataset updated
    Nov 27, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    In the first half of 2021, video accounted for over **** of global traffic. Social occupied the next largest share at **** percent, while web browsing accounted for around a *****. Audio accounted for only **** percent of traffic worldwide.

  2. Internet Traffic Data Set

    • kaggle.com
    zip
    Updated May 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asfand Yar (2023). Internet Traffic Data Set [Dataset]. https://www.kaggle.com/dsv/5658579
    Explore at:
    zip(826481210 bytes)Available download formats
    Dataset updated
    May 10, 2023
    Authors
    Asfand Yar
    Description

    This data set contains internet traffic data captured by an Internet Service Provider (ISP) using Mikrotik SDN Controller and packet sniffer tools. The data set includes traffic from over 2000 customers who use Fibre to the Home (FTTH) and Gpon internet connections. The data was collected over a period of several months and contains all traffic in its original format with headers and packets.

    The data set contains information on inbound and outbound traffic, including web browsing, email, file transfers, and more. The data set can be used for research in areas such as network security, traffic analysis, and machine learning.

    **Data Collection Method: ** The data was captured using Mikrotik SDN Controller and packet sniffer tools. These tools capture traffic data by monitoring network traffic in real-time. The data set contains all traffic data in its original format, including headers and packets.

    **Data Set Content: ** The data set is provided in a CSV format and includes the following fields:

    1. Timestamp: The date and time the traffic was captured
    2. Source IP Address: The IP address of the device that sent the traffic Destination IP Address: The IP address of the device that received the traffic Protocol: The network protocol used for the traffic (e.g. TCP, UDP) Source Port: The port used by the source device for the traffic Destination Port: The port used by the destination device for the traffic Packet Size: The size of the packet in bytes Payload: The payload data of the packet The data set contains a large volume of traffic data from over 2000 customers. The data is organized by timestamp and includes all traffic data in its original format, including headers and packets. The data set contains both inbound and outbound traffic, and covers various types of internet traffic, including web browsing, email, file transfers, and more. one of listed protocols: ipsec-ah - IPsec AH protocol *ipsec-esp - IPsec ESP protocol ddp - datagram delivery protocol egp - exterior gateway protocol ggp - gateway-gateway protocol gre - general routing encapsulation hmp - host monitoring protocol idpr-cmtp - idpr control message transport icmp - internet control message protocol icmpv6 - internet control message protocol v6 igmp - internet group management protocol ipencap - ip encapsulated in ip ipip - ip encapsulation encap - ip encapsulation iso-tp4 - iso transport protocol class 4 ospf - open shortest path first pup - parc universal packet protocol pim - protocol independent multicast rspf - radio shortest path first rdp - reliable datagram protocol st - st datagram mode tcp - transmission control protocol udp - user datagram protocol vmtp - versatile message transport vrrp - virtual router redundancy protocol xns-idp - xerox xns idp xtp - xpress transfer protocol

    MAC Protocol Examples 802.2 - 802.2 Frames (0x0004) arp - Address Resolution Protocol (0x0806) homeplug-av - HomePlug AV MME (0x88E1) ip - Internet Protocol version 4 (0x0800) ipv6 - Internet Protocol Version 6 (0x86DD) ipx - Internetwork Packet Exchange (0x8137) lldp - Link Layer Discovery Protocol (0x88CC) loop-protect - Loop Protect Protocol (0x9003) mpls-multicast - MPLS multicast (0x8848) mpls-unicast - MPLS unicast (0x8847) packing-compr - Encapsulated packets with compressed IP packing (0x9001) packing-simple - Encapsulated packets with simple IP packing (0x9000) pppoe - PPPoE Session Stage (0x8864) pppoe-discovery - PPPoE Discovery Stage (0x8863) rarp - Reverse Address Resolution Protocol (0x8035) service-vlan - Provider Bridging (IEEE 802.1ad) & Shortest Path Bridging IEEE 802.1aq (0x88A8) vlan - VLAN-tagged frame (IEEE 802.1Q) and Shortest Path Bridging IEEE 802.1aq with NNI compatibility (0x8100)

    **Data Usage: ** The data set can be used for research in areas such as network security, traffic analysis, and machine learning. Researchers can use the data to develop new algorithms for detecting and preventing cyber attacks, analyzing internet traffic patterns, and more.

    **Data Availability: ** If you are interested in using this data set for research purposes, please contact us at asfandyar250@gmail.com for more information and references. The data set is available for download on Kaggle and can be accessed by researchers who have obtained permission from the ISP.

    We hope this data set will be useful for researchers in the field of network security and traffic analysis. If you have any questions or need further information, please do not hesitate to contact us. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5985737%2F61c81ce9eb393f8fc7c15540c9819b95%2FData.PNG?generation=1683750473536727&alt=media" alt=""> You can use Wireshark or other software's to view files

  3. Number of internet users worldwide 2014-2029

    • statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department, Number of internet users worldwide 2014-2029 [Dataset]. https://www.statista.com/topics/1145/internet-usage-worldwide/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    World
    Description

    The global number of internet users in was forecast to continuously increase between 2024 and 2029 by in total 1.3 billion users (+23.66 percent). After the fifteenth consecutive increasing year, the number of users is estimated to reach 7 billion users and therefore a new peak in 2029. Notably, the number of internet users of was continuously increasing over the past years.Depicted is the estimated number of individuals in the country or region at hand, that use the internet. As the datasource clarifies, connection quality and usage frequency are distinct aspects, not taken into account here.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of internet users in countries like the Americas and Asia.

  4. Most common internet accesses by type in the U.S. 2025

    • statista.com
    Updated May 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Most common internet accesses by type in the U.S. 2025 [Dataset]. https://www.statista.com/topics/2237/internet-usage-in-the-united-states/
    Explore at:
    Dataset updated
    May 1, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    United States
    Description

    37 percent of U.S. respondents answer our survey on "Most common internet accesses by type" with "Broadband (DSL, cable, etc.)". The survey was conducted in 2025, among 15,496 consumers. Looking to gain valuable insights about users of internet providers worldwide? Check out our reports on consumers who use internet providers. These reports give readers a thorough picture of these customers, including their identities, preferences, opinions, and methods of communication.

  5. Share of mobile internet traffic in global regions 2025

    • statista.com
    Updated Nov 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Share of mobile internet traffic in global regions 2025 [Dataset]. https://www.statista.com/statistics/306528/share-of-mobile-internet-traffic-in-global-regions/
    Explore at:
    Dataset updated
    Nov 25, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Nov 2025
    Area covered
    Worldwide
    Description

    In November 2025, mobile devices excluding tablets accounted for over ***** percent of web page views worldwide. Meanwhile, over ***** percent of webpage views in Africa were generated via mobile. In contrast, just over half of web traffic in North America still took place via desktop connections, with mobile only accounting for ***** percent of total web traffic. While regional infrastructure remains an important factor in broadband vs. mobile coverage, most of the world has had their eyes on the recent 5G rollout across the globe, spearheaded by tech leaders China and the United States. The number of mobile 5G subscriptions worldwide is forecast to reach more than ***** billion by 2028. Social media: room for growth in Africa and southern Asia Overall, more than ** percent of the world’s mobile internet subscribers are also active on social media. A fast-growing market, with newcomers such as TikTok taking the world by storm, marketers have been cashing in on social media’s reach. Overall, social media penetration is highest in Europe and America, while in Africa and southern Asia, there is still room for growth. As of 2021, Facebook and Google-owned YouTube are the most popular social media platforms worldwide. Facebook and Instagram are most effective With nearly ***** billion users, it is no wonder that Facebook remains the social media avenue of choice for the majority of marketers across the world. Instagram, meanwhile, was the second most popular outlet. Both platforms are low-cost and support short-form content, known for its universal consumer appeal and answering to the most important benefits of using these kinds of platforms for business and advertising purposes.

  6. Network Traffic Dataset

    • kaggle.com
    zip
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ravikumar Gattu (2023). Network Traffic Dataset [Dataset]. https://www.kaggle.com/datasets/ravikumargattu/network-traffic-dataset
    Explore at:
    zip(6783827 bytes)Available download formats
    Dataset updated
    Oct 31, 2023
    Authors
    Ravikumar Gattu
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    The data presented here was obtained in a Kali Machine from University of Cincinnati,Cincinnati,OHIO by carrying out packet captures for 1 hour during the evening on Oct 9th,2023 using Wireshark.This dataset consists of 394137 instances were obtained and stored in a CSV (Comma Separated Values) file.This large dataset could be used utilised for different machine learning applications for instance classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

    The dataset can be used for a variety of machine learning tasks, such as network intrusion detection, traffic classification, and anomaly detection.

    Content :

    This network traffic dataset consists of 7 features.Each instance contains the information of source and destination IP addresses, The majority of the properties are numeric in nature, however there are also nominal and date kinds due to the Timestamp.

    The network traffic flow statistics (No. Time Source Destination Protocol Length Info) were obtained using Wireshark (https://www.wireshark.org/).

    Dataset Columns:

    No : Number of Instance. Timestamp : Timestamp of instance of network traffic Source IP: IP address of Source Destination IP: IP address of Destination Portocol: Protocol used by the instance Length: Length of Instance Info: Information of Traffic Instance

    Acknowledgements :

    I would like thank University of Cincinnati for giving the infrastructure for generation of network traffic data set.

    Ravikumar Gattu , Susmitha Choppadandi

    Inspiration : This dataset goes beyond the majority of network traffic classification datasets, which only identify the type of application (WWW, DNS, ICMP,ARP,RARP) that an IP flow contains. Instead, it generates machine learning models that can identify specific applications (like Tiktok,Wikipedia,Instagram,Youtube,Websites,Blogs etc.) from IP flow statistics (there are currently 25 applications in total).

    **Dataset License: ** CC0: Public Domain

    Dataset Usages : This dataset can be used for different machine learning applications in the field of cybersecurity such as classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

    ML techniques benefits from this Dataset :

    This dataset is highly useful because it consists of 394137 instances of network traffic data obtained by using the 25 applications on a public,private and Enterprise networks.Also,the dataset consists of very important features that can be used for most of the applications of Machine learning in cybersecurity.Here are few of the potential machine learning applications that could be benefited from this dataset are :

    1. Network Performance Monitoring : This large network traffic data set can be utilised for analysing the network traffic to identifying the network patterns in the network .This help in designing the network security algorithms for minimise the network probelms.

    2. Anamoly Detection : Large network traffic dataset can be utilised training the machine learning models for finding the irregularitues in the traffic which could help identify the cyber attacks.

    3.Network Intrusion Detection : This large dataset could be utilised for machine algorithms training and designing the models for detection of the traffic issues,Malicious traffic network attacks and DOS attacks as well.

  7. f

    YouTube Dataset on Mobile Streaming for Internet Traffic Modeling, Network...

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Apr 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Geißler, Stefan; Wamser, Florian; Loh, Frank; Hoßfeld, Tobias; Poignée, Fabian (2022). YouTube Dataset on Mobile Streaming for Internet Traffic Modeling, Network Management, and Streaming Analysis [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000199880
    Explore at:
    Dataset updated
    Apr 14, 2022
    Authors
    Geißler, Stefan; Wamser, Florian; Loh, Frank; Hoßfeld, Tobias; Poignée, Fabian
    Area covered
    YouTube
    Description

    Streaming is by far the predominant type of traffic in communication networks. With thispublic dataset, we provide 1,081 hours of time-synchronous video measurements at network, transport, and application layer with the native YouTube streaming client on mobile devices. The dataset includes 80 network scenarios with 171 different individual bandwidth settings measured in 5,181 runs with limited bandwidth, 1,939 runs with emulated 3G/4G traces, and 4,022 runs with pre-defined bandwidth changes. This corresponds to 332GB video payload. We present the most relevant quality indicators for scientific use, i.e., initial playback delay, streaming video quality, adaptive video quality changes, video rebuffering events, and streaming phases.

  8. Data from: HTTPS traffic classification

    • kaggle.com
    zip
    Updated Mar 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Đinh Ngọc Ân (2024). HTTPS traffic classification [Dataset]. https://www.kaggle.com/datasets/inhngcn/https-traffic-classification/data
    Explore at:
    zip(36287490 bytes)Available download formats
    Dataset updated
    Mar 11, 2024
    Authors
    Đinh Ngọc Ân
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    The people from Czech are publishing a dataset for the HTTPS traffic classification.

    Since the data were captured mainly in the real backbone network, they omitted IP addresses and ports. The datasets consist of calculated from bidirectional flows exported with flow probe Ipifixprobe. This exporter can export a sequence of packet lengths and times and a sequence of packet bursts and time. For more information, please visit ipfixprobe repository (Ipifixprobe).

    During research, they divided HTTPS into five categories: L -- Live Video Streaming, P -- Video Player, M -- Music Player, U -- File Upload, D -- File Download, W -- Website, and other traffic.

    They have chosen the service representatives known for particular traffic types based on the Alexa Top 1M list and Moz's list of the most popular 500 websites for each category. They also used several popular websites that primarily focus on the audience in Czech. The identified traffic classes and their representatives are provided below:

    Live Video Stream Twitch, Czech TV, YouTube Live Video Player DailyMotion, Stream.cz, Vimeo, YouTube Music Player AppleMusic, Spotify, SoundCloud File Upload/Download FileSender, OwnCloud, OneDrive, Google Drive Website and Other Traffic Websites from Alexa Top 1M list

  9. Website Traffic

    • kaggle.com
    zip
    Updated Aug 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AnthonyTherrien (2024). Website Traffic [Dataset]. https://www.kaggle.com/datasets/anthonytherrien/website-traffic
    Explore at:
    zip(65228 bytes)Available download formats
    Dataset updated
    Aug 5, 2024
    Authors
    AnthonyTherrien
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Overview

    This dataset provides detailed information on website traffic, including page views, session duration, bounce rate, traffic source, time spent on page, previous visits, and conversion rate.

    Dataset Description

    • Page Views: The number of pages viewed during a session.
    • Session Duration: The total duration of the session in minutes.
    • Bounce Rate: The percentage of visitors who navigate away from the site after viewing only one page.
    • Traffic Source: The origin of the traffic (e.g., Organic, Social, Paid).
    • Time on Page: The amount of time spent on the specific page.
    • Previous Visits: The number of previous visits by the same visitor.
    • Conversion Rate: The percentage of visitors who completed a desired action (e.g., making a purchase).

    Data Summary

    • Total Records: 2000
    • Total Features: 7

    Key Features

    1. Page Views: This feature indicates the engagement level of the visitors by showing how many pages they visit during their session.
    2. Session Duration: This feature measures the length of time a visitor stays on the website, which can indicate the quality of the content.
    3. Bounce Rate: A critical metric for understanding user behavior. A high bounce rate may indicate that visitors are not finding what they are looking for.
    4. Traffic Source: Understanding where your traffic comes from can help in optimizing marketing strategies.
    5. Time on Page: This helps in analyzing which pages are retaining visitors' attention the most.
    6. Previous Visits: This can be used to analyze the loyalty of visitors and the effectiveness of retention strategies.
    7. Conversion Rate: The ultimate metric for measuring the effectiveness of the website in achieving its goals.

    Usage

    This dataset can be used for various analyses such as:

    • Identifying key drivers of engagement and conversion.
    • Analyzing the effectiveness of different traffic sources.
    • Understanding user behavior patterns and optimizing the website accordingly.
    • Improving marketing strategies based on traffic source performance.
    • Enhancing user experience by analyzing time spent on different pages.

    Acknowledgments

    This dataset was generated for educational purposes and is not from a real website. It serves as a tool for learning data analysis and machine learning techniques.

  10. Global web traffic

    • kaggle.com
    zip
    Updated Feb 12, 2026
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maaz Shaikh (2026). Global web traffic [Dataset]. https://www.kaggle.com/datasets/maazshaikh05/global-web-traffic
    Explore at:
    zip(27700 bytes)Available download formats
    Dataset updated
    Feb 12, 2026
    Authors
    Maaz Shaikh
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    The Global Web Traffic Dataset is a synthetic dataset simulating 2,000 website visits from users across 20 countries. Each record represents a unique visit and includes date, country, traffic source, device type, browser, page visited, session duration, pages viewed, bounce, and conversion.

    This dataset is programmatically generated to mimic realistic user behavior. Devices, traffic sources, session durations, pages viewed, bounce rates, and conversions are generated using probability-weighted sampling and realistic ranges. All data is synthetic, containing no personal or sensitive information, making it safe and reusable for educational and professional purposes.

    Intended Uses:

    Exploratory Data Analysis (EDA): Identify trends in global web traffic.

    Data Visualization: Build dashboards, heatmaps, and charts.

    Machine Learning Practice: Train models for predicting bounce, session duration, or conversions.

    Educational & Research Projects: Learn web analytics, user behavior modeling, or data preprocessing.

  11. z

    Internet traffic data for different frame size ranges

    • zasobynauki.pl
    Updated 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aleksandra Knapińska; Piotr Lechowicz; Krzysztof Walkowiak (2020). Internet traffic data for different frame size ranges [Dataset]. https://zasobynauki.pl/zasoby/internet-traffic-data-for-different-frame-size-ranges,56566/
    Explore at:
    Dataset updated
    2020
    Authors
    Aleksandra Knapińska; Piotr Lechowicz; Krzysztof Walkowiak
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This resource includes input data used in the work "Machine-Learning Based Prediction of Multiple Types of Network Traffic" by Aleksandra Knapińska, Piotr Lechowicz, and Krzysztof Walkowiak; published in International Conference on Computational Science (ICCS) 2021, Lecture Notes in Computer Science, vol 12742. pp. 122-136. Springer, Cham. https://doi.org/10.1007/978-3-030-77961-0_12 The work was supported by the National Science Centre, Poland, under Grant 2019/35/B/ST7/04272. Both seattle_november.xml and seattle_december.xml files include internet traffic data from Seattle Internet Exchange Point. The european.xml file includes internet traffic data from one of the European Internet Exchange Points. Each file includes the traffic volume decomposed into specific frame size ranges. Each file starts with a metadata section providing general information. The period covered by a specific file is indicated by its 'start' and 'end' tags. They provide Unix timestamps in the GMT timezone. It should be noted that Seattle lies in the PST time zone, and the European IXP is located in the CET timezone, so the start and end times should be adjusted accordingly. The step parameter is given in seconds, so the samples are stored every 5 minutes in all three files. Each file has multiple columns providing traffic data in bits per second for different frame size ranges. Column names specify the ranges in bytes. The 'total' column stores information about the total aggregate traffic volume, which is a sum of values in all the remaining columns in each row.

  12. Most used websites and online services by type in the U.S. 2025

    • statista.com
    Updated May 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Most used websites and online services by type in the U.S. 2025 [Dataset]. https://www.statista.com/topics/2237/internet-usage-in-the-united-states/
    Explore at:
    Dataset updated
    May 1, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    United States
    Description

    72 percent of U.S. respondents answer our survey on "Most used websites and online services by type" with "Social media". The survey was conducted in 2025, among 15,502 consumers.

  13. Average daily mobile internet traffic per capita in Romania 2017-2024

    • statista.com
    Updated Nov 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Average daily mobile internet traffic per capita in Romania 2017-2024 [Dataset]. https://www.statista.com/statistics/1134617/romania-average-daily-internet-traffic-per-capita/
    Explore at:
    Dataset updated
    Nov 27, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Romania
    Description

    The average daily internet traffic per capita in Romania has been increasing over the observed period, for both broadband and mobile internet connections. As a result, the average daily mobile internet traffic per person was around *** megabytes in the first half of 2024.

  14. Data from: CESNET-QUIC22: A large one-month QUIC network traffic dataset...

    • data.niaid.nih.gov
    • nde-dev.biothings.io
    • +1more
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luxemburk, Jan; Hynek, Karel; Čejka, Tomáš; Lukačovič, Andrej; Šiška, Pavel (2024). CESNET-QUIC22: A large one-month QUIC network traffic dataset from backbone lines [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7409923
    Explore at:
    Dataset updated
    Feb 29, 2024
    Dataset provided by
    CESNEThttp://www.cesnet.cz/
    FIT Czech Technical University in Prague
    Authors
    Luxemburk, Jan; Hynek, Karel; Čejka, Tomáš; Lukačovič, Andrej; Šiška, Pavel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Please refer to the original data article for further data description: Jan Luxemburk et al. CESNET-QUIC22: A large one-month QUIC network traffic dataset from backbone lines, Data in Brief, 2023, 108888, ISSN 2352-3409, https://doi.org/10.1016/j.dib.2023.108888. We recommend using the CESNET DataZoo python library, which facilitates the work with large network traffic datasets. More information about the DataZoo project can be found in the GitHub repository https://github.com/CESNET/cesnet-datazoo. The QUIC (Quick UDP Internet Connection) protocol has the potential to replace TLS over TCP, which is the standard choice for reliable and secure Internet communication. Due to its design that makes the inspection of QUIC handshakes challenging and its usage in HTTP/3, there is an increasing demand for research in QUIC traffic analysis. This dataset contains one month of QUIC traffic collected in an ISP backbone network, which connects 500 large institutions and serves around half a million people. The data are delivered as enriched flows that can be useful for various network monitoring tasks. The provided server names and packet-level information allow research in the encrypted traffic classification area. Moreover, included QUIC versions and user agents (smartphone, web browser, and operating system identifiers) provide information for large-scale QUIC deployment studies. Data capture The data was captured in the flow monitoring infrastructure of the CESNET2 network. The capturing was done for four weeks between 31.10.2022 and 27.11.2022. The following list provides per-week flow count, capture period, and uncompressed size:

    W-2022-44

    Uncompressed Size: 19 GB Capture Period: 31.10.2022 - 6.11.2022 Number of flows: 32.6M W-2022-45

    Uncompressed Size: 25 GB Capture Period: 7.11.2022 - 13.11.2022 Number of flows: 42.6M W-2022-46

    Uncompressed Size: 20 GB Capture Period: 14.11.2022 - 20.11.2022 Number of flows: 33.7M W-2022-47

    Uncompressed Size: 25 GB Capture Period: 21.11.2022 - 27.11.2022 Number of flows: 44.1M CESNET-QUIC22

    Uncompressed Size: 89 GB Capture Period: 31.10.2022 - 27.11.2022 Number of flows: 153M

    Data description The dataset consists of network flows describing encrypted QUIC communications. Flows were created using ipfixprobe flow exporter and are extended with packet metadata sequences, packet histograms, and with fields extracted from the QUIC Initial Packet, which is the first packet of the QUIC connection handshake. The extracted handshake fields are the Server Name Indication (SNI) domain, the used version of the QUIC protocol, and the user agent string that is available in a subset of QUIC communications. Packet Sequences Flows in the dataset are extended with sequences of packet sizes, directions, and inter-packet times. For the packet sizes, we consider payload size after transport headers (UDP headers for the QUIC case). Packet directions are encoded as ±1, +1 meaning a packet sent from client to server, and -1 a packet from server to client. Inter-packet times depend on the location of communicating hosts, their distance, and on the network conditions on the path. However, it is still possible to extract relevant information that correlates with user interactions and, for example, with the time required for an API/server/database to process the received data and generate the response to be sent in the next packet. Packet metadata sequences have a length of 30, which is the default setting of the used flow exporter. We also derive three fields from each packet sequence: its length, time duration, and the number of roundtrips. The roundtrips are counted as the number of changes in the communication direction (from packet directions data); in other words, each client request and server response pair counts as one roundtrip. Flow statistics Flows also include standard flow statistics, which represent aggregated information about the entire bidirectional flow. The fields are: the number of transmitted bytes and packets in both directions, the duration of flow, and packet histograms. Packet histograms include binned counts of packet sizes and inter-packet times of the entire flow in both directions (more information in the PHISTS plugin documentation There are eight bins with a logarithmic scale; the intervals are 0-15, 16-31, 32-63, 64-127, 128-255, 256-511, 512-1024, >1024 [ms or B]. The units are milliseconds for inter-packet times and bytes for packet sizes. Moreover, each flow has its end reason - either it was idle, reached the active timeout, or ended due to other reasons. This corresponds with the official IANA IPFIX-specified values. The FLOW_ENDREASON_OTHER field represents the forced end and lack of resources reasons. The end of flow detected reason is not considered because it is not relevant for UDP connections. Dataset structure The dataset flows are delivered in compressed CSV files. CSV files contain one flow per row; data columns are summarized in the provided list below. For each flow data file, there is a JSON file with the number of saved and seen (before sampling) flows per service and total counts of all received (observed on the CESNET2 network), service (belonging to one of the dataset's services), and saved (provided in the dataset) flows. There is also the stats-week.json file aggregating flow counts of a whole week and the stats-dataset.json file aggregating flow counts for the entire dataset. Flow counts before sampling can be used to compute sampling ratios of individual services and to resample the dataset back to the original service distribution. Moreover, various dataset statistics, such as feature distributions and value counts of QUIC versions and user agents, are provided in the dataset-statistics folder. The mapping between services and service providers is provided in the servicemap.csv file, which also includes SNI domains used for ground truth labeling. The following list describes flow data fields in CSV files:

    ID: Unique identifier SRC_IP: Source IP address DST_IP: Destination IP address DST_ASN: Destination Autonomous System number SRC_PORT: Source port DST_PORT: Destination port PROTOCOL: Transport protocol QUIC_VERSION QUIC: protocol version QUIC_SNI: Server Name Indication domain QUIC_USER_AGENT: User agent string, if available in the QUIC Initial Packet TIME_FIRST: Timestamp of the first packet in format YYYY-MM-DDTHH-MM-SS.ffffff TIME_LAST: Timestamp of the last packet in format YYYY-MM-DDTHH-MM-SS.ffffff DURATION: Duration of the flow in seconds BYTES: Number of transmitted bytes from client to server BYTES_REV: Number of transmitted bytes from server to client PACKETS: Number of packets transmitted from client to server PACKETS_REV: Number of packets transmitted from server to client PPI: Packet metadata sequence in the format: [[inter-packet times], [packet directions], [packet sizes]] PPI_LEN: Number of packets in the PPI sequence PPI_DURATION: Duration of the PPI sequence in seconds PPI_ROUNDTRIPS: Number of roundtrips in the PPI sequence PHIST_SRC_SIZES: Histogram of packet sizes from client to server PHIST_DST_SIZES: Histogram of packet sizes from server to client PHIST_SRC_IPT: Histogram of inter-packet times from client to server PHIST_DST_IPT: Histogram of inter-packet times from server to client APP: Web service label CATEGORY: Service category FLOW_ENDREASON_IDLE: Flow was terminated because it was idle FLOW_ENDREASON_ACTIVE: Flow was terminated because it reached the active timeout FLOW_ENDREASON_OTHER: Flow was terminated for other reasons

    Link to other CESNET datasets

    https://www.liberouter.org/technology-v2/tools-services-datasets/datasets/ https://github.com/CESNET/cesnet-datazoo Please cite the original data article:

    @article{CESNETQUIC22, author = {Jan Luxemburk and Karel Hynek and Tomáš Čejka and Andrej Lukačovič and Pavel Šiška}, title = {CESNET-QUIC22: a large one-month QUIC network traffic dataset from backbone lines}, journal = {Data in Brief}, pages = {108888}, year = {2023}, issn = {2352-3409}, doi = {https://doi.org/10.1016/j.dib.2023.108888}, url = {https://www.sciencedirect.com/science/article/pii/S2352340923000069} }

  15. Average monthly mobile internet traffic per capita in Romania 2017-2023

    • statista.com
    Updated Nov 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Average monthly mobile internet traffic per capita in Romania 2017-2023 [Dataset]. https://www.statista.com/statistics/1189615/romania-average-monthly-mobile-internet-traffic-per-capita/
    Explore at:
    Dataset updated
    Nov 27, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Romania
    Description

    The average monthly internet traffic per capita in Romania had been increasing over the observed period, for both cable and mobile internet connections. As a result, the average monthly mobile internet traffic per person increased more than **** times since 2018, reaching **** GB by the end of 2023.

  16. m

    Data from: Packet-level and IEEE 802.11 MAC frame-level Network Traffic...

    • data.mendeley.com
    • narcis.nl
    Updated Jan 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rajarshi Roy Chowdhury (2021). Packet-level and IEEE 802.11 MAC frame-level Network Traffic Traces Data of the D-Link IoT devices [Dataset]. http://doi.org/10.17632/84cc8grtkt.1
    Explore at:
    Dataset updated
    Jan 14, 2021
    Authors
    Rajarshi Roy Chowdhury
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset presents network traffic traces data of the 14 D-Link IoT devices from different types including camera, network camera, smart-plug, door-window sensor, and home-hub. It consists of:

    • Network packet traces (inbound and outbound traffic) and
    • IEEE 802.11 MAC frame traces.
    

    The experimental testbed was set-up in the Network Systems and Signal Processing (NSSP) laboratory at Universiti Brunei Darussalam (UBD) to collect all the network traffic traces from 9th September 2020 to 10th January 2021 including an access point on a laptop. The network traffic traces were captured passively observing the Ethernet interface and the WiFi interface at the access point.

    In packet traces, typical communication protocols, such as TCP, UDP, IP, ICMP, ARP, DNS, SSDP, TLS/SSL etc, data are captured which IoT devices use for communication on the Internet. In the probe request frame (a subtype of management frames) traces, data are recorded which IoT devices use to connect access point on the local area network.

    The authors would like to thank the Faculty of Integrated Technologies, Universiti Brunei Darussalam, for the support to conduct this research experiment in the Network Systems and Signal Processing laboratory.

  17. SimilarWeb Top Websites [April 2024]

    • kaggle.com
    zip
    Updated Sep 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammed Kamal Alsyd (2024). SimilarWeb Top Websites [April 2024] [Dataset]. https://www.kaggle.com/datasets/mohammedkamalalsyd/similarweb-top-websites
    Explore at:
    zip(480522 bytes)Available download formats
    Dataset updated
    Sep 21, 2024
    Authors
    Mohammed Kamal Alsyd
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset provides detailed insights into website traffic metrics and user engagement statistics, collected from SimilarWeb. The data includes information on various websites, such as rank, category, average visit duration, pages per visit, and bounce rate. This data aims to facilitate an understanding of online behavior and performance trends across different sectors, making it a valuable resource for researchers, marketers, and data analysts. The dataset is ideal for exploring patterns in web traffic and user interaction and conducting comparative analyses across various website categories.

    Important Warning: Running this code within Kaggle may result in a ban, as scraping activities are prohibited on the platform. There is no guarantee that any ban will be lifted, as Kaggle staff may interpret scraping as a denial-of-service attack. Although I have implemented measures to reduce server load, such as adding sleep intervals, it is advisable to run this code locally to ensure compliance with Kaggle's policies.

  18. e

    Internet traffic data from Seattle Internet Exchange Point for different...

    • azon.e-science.pl
    • zasobynauki.pl
    Updated 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aleksandra Knapińska; Piotr Lechowicz; Krzysztof Walkowiak; Weronika Węgier (2021). Internet traffic data from Seattle Internet Exchange Point for different frame size ranges (2021) [Dataset]. https://azon.e-science.pl/zasoby/internet-traffic-data-from-seattle-internet-exchange-point-for-different-frame-size-ranges-2021,67873/
    Explore at:
    Dataset updated
    2021
    Authors
    Aleksandra Knapińska; Piotr Lechowicz; Krzysztof Walkowiak; Weronika Węgier
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This resource includes input data used in the work "Long-term prediction of multiple types of time-varying network traffic using chunk-based ensemble learning" by Aleksandra Knapińska, Piotr Lechowicz, Weronika Węgier, and Krzysztof Walkowiak. The work was supported by the National Science Centre, Poland, under Grants 2019/35/B/ST7/04272, 2018/31/D/ST6/0304, and 2019/35/B/ST6/04442.
    The SIX2021.xml file includes internet traffic data from the Seattle Internet Exchange Point collected for one year. The file contains information about the traffic volume decomposed into specific frame size ranges. It starts with a metadata section providing general information. The covered period is indicated by the 'start' and 'end' tags. They provide Unix timestamps in the GMT timezone. It should be noted that Seattle lies in the PST time zone, so the start and end times should be adjusted accordingly. The step parameter is given in seconds, so the samples are stored every 5 minutes. The file has multiple columns providing traffic data in bits per second for different frame size ranges. Column names specify the ranges in bytes. The 'total' column stores information about the total aggregate traffic volume, which is a sum of values in all the remaining columns in each row.

  19. Share of global mobile website traffic 2015-2025

    • statista.com
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Share of global mobile website traffic 2015-2025 [Dataset]. https://www.statista.com/statistics/277125/share-of-website-traffic-coming-from-mobile-devices/
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    In the second quarter of 2025, mobile devices (excluding tablets) accounted for 62.54 percent of global website traffic. Since consistently maintaining a share of around 50 percent beginning in 2017, mobile usage surpassed this threshold in 2020 and has demonstrated steady growth in its dominance of global web access. Mobile traffic Due to low infrastructure and financial restraints, many emerging digital markets skipped the desktop internet phase entirely and moved straight onto mobile internet via smartphone and tablet devices. India is a prime example of a market with a significant mobile-first online population. Other countries with a significant share of mobile internet traffic include Nigeria, Ghana and Kenya. In most African markets, mobile accounts for more than half of the web traffic. By contrast, mobile only makes up around 45.49 percent of online traffic in the United States. Mobile usage The most popular mobile internet activities worldwide include watching movies or videos online, e-mail usage and accessing social media. Apps are a very popular way to watch video on the go and the most-downloaded entertainment apps in the Apple App Store are Netflix, Tencent Video and Amazon Prime Video.

  20. a

    Traffic Control Permits Web Map WAZE

    • hub.arcgis.com
    • egisdata-dallasgis.hub.arcgis.com
    • +1more
    Updated Nov 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Dallas GIS Services (2020). Traffic Control Permits Web Map WAZE [Dataset]. https://hub.arcgis.com/maps/d27592d203d24e588107d37394e7d95d
    Explore at:
    Dataset updated
    Nov 20, 2020
    Dataset authored and provided by
    City of Dallas GIS Services
    Area covered
    Description

    This web map contains Traffic Control Permits lines and Points.Web map is used in this dashboard - https://dallasgis.maps.arcgis.com/home/item.html?id=725bef2567844d099b98ebdd3e5bc26bTraffic Control Permits Hosted Feature Service - https://dallasgis.maps.arcgis.com/home/item.html?id=ab1ba4920c45472bab563fd86f4726e8Waze Data Dallas Construction : Filtered from https://dallasgis.maps.arcgis.com/home/item.html?id=f71ddb118f8f4200ac6193b585df25bb layer with parameter Incident sub Type = "Construction" and City Name = "Dallas, TX".Waze Data Jam: Filtered from https://dallasgis.maps.arcgis.com/home/item.html?id=f71ddb118f8f4200ac6193b585df25bb layer with parameter Incident type = Jam and City Name = "Dallas, TX".This data is primarily crowdsourced, meaning it comes from Waze users who report incidents, traffic conditions, and other road-related information.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista (2025). Global data traffic 1H 2021, by category [Dataset]. https://www.statista.com/statistics/1312357/global-data-traffic-by-content-type/
Organization logo

Global data traffic 1H 2021, by category

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Nov 27, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description

In the first half of 2021, video accounted for over **** of global traffic. Social occupied the next largest share at **** percent, while web browsing accounted for around a *****. Audio accounted for only **** percent of traffic worldwide.

Search
Clear search
Close search
Google apps
Main menu