100+ datasets found
  1. Network Traffic Dataset

    • kaggle.com
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ravikumar Gattu (2023). Network Traffic Dataset [Dataset]. https://www.kaggle.com/datasets/ravikumargattu/network-traffic-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ravikumar Gattu
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    The data presented here was obtained in a Kali Machine from University of Cincinnati,Cincinnati,OHIO by carrying out packet captures for 1 hour during the evening on Oct 9th,2023 using Wireshark.This dataset consists of 394137 instances were obtained and stored in a CSV (Comma Separated Values) file.This large dataset could be used utilised for different machine learning applications for instance classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

    The dataset can be used for a variety of machine learning tasks, such as network intrusion detection, traffic classification, and anomaly detection.

    Content :

    This network traffic dataset consists of 7 features.Each instance contains the information of source and destination IP addresses, The majority of the properties are numeric in nature, however there are also nominal and date kinds due to the Timestamp.

    The network traffic flow statistics (No. Time Source Destination Protocol Length Info) were obtained using Wireshark (https://www.wireshark.org/).

    Dataset Columns:

    No : Number of Instance. Timestamp : Timestamp of instance of network traffic Source IP: IP address of Source Destination IP: IP address of Destination Portocol: Protocol used by the instance Length: Length of Instance Info: Information of Traffic Instance

    Acknowledgements :

    I would like thank University of Cincinnati for giving the infrastructure for generation of network traffic data set.

    Ravikumar Gattu , Susmitha Choppadandi

    Inspiration : This dataset goes beyond the majority of network traffic classification datasets, which only identify the type of application (WWW, DNS, ICMP,ARP,RARP) that an IP flow contains. Instead, it generates machine learning models that can identify specific applications (like Tiktok,Wikipedia,Instagram,Youtube,Websites,Blogs etc.) from IP flow statistics (there are currently 25 applications in total).

    **Dataset License: ** CC0: Public Domain

    Dataset Usages : This dataset can be used for different machine learning applications in the field of cybersecurity such as classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

    ML techniques benefits from this Dataset :

    This dataset is highly useful because it consists of 394137 instances of network traffic data obtained by using the 25 applications on a public,private and Enterprise networks.Also,the dataset consists of very important features that can be used for most of the applications of Machine learning in cybersecurity.Here are few of the potential machine learning applications that could be benefited from this dataset are :

    1. Network Performance Monitoring : This large network traffic data set can be utilised for analysing the network traffic to identifying the network patterns in the network .This help in designing the network security algorithms for minimise the network probelms.

    2. Anamoly Detection : Large network traffic dataset can be utilised training the machine learning models for finding the irregularitues in the traffic which could help identify the cyber attacks.

    3.Network Intrusion Detection : This large dataset could be utilised for machine algorithms training and designing the models for detection of the traffic issues,Malicious traffic network attacks and DOS attacks as well.

  2. Internet Traffic Data Set

    • kaggle.com
    Updated May 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asfand Yar (2023). Internet Traffic Data Set [Dataset]. http://doi.org/10.34740/kaggle/dsv/5658579
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 10, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Asfand Yar
    Description

    This data set contains internet traffic data captured by an Internet Service Provider (ISP) using Mikrotik SDN Controller and packet sniffer tools. The data set includes traffic from over 2000 customers who use Fibre to the Home (FTTH) and Gpon internet connections. The data was collected over a period of several months and contains all traffic in its original format with headers and packets.

    The data set contains information on inbound and outbound traffic, including web browsing, email, file transfers, and more. The data set can be used for research in areas such as network security, traffic analysis, and machine learning.

    **Data Collection Method: ** The data was captured using Mikrotik SDN Controller and packet sniffer tools. These tools capture traffic data by monitoring network traffic in real-time. The data set contains all traffic data in its original format, including headers and packets.

    **Data Set Content: ** The data set is provided in a CSV format and includes the following fields:

    1. Timestamp: The date and time the traffic was captured
    2. Source IP Address: The IP address of the device that sent the traffic Destination IP Address: The IP address of the device that received the traffic Protocol: The network protocol used for the traffic (e.g. TCP, UDP) Source Port: The port used by the source device for the traffic Destination Port: The port used by the destination device for the traffic Packet Size: The size of the packet in bytes Payload: The payload data of the packet The data set contains a large volume of traffic data from over 2000 customers. The data is organized by timestamp and includes all traffic data in its original format, including headers and packets. The data set contains both inbound and outbound traffic, and covers various types of internet traffic, including web browsing, email, file transfers, and more. one of listed protocols: ipsec-ah - IPsec AH protocol *ipsec-esp - IPsec ESP protocol ddp - datagram delivery protocol egp - exterior gateway protocol ggp - gateway-gateway protocol gre - general routing encapsulation hmp - host monitoring protocol idpr-cmtp - idpr control message transport icmp - internet control message protocol icmpv6 - internet control message protocol v6 igmp - internet group management protocol ipencap - ip encapsulated in ip ipip - ip encapsulation encap - ip encapsulation iso-tp4 - iso transport protocol class 4 ospf - open shortest path first pup - parc universal packet protocol pim - protocol independent multicast rspf - radio shortest path first rdp - reliable datagram protocol st - st datagram mode tcp - transmission control protocol udp - user datagram protocol vmtp - versatile message transport vrrp - virtual router redundancy protocol xns-idp - xerox xns idp xtp - xpress transfer protocol

    MAC Protocol Examples 802.2 - 802.2 Frames (0x0004) arp - Address Resolution Protocol (0x0806) homeplug-av - HomePlug AV MME (0x88E1) ip - Internet Protocol version 4 (0x0800) ipv6 - Internet Protocol Version 6 (0x86DD) ipx - Internetwork Packet Exchange (0x8137) lldp - Link Layer Discovery Protocol (0x88CC) loop-protect - Loop Protect Protocol (0x9003) mpls-multicast - MPLS multicast (0x8848) mpls-unicast - MPLS unicast (0x8847) packing-compr - Encapsulated packets with compressed IP packing (0x9001) packing-simple - Encapsulated packets with simple IP packing (0x9000) pppoe - PPPoE Session Stage (0x8864) pppoe-discovery - PPPoE Discovery Stage (0x8863) rarp - Reverse Address Resolution Protocol (0x8035) service-vlan - Provider Bridging (IEEE 802.1ad) & Shortest Path Bridging IEEE 802.1aq (0x88A8) vlan - VLAN-tagged frame (IEEE 802.1Q) and Shortest Path Bridging IEEE 802.1aq with NNI compatibility (0x8100)

    **Data Usage: ** The data set can be used for research in areas such as network security, traffic analysis, and machine learning. Researchers can use the data to develop new algorithms for detecting and preventing cyber attacks, analyzing internet traffic patterns, and more.

    **Data Availability: ** If you are interested in using this data set for research purposes, please contact us at asfandyar250@gmail.com for more information and references. The data set is available for download on Kaggle and can be accessed by researchers who have obtained permission from the ISP.

    We hope this data set will be useful for researchers in the field of network security and traffic analysis. If you have any questions or need further information, please do not hesitate to contact us. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5985737%2F61c81ce9eb393f8fc7c15540c9819b95%2FData.PNG?generation=1683750473536727&alt=media" alt=""> You can use Wireshark or other software's to view files

  3. Monthly mobile data traffic per user MENA 2018-2030

    • statista.com
    Updated Jul 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Monthly mobile data traffic per user MENA 2018-2030 [Dataset]. https://www.statista.com/statistics/1017457/mena-mobile-data-traffic-in-per-user/
    Explore at:
    Dataset updated
    Jul 1, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    MENA
    Description

    By 2030, the average mobile data connection was forecast to generate almost ** gigabytes of traffic per month in the Middle East and North Africa (MENA), increasing from *** gigabytes in 2023. The monthly mobile data traffic per subscriber has experienced a considerable growth from *** gigabytes in 2018.

  4. Most-valuable apps based on user traffic value in China 2024

    • statista.com
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Most-valuable apps based on user traffic value in China 2024 [Dataset]. https://www.statista.com/statistics/1362767/china-leading-apps-based-on-user-traffic-value/
    Explore at:
    Dataset updated
    Jul 8, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2024
    Area covered
    China
    Description

    Based on its user traffic value of **** billion yuan, WeChat ranked first among all Chinese mobile applications as of March 2024. In contrast, QQ, Tencent's other instant message app, generated a user traffic value of less than ** billion yuan.

  5. m

    Network traffic and code for machine learning classification

    • data.mendeley.com
    Updated Feb 20, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Víctor Labayen (2020). Network traffic and code for machine learning classification [Dataset]. http://doi.org/10.17632/5pmnkshffm.2
    Explore at:
    Dataset updated
    Feb 20, 2020
    Authors
    Víctor Labayen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset is a set of network traffic traces in pcap/csv format captured from a single user. The traffic is classified in 5 different activities (Video, Bulk, Idle, Web, and Interactive) and the label is shown in the filename. There is also a file (mapping.csv) with the mapping of the host's IP address, the csv/pcap filename and the activity label.

    Activities:

    Interactive: applications that perform real-time interactions in order to provide a suitable user experience, such as editing a file in google docs and remote CLI's sessions by SSH. Bulk data transfer: applications that perform a transfer of large data volume files over the network. Some examples are SCP/FTP applications and direct downloads of large files from web servers like Mediafire, Dropbox or the university repository among others. Web browsing: contains all the generated traffic while searching and consuming different web pages. Examples of those pages are several blogs and new sites and the moodle of the university. Vídeo playback: contains traffic from applications that consume video in streaming or pseudo-streaming. The most known server used are Twitch and Youtube but the university online classroom has also been used. Idle behaviour: is composed by the background traffic generated by the user computer when the user is idle. This traffic has been captured with every application closed and with some opened pages like google docs, YouTube and several web pages, but always without user interaction.

    The capture is performed in a network probe, attached to the router that forwards the user network traffic, using a SPAN port. The traffic is stored in pcap format with all the packet payload. In the csv file, every non TCP/UDP packet is filtered out, as well as every packet with no payload. The fields in the csv files are the following (one line per packet): Timestamp, protocol, payload size, IP address source and destination, UDP/TCP port source and destination. The fields are also included as a header in every csv file.

    The amount of data is stated as follows:

    Bulk : 19 traces, 3599 s of total duration, 8704 MBytes of pcap files Video : 23 traces, 4496 s, 1405 MBytes Web : 23 traces, 4203 s, 148 MBytes Interactive : 42 traces, 8934 s, 30.5 MBytes Idle : 52 traces, 6341 s, 0.69 MBytes

    The code of our machine learning approach is also included. There is a README.txt file with the documentation of how to use the code.

  6. Total global visitor traffic to user-generated content websites 2024

    • statista.com
    Updated Aug 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Total global visitor traffic to user-generated content websites 2024 [Dataset]. https://www.statista.com/statistics/1328702/web-visitor-traffic-top-websites-ugc/
    Explore at:
    Dataset updated
    Aug 20, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2024
    Area covered
    Worldwide
    Description

    In March 2024, the video platform YouTube reported around 32.5 billion visits from global users. Meta-owned Facebook.com reported around 16.1 billion visits from global users, as Instagram.com and Twitter.com followed, each with 7 billion and 6.1 billion visits from users worldwide during the examined month. Wikipedia.org, which hosts users-generated encyclopedic entries, recorded around 4.4 billion visits, while news aggregator and community platform Reddit.com saw approximately 2.2 billion visits during the examined period.

  7. a

    TMS daily traffic counts CSV

    • hub.arcgis.com
    Updated Aug 30, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waka Kotahi (2020). TMS daily traffic counts CSV [Dataset]. https://hub.arcgis.com/datasets/9cb86b342f2d4f228067a7437a7f7313
    Explore at:
    Dataset updated
    Aug 30, 2020
    Dataset authored and provided by
    Waka Kotahi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    You can also access an API version of this dataset.

    TMS

    (traffic monitoring system) daily-updated traffic counts API

    Important note: due to the size of this dataset, you won't be able to open it fully in Excel. Use notepad / R / any software package which can open more than a million rows.

    Data reuse caveats: as per license.

    Data quality

    statement: please read the accompanying user manual, explaining:

    how

     this data is collected identification 
    
     of count stations traffic 
    
     monitoring technology monitoring 
    
     hierarchy and conventions typical 
    
     survey specification data 
    
     calculation TMS 
    
     operation. 
    

    Traffic

    monitoring for state highways: user manual

    [PDF 465 KB]

    The data is at daily granularity. However, the actual update

    frequency of the data depends on the contract the site falls within. For telemetry

    sites it's once a week on a Wednesday. Some regional sites are fortnightly, and

    some monthly or quarterly. Some are only 4 weeks a year, with timing depending

    on contractors’ programme of work.

    Data quality caveats: you must use this data in

    conjunction with the user manual and the following caveats.

    The

     road sensors used in data collection are subject to both technical errors and 
    
     environmental interference.Data 
    
     is compiled from a variety of sources. Accuracy may vary and the data 
    
     should only be used as a guide.As 
    
     not all road sections are monitored, a direct calculation of Vehicle 
    
     Kilometres Travelled (VKT) for a region is not possible.Data 
    
     is sourced from Waka Kotahi New Zealand Transport Agency TMS data.For 
    
     sites that use dual loops classification is by length. Vehicles with a length of less than 5.5m are 
    
     classed as light vehicles. Vehicles over 11m long are classed as heavy 
    
     vehicles. Vehicles between 5.5 and 11m are split 50:50 into light and 
    
     heavy.In September 2022, the National Telemetry contract was handed to a new contractor. During the handover process, due to some missing documents and aged technology, 40 of the 96 national telemetry traffic count sites went offline. Current contractor has continued to upload data from all active sites and have gradually worked to bring most offline sites back online. Please note and account for possible gaps in data from National Telemetry Sites. 
    

    The NZTA Vehicle

    Classification Relationships diagram below shows the length classification (typically dual loops) and axle classification (typically pneumatic tube counts),

    and how these map to the Monetised benefits and costs manual, table A37,

    page 254.

    Monetised benefits and costs manual [PDF 9 MB]

    For the full TMS

    classification schema see Appendix A of the traffic counting manual vehicle

    classification scheme (NZTA 2011), below.

    Traffic monitoring for state highways: user manual [PDF 465 KB]

    State highway traffic monitoring (map)

    State highway traffic monitoring sites

  8. reddit.com Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). reddit.com Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/reddit.com/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    reddit.com is ranked #5 in US with 4.66B Traffic. Categories: Online Services. Learn more about website traffic, market share, and more!

  9. R

    Carla Traffic Dataset

    • universe.roboflow.com
    zip
    Updated Apr 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GP (2023). Carla Traffic Dataset [Dataset]. https://universe.roboflow.com/gp-rspur/carla-traffic-dataset-uao1b
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 15, 2023
    Dataset authored and provided by
    GP
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Car Pedestrian TrafficLight Bounding Boxes
    Description

    Here are a few use cases for this project:

    1. Autonomous Vehicles Navigation: The "Carla traffic dataset" can be used to develop and improve algorithms for autonomous vehicles, enabling them to effectively identify other road users, traffic lights, and various traffic signs, improving the cars’ ability to navigate safely in different weather conditions including fog.

    2. Traffic Management Systems: The dataset could be leveraged to create advanced traffic management systems, identifying car, bike, or pedestrian movement, detecting traffic light states, and understanding if road users respect speed limits (30, 60, 90 km/h signs). This could improve urban traffic flow and increase overall road safety.

    3. Driver Assistance Systems: The dataset could be used to develop advanced driver assistance systems (ADAS) that could alert drivers of pedestrians, other vehicles, traffic signs, and the status of traffic lights, particularly in foggy or difficult conditions.

    4. Safety Testing for Vehicle Manufacturers: Companies manufacturing cars, bikes, or motorbikes could use the data to carry out safety testing under different situations, including different weather conditions and traffic light changes.

    5. Virtual Driving Simulation: Game developers or driving schools could use this model to develop realistic driving simulations. The players or trainee drivers would need to respond correctly and promptly to real-world traffic situations like recognizing speed signs, traffic lights, and other road users.

  10. Share of global mobile website traffic 2015-2024

    • statista.com
    Updated Jan 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Share of global mobile website traffic 2015-2024 [Dataset]. https://www.statista.com/statistics/277125/share-of-website-traffic-coming-from-mobile-devices/
    Explore at:
    Dataset updated
    Jan 28, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    Mobile accounts for approximately half of web traffic worldwide. In the last quarter of 2024, mobile devices (excluding tablets) generated 62.54 percent of global website traffic. Mobiles and smartphones consistently hoovered around the 50 percent mark since the beginning of 2017, before surpassing it in 2020. Mobile traffic Due to low infrastructure and financial restraints, many emerging digital markets skipped the desktop internet phase entirely and moved straight onto mobile internet via smartphone and tablet devices. India is a prime example of a market with a significant mobile-first online population. Other countries with a significant share of mobile internet traffic include Nigeria, Ghana and Kenya. In most African markets, mobile accounts for more than half of the web traffic. By contrast, mobile only makes up around 45.49 percent of online traffic in the United States. Mobile usage The most popular mobile internet activities worldwide include watching movies or videos online, e-mail usage and accessing social media. Apps are a very popular way to watch video on the go and the most-downloaded entertainment apps in the Apple App Store are Netflix, Tencent Video and Amazon Prime Video.

  11. Pedestrians and Cyclists in Road Traffic: Trajectories, 3D Poses and...

    • zenodo.org
    • explore.openaire.eu
    zip
    Updated Jun 8, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Viktor Kress; Viktor Kress; Stefan Zernetsch; Stefan Zernetsch; Maarten Bieshaar; Maarten Bieshaar; Günther Reitberger; Günther Reitberger; Erich Fuchs; Konrad Doll; Konrad Doll; Bernhard Sick; Bernhard Sick; Erich Fuchs (2021). Pedestrians and Cyclists in Road Traffic: Trajectories, 3D Poses and Semantic Maps [Dataset]. http://doi.org/10.5281/zenodo.4898838
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 8, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Viktor Kress; Viktor Kress; Stefan Zernetsch; Stefan Zernetsch; Maarten Bieshaar; Maarten Bieshaar; Günther Reitberger; Günther Reitberger; Erich Fuchs; Konrad Doll; Konrad Doll; Bernhard Sick; Bernhard Sick; Erich Fuchs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset consists of more than 2300 trajectories of pedestrians and 1000 trajectories of cyclists recorded by a research vehicle of the University of Applied Sciences Aschaffenburg (Kooperative Automatisierte Verkehrssysteme) in urban traffic. In addition to the actual trajectory, the data set contains 3D poses, a representation of the body posture in three-dimensional space, and semantic maps describing the surrounding of the respective vulnerable road user (VRU).

    The trajectories were sampled using a sliding window approach and split into a training, validation, and test dataset. Each sample contains the trajectory, 3D poses and semantic maps of the past second, as well as the sought future trajectory and semantic maps for the future 2.52 s. In addition, each pattern is assigned to a current type of motion. The motion types were annotated manually. For a more detailed description of the dataset, please refer to the following publication:

    Viktor Kress, Fabian Jeske, Stefan Zernetsch, Konrad Doll, Bernhard Sick: Pose and Semantic Map Based Probabilistic Forecast of Vulnerable Road Users' Trajectories. 2021, arXiv: 2106.02598, https://arxiv.org/abs/2106.02598

    We provide files for the training/validation dataset and the test dataset for pedestrians and cyclists, respectively. To read the provided data, unzip the files first. Each file contains a zarr directory. Zarr is a format for the storage of chunked, compressed, N-dimensional arrays (https://zarr.readthedocs.io). To read the data:

    import zarr
    
    data = zarr.open(

    Each zarr directory contains the following keys:

    Key:
      pre_trajectories_and_poses: input trajectories of 13 body joint positions, format: [sample, timestep, x,y,z coordinates (first 13 coordinates: x, 14- 26: y, 27:39: z)]
      pre_smaps: input semantic maps, format: [sample, timestep (-0.96s, -0.48a, 0.00s)], codes: static obstacles: 0, dynamic obstacles: 1, person: 2, sidewalk: 3, road: 4, walkable vegetation: 5, unknown obstacle: 6, unknown free space: 7, unkown: 8
      pos_trajectories: ground truth future trajectories of the head, format: [sample, x,y coordinates (first 63 coordinates: x, 64- 126: y for the timesteps +0.04s, +0.08s, ..., +2.52s))]
      pos_smaps: future semantic maps, format: [sample, timestep (+0.44s, +0.96s, +1.48s, +2.00s, 2.52s)]
      fold: affiliation to training/validation dataset, format: [sample], codes: test set: 0, validation set: 1, training set: 2
      augmentation: affiliation to the augmentation loop (0-2), format: [sample]
      move, start, stop, wait, tl, tr: current motion type as boolean arrays, format: [sample]

    This work was supported by “Zentrum Digitalisierung.Bayern”. In addition, the work is backed by the project DeCoInt2 , supported by the German Research Foundation (DFG) within the priority program SPP 1835: “Kooperativ interagierende Automobile”, grant numbers DO 1186/1-2 and SI 674/11-2.

  12. amazon.com Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    • stb2.digiseotools.com
    Updated Aug 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). amazon.com Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/amazon.com/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    amazon.com is ranked #3 in US with 2.82B Traffic. Categories: Online Services. Learn more about website traffic, market share, and more!

  13. faire.com Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). faire.com Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/faire.com/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    faire.com is ranked #2885 in US with 5.89M Traffic. Categories: Retail. Learn more about website traffic, market share, and more!

  14. chatgpt.com Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    • stb2.digiseotools.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). chatgpt.com Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/chatgpt.com/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    chatgpt.com is ranked #10 in US with 5.24B Traffic. Categories: AI. Learn more about website traffic, market share, and more!

  15. P

    People Counting in Retail Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). People Counting in Retail Report [Dataset]. https://www.marketreportanalytics.com/reports/people-counting-in-retail-75211
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Apr 10, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    IN
    Variables measured
    Market Size
    Description

    The retail people counting market, valued at $1556 million in 2025, is experiencing robust growth, projected to expand at a compound annual growth rate (CAGR) of 8.7% from 2025 to 2033. This expansion is fueled by several key factors. Firstly, the increasing adoption of advanced technologies like AI-powered video analytics, WiFi and Bluetooth sensing, and infrared sensors provides retailers with more accurate and granular data on customer traffic patterns. This allows for optimized store layouts, staffing levels, and marketing campaigns, leading to improved operational efficiency and enhanced customer experiences. Secondly, the growing demand for data-driven decision-making across the retail sector is driving the adoption of people counting systems. Retailers are increasingly realizing the importance of understanding customer behavior to personalize their offerings and optimize their strategies for better profitability. Finally, the increasing availability of affordable and user-friendly people counting solutions, including cloud-based platforms and mobile applications, is making this technology accessible to a wider range of businesses, from small and medium-sized enterprises (SMEs) to large multinational corporations. While the market faces challenges such as the initial investment costs associated with implementing these systems and concerns about data privacy, these are being mitigated by the long-term return on investment (ROI) generated through optimized operations and improved sales conversions. The market is segmented by application (SMEs and large enterprises) and technology (Wi-Fi and Bluetooth sensing, video-based counting, infrared sensors, time-of-flight sensors, and others). Key players in the market, including V-Count, Visionarea, Beonic (Blix), Retail Next, and ShopperTrak, are constantly innovating and expanding their product offerings to cater to the evolving needs of retailers. The competitive landscape is dynamic, with ongoing mergers, acquisitions, and the development of new technologies driving market evolution. The continued focus on enhancing the customer experience and leveraging data analytics will ensure sustained growth in the retail people counting market throughout the forecast period.

  16. Latin America: mobile data traffic per smartphone user 2023 vs. 2030

    • statista.com
    Updated Jul 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Latin America: mobile data traffic per smartphone user 2023 vs. 2030 [Dataset]. https://www.statista.com/statistics/218533/mobile-data-traffic-per-capita-in-latin-america/
    Explore at:
    Dataset updated
    Jul 7, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2023
    Area covered
    LAC, Latin America
    Description

    In 2023, an average mobile subscriber in Latin America generated ***** gigabytes of data traffic per month. It has been projected that the number will increase to ** gigabytes by 2030. The number of unique mobile subscribers is expected to increase from *** to *** million between 2023 and 2030.

  17. perplexity.ai Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). perplexity.ai Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/perplexity.ai/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    perplexity.ai is ranked #89 in IN with 173.58M Traffic. Categories: AI. Learn more about website traffic, market share, and more!

  18. craigslist.org Website Traffic, Ranking, Analytics [July 2025]

    • semrush.com
    • semrush.ebundletools.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semrush (2025). craigslist.org Website Traffic, Ranking, Analytics [July 2025] [Dataset]. https://www.semrush.com/website/craigslist.org/overview/
    Explore at:
    Dataset updated
    Aug 12, 2025
    Dataset authored and provided by
    Semrushhttps://fr.semrush.com/
    License

    https://www.semrush.com/company/legal/terms-of-service/https://www.semrush.com/company/legal/terms-of-service/

    Time period covered
    Aug 12, 2025
    Area covered
    Worldwide
    Variables measured
    visits, backlinks, bounceRate, pagesPerVisit, authorityScore, organicKeywords, avgVisitDuration, referringDomains, trafficByCountry, paidSearchTraffic, and 3 more
    Measurement technique
    Semrush Traffic Analytics; Click-stream data
    Description

    craigslist.org is ranked #73 in US with 125.7M Traffic. Categories: Online Services, Real Estate. Learn more about website traffic, market share, and more!

  19. 🏄🏻‍♂️ Darknet Surfing

    • kaggle.com
    Updated Jul 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mexwell (2024). 🏄🏻‍♂️ Darknet Surfing [Dataset]. https://www.kaggle.com/datasets/mexwell/darknet-surfing
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 23, 2024
    Dataset provided by
    Kaggle
    Authors
    mexwell
    Description

    This dataset contains user behavior traffic in Tor, I2P, ZeroNet and Freenet.

    We divide darknet user behaviors in 8 categories: Browsing, Chat, E-mail, Audio-streaming, Video-streaming, File Transfer, P2P and VoIP. We investigated the commonly used applications in Tor, I2P, ZeroNet, Freenet to simulate various user behaviors.

    After capturing pcap, we use CICFlowMeter for feature extraction. Since our user behavior hierarchical classifier consists of 6 local classifiers, we divide the dataset into 6 csv files. The statistics of traffic data are shown in the following table.

    BrowsingChatEmailFile TransferP2PAudioVideoVoIPTotal
    Tor128184155310771018156717035928632
    I2P1921442108417912910---8148
    ZeroNet79721531352215713948201251-15477
    Freenet4990112329804897--2397-16387

    Dataset

    Original Pcap Datasets: Google drive - download

    Citation

    @INPROCEEDINGS{9343185,
     author={Hu, Yuzong and Zou, Futai and Li, Linsen and Yi, Ping},
     booktitle={2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)}, 
     title={Traffic Classification of User Behaviors in Tor, I2P, ZeroNet, Freenet}, 
     year={2020},
     volume={},
     number={},
     pages={418-424},
     doi={10.1109/TrustCom50675.2020.00064}}
    

    Acknowlegement

    Foto von Leon Seibert auf Unsplash

  20. Quarterly smartphone mobile data traffic per user South Korea 2019-2025, by...

    • statista.com
    Updated Aug 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Quarterly smartphone mobile data traffic per user South Korea 2019-2025, by tech [Dataset]. https://www.statista.com/statistics/1108286/south-korea-smartphone-data-traffic-per-user-monhtly-by-technology/
    Explore at:
    Dataset updated
    Aug 5, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2019 - Mar 2025
    Area covered
    South Korea
    Description

    As of March 2025, smartphone users with a 5G data plan in South Korea used around **** GB per subscription in that month. The number of 5G subscribers in the country had reached around ***** million users in the same month that year.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ravikumar Gattu (2023). Network Traffic Dataset [Dataset]. https://www.kaggle.com/datasets/ravikumargattu/network-traffic-dataset
Organization logo

Network Traffic Dataset

Use this Dataset for analysis the network traffic and designing the applications

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 31, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ravikumar Gattu
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

The data presented here was obtained in a Kali Machine from University of Cincinnati,Cincinnati,OHIO by carrying out packet captures for 1 hour during the evening on Oct 9th,2023 using Wireshark.This dataset consists of 394137 instances were obtained and stored in a CSV (Comma Separated Values) file.This large dataset could be used utilised for different machine learning applications for instance classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

The dataset can be used for a variety of machine learning tasks, such as network intrusion detection, traffic classification, and anomaly detection.

Content :

This network traffic dataset consists of 7 features.Each instance contains the information of source and destination IP addresses, The majority of the properties are numeric in nature, however there are also nominal and date kinds due to the Timestamp.

The network traffic flow statistics (No. Time Source Destination Protocol Length Info) were obtained using Wireshark (https://www.wireshark.org/).

Dataset Columns:

No : Number of Instance. Timestamp : Timestamp of instance of network traffic Source IP: IP address of Source Destination IP: IP address of Destination Portocol: Protocol used by the instance Length: Length of Instance Info: Information of Traffic Instance

Acknowledgements :

I would like thank University of Cincinnati for giving the infrastructure for generation of network traffic data set.

Ravikumar Gattu , Susmitha Choppadandi

Inspiration : This dataset goes beyond the majority of network traffic classification datasets, which only identify the type of application (WWW, DNS, ICMP,ARP,RARP) that an IP flow contains. Instead, it generates machine learning models that can identify specific applications (like Tiktok,Wikipedia,Instagram,Youtube,Websites,Blogs etc.) from IP flow statistics (there are currently 25 applications in total).

**Dataset License: ** CC0: Public Domain

Dataset Usages : This dataset can be used for different machine learning applications in the field of cybersecurity such as classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

ML techniques benefits from this Dataset :

This dataset is highly useful because it consists of 394137 instances of network traffic data obtained by using the 25 applications on a public,private and Enterprise networks.Also,the dataset consists of very important features that can be used for most of the applications of Machine learning in cybersecurity.Here are few of the potential machine learning applications that could be benefited from this dataset are :

  1. Network Performance Monitoring : This large network traffic data set can be utilised for analysing the network traffic to identifying the network patterns in the network .This help in designing the network security algorithms for minimise the network probelms.

  2. Anamoly Detection : Large network traffic dataset can be utilised training the machine learning models for finding the irregularitues in the traffic which could help identify the cyber attacks.

3.Network Intrusion Detection : This large dataset could be utilised for machine algorithms training and designing the models for detection of the traffic issues,Malicious traffic network attacks and DOS attacks as well.

Search
Clear search
Close search
Google apps
Main menu