31 datasets found
  1. Z

    Network Traffic Analysis: Data and Code

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Homan, Sophia (2024). Network Traffic Analysis: Data and Code [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11479410
    Explore at:
    Dataset updated
    Jun 12, 2024
    Dataset provided by
    Soni, Shreena
    Honig, Joshua
    Chan-Tin, Eric
    Ferrell, Nathan
    Moran, Madeline
    Homan, Sophia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Code:

    Packet_Features_Generator.py & Features.py

    To run this code:

    pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j

    -h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j

    Purpose:

    Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.

    Uses Features.py to calcualte the features.

    startMachineLearning.sh & machineLearning.py

    To run this code:

    bash startMachineLearning.sh

    This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags

    Options (to be edited within this file):

    --evaluate-only to test 5 fold cross validation accuracy

    --test-scaling-normalization to test 6 different combinations of scalers and normalizers

    Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use

    --grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'

    Purpose:

    Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.

    Data

    Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.

    Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:

    First number is a classification number to denote what website, query, or vr action is taking place.

    The remaining numbers in each line denote:

    The size of a packet,

    and the direction it is traveling.

    negative numbers denote incoming packets

    positive numbers denote outgoing packets

    Figure 4 Data

    This data uses specific lines from the Virtual Reality.txt file.

    The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.

    The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.

    The .xlsx and .csv file are identical

    Each file includes (from right to left):

    The origional packet data,

    each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,

    and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.

  2. Monthly global traffic to bestbuy.com 2023-2024

    • statista.com
    Updated Nov 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Monthly global traffic to bestbuy.com 2023-2024 [Dataset]. https://www.statista.com/statistics/1337483/web-visits-to-bestbuycom/
    Explore at:
    Dataset updated
    Nov 21, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jul 2023 - Oct 2024
    Area covered
    Worldwide, North America
    Description

    Total visits to bestbuy.com peaked in November 2023 at 330 million before declining to about 123 million in April 2024. While this figure measures the site's global traffic, the consumer electronics retailer operates primarily in the U.S., Canada, and Mexico.

  3. Bounce rate of most visited retail websites traffic in Japan 2024

    • statista.com
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bounce rate of most visited retail websites traffic in Japan 2024 [Dataset]. https://www.statista.com/statistics/1484450/japan-bounce-rate-most-visited-retail-website/
    Explore at:
    Dataset updated
    Jul 10, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jul 2024
    Area covered
    Japan
    Description

    The Japanese review site my-best.com had the highest bounce rate among the most visited retail websites in Japan in July 2024. Operated by mybest, Inc. and part of LY Corporation, the website had a bounce of nearly ** percent, while ranking as the ****** most visited retail website in the same month.

  4. e

    best-hashtags.com Traffic Analytics Data

    • analytics.explodingtopics.com
    Updated May 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). best-hashtags.com Traffic Analytics Data [Dataset]. https://analytics.explodingtopics.com/website/best-hashtags.com
    Explore at:
    Dataset updated
    May 1, 2025
    Variables measured
    Global Rank, Monthly Visits, Authority Score, US Country Rank, Advertising & Marketing Category Rank
    Description

    Traffic analytics, rankings, and competitive metrics for best-hashtags.com as of May 2025

  5. Share of paid traffic on GAFAM companies' website in France 2019

    • ai-chatbox.pro
    • statista.com
    Updated Nov 30, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). Share of paid traffic on GAFAM companies' website in France 2019 [Dataset]. https://www.ai-chatbox.pro/?_=%2Fstatistics%2F1182402%2Fwebsite-paid-traffic-gafam-companies-france%2F%23XgboD02vawLYpGJjSPEePEUG%2FVFd%2Bik%3D
    Explore at:
    Dataset updated
    Nov 30, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Dec 2019
    Area covered
    France
    Description

    When Amazon was launched in July 16, 1995 as a website that only sold books, founder Jeff Bezos had a vision for the company's explosive growth and e-commerce domination. In December 2019, we note that around 6.5 percent of visits to Amazon came from paid traffic in France, that is to say, visits made through paid advertising. With a strong discrepancy to it's competitors, the next best performing company was Microsoft.

  6. d

    Traffic Volumes from SCATS Traffic Management System Jan-Jun 2025 DCC

    • datasalsa.com
    zip
    Updated Jun 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dublin City Council (2025). Traffic Volumes from SCATS Traffic Management System Jan-Jun 2025 DCC [Dataset]. https://datasalsa.com/dataset/?catalogue=data.gov.ie&name=dcc-scats-detector-volume-jan-jun-2025
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 19, 2025
    Dataset authored and provided by
    Dublin City Council
    Time period covered
    Jun 19, 2025
    Description

    Traffic Volumes from SCATS Traffic Management System Jan-Jun 2025 DCC. Published by Dublin City Council. Available under the license cc-by (CC-BY-4.0).Traffic volumes data across Dublin City from the SCATS traffic management system. The Sydney Coordinated Adaptive Traffic System (SCATS) is an intelligent transportation system used to manage timing of signal phases at traffic signals. SCATS uses sensors at each traffic signal to detect vehicle presence in each lane and pedestrians waiting to cross at the local site. The vehicle sensors are generally inductive loops installed within the road.

    3 resources are provided:

    SCATS Traffic Volumes Data (Monthly) Contained in this report are traffic counts taken from the SCATS traffic detectors located at junctions. The primary function for these traffic detectors is for traffic signal control. Such devices can also count general traffic volumes at defined locations on approach to a junction. These devices are set at specific locations on approaches to the junction but may not be on all approaches to a junction. As there are multiple junctions on any one route, it could be expected that a vehicle would be counted multiple times as it progress along the route. Thus the traffic volume counts here are best used to represent trends in vehicle movement by selecting a specific junction on the route which best represents the overall traffic flows.

    Information provided:

    End Time: time that one hour count period finishes.

    Region: location of the detector site (e.g. North City, West City, etc).

    Site: this can be matched with the SCATS Sites file to show location

    Detector: the detectors/ sensors at each site are numbered

    Sum volume: total traffic volumes in preceding hour

    Avg volume: average traffic volumes per 5 minute interval in preceding hour

    All Dates Traffic Volumes Data

    This file contains daily totals of traffic flow at each site location.

    SCATS Site Location Data Contained in this report, the location data for the SCATS sites is provided. The meta data provided includes the following;

    Site id – This is a unique identifier for each junction on SCATS

    Site description( CAP) – Descriptive location of the junction containing street name(s) intersecting streets

    Site description (lower) - – Descriptive location of the junction containing street name(s) intersecting streets

    Region – The area of the city, adjoining local authority, region that the site is located

    LAT/LONG – Coordinates

    Disclaimer: the location files are regularly updated to represent the locations of SCATS sites under the control of Dublin City Council. However site accuracy is not absolute. Information for LAT/LONG and region may not be available for all sites contained. It is at the discretion of the user to link the files for analysis and to create further data. Furthermore, detector communication issues or faulty detectors could also result in an inaccurate result for a given period, so values should not be taken as absolute but can be used to indicate trends....

  7. Global share of human and bot web traffic 2013-2023

    • statista.com
    Updated Dec 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Global share of human and bot web traffic 2013-2023 [Dataset]. https://www.statista.com/statistics/1264226/human-and-bot-web-traffic-share/
    Explore at:
    Dataset updated
    Dec 10, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    In 2023, most of the global website traffic was still generated by humans but bot traffic is constantly growing. Fraudulent traffic through bad bot actors accounted for 32 percent of global web traffic in the most recently measured period, representing an increase of 1.8 percent from the previous year. Sophistication of Bad Bots on the rise The complexity of malicious bot activity has dramatically increased in recent years. Advanced bad bots have doubled in prevalence over the past two years, indicating a surge in the sophistication of cyber threats. Simultaneously, simple bad bots saw a 6 percent increase compared to the previous year, suggesting a shift in the landscape of automated threats. Meanwhile, areas like entertainment, and law & government face the highest amount of advanced bad bots, with more than 78 percent of their bot traffic affected by evasive applications. Good and bad bots across industries The impact of bot traffic varies across different sectors. Bad bots accounted for over 57.2 percent of the gaming segment's web traffic. Meanwhile, almost half of the online traffic for telecom and ISPs was moved by malicious applications. However, not all bot traffic is considered bad. Some of these applications help index websites for search engines or monitor website performance, assisting users throughout their online search. Therefore, areas like entertainment, food and groceries, and financial services experienced notable levels of good bot traffic, demonstrating the diverse applications of benign automated systems across different sectors.

  8. t

    CRO for Small Businesses: Optimizing Low Traffic Sites for Conversions

    • thegood.com
    html
    Updated May 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Good (2025). CRO for Small Businesses: Optimizing Low Traffic Sites for Conversions [Dataset]. https://thegood.com/insights/cro-for-small-businesses/
    Explore at:
    htmlAvailable download formats
    Dataset updated
    May 14, 2025
    Dataset authored and provided by
    The Good
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    At The Good, we love to use the phrase, “Let’s test it.” There’s no sense in making assumptions or guessing what an audience wants. It’s smarter – and better for your budget – to run experiments in order to determine what actually boosts your conversion rate. A/B testing is the standard experimentation methodology for digital marketing. […]

  9. H

    Buy Guest Post on Zeenews.india

    • dataverse.harvard.edu
    Updated Jan 26, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2022). Buy Guest Post on Zeenews.india [Dataset]. http://doi.org/10.7910/DVN/FU7YI7
    Explore at:
    Dataset updated
    Jan 26, 2022
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    What is a high quality website? Over the years the whole SEO industry is talking about the need of producing high quality content and top experts came up with the clever quote ‘Content is king’, meaning that content is the success factor of any website. While this is true, does it mean that a website with good content is also a high quality website? The answer is NO. Good content is not enough. It is one of the factors (the most important) that separates low from high quality sites but good content alone does not complete the puzzle of what is considered by Google as a high quality website. Now you can get the high quality on high quality sites like Nytimes, Forbes etc. You can also buy Zeenews.india guest Post at a reasonable price from the best guest post service. What is SEO SEO is short for ‘Search Engine Optimization’. It refers to the process of increasing a websites traffic flow by optimizing several aspects of a website; such as your on-page SEO, technical SEO & off-site SEO,. Your SEO strategy should ideally be planned around your content strategy. For this you will require three elements, 1.) keywords, 2.) links and 3.) substance to piece your content strategy together. Guest Post on High quality sites can improve your SEO ranking. To improve ranking and boost ranking, buy Guest Post on Zeenews.india from the high quality guest post service. Characteristics of a high quality website A high quality website has the following characteristics: Unique content Content is unique both within the website itself (i.e. each page has unique content and not similar to other pages), but also compared to other websites. Demonstrate Expertise Content is produced by experts based on research and or experience. If for example the subject is health related, then the advice should be provided by qualified authors who can professionally give advice for the particular subject. Unbiased content Content is detail and describes both sides of a story and is not promoting a single product, idea or service. Accessibility A high quality website has versions for non PC users as well. It is important that mobile and tablet users can access the website without any usability issues. Usability Can the user navigate the website easily; is the website user friendly? Attention to detail Content is easy to read with images (if applicable) and free of spelling and grammar mistakes. Does it seem that the owner cares on what is published on the website or is it for the purpose of having content in order to run ads? SEO Optimized Optimizing a web site for search engines has many benefits but it is important not to overdo it. A good quality web site needs to have non-optimized content as well. This is my opinion and although some people may disagree it is a fact that over-optimization can sometimes generate the opposite results. The reason is that algorithms can sometimes interpret over-optimization as an attempt to game the system and they may take measures to prevent this from happening. Balance between content and ads It is not something bad for a website to have ads or promotions but these should not distract the users from finding the information they need. Speed A high quality website loads fast. A fast website will rank higher and create more conventions, sales and loyal readers. Social Social media changed our lives, the way we communicate but also the way we assess quality. It is expected for a good product to have good reviews, Facebook likes and Tweets. Before you make a decision to buy or not, you may examine these social factors as well. Likewise, It is also expected for a good website to be socially accepted and recognized i.e. have Facebook followers, RSS subscribers etc. User Engagement and Interaction Do users spend enough time on the site and read more than one pages before they leave? Do they interact with the content by adding comments, making suggestions, getting into conversations etc.? Better than the competition When you take a specific keyword, is your website better than your competitors? Does it deserve one of the top positions if judged without bias?

  10. w

    Websites susceptible to CVE-2024-3554

    • webtechsurvey.com
    csv
    Updated May 2, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WebTechSurvey (2024). Websites susceptible to CVE-2024-3554 [Dataset]. https://webtechsurvey.com/cve/CVE-2024-3554
    Explore at:
    csvAvailable download formats
    Dataset updated
    May 2, 2024
    Dataset authored and provided by
    WebTechSurvey
    License

    https://webtechsurvey.com/termshttps://webtechsurvey.com/terms

    Time period covered
    2025
    Area covered
    Global
    Description

    A complete list of live websites affected by CVE-2024-3554, compiled through global website indexing conducted by WebTechSurvey.

  11. g

    Traffic Volumes from scats Traffic Management System Jul-Dec 2021 DCC |...

    • gimi9.com
    Updated Dec 15, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Traffic Volumes from scats Traffic Management System Jul-Dec 2021 DCC | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_afd64930-ff9a-4bba-846b-7511edb0e4fa/
    Explore at:
    Dataset updated
    Dec 15, 2021
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Traffic Volumes data across Dublin City from the scats traffic management system. The Sydney Coordinated Adaptive Traffic System (scats) is an intelligent transportation system used to manage timing of signal Phases at traffic signals. Scats uses SENSORS at each traffic signal to detect vehicle presence in each lane and pedestrians waiting to cross at the local site. The vehicle SENSORS are Generally Inductive Loops installed within the road. 3 resources are provided: Traffic Volumes Data scats (Monthly) Contained in this report are traffic Counts taken from the scats traffic detectors located at junctions. The primary function for these traffic detectors is for traffic signal control. Such devices can also count general traffic Volumes at defined locations on approach to a junction. These devices are set at specific locations on approaches to the junction but may not be on all approaches to a junction. As there are multiple junctions on any one route, it could be expected that a vehicle would be counted multiple times as it progress along the route. Set the traffic volume Counts here are best used to Represent trends in vehicle movement by selecting a specific junction on the route which best represents the overall traffic flows. Information provided: End Time: time that one hour count period finishes. Region: location of the detector site (e.g. North City, West City, etc.). Site: this can be matched with the scats Sites file to show location Detector: the detectors/SENSORS at each site are numbered Sum volume: total traffic Volumes in preceding hour AVG volume: average traffic Volumes per 5 minute interval in preceding hour All Dates Traffic Volumes Data This file contains daily totals of traffic flow at each site location. Scats Site Location Data Contained in this report, the location data for the scats sites is provided. The meta data provided includes the following; Site id — This is a unique identifier for each junction on scats Site description(CAP) — Descriptive location of the junction containing street name(s) intersecting street streets Site description (lower) — – Descriptive location of the junction containing street name(s) intersecting street streets Region — The area of the city, adjoining local authority, region that the site is located Lat/LONG — Coordinates Disclaimer: the location files are regularly updated to Represent the locations of scats sites under the control of Dublin City Council. However site accuracy is not absolute. Information for LAT/LONG and region may not be available for all sites contained. It is at the discretion of the user to link the files for analysis and to create further data. Furthermore, detector communication issues or Faulty detectors could also result in an inaccurate result for a given period, so values should not be taken as absolute but can be used to indicate trends.

  12. t

    Website Navigation: 10 Conversion Tips & Winning Examples

    • thegood.com
    html
    Updated May 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Good (2025). Website Navigation: 10 Conversion Tips & Winning Examples [Dataset]. https://thegood.com/insights/website-navigation/
    Explore at:
    htmlAvailable download formats
    Dataset updated
    May 21, 2025
    Dataset authored and provided by
    The Good
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Website navigation can make or break your visitors’ experience. Attempting to navigate a website without a logical, well-defined structure is like being dropped in the middle of a complex maze with no map and no frame of reference. It’s overwhelming, frustrating, and all-around unpleasant — not exactly the user experience you’re hoping for. This means […]

  13. t

    Driving Paid Traffic To A Landing Page vs. Product Page: What Is Better?

    • thegood.com
    html
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Good (2025). Driving Paid Traffic To A Landing Page vs. Product Page: What Is Better? [Dataset]. https://thegood.com/insights/landing-page-vs-product-page/
    Explore at:
    htmlAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    The Good
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Imagine you click an ad on Facebook for a spiffy set of binoculars. The ad claims they are perfect for bird watchers like yourself. The link sends you to a product page on an ecommerce website. You see the same binoculars but no mention of birds. It seems like a great device, but you wonder […]

  14. Unacast Aggregated Foot Traffic Data for U.S. Locations

    • datarade.ai
    .csv
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unacast (2025). Unacast Aggregated Foot Traffic Data for U.S. Locations [Dataset]. https://datarade.ai/data-providers/gravy-analytics/data-products/unacast-aggregated-foot-traffic-data-for-u-s-locations-unacast
    Explore at:
    .csvAvailable download formats
    Dataset updated
    Jun 25, 2025
    Dataset provided by
    Unacast, Inc.
    Authors
    Unacast
    Area covered
    United States
    Description

    Aggregated Foot Traffic Data is derived from Unacast's proprietary machine learning model. Unlike typical aggregated products that rely solely on aggregating the underlying GPS device-level supply, our machine learning model is more robust and less dependent on GPS data fluctuations because it is based on a magnitude of data sources.

    Aggregated Foot Traffic Data is designed to enable users to analyze foot traffic trends to places of commercial interest. Unacast offers Aggregated Foot Traffic Data to millions of points of interest (POIs), Census Block Groups (CBGs), and custom locations within the United States.

    Companies use Unacast Aggregated Foot Traffic Data for: - Site performance - Site selection - Market analysis - Competitor analysis - Business intelligence - Advertising and marketing - Benchmarking - Operational and staffing strategies

    Aggregated Foot Traffic Data is best used along with Unacast’s Aggregated Trade Areas Data and Aggregated Demographic Data. Together, these datasets provide a comprehensive view of visitor profiles, activity, and traveler origin. These machine learning datasets are built with a privacy-first mindset to give you peace of mind as you solve your biggest business problems.

  15. A

    ‘Volume del traffico proveniente dal sistema di gestione del traffico sparso...

    • analyst-2.ai
    Updated Jul 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2020). ‘Volume del traffico proveniente dal sistema di gestione del traffico sparso DCC, gennaio-giugno 2020 ’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/data-europa-eu-volume-del-traffico-proveniente-dal-sistema-di-gestione-del-traffico-sparso-dcc-gennaio-giugno-2020-7c8b/4a1b8e31/?iid=002-073&v=presentation
    Explore at:
    Dataset updated
    Jul 1, 2020
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Volume del traffico proveniente dal sistema di gestione del traffico sparso DCC, gennaio-giugno 2020 ’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from http://data.europa.eu/88u/dataset/3495362f-06c2-4b06-90f5-3eaa307ab145 on 10 January 2022.

    --- Dataset description provided by original source is as follows ---

    Traffic volumes data across Dublin City from the SCATS traffic management system. The Sydney Coordinated Adaptive Traffic System (SCATS) is an intelligent transportation system used to manage timing of signal phases at traffic signals. SCATS uses sensors at each traffic signal to detect vehicle presence in each lane and pedestrians waiting to cross at the local site. The vehicle sensors are generally inductive loops installed within the road.

    3 resources are provided:

    • SCATS Traffic Volumes Data (Monthly)

    Contained in this report are traffic counts taken from the SCATS traffic detectors located at junctions. The primary function for these traffic detectors is for traffic signal control. Such devices can also count general traffic volumes at defined locations on approach to a junction. These devices are set at specific locations on approaches to the junction but may not be on all approaches to a junction. As there are multiple junctions on any one route, it could be expected that a vehicle would be counted multiple times as it progress along the route. Thus the traffic volume counts here are best used to represent trends in vehicle movement by selecting a specific junction on the route which best represents the overall traffic flows.

    Information provided:

    1. End Time: time that one hour count period finishes.

    2. Region: location of the detector site (e.g. North City, West City, etc).

    3. Site: this can be matched with the SCATS Sites file to show location

    4. Detector: the detectors/ sensors at each site are numbered

    5. Sum volume: total traffic volumes in preceding hour

    6. Avg volume: average traffic volumes per 5 minute interval in preceding hour

    • All Dates Traffic Volumes Data

    This file contains daily totals of traffic flow at each site location.

    • SCATS Site Location Data

    Contained in this report, the location data for the SCATS sites is provided. The meta data provided includes the following;

    1. Site id – This is a unique identifier for each junction on SCATS

    2. Site description( CAP) – Descriptive location of the junction containing street name(s) intersecting streets

    3. Site description (lower) - – Descriptive location of the junction containing street name(s) intersecting streets

    4. Region – The area of the city, adjoining local authority, region that the site is located

    5. LAT/LONG – Coordinates

    Disclaimer: the location files are regularly updated to represent the locations of SCATS sites under the control of Dublin City Council. However site accuracy is not absolute. Information for LAT/LONG and region may not be available for all sites contained. It is at the discretion of the user to link the files for analysis and to create further data. Furthermore, detector communication issues or faulty detectors could also result in an inaccurate result for a given period, so values should not be taken as absolute but can be used to indicate trends.

    --- Original source retains full ownership of the source dataset ---

  16. Total global visitor traffic to amazon.com 2024

    • statista.com
    Updated Feb 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Total global visitor traffic to amazon.com 2024 [Dataset]. https://www.statista.com/statistics/623566/web-visits-to-amazoncom/
    Explore at:
    Dataset updated
    Feb 18, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Oct 2023 - Mar 2024
    Area covered
    Worldwide
    Description

    In March 2024, Amazon.com had approximately 2.2 billion combined web visits, up from 2.1 billion visits in February. In the fourth quarter of 2024, Amazon’s net income amounted to approximately 20 billion U.S. dollars. Online retail in the United States Online retail in the United States is constantly growing. In the third quarter of 2023, e-commerce sales accounted for 15.6 percent of retail sales in the United States. During that quarter, U.S. retail e-commerce sales amounted to over 284 billion U.S. dollars. Amazon is the leading online store in the country, in terms of e-commerce net sales. Amazon.com generated around 130 billion U.S. dollars in online sales in 2022. Walmart ranked as the second-biggest online store, with revenues of 52 billion U.S. dollars. The king of Black Friday In 2023, Amazon ranked as U.S. shoppers' favorite place to go shopping during Black Friday, even surpassing in-store purchasing. Nearly six out of ten consumers chose Amazon as the number one place to go find the best Black Friday deals. Similar findings can be observed in the United Kingdom (UK), where Amazon is also ranked as the preferred Black Friday destination.

  17. Website traffic to selected recommerce marketplaces in France 2023-2024

    • statista.com
    • ai-chatbox.pro
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Website traffic to selected recommerce marketplaces in France 2023-2024 [Dataset]. https://www.statista.com/statistics/1331293/top-websites-for-secondhand-purchases-by-traffic-france/
    Explore at:
    Dataset updated
    Jun 24, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Apr 2023 - Apr 2024
    Area covered
    France
    Description

    With approximately ***** million visits in April 2024, French online marketplace leboncoin.fr was the most consulted recommerce website in France. Other prominent web stores that consumers used to buy and sell second-hand or reconditioned items included the Lithuanian fashion marketplace Vinted.fr, which recorded **** million visits that same month, and ebay.fr who was the only secondhand marketplace in France to experience an increase in monthly visits in April 2024 compared to the same month in the previous year.

  18. Best performing online sectors after the COVID-19 outbreak in France in 2020...

    • statista.com
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Best performing online sectors after the COVID-19 outbreak in France in 2020 [Dataset]. https://www.statista.com/statistics/1252824/performance-online-high-tech-furniture-groceries-traffic-conversion-france-covid/
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jan 2020 - Dec 2020
    Area covered
    France
    Description

    After the outbreak of the coronavirus (COVID-19) online general retailers and grocery shopping had the most positive development as compared to the luxury products category or online cosmetics industry, also analyzed by the source. In general, the three big winners of the crisis in terms of traffic, conversion rate and time spent per session were the online grocery sector, high-tech products and the home furnishing sector.

    More information about

  19. Share of traffic sources on LV's brand website in China 2019

    • statista.com
    Updated Dec 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Share of traffic sources on LV's brand website in China 2019 [Dataset]. https://www.statista.com/statistics/1098784/china-louis-vuitton-website-traffic-by-source/
    Explore at:
    Dataset updated
    Dec 20, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Oct 2019
    Area covered
    China
    Description

    In 2004, luxury French fashion brand Louis Vuitton Malletier (also Louis Vuitton or LV) established its official website in China. The website allows direct purchases online and is one of LV's best performing retail channels. More than 55 percent of the website traffic comes from search results.

  20. Leading websites worldwide 2024, by session length

    • statista.com
    • ai-chatbox.pro
    Updated Feb 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Leading websites worldwide 2024, by session length [Dataset]. https://www.statista.com/statistics/1201901/most-visited-websites-worldwide-time-visit/
    Explore at:
    Dataset updated
    Feb 11, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Nov 2024
    Area covered
    Worldwide
    Description

    In November 2024, WhatsApp.com was the most engaging website worldwide, with users spending approximately 31 minutes and 17 seconds per visit on the website. YouTube.com was second in user engagement, with an average visit duration of 22 minutes and 24 minutes and 15 seconds. X.com, which ranks as the tenth most visited website worldwide, reported an average session length of 15 minutes and 26 seconds.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Homan, Sophia (2024). Network Traffic Analysis: Data and Code [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11479410

Network Traffic Analysis: Data and Code

Explore at:
Dataset updated
Jun 12, 2024
Dataset provided by
Soni, Shreena
Honig, Joshua
Chan-Tin, Eric
Ferrell, Nathan
Moran, Madeline
Homan, Sophia
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Code:

Packet_Features_Generator.py & Features.py

To run this code:

pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j

-h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j

Purpose:

Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.

Uses Features.py to calcualte the features.

startMachineLearning.sh & machineLearning.py

To run this code:

bash startMachineLearning.sh

This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags

Options (to be edited within this file):

--evaluate-only to test 5 fold cross validation accuracy

--test-scaling-normalization to test 6 different combinations of scalers and normalizers

Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use

--grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'

Purpose:

Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.

Data

Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.

Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:

First number is a classification number to denote what website, query, or vr action is taking place.

The remaining numbers in each line denote:

The size of a packet,

and the direction it is traveling.

negative numbers denote incoming packets

positive numbers denote outgoing packets

Figure 4 Data

This data uses specific lines from the Virtual Reality.txt file.

The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.

The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.

The .xlsx and .csv file are identical

Each file includes (from right to left):

The origional packet data,

each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,

and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.

Search
Clear search
Close search
Google apps
Main menu