100+ datasets found
  1. Number of internet users worldwide 2014-2029

    • statista.com
    Updated Apr 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Number of internet users worldwide 2014-2029 [Dataset]. https://www.statista.com/topics/1145/internet-usage-worldwide/
    Explore at:
    Dataset updated
    Apr 11, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Area covered
    World
    Description

    The global number of internet users in was forecast to continuously increase between 2024 and 2029 by in total 1.3 billion users (+23.66 percent). After the fifteenth consecutive increasing year, the number of users is estimated to reach 7 billion users and therefore a new peak in 2029. Notably, the number of internet users of was continuously increasing over the past years.Depicted is the estimated number of individuals in the country or region at hand, that use the internet. As the datasource clarifies, connection quality and usage frequency are distinct aspects, not taken into account here.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of internet users in countries like the Americas and Asia.

  2. f

    Data from: Penalized and Constrained Optimization: An Application to...

    • tandf.figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gareth M. James; Courtney Paulson; Paat Rusmevichientong (2023). Penalized and Constrained Optimization: An Application to High-Dimensional Website Advertising [Dataset]. http://doi.org/10.6084/m9.figshare.8023382.v3
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Taylor & Francis
    Authors
    Gareth M. James; Courtney Paulson; Paat Rusmevichientong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Firms are increasingly transitioning advertising budgets to Internet display campaigns, but this transition poses new challenges. These campaigns use numerous potential metrics for success (e.g., reach or click rate), and because each website represents a separate advertising opportunity, this is also an inherently high-dimensional problem. Further, advertisers often have constraints they wish to place on their campaign, such as targeting specific sub-populations or websites. These challenges require a method flexible enough to accommodate thousands of websites, as well as numerous metrics and campaign constraints. Motivated by this application, we consider the general constrained high-dimensional problem, where the parameters satisfy linear constraints. We develop the Penalized and Constrained optimization method (PaC) to compute the solution path for high-dimensional, linearly constrained criteria. PaC is extremely general; in addition to internet advertising, we show it encompasses many other potential applications, such as portfolio estimation, monotone curve estimation, and the generalized lasso. Computing the PaC coefficient path poses technical challenges, but we develop an efficient algorithm over a grid of tuning parameters. Through extensive simulations, we show PaC performs well. Finally, we apply PaC to a proprietary dataset in an exemplar Internet advertising case study and demonstrate its superiority over existing methods in this practical setting. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.

  3. Internet and Computer use, London - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Jun 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2025). Internet and Computer use, London - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/internet-and-computer-use-london
    Explore at:
    Dataset updated
    Jun 9, 2025
    Dataset provided by
    CKANhttps://ckan.org/
    Area covered
    London
    Description

    Statistics of how many adults access the internet and use different types of technology covering: home internet access how people connect to the web how often people use the web/computers whether people use mobile devices whether people buy goods over the web whether people carried out specified activities over the internet For more information see the ONS website and the UKDS website.

  4. E-commerce - Users of a French C2C fashion store

    • kaggle.com
    zip
    Updated Mar 17, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeffrey Mvutu Mabilama (2020). E-commerce - Users of a French C2C fashion store [Dataset]. https://www.kaggle.com/jmmvutu/ecommerce-users-of-a-french-c2c-fashion-store
    Explore at:
    zip(1906187 bytes)Available download formats
    Dataset updated
    Mar 17, 2020
    Authors
    Jeffrey Mvutu Mabilama
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Context

    There are a lot of unknowns when running an E-commerce store, even when you have analytics to guide your decisions.

    Users are an important factor in an e-commerce business. This is especially true in a C2C-oriented store, since they are both the suppliers (by uploading their products) AND the customers (by purchasing other user's articles).

    This dataset aims to serve as a benchmark for an e-commerce fashion store. Using this dataset, you may want to try and understand what you can expect of your users and determine in advance how your grows may be.

    • For instance, if you see that most of your users are not very active, you may look into this dataset to compare your store's performance.

    If you think this kind of dataset may be useful or if you liked it, don't forget to show your support or appreciation with an upvote/comment. You may even include how you think this dataset might be of use to you. This way, I will be more aware of specific needs and be able to adapt my datasets to suits more your needs.

    This dataset is part of a preview of a much larger dataset. Please contact me for more.

    Content

    What is inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    The data was scraped from a successful online C2C fashion store with over 9M registered users. The store was first launched in Europe around 2009 then expanded worldwide.

    Visitors vs Users: Visitors do not appear in this dataset. Only registered users are included. "Visitors" cannot purchase an article but can view the catalog.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Questions you might want to answer using this dataset:

    • Are e-commerce users interested in social network feature ?
    • Are my users active enough (compared to those of this dataset) ?
    • How likely are people from other countries to sign up in a C2C website ?
    • How many users are likely to drop off after years of using my service ?

    License

    CC-BY-NC-SA 4.0

    For other licensing options, contact me.

  5. Number of global social network users 2017-2028

    • statista.com
    • grusthub.com
    • +3more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stacy Jo Dixon, Number of global social network users 2017-2028 [Dataset]. https://www.statista.com/topics/1164/social-networks/
    Explore at:
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Stacy Jo Dixon
    Description

    How many people use social media?

                  Social media usage is one of the most popular online activities. In 2024, over five billion people were using social media worldwide, a number projected to increase to over six billion in 2028.
    
                  Who uses social media?
                  Social networking is one of the most popular digital activities worldwide and it is no surprise that social networking penetration across all regions is constantly increasing. As of January 2023, the global social media usage rate stood at 59 percent. This figure is anticipated to grow as lesser developed digital markets catch up with other regions
                  when it comes to infrastructure development and the availability of cheap mobile devices. In fact, most of social media’s global growth is driven by the increasing usage of mobile devices. Mobile-first market Eastern Asia topped the global ranking of mobile social networking penetration, followed by established digital powerhouses such as the Americas and Northern Europe.
    
                  How much time do people spend on social media?
                  Social media is an integral part of daily internet usage. On average, internet users spend 151 minutes per day on social media and messaging apps, an increase of 40 minutes since 2015. On average, internet users in Latin America had the highest average time spent per day on social media.
    
                  What are the most popular social media platforms?
                  Market leader Facebook was the first social network to surpass one billion registered accounts and currently boasts approximately 2.9 billion monthly active users, making it the most popular social network worldwide. In June 2023, the top social media apps in the Apple App Store included mobile messaging apps WhatsApp and Telegram Messenger, as well as the ever-popular app version of Facebook.
    
  6. Phishing Websites Detection

    • kaggle.com
    zip
    Updated May 28, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    J Akshaya (2020). Phishing Websites Detection [Dataset]. https://www.kaggle.com/akshaya1508/phishing-websites-detection
    Explore at:
    zip(80950 bytes)Available download formats
    Dataset updated
    May 28, 2020
    Authors
    J Akshaya
    Description

    Context

    Phishing is a form of identity theft that occurs when a malicious website impersonates a legitimate one in order to acquire sensitive information such as passwords, account details, or credit card numbers. People generally tend to fall pray to this very easily. Kudos to the commendable craftsmanship of the attackers which makes people believe that it is a legitimate website. There is a need to identify the potential phishing websites and differentiate them from the legitimate ones. This dataset identifies the prominent features of the phishing websites, 10 such features have been identified.

    Content

    Generally, the open source datasets available on the internet do not comes with the code and the logic which arises certain problems i.e.:

    1. Limited Data: The ML algorithms can only be tested with the existing phishing URLs and no new phishing URLS can be checked for its validity.
    2. Outdated URLs: The datasets available on the internet has been uploaded long time ago, there are new kind of phishing URLs arising in every second.
    3. Outdated Features: The datasets available on the internet has been uploaded long time ago, there are new methodologies arising in phishing techniques.
    4. No Access to Backend: There is no stepwise guide describing how the feature has been derived.

    On the contrary we are trying to overcome all the above-mentioned problems.

    1. Real Time Data: Before applying a Machine Learning algorithm, we can run the script and fetch real time URLs from Phishtank (for phishing URLs) and from moz (for legitimate URLs) 2. Scalable Data: We can also specify the number of URLs we want to feed the model and hence the web scrapper will fetch that much amount of data from the websites. Presently we are using 1401 URLs in this project i.e. 901 Phishing URLs and 500 Legitimate URLS. 3. New Features: We have tried to implement the prominent new features that is there in the current phishing URLs and since we own the code, new features can also be added. 4. Source code on Github: The source code is published on GitHub for public use and can be used for further scope of improvements. This way there will be transparency to the logic and more creators can add there meaningful additions to the code.

    Link to the source code

    https://github.com/akshaya1508/detection_of_phishing_websites.git

    Inspiration

    The idea to develop the dataset and the code for this dataset has been inspired by various other creators who have worked on the similar lines.

  7. Web Graphs

    • kaggle.com
    zip
    Updated Nov 11, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Subhajit Sahu (2021). Web Graphs [Dataset]. https://www.kaggle.com/wolfram77/graphs-web
    Explore at:
    zip(52848952 bytes)Available download formats
    Dataset updated
    Nov 11, 2021
    Authors
    Subhajit Sahu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dynamic face-to-face interaction networks represent the interactions that happen during discussions between a group of participants playing the Resistance game. This dataset contains networks extracted from 62 games. Each game is played by 5-8 participants and lasts between 45--60 minutes. We extract dynamically evolving networks from the free-form discussions using the ICAF algorithm. The extracted networks are used to characterize and detect group deceptive behavior using the DeceptionRank algorithm.

    The networks are weighted, directed and temporal. Each node represents a participant. At each 1/3 second, a directed edge from node u to v is weighted by the probability of participant u looking at participant v or the laptop. Additionally, we also provide a binary version where an edge from u to v indicates participant u looks at participant v (or the laptop).

    Stanford Network Analysis Platform (SNAP) is a general purpose, high performance system for analysis and manipulation of large networks. Graphs consists of nodes and directed/undirected/multiple edges between the graph nodes. Networks are graphs with data on nodes and/or edges of the network.

    The core SNAP library is written in C++ and optimized for maximum performance and compact graph representation. It easily scales to massive networks with hundreds of millions of nodes, and billions of edges. It efficiently manipulates large graphs, calculates structural properties, generates regular and random graphs, and supports attributes on nodes and edges. Besides scalability to large graphs, an additional strength of SNAP is that nodes, edges and attributes in a graph or a network can be changed dynamically during the computation.

    SNAP was originally developed by Jure Leskovec in the course of his PhD studies. The first release was made available in Nov, 2009. SNAP uses a general purpose STL (Standard Template Library)-like library GLib developed at Jozef Stefan Institute. SNAP and GLib are being actively developed and used in numerous academic and industrial projects.

    http://snap.stanford.edu/data/index.html#face2face

  8. Network Traffic Dataset

    • kaggle.com
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ravikumar Gattu (2023). Network Traffic Dataset [Dataset]. https://www.kaggle.com/datasets/ravikumargattu/network-traffic-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ravikumar Gattu
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    The data presented here was obtained in a Kali Machine from University of Cincinnati,Cincinnati,OHIO by carrying out packet captures for 1 hour during the evening on Oct 9th,2023 using Wireshark.This dataset consists of 394137 instances were obtained and stored in a CSV (Comma Separated Values) file.This large dataset could be used utilised for different machine learning applications for instance classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

    The dataset can be used for a variety of machine learning tasks, such as network intrusion detection, traffic classification, and anomaly detection.

    Content :

    This network traffic dataset consists of 7 features.Each instance contains the information of source and destination IP addresses, The majority of the properties are numeric in nature, however there are also nominal and date kinds due to the Timestamp.

    The network traffic flow statistics (No. Time Source Destination Protocol Length Info) were obtained using Wireshark (https://www.wireshark.org/).

    Dataset Columns:

    No : Number of Instance. Timestamp : Timestamp of instance of network traffic Source IP: IP address of Source Destination IP: IP address of Destination Portocol: Protocol used by the instance Length: Length of Instance Info: Information of Traffic Instance

    Acknowledgements :

    I would like thank University of Cincinnati for giving the infrastructure for generation of network traffic data set.

    Ravikumar Gattu , Susmitha Choppadandi

    Inspiration : This dataset goes beyond the majority of network traffic classification datasets, which only identify the type of application (WWW, DNS, ICMP,ARP,RARP) that an IP flow contains. Instead, it generates machine learning models that can identify specific applications (like Tiktok,Wikipedia,Instagram,Youtube,Websites,Blogs etc.) from IP flow statistics (there are currently 25 applications in total).

    **Dataset License: ** CC0: Public Domain

    Dataset Usages : This dataset can be used for different machine learning applications in the field of cybersecurity such as classification of Network traffic,Network performance monitoring,Network Security Management , Network Traffic Management ,network intrusion detection and anomaly detection.

    ML techniques benefits from this Dataset :

    This dataset is highly useful because it consists of 394137 instances of network traffic data obtained by using the 25 applications on a public,private and Enterprise networks.Also,the dataset consists of very important features that can be used for most of the applications of Machine learning in cybersecurity.Here are few of the potential machine learning applications that could be benefited from this dataset are :

    1. Network Performance Monitoring : This large network traffic data set can be utilised for analysing the network traffic to identifying the network patterns in the network .This help in designing the network security algorithms for minimise the network probelms.

    2. Anamoly Detection : Large network traffic dataset can be utilised training the machine learning models for finding the irregularitues in the traffic which could help identify the cyber attacks.

    3.Network Intrusion Detection : This large dataset could be utilised for machine algorithms training and designing the models for detection of the traffic issues,Malicious traffic network attacks and DOS attacks as well.

  9. phishing.arff

    • figshare.com
    txt
    Updated Jul 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ambroise Odonnat (2024). phishing.arff [Dataset]. http://doi.org/10.6084/m9.figshare.26232710.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jul 10, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Ambroise Odonnat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This folder contains the data from the Phishing Website dataset provided in [1]. All the features are categorical and were preprocessed in integer values. The data can be downloaded from https://archive.ics.uci.edu/dataset/327/phishing+websites. There are 11055 samples with 30 features. Websites belong to 2 domains: websites that use the IP address used instead of the domain name in the URL and websites that use the domain name in the URL. For reference, please refer to: [1] R. Mohammad, F. Thabtah, L. Mccluskey. An assessment of features related to phishing websites using an automated technique In International Conference for Internet Technology and Secured Transactions, 2012

  10. e

    Government websites hyperlink networks, multiple countries - Dataset -...

    • b2find.eudat.eu
    Updated Apr 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Government websites hyperlink networks, multiple countries - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/be3c6062-d352-5d76-884d-03d0b77e96d2
    Explore at:
    Dataset updated
    Apr 26, 2023
    Description

    The dataset includes the hyperlink network structure of the government websites in different countries. This version of the dataset includes data from Canada, Japan, and Spain. See 'Related Resources' section below for similar collections. This project aims to develop methodologies to study online political behaviour including use of the Internet to generate new data and experiments; to collect and analyse data on internet-mediated interactions at both individual and organisational levels; and to use this data to re-examine and where necessary develop political science knowledge and theory in light of widespread use of the Internet First, the project will re-examine the logic of collective action, assessing the impact of reduced communication and coordination costs; the changing nature of leadership; and the effects of different information environments on propensity to participate in political mobilisation. This part of the research will involve conducting laboratory and field experiments into online behaviour. Second, the research will develop the Digital-era Governance model for newer 'Web 2.0' applications and other technological developments such as cloud computing. The research will re-examine the nature of citizen-government interactions in this changing environment, examining the impact of Internet-based mediation on information exchange, organisational forms in government and citizen participation in policy-making. This part of the research will involve a comparison of government's online presence in eight countries, using webmetric techniques, and in-depth qualitative analysis of governance models, using elite interviewing and documentary analysis. We used the Heritrix web crawler (https://en.wikipedia.org/wiki/Heritrix) to capture the hyperlink structure of webpages witihin the .gc.ca, .go.jp, and .gob.es domains.

  11. d

    Performance Metrics - Innovation & Technology - City Website Availability

    • catalog.data.gov
    • data.cityofchicago.org
    • +3more
    Updated Dec 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.cityofchicago.org (2023). Performance Metrics - Innovation & Technology - City Website Availability [Dataset]. https://catalog.data.gov/dataset/performance-metrics-innovation-technology-city-website-availability
    Explore at:
    Dataset updated
    Dec 2, 2023
    Dataset provided by
    data.cityofchicago.org
    Description

    The City's Internet site allows residents to access City services online, learn more about the City of Chicago, and find other pertinent information. The percentage of the City’s Internet website uptime, the amount of time the site was available, and the target uptime for each week are available by mousing over columns. The target availability for this site is 99.5%.

  12. About Norwegian Agriculture

    • kaggle.com
    Updated Jul 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olena Bugaiova (2024). About Norwegian Agriculture [Dataset]. http://doi.org/10.34740/kaggle/dsv/9037685
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 26, 2024
    Dataset provided by
    Kaggle
    Authors
    Olena Bugaiova
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    Norway
    Description

    Context

    The cleaned text data can be used to adapt LLM to the domain of Norwegian Agriculture within the Norwegian language. In addition, it can be valuable for various NLP tasks such as region classification, or analytical tasks, such as exploring common agricultural practices in Norway.

    Content

    This dataset focuses on agronomic management practices and production in Norway. It consists of 2292 articles in Norwegian. All data is derived from three Norwegian agricultural-related websites and includes data from the largest advisory service for the agricultural sector, Norsk landbruksrådgivning (Norwegian Agricultural Extension Service, NLR), the most prominent agricultural research institute in Norway, Norsk Institutt for Bioøkonomi (Norwegian Institute for Bioeconomy, NIBIO), and the most comprehensive web page dedicated to plant protection in agriculture, Plantevernleksikonet.

    Inspiration

    The emergence of LLMs marked a significant step forward, providing a single solution for generating human-like text. However, training an LLM requires substantial amounts of text data, which is not readily available for most natural languages, including Norwegian. And agriculture as an industry has not seen much penetration of AI, - what if we could provide location-specific insights to a farmer?

    Acknowledgements

    The data from NLR can be expanded in the future, gathering more text data.

  13. NYC STEW-MAP Staten Island organizations' website hyperlink webscrape

    • catalog.data.gov
    • s.cnmilf.com
    Updated Nov 21, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2022). NYC STEW-MAP Staten Island organizations' website hyperlink webscrape [Dataset]. https://catalog.data.gov/dataset/nyc-stew-map-staten-island-organizations-website-hyperlink-webscrape
    Explore at:
    Dataset updated
    Nov 21, 2022
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Area covered
    Staten Island, New York
    Description

    The data represent web-scraping of hyperlinks from a selection of environmental stewardship organizations that were identified in the 2017 NYC Stewardship Mapping and Assessment Project (STEW-MAP) (USDA 2017). There are two data sets: 1) the original scrape containing all hyperlinks within the websites and associated attribute values (see "README" file); 2) a cleaned and reduced dataset formatted for network analysis. For dataset 1: Organizations were selected from from the 2017 NYC Stewardship Mapping and Assessment Project (STEW-MAP) (USDA 2017), a publicly available, spatial data set about environmental stewardship organizations working in New York City, USA (N = 719). To create a smaller and more manageable sample to analyze, all organizations that intersected (i.e., worked entirely within or overlapped) the NYC borough of Staten Island were selected for a geographically bounded sample. Only organizations with working websites and that the web scraper could access were retained for the study (n = 78). The websites were scraped between 09 and 17 June 2020 to a maximum search depth of ten using the snaWeb package (version 1.0.1, Stockton 2020) in the R computational language environment (R Core Team 2020). For dataset 2: The complete scrape results were cleaned, reduced, and formatted as a standard edge-array (node1, node2, edge attribute) for network analysis. See "READ ME" file for further details. References: R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/. Version 4.0.3. Stockton, T. (2020). snaWeb Package: An R package for finding and building social networks for a website, version 1.0.1. USDA Forest Service. (2017). Stewardship Mapping and Assessment Project (STEW-MAP). New York City Data Set. Available online at https://www.nrs.fs.fed.us/STEW-MAP/data/. This dataset is associated with the following publication: Sayles, J., R. Furey, and M. Ten Brink. How deep to dig: effects of web-scraping search depth on hyperlink network analysis of environmental stewardship organizations. Applied Network Science. Springer Nature, New York, NY, 7: 36, (2022).

  14. Internet Use by Borough, and Population Sub-Groups - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Jun 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2025). Internet Use by Borough, and Population Sub-Groups - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/internet-use-by-borough-and-population-sub-groups
    Explore at:
    Dataset updated
    Jun 9, 2025
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    This table shows whether people aged 16 or over have ever used or never used the internet by a range of variables such as age, ethnicity, pay, occupation, qualifications, and disability. The question asked in the Labour Force Survey is "When did you last use the internet?" This question is only asked to people aged 16 and over. The first time this data was available was 2011 Q1. At borough level the data showed ever used or never used. For London and Rest of UK the data is broken down by a range of indicators, including age, ethnic group, weekly pay, occupation levels, qualification levels, and economic activity. The APS sampled around 333,000 people in the UK (around 27,000 in London). As such all figures must be treated with some caution. Data was supplied directly by ONS under request from the Greater London Authority. Numbers rounded to the nearest thousand. Other Internet Access data can be found on the ONS website. This is national data based on the Opinions and Lifestyle Survey.

  15. m

    United Internet AG NA - Ebitda

    • macro-rankings.com
    csv, excel
    Updated Jul 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). United Internet AG NA - Ebitda [Dataset]. https://www.macro-rankings.com/Markets/Stocks/UTDI-XETRA/Income-Statement/Ebitda
    Explore at:
    excel, csvAvailable download formats
    Dataset updated
    Jul 30, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    germany
    Description

    Ebitda Time Series for United Internet AG NA. United Internet AG, through its subsidiaries, operates as an Internet service provider worldwide. The company operates through Consumer Access, Business Access, Consumer Applications, and Business Applications segments. It offers landline-based broadband and mobile internet products, including home networks, online storage, smart home, and IPTV for private users; and telecommunication products ranging from fiber-optic direct connections to tailored ICT solutions, which include voice, data, and network solutions, as well as infrastructure services to national and international carriers and ISPs. The company also provides applications and services for home users, such as personal information management applications comprising email, to-do lists, appointments, and addresses; and online cloud storage and office software. In addition, it provides business applications for freelancers and small to medium enterprises, such as domains, websites, web hosting, servers, e-shops, group work, online cloud storage, and office software, as well as cloud solutions and infrastructure. It offers its access products through the yourfone, smartmobile.de, 1&1, and 1&1 Versatel brand names; and applications through GMX, mail.com, WEB.DE, home.pl, Arsys, STRATO, IONOS, Fasthosts, we22, InterNetX, united-domains, and World4You brand names. In addition, the company offers professional services in the fields of active domain management; performance-based advertising and sales services under the Sedo brand name; online advertising services under the United Internet Media brand name; and white-label website builder services under the we22 brand, as well as sells IT hardware. The company was founded in 1988 and is headquartered in Montabaur, Germany.

  16. VLC Data: A Multi-Class Network Traffic Dataset Covering Diverse...

    • zenodo.org
    • producciocientifica.uv.es
    • +1more
    bin
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francisco Rau; Francisco Rau; Carlos Herranz Claveras; Carlos Herranz Claveras; Iñaki Val; Iñaki Val; Joaquin Perez; Joaquin Perez (2025). VLC Data: A Multi-Class Network Traffic Dataset Covering Diverse Applications and Platforms [Dataset]. http://doi.org/10.5281/zenodo.15121418
    Explore at:
    binAvailable download formats
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Francisco Rau; Francisco Rau; Carlos Herranz Claveras; Carlos Herranz Claveras; Iñaki Val; Iñaki Val; Joaquin Perez; Joaquin Perez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Apr 1, 2025
    Description

    VLC Data: A Multi-Class Network Traffic Dataset Covering Diverse Applications and Platforms

    Valencia Data (VLC Data) is a network traffic dataset collected from various applications and platforms. It includes both encrypted and, when applicable, unencrypted protocols, capturing realistic usage scenarios and application-specific behavior.

    The dataset covers 18.5 hours, 58 pcapng files, and 24.26 GB, with traffic from:

    • Video streaming: Netflix and Prime Video (10–50 min) via Firefox.
    • Gaming: Roblox sessions on Windows (20–35 min), recorded outside of virtual machines, despite VM support.
    • Video conferencing: Microsoft Teams (20 min) via Firefox.
    • Web browsing: Wikipedia, BBC, Google, LinkedIn, Amazon, and OWIN6G (2–5 min) via Firefox or Chrome.
    • Audio streaming: Spotify (30–33 min) on multiple OS.
    • Web streaming: YouTube in 4K and Full HD (20–30 min).

    This dataset is publicly available for traffic analysis across different apps, protocols, and systems.

    Table Description:

    TypeApplicationsPlatformTime [min]CommentsFilenameSize (MB)
    Video StreamingNetflixLinux10Running Netflix on Firefox Browsernetflix_linux_10m_01 95.1
    Video StreamingNetflixLinux20Running Netflix on Firefox Browsernetflix_linux_20m_01 167.7
    Video StreamingNetflixLinux20Running Netflix on Firefox Browsernetflix_linux_20m_02 237.9
    Video StreamingNetflixLinux20Running Netflix on Firefox Browsernetflix_linux_20m_03 212.6
    Video StreamingNetflixLinux25Running Netflix on Firefox, but 2 min in Menunetflix_linux_25m_01 610.7
    Video StreamingNetflixLinux35Running Netflix on Firefox, but 1 min in Menunetflix_linux_35m_01 534.8
    Video StreamingNetflixLinux50Running Netflix on Firefox Browsernetflix_linux_50m_01 660.9
    Video StreamingNetflixWindows10Running Netflix on Firefox Browsernetflix_windows_10m_01 132.1
    Video StreamingNetflixWindows20Running Netflix on Firefox Browsernetflix_windows_20m_01 506.4
    Video StreamingPrime VideoLinux20Running Prime Video on Firefox Browserprime_linux_20m_01 767.3
    Video StreamingPrime VideoLinux20Running Prime Video on Firefox Browserprime_linux_20m_02 569.3
    Video StreamingPrime VideoWindows20Running Prime Video on Firefox Browserprime_windows_20m_01 512.3
    Video StreamingPrime VideoWindows20Running Prime Video on Firefox Browserprime_windows_20m_02 364.2
    GamingRobloxWindows20Doesn't run in VMroblox_windows_20m_01 127.5
    GamingRobloxWindows20Doesn't run in VMroblox_windows_20m_02 378.5
    GamingRobloxWindows20Doesn't run in VMroblox_windows_20m_03 458.9
    GamingRobloxWindows30Doesn't run in VMroblox_windows_30m_01 519.8
    GamingRobloxWindows30Doesn't run in VMroblox_windows_30m_02 357.3
    GamingRobloxWindows35Doesn't run in VMroblox_windows_35m_01 880.4
    Audio StreamingSpotifyLinux30Running Spotify app on Ubuntu-Linuxspotify_linux_30m_01 98.2
    Audio StreamingSpotifyLinux30Running Spotify app on Ubuntu-Linuxspotify_linux_30m_02 112.2
    Audio StreamingSpotifyLinux30Running Spotify app on Ubuntu-Linuxspotify_linux_30m_03 175.5
    Audio StreamingSpotifyWindows30Running Spotify app on Windowsspotify_windows_30m_01 50.7
    Audio StreamingSpotifyWindows30Doesn't run in VMspotify_windows_30m_02 63.2
    Audio StreamingSpotifyWindows33Running Spotify app on Windowsspotify_windows_33m_01 70.9
    Video ConferencingTeamsLinux20Running Teams on Firefox Browserteams_linux_20m_01 134.6
    Video ConferencingTeamsLinux20Running Teams on Firefox Browserteams_linux_20m_02 343.3
    Video ConferencingTeamsLinux20Running Teams on Firefox Browserteams_linux_20m_03 376.6
    Video ConferencingTeamsWindows20Running Teams on Firefox Browserteams_windows_20m_01 634.1
    Video ConferencingTeamsWindows20Running Teams on Firefox Browserteams_windows_20m_02 517.8
    Video ConferencingTeamsWindows20Running Teams on Firefox Browserteams_windows_20m_03 629.9
    Web BrowsingWebLinux2OWIN6G website on Firefox Browserweb_linux_2m_owin6g 1.2
    Web BrowsingWebLinux2Wikipedia website on Firefox Browserweb_linux_2m_wikipedia 19.7
    Web BrowsingWebLinux3OWIN6G website on Firefox Browserweb_linux_3m_owin6g 4.5
    Web BrowsingWebLinux3Wikipedia website on Firefox Browserweb_linux_3m_wikipedia 23.5
    Web BrowsingWebLinux5Amazon website on Chrome Browser web_linux_5m_amazon 262.9
    Web BrowsingWebLinux5BBC website on Firefox Browser web_linux_5m_bbc 55.7
    Web BrowsingWebLinux5Google website on Firefox Browser web_linux_5m_google 22.6
    Web BrowsingWebLinux5Linkedin website on Firefox Browserweb_linux_5m_linkedin 39.8
    Web BrowsingWebWindows3OWIN6G website on Firefox Browserweb_windows_3m_owin6g 32.6
    Web BrowsingWebWindows3Wikipedia website on Firefox Browserweb_windows_3m_wikipedia 94.9
    Web BrowsingWebWindows5Amazon website on Chrome Browser web_windows_5m_amazon 104.0
    Web BrowsingWebWindows5BBC website on Firefox Browser web_windows_5m_bbc 23.1
    Web BrowsingWebWindows5Google website on Firefox Browser web_windows_5m_google 31.5
    Web BrowsingWebWindows5Linkedin website on Firefox Browserweb_windows_5m_linkedin 104.1
    Web StreamingYoutubeLinux20One Video Streaming, 4Kyoutube_linux_20m_01 1,145.6
    Web StreamingYoutubeLinux20One Video Streaming, FullHDyoutube_linux_20m_02 389.4
    Web StreamingYoutubeLinux20One Video Streaming, FullHDyoutube_linux_20m_03 2,007.1
    Web StreamingYoutubeLinux20One Video Streaming, 4Kyoutube_linux_20m_04 390.4
    Web StreamingYoutubeLinux20One Video Streaming, FullHDyoutube_linux_20m_05 410.1
    Web

  17. e

    Geography of digital inequality - Dataset - B2FIND

    • b2find.eudat.eu
    Updated Jun 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Geography of digital inequality - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/aac47d03-1fdf-5f48-8021-627e02f643e9
    Explore at:
    Dataset updated
    Jun 27, 2023
    Description

    These data consist of measures of Internet use estimated using small area estimation. The small area estimation is based on census Output Areas (OAs) using the 2013 Oxford Internet Survey (OxIS) and the 2011 British census. There is an estimate for each OA in Great Britain. By combining the 2013 OxIS survey data with the comprehensive small area coverage of the 2011 British census we can use the strengths of one to offset the gaps in the other. Specifically, we follow a two-step process. First, we use the information that is reliably available in OxIS to create model that estimates the proportion of Internet users in OAs. Second, we use the parameters from this model combined with census data to estimate the proportion of Internet users each OA in Britain. Once these estimates are available, we aggregate the estimates up to higher levels of geography. In this way we can estimate Internet use in Glasgow, Manchester and Cardiff as well as other small areas in Britain. This procedure is referred to as indirect, model-based or synthetic estimation. In recent years such SAE techniques have been widely used throughout Europe and North America. See the project website for more details.The objective of the Geography of Digital Inequality project was to explore the geographical contours of Internet use and penetration in Britain. Specifically, the project assembled from existing datasets a new dataset which contains Internet information at fine-grained geographic levels, census output areas (OAs). From OAs we were able to aggregate to higher geographic levels such as counties, Welsh and Scottish Councils, metropolitan areas, or others. Through this unique dataset we explored digital divides and the geography of the Internet, a capability possessed by no other dataset. Specifically, we explored the extent of use versus non-use of the Internet. There were 2 datasets used to assemble this dataset. First, the 2013 Oxford Internet Survey (OxIS) is a random sample of the 2657 people age 14+ from the British population (England, Scotland & Wales). Interviews were conducted face-to-face by an independent survey research company. The response rate for 2013 was 51%. The data collection was a two-stage sample. A random sample of census output areas (OAs) was selected and respondents were randomly sampled within each selected OA. For details, see "Data collection technical report.pdf" which has been uploaded. We use six variables from OxIS: Internet use, region, age, lifestage, gender and education. The questionnaire for OxIS contains about 300 variables and it is available from the OxIS website, see the URL in the "related resources" section. Second, the 2011 British Census. For information on how the census was conducted,see the census website. The URL for the 2011 census is given below in "related resources".

  18. y

    York Cycle Network - Dataset - York Open Data

    • data.yorkopendata.org
    Updated Feb 10, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). York Cycle Network - Dataset - York Open Data [Dataset]. https://data.yorkopendata.org/dataset/york-cycle-network
    Explore at:
    Dataset updated
    Feb 10, 2016
    License

    Open Government Licence 2.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/2/
    License information was derived automatically

    Area covered
    York
    Description

    For further information about cycling - see the City of York Council website *Please note that the data published within this dataset is a live API link to CYC's GIS server. Any changes made to the master copy of the data will be immediately reflected in the resources of this dataset.The date shown in the "Last Updated" field of each GIS resource reflects when the data was first published.

  19. m

    United Internet AG NA - Total-Revenue

    • macro-rankings.com
    csv, excel
    Updated Aug 24, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    macro-rankings (2025). United Internet AG NA - Total-Revenue [Dataset]. https://www.macro-rankings.com/Markets/Stocks/UTDI-XETRA/Income-Statement/Total-Revenue
    Explore at:
    csv, excelAvailable download formats
    Dataset updated
    Aug 24, 2025
    Dataset authored and provided by
    macro-rankings
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    germany
    Description

    Total-Revenue Time Series for United Internet AG NA. United Internet AG, through its subsidiaries, operates as an Internet service provider worldwide. The company operates through Consumer Access, Business Access, Consumer Applications, and Business Applications segments. It offers landline-based broadband and mobile internet products, including home networks, online storage, smart home, and IPTV for private users; and telecommunication products ranging from fiber-optic direct connections to tailored ICT solutions, which include voice, data, and network solutions, as well as infrastructure services to national and international carriers and ISPs. The company also provides applications and services for home users, such as personal information management applications comprising email, to-do lists, appointments, and addresses; and online cloud storage and office software. In addition, it provides business applications for freelancers and small to medium enterprises, such as domains, websites, web hosting, servers, e-shops, group work, online cloud storage, and office software, as well as cloud solutions and infrastructure. It offers its access products through the yourfone, smartmobile.de, 1&1, and 1&1 Versatel brand names; and applications through GMX, mail.com, WEB.DE, home.pl, Arsys, STRATO, IONOS, Fasthosts, we22, InterNetX, united-domains, and World4You brand names. In addition, the company offers professional services in the fields of active domain management; performance-based advertising and sales services under the Sedo brand name; online advertising services under the United Internet Media brand name; and white-label website builder services under the we22 brand, as well as sells IT hardware. The company was founded in 1988 and is headquartered in Montabaur, Germany.

  20. C

    National Hydrography Data - NHD and 3DHP

    • data.cnra.ca.gov
    • data.ca.gov
    • +1more
    Updated Jul 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Water Resources (2025). National Hydrography Data - NHD and 3DHP [Dataset]. https://data.cnra.ca.gov/dataset/national-hydrography-dataset-nhd
    Explore at:
    website, zip(39288832), pdf(3684753), pdf, pdf(1175775), zip(13901824), pdf(182651), arcgis geoservices rest api, zip(972664), pdf(9867020), csv(12977), zip(4657694), zip(578260992), pdf(1436424), zip(1647291), zip(15824984), pdf(1634485), zip(73817620), zip(10029073), pdf(4856863), zip(128966494), pdf(437025), web videos, pdf(3932070)Available download formats
    Dataset updated
    Jul 16, 2025
    Dataset authored and provided by
    California Department of Water Resources
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    The USGS National Hydrography Dataset (NHD) downloadable data collection from The National Map (TNM) is a comprehensive set of digital spatial data that encodes information about naturally occurring and constructed bodies of surface water (lakes, ponds, and reservoirs), paths through which water flows (canals, ditches, streams, and rivers), and related entities such as point features (springs, wells, stream gages, and dams). The information encoded about these features includes classification and other characteristics, delineation, geographic name, position and related measures, a "reach code" through which other information can be related to the NHD, and the direction of water flow. The network of reach codes delineating water and transported material flow allows users to trace movement in upstream and downstream directions. In addition to this geographic information, the dataset contains metadata that supports the exchange of future updates and improvements to the data. The NHD supports many applications, such as making maps, geocoding observations, flow modeling, data maintenance, and stewardship. For additional information on NHD, go to https://www.usgs.gov/core-science-systems/ngp/national-hydrography.

    DWR was the steward for NHD and Watershed Boundary Dataset (WBD) in California. We worked with other organizations to edit and improve NHD and WBD, using the business rules for California. California's NHD improvements were sent to USGS for incorporation into the national database. The most up-to-date products are accessible from the USGS website. Please note that the California portion of the National Hydrography Dataset is appropriate for use at the 1:24,000 scale.

    For additional derivative products and resources, including the major features in geopackage format, please go to this page: https://data.cnra.ca.gov/dataset/nhd-major-features Archives of previous statewide extracts of the NHD going back to 2018 may be found at https://data.cnra.ca.gov/dataset/nhd-archive.

    In September 2022, USGS officially notified DWR that the NHD would become static as USGS resources will be devoted to the transition to the new 3D Hydrography Program (3DHP). 3DHP will consist of LiDAR-derived hydrography at a higher resolution than NHD. Upon completion, 3DHP data will be easier to maintain, based on a modern data model and architecture, and better meet the requirements of users that were documented in the Hydrography Requirements and Benefits Study (2016). The initial releases of 3DHP include NHD data cross-walked into the 3DHP data model. It will take several years for the 3DHP to be built out for California. Please refer to the resources on this page for more information.

    The FINAL,STATIC version of the National Hydrography Dataset for California was published for download by USGS on December 27, 2023. This dataset can no longer be edited by the state stewards. The next generation of national hydrography data is the USGS 3D Hydrography Program (3DHP).

    Questions about the California stewardship of these datasets may be directed to nhd_stewardship@water.ca.gov.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista Research Department (2025). Number of internet users worldwide 2014-2029 [Dataset]. https://www.statista.com/topics/1145/internet-usage-worldwide/
Organization logo

Number of internet users worldwide 2014-2029

Explore at:
306 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Apr 11, 2025
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Area covered
World
Description

The global number of internet users in was forecast to continuously increase between 2024 and 2029 by in total 1.3 billion users (+23.66 percent). After the fifteenth consecutive increasing year, the number of users is estimated to reach 7 billion users and therefore a new peak in 2029. Notably, the number of internet users of was continuously increasing over the past years.Depicted is the estimated number of individuals in the country or region at hand, that use the internet. As the datasource clarifies, connection quality and usage frequency are distinct aspects, not taken into account here.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of internet users in countries like the Americas and Asia.

Search
Clear search
Close search
Google apps
Main menu