74 datasets found

Most popular website types among three generations of Internet users in...
statista.com
Updated Dec 2, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2022). Most popular website types among three generations of Internet users in Poland 2019 [Dataset]. https://www.statista.com/statistics/1031345/poland-website-types-among-three-generations-of-internet-users/
Explore at:
Dataset updated
Dec 2, 2022
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Apr 2019
Area covered
Poland
Description
In most cases, internet users across three generations spent their time online on similar website types. The top three was occupied by search engines (98 percent), social networking sites (90-93 percent), and mail services (84-85 percent). Bank websites and applications had the largest reach in the group of 60-69 - as many as 81 percent of this age group used internet banking.
Fraudulent Bank Websites, Phishing E-mails and Similar Scams | DATA.GOV.HK
data.gov.hk
Updated Oct 26, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.gov.hk (2018). Fraudulent Bank Websites, Phishing E-mails and Similar Scams | DATA.GOV.HK [Dataset]. https://data.gov.hk/en-data/dataset/hk-hkma-banksvf-fraudulent-bank-scams
Explore at:
Dataset updated
Oct 26, 2018
Dataset provided by
data.gov.hk
Description
This API is providing the information of press releases issued by the authorized institutions and other similar press releases issued by the HKMA in the past regarding fraudulent bank websites, phishing E-mails and similar scams information.
w
Websites using Same But Different
webtechsurvey.com
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey, Websites using Same But Different [Dataset]. https://webtechsurvey.com/technology/same-but-different
Explore at:
csvAvailable download formats
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Same But Different technology, compiled through global website indexing conducted by WebTechSurvey.
Z
Traffic Acquisition to LAMs Websites
data.niaid.nih.gov
Updated Apr 30, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dimitrios Kouis (2022). Traffic Acquisition to LAMs Websites [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6505276
Explore at:
Dataset updated
Apr 30, 2022
Dataset provided by
Ioannis C. Drivas
Dimitrios Kouis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Preliminary research efforts regarding Social Media Platforms and their contribution to website traffic in LAMs. Through the Similar Web API, the leading social networks (Facebook, Twitter, Youtube, Instagram, Reddit, Pinterest, LinkedIn) that drove traffic to each one of the 220 cases in our dataset were identified and analyzed in the first sheet. Aggregated results proved that Facebook platform was responsible for 46.1% of social traffic (second sheet).
Curlie Enhanced with LLM Annotations: Two Datasets for Advancing...
zenodo.org
data.niaid.nih.gov
csv
Updated Dec 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peter Nutter; Mika Senghaas; Ludek Cizinsky; Peter Nutter; Mika Senghaas; Ludek Cizinsky (2023). Curlie Enhanced with LLM Annotations: Two Datasets for Advancing Homepage2Vec's Multilingual Website Classification [Dataset]. http://doi.org/10.5281/zenodo.10413068
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10413068
Dataset updated
Dec 21, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Peter Nutter; Mika Senghaas; Ludek Cizinsky; Peter Nutter; Mika Senghaas; Ludek Cizinsky
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Advancing Homepage2Vec with LLM-Generated Datasets for Multilingual Website Classification

This dataset contains two subsets of labeled website data, specifically created to enhance the performance of Homepage2Vec, a multi-label model for website classification. The datasets were generated using Large Language Models (LLMs) to provide more accurate and diverse topic annotations for websites, addressing a limitation of existing Homepage2Vec training data.

Key Features:

LLM-generated annotations: Both datasets feature website topic labels generated using LLMs, a novel approach to creating high-quality training data for website classification models.

Improved multi-label classification: Fine-tuning Homepage2Vec with these datasets has been shown to improve its macro F1 score from 38% to 43% evaluated on a human-labeled dataset, demonstrating their effectiveness in capturing a broader range of website topics.

Multilingual applicability: The datasets facilitate classification of websites in multiple languages, reflecting the inherent multilingual nature of Homepage2Vec.

Dataset Composition:

curlie-gpt3.5-10k: 10,000 websites labeled using GPT-3.5, context 2 and 1-shot

curlie-gpt4-10k: 10,000 websites labeled using GPT-4, context 2 and zero-shot

Intended Use:

Fine-tuning and advancing Homepage2Vec or similar website classification models

Research on LLM-generated datasets for text classification tasks

Exploration of multilingual website classification

Additional Information:

Project and report repository: https://github.com/CS-433/ml-project-2-mlp

Acknowledgments:

This dataset was created as part of a project at EPFL's Data Science Lab (DLab) in collaboration with Prof. Robert West and Tiziano Piccardi.
w
Websites using Comments Like Dislike
webtechsurvey.com
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey, Websites using Comments Like Dislike [Dataset]. https://webtechsurvey.com/technology/comments-like-dislike
Explore at:
csvAvailable download formats
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Comments Like Dislike technology, compiled through global website indexing conducted by WebTechSurvey.
w
Websites using Sn Facebook Like
webtechsurvey.com
csv
Updated Jul 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Sn Facebook Like [Dataset]. https://webtechsurvey.com/technology/sn-facebook-like
Explore at:
csvAvailable download formats
Dataset updated
Jul 3, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Sn Facebook Like technology, compiled through global website indexing conducted by WebTechSurvey.
Z
Network Traffic Analysis: Data and Code
data.niaid.nih.gov
zenodo.org
Updated Jun 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Homan, Sophia (2024). Network Traffic Analysis: Data and Code [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11479410
Explore at:
Dataset updated
Jun 12, 2024
Dataset provided by
Honig, Joshua
Ferrell, Nathan
Soni, Shreena
Moran, Madeline
Chan-Tin, Eric
Homan, Sophia
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Code:

Packet_Features_Generator.py & Features.py

To run this code:

pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j

-h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j

Purpose:

Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.

Uses Features.py to calcualte the features.

startMachineLearning.sh & machineLearning.py

To run this code:

bash startMachineLearning.sh

This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags

Options (to be edited within this file):

--evaluate-only to test 5 fold cross validation accuracy

--test-scaling-normalization to test 6 different combinations of scalers and normalizers

Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use

--grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'

Purpose:

Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.

Data

Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.

Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:

First number is a classification number to denote what website, query, or vr action is taking place.

The remaining numbers in each line denote:

The size of a packet,

and the direction it is traveling.

negative numbers denote incoming packets

positive numbers denote outgoing packets

Figure 4 Data

This data uses specific lines from the Virtual Reality.txt file.

The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.

The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.

The .xlsx and .csv file are identical

Each file includes (from right to left):

The origional packet data,

each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,

and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.
C
Competitive Analysis of Industry Rivals Report
datainsightsmarket.com
doc, pdf, ppt
Updated May 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Competitive Analysis of Industry Rivals Report [Dataset]. https://www.datainsightsmarket.com/reports/competitive-analysis-of-industry-rivals-1427015
Explore at:
pdf, doc, pptAvailable download formats
Dataset updated
May 20, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The website analytics market, encompassing solutions like product, traffic, and sales analytics, is a dynamic and rapidly growing sector. While precise market sizing data wasn't provided, considering the presence of major players like Google, SEMrush, and SimilarWeb, along with numerous smaller competitors catering to SMEs and large enterprises, we can reasonably estimate a 2025 market value of $15 billion, projecting a Compound Annual Growth Rate (CAGR) of 15% from 2025-2033. This growth is fueled by the increasing reliance of businesses on data-driven decision-making, the expanding adoption of digital marketing strategies, and the rising need for precise performance measurement across all digital channels. Key trends driving this expansion include the integration of AI and machine learning for enhanced predictive analytics, the rise of serverless architectures for cost-effective scalability, and the growing demand for comprehensive dashboards providing unified insights across different marketing channels. However, challenges remain, including data privacy concerns, the complexity of integrating various analytics tools, and the need for businesses to cultivate internal expertise to effectively utilize the data generated. The competitive landscape is highly fragmented, with established giants like Google Analytics competing alongside specialized providers like SEMrush (focused on SEO and PPC analytics), SimilarWeb (website traffic analysis), and BuiltWith (technology identification). Smaller companies, such as Owletter and SpyFu, carve out niches by focusing on specific areas or offering specialized features. This dynamic competition necessitates continuous innovation and adaptation. Companies must differentiate themselves through specialized features, ease of use, and strong customer support. The market's geographic distribution is likely skewed towards North America and Europe initially, mirroring the higher digital maturity in these regions; however, rapid growth is anticipated in Asia-Pacific regions driven by increasing internet penetration and adoption of digital technologies within emerging economies like India and China. Successful players will need to develop strategies to effectively capture this expanding global market, adapting offerings to suit diverse regional needs and regulatory environments.
Bounce rate of leading consumer electronics sites worldwide 2024
statista.com
Updated Apr 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Bounce rate of leading consumer electronics sites worldwide 2024 [Dataset]. https://www.statista.com/statistics/1325859/consumer-electronics-websites-bounce-rate-worldwide/
Explore at:
Dataset updated
Apr 17, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Apr 2024
Area covered
Worldwide
Description
Among selected consumer electronics retailers worldwide, apple.com recorded the highest bounce rate in April 2024, at approximately 55.3 percent. Rival samsung.com had a slightly lower bounce rate of nearly 54 percent. Among selected consumer electronics e-tailers, huawei.com had the lowest bounce rate at 30.91 percent. Bounce rate is a marketing term used in web traffic analysis reflecting the percentage of visitors who enter the site and then leave without taking any further action like making a purchase or viewing other pages within the website ("bounce"). A sector with growth potential With one of the lowest online shopping cart abandonment rates globally in 2022, consumer electronics is a burgeoning e-commerce segment that places itself at the crossroads between technological progress and digital transformation. Boosted by the pandemic-induced surge in online shopping, the global market size of consumer electronics e-commerce was estimated at more than 340 billion U.S. dollars in 2021 and forecast to nearly double less than five years later. Amazon and Apple lead the charts in electronics e-commerce With more than 59 billion U.S. dollars in e-commerce net sales in the consumer electronics segment in 2022, apple.com was the uncontested industry leader. The global powerhouse surpassed e-commerce giants amazon.com and jd.com with more than ten billion U.S. dollars difference in online sales in the consumer electronics category.
i
A Dataset on Online Learning-based Web Behavior from Different Countries...
ieee-dataport.org
Updated Apr 27, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saumick Pradhan (2022). A Dataset on Online Learning-based Web Behavior from Different Countries Before and After COVID-19 [Dataset]. https://ieee-dataport.org/open-access/dataset-online-learning-based-web-behavior-different-countries-and-after-covid-19
Explore at:
Dataset updated
Apr 27, 2022
Authors
Saumick Pradhan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
2022
The Test-Case Dataset
kaggle.com
Updated Nov 29, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
sapal6 (2020). The Test-Case Dataset [Dataset]. https://www.kaggle.com/datasets/sapal6/the-testcase-dataset/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 29, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
sapal6
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

There are lots of datasets available for different machine learning tasks like NLP, Computer vision etc. However I couldn't find any dataset which catered to the domain of software testing. This is one area which has lots of potential for application of Machine Learning techniques specially deep-learning.

This was the reason I wanted such a dataset to exist. So, I made one.

Content

New version [28th Nov'20]- Uploaded testing related questions and related details from stack-overflow. These are query results which were collected from stack-overflow by using stack-overflow's query viewer. The result set of this query contained posts which had the words "testing web pages".

New version[27th Nov'20] - Created a csv file containing pairs of test case titles and test case description.

This dataset is very tiny (approximately 200 rows of data). I have collected sample test cases from around the web and created a text file which contains all the test cases that I have collected. This text file has sections and under each section there are numbered rows of test cases.

Acknowledgements

I would like to thank websites like guru99.com, softwaretestinghelp.com and many other such websites which host great many sample test cases. These were the source for the test cases in this dataset.

Inspiration

My Inspiration to create this dataset was the scarcity of examples showcasing the implementation of machine learning on the domain of software testing. I would like to see if this dataset can be used to answer questions similar to the following--> * Finding semantic similarity between different test cases ranging across products and applications. * Automating the elimination of duplicate test cases in a test case repository. * Cana recommendation system be built for suggesting domain specific test cases to software testers.
E
Enterprise Website Construction Report
archivemarketresearch.com
doc, pdf, ppt
Updated Jun 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Enterprise Website Construction Report [Dataset]. https://www.archivemarketresearch.com/reports/enterprise-website-construction-363756
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Jun 8, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Enterprise Website Construction market is experiencing robust growth, driven by the increasing digitalization of businesses and the escalating demand for sophisticated online presences. While the exact market size for 2025 is not provided, considering typical growth rates in the tech sector and assuming a moderately conservative estimate based on similar markets, let's posit a 2025 market size of $15 billion USD. This represents a significant opportunity for companies involved in designing, developing, and maintaining enterprise-grade websites. A Compound Annual Growth Rate (CAGR) of 12% is projected for the period 2025-2033, indicating a continued upward trajectory fueled by technological advancements like AI-powered website builders, enhanced security features, and increasing adoption of headless CMS architectures. This growth is expected across all segments, including large enterprises needing complex solutions and smaller businesses seeking scalable platforms. Factors restraining growth include the high initial investment costs for some enterprise solutions, a shortage of skilled developers in certain regions, and ongoing concerns about website security vulnerabilities. However, innovative solutions and increasing awareness of the crucial role of a robust online presence are expected to mitigate these challenges. The market is segmented by various factors, including website functionalities (e-commerce, content management, customer relationship management), deployment models (cloud-based, on-premise), and industry verticals (finance, healthcare, education). Key players like Global Data Solutions Limited, Equinix Inc., Digital Realty Trust, Inc., and others are aggressively competing to capture market share by offering tailored solutions and expanding their service portfolios. Geographical expansion, particularly in emerging markets with high internet penetration growth, presents further opportunities. The forecast period (2025-2033) signifies a window of significant growth potential for the Enterprise Website Construction market, attracting significant investment and innovation. The market is expected to witness consolidation among players and the emergence of niche players offering specialized services.
C
Competitor Analysis Evaluation Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Competitor Analysis Evaluation Report [Dataset]. https://www.datainsightsmarket.com/reports/competitor-analysis-evaluation-1987684
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Jun 2, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The competitive landscape of the website analytics market, encompassing players like Google, BuiltWith, SEMrush, and others, is dynamic and characterized by significant growth. The market's size in 2025 is estimated at $15 billion, reflecting a Compound Annual Growth Rate (CAGR) of 15% from 2019. This robust growth is driven by increasing reliance on data-driven decision-making across businesses, expanding digital marketing strategies, and the rise of e-commerce. Key trends include the integration of AI and machine learning for more sophisticated analysis, the increasing demand for real-time data, and a growing focus on personalized user experiences. While the market faces constraints such as data privacy concerns and the complexity of integrating diverse data sources, the overall outlook remains highly positive. The market is segmented by solution type (website analytics, social media analytics, app analytics), deployment mode (cloud, on-premise), and enterprise size (small, medium, large). Companies are focusing on developing advanced analytical capabilities, strengthening partnerships, and expanding their global reach to maintain their competitive edge. The competitive analysis reveals a clear dominance by established players such as Google Analytics, leveraging its massive user base and comprehensive feature set. However, specialized tools like SEMrush and Ahrefs cater to niche needs like SEO analysis and backlink profiling. Smaller players often differentiate themselves through specialized features, superior customer support, or cost-effectiveness, carving out space within the market. Future market share will largely depend on the ability of companies to innovate, adapt to changing privacy regulations, and successfully integrate cutting-edge technologies like AI and machine learning into their offerings. The competition is expected to intensify further with the emergence of new players and the constant evolution of analytical techniques. Strategic mergers and acquisitions are also likely to reshape the market structure in the coming years.
P
WebUI Dataset
paperswithcode.com
Updated Mar 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). WebUI Dataset [Dataset]. https://paperswithcode.com/dataset/webui
Explore at:
Dataset updated
Mar 15, 2025
Description
The WebUI dataset contains 400K web UIs captured over a period of 3 months and cost about $500 to crawl. We grouped web pages together by their domain name, then generated training (70%), validation (10%), and testing (20%) splits. This ensured that similar pages from the same website must appear in the same split. We created four versions of the training dataset. Three of these splits were generated by randomly sampling a subset of the training split: Web-7k, Web-70k, Web-350k. We chose 70k as a baseline size, since it is approximately the size of existing UI datasets. We also generated an additional split (Web-7k-Resampled) to provide a small, higher quality split for experimentation. Web-7k-Resampled was generated using a class-balancing sampling technique, and we removed screens with possible visual defects (e.g., very small, occluded, or invisible elements). The validation and test split was always kept the same.
F
Total Revenue for Museums, Historical Sites, and Similar Institutions, All...
fred.stlouisfed.org
json
Updated Jun 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Total Revenue for Museums, Historical Sites, and Similar Institutions, All Establishments [Dataset]. https://fred.stlouisfed.org/series/REV712ALLEST144QNSA
Explore at:
jsonAvailable download formats
Dataset updated
Jun 12, 2025
License
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Description
Graph and download economic data for Total Revenue for Museums, Historical Sites, and Similar Institutions, All Establishments (REV712ALLEST144QNSA) from Q1 2009 to Q1 2025 about museums, revenue, establishments, and USA.
Watching paid content on websites like Netflix and HBO in Norway 2009-2020
statista.com
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Watching paid content on websites like Netflix and HBO in Norway 2009-2020 [Dataset]. https://www.statista.com/statistics/981176/watching-paid-content-on-websites-like-netflix-and-hbo-in-norway/
Explore at:
Dataset updated
Jul 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Norway
Description
The share of individuals watching paid content on websites like Netflix and HBO in Norway generally increased from 2009 to 2020. In 2009, the share amounted to three percent of respondents, whereas in 2020 it reached ** percent.
Common languages used for web content 2025, by share of websites
statista.com
ai-chatbox.pro
Updated Feb 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Common languages used for web content 2025, by share of websites [Dataset]. https://www.statista.com/statistics/262946/most-common-languages-on-the-internet/
Explore at:
Dataset updated
Feb 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Feb 2025
Area covered
Worldwide
Description
As of February 2025, English was the most popular language for web content, with over 49.4 percent of websites using it. Spanish ranked second, with six percent of web content, while the content in the German language followed, with 5.6 percent. English as the leading online language United States and India, the countries with the most internet users after China, are also the world's biggest English-speaking markets. The internet user base in both countries combined, as of January 2023, was over a billion individuals. This has led to most of the online information being created in English. Consequently, even those who are not native speakers may use it for convenience. Global internet usage by regions As of October 2024, the number of internet users worldwide was 5.52 billion. In the same period, Northern Europe and North America were leading in terms of internet penetration rates worldwide, with around 97 percent of its populations accessing the internet.
w
Websites using Facebook Simple Like
webtechsurvey.com
csv
Updated Jun 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
WebTechSurvey (2025). Websites using Facebook Simple Like [Dataset]. https://webtechsurvey.com/technology/facebook-simple-like
Explore at:
csvAvailable download formats
Dataset updated
Jun 21, 2025
Dataset authored and provided by
WebTechSurvey
License
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
Time period covered
2025
Area covered
Global
Description
A complete list of live websites using the Facebook Simple Like technology, compiled through global website indexing conducted by WebTechSurvey.
Global share of human and bot web traffic 2013-2023
statista.com
Updated Dec 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Global share of human and bot web traffic 2013-2023 [Dataset]. https://www.statista.com/statistics/1264226/human-and-bot-web-traffic-share/
Explore at:
Dataset updated
Dec 10, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
In 2023, most of the global website traffic was still generated by humans but bot traffic is constantly growing. Fraudulent traffic through bad bot actors accounted for 32 percent of global web traffic in the most recently measured period, representing an increase of 1.8 percent from the previous year. Sophistication of Bad Bots on the rise The complexity of malicious bot activity has dramatically increased in recent years. Advanced bad bots have doubled in prevalence over the past two years, indicating a surge in the sophistication of cyber threats. Simultaneously, simple bad bots saw a 6 percent increase compared to the previous year, suggesting a shift in the landscape of automated threats. Meanwhile, areas like entertainment, and law & government face the highest amount of advanced bad bots, with more than 78 percent of their bot traffic affected by evasive applications. Good and bad bots across industries The impact of bot traffic varies across different sectors. Bad bots accounted for over 57.2 percent of the gaming segment's web traffic. Meanwhile, almost half of the online traffic for telecom and ISPs was moved by malicious applications. However, not all bot traffic is considered bad. Some of these applications help index websites for search engines or monitor website performance, assisting users throughout their online search. Therefore, areas like entertainment, food and groceries, and financial services experienced notable levels of good bot traffic, demonstrating the diverse applications of benign automated systems across different sectors.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2022). Most popular website types among three generations of Internet users in Poland 2019 [Dataset]. https://www.statista.com/statistics/1031345/poland-website-types-among-three-generations-of-internet-users/

Most popular website types among three generations of Internet users in Poland 2019

Explore at:

Dataset updated

Dec 2, 2022

Dataset authored and provided by

Statistahttp://statista.com/

Time period covered

Apr 2019

Area covered

Poland

Description

In most cases, internet users across three generations spent their time online on similar website types. The top three was occupied by search engines (98 percent), social networking sites (90-93 percent), and mail services (84-85 percent). Bank websites and applications had the largest reach in the group of 60-69 - as many as 81 percent of this age group used internet banking.

Clear search

Close search

Google apps

Main menu

Most popular website types among three generations of Internet users in...

Fraudulent Bank Websites, Phishing E-mails and Similar Scams | DATA.GOV.HK

Websites using Same But Different

Traffic Acquisition to LAMs Websites

Curlie Enhanced with LLM Annotations: Two Datasets for Advancing...

Advancing Homepage2Vec with LLM-Generated Datasets for Multilingual Website Classification

Websites using Comments Like Dislike

Websites using Sn Facebook Like

Network Traffic Analysis: Data and Code

Competitive Analysis of Industry Rivals Report

Bounce rate of leading consumer electronics sites worldwide 2024

A Dataset on Online Learning-based Web Behavior from Different Countries...

The Test-Case Dataset

Context

Content

Acknowledgements

Inspiration

Enterprise Website Construction Report

Competitor Analysis Evaluation Report

WebUI Dataset

Total Revenue for Museums, Historical Sites, and Similar Institutions, All...

Watching paid content on websites like Netflix and HBO in Norway 2009-2020

Common languages used for web content 2025, by share of websites

Websites using Facebook Simple Like

Global share of human and bot web traffic 2013-2023

Most popular website types among three generations of Internet users in Poland 2019