In most cases, internet users across three generations spent their time online on similar website types. The top three was occupied by search engines (98 percent), social networking sites (90-93 percent), and mail services (84-85 percent). Bank websites and applications had the largest reach in the group of 60-69 - as many as 81 percent of this age group used internet banking.
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Showeblogin Facebook Page Like Box technology, compiled through global website indexing conducted by WebTechSurvey.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual total students amount from 2009 to 2010 for Calcasieu Alternative Site For Elementary Students
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual total classroom teachers amount from 2008 to 2010 for Calcasieu Alternative Site For Elementary Students
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Different Menus In Different Pages technology, compiled through global website indexing conducted by WebTechSurvey.
This API is providing the information of press releases issued by the authorized institutions and other similar press releases issued by the HKMA in the past regarding fraudulent bank websites, phishing E-mails and similar scams information.
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Same But Different technology, compiled through global website indexing conducted by WebTechSurvey.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Advancing Homepage2Vec with LLM-Generated Datasets for Multilingual Website Classification
This dataset contains two subsets of labeled website data, specifically created to enhance the performance of Homepage2Vec, a multi-label model for website classification. The datasets were generated using Large Language Models (LLMs) to provide more accurate and diverse topic annotations for websites, addressing a limitation of existing Homepage2Vec training data.
Key Features:
LLM-generated annotations: Both datasets feature website topic labels generated using LLMs, a novel approach to creating high-quality training data for website classification models.
Improved multi-label classification: Fine-tuning Homepage2Vec with these datasets has been shown to improve its macro F1 score from 38% to 43% evaluated on a human-labeled dataset, demonstrating their effectiveness in capturing a broader range of website topics.
Multilingual applicability: The datasets facilitate classification of websites in multiple languages, reflecting the inherent multilingual nature of Homepage2Vec.
Dataset Composition:
curlie-gpt3.5-10k: 10,000 websites labeled using GPT-3.5, context 2 and 1-shot
curlie-gpt4-10k: 10,000 websites labeled using GPT-4, context 2 and zero-shot
Intended Use:
Fine-tuning and advancing Homepage2Vec or similar website classification models
Research on LLM-generated datasets for text classification tasks
Exploration of multilingual website classification
Additional Information:
Project and report repository: https://github.com/CS-433/ml-project-2-mlp
Acknowledgments:
This dataset was created as part of a project at EPFL's Data Science Lab (DLab) in collaboration with Prof. Robert West and Tiziano Piccardi.
Digital media has made possible the growth of alternative and partisan news websites catering to certain political ideologies. Like many other countries, India saw a rise of such news websites over the past few years. When survey respondents were asked whether they were aware of any such websites, ** percent said they knew of The Logical Indian, whereas, Oneindia.com came second with ** percent of respondents aware of the website.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Host country of organization for 86 websites in study.
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Easy Related Posts technology, compiled through global website indexing conducted by WebTechSurvey.
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Advanced Related Posts technology, compiled through global website indexing conducted by WebTechSurvey.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual black student percentage from 2009 to 2010 for Calcasieu Alternative Site For Elementary Students vs. Louisiana and Calcasieu Parish School District
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Facebook Simple Like technology, compiled through global website indexing conducted by WebTechSurvey.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Here’s an uncomfortable truth about digital optimization: Anyone who guarantees they can increase your conversion rates with just on-site optimization alone is either lucky or lying. There are over 50 factors that influence conversion rates across 8 categories. Some are within our control, and some are outside. On-site factors are only half of the equation. […]
The statistic shows the running related websites most often visited by runners in the United States in 2016. According to the survey, ** percent of respondents regularly visited local club websites.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Explore our detailed website traffic dataset featuring key metrics like page views, session duration, bounce rate, traffic source, and conversion rates.
A. SUMMARY This dataset includes aggregate data on the type, status, population served, and individuals placed at each alternative housing site under contract with HSA. B. HOW THE DATASET IS CREATED Site Type, Status, and Population The HSA DOC leadership inform the data tracker owner when the legal status, site type, or intended population to serve changes. Daily Census and Units Available The site monitors at each site inform the data tracker owner at the HSA DOC at least once daily with the updates to the daily census. C. UPDATE PROCESS Updated several times daily, whenever new information is shared with the data tracker owner. The data tracker owner inputs the data directly into the underlying SharePoint spreadsheet. D. HOW TO USE THIS DATASET Use the data for aggregate data on the site type, status, and daily census of individuals placed in the sites. Do not use this spreadsheet for individual-level information. There is no personally identifying or medical information in this dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States - Total Revenue for Museums, Historical Sites, and Similar Institutions, All Establishments was -19.20000 % Chg. in January of 2025, according to the United States Federal Reserve. Historically, United States - Total Revenue for Museums, Historical Sites, and Similar Institutions, All Establishments reached a record high of 48.90000 in April of 2009 and a record low of -49.30000 in January of 2020. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Total Revenue for Museums, Historical Sites, and Similar Institutions, All Establishments - last updated from the United States Federal Reserve on July of 2025.
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the Related technology, compiled through global website indexing conducted by WebTechSurvey.
In most cases, internet users across three generations spent their time online on similar website types. The top three was occupied by search engines (98 percent), social networking sites (90-93 percent), and mail services (84-85 percent). Bank websites and applications had the largest reach in the group of 60-69 - as many as 81 percent of this age group used internet banking.