Context There's a story behind every dataset and here's your opportunity to share yours.
Content What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.
Acknowledgements We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
Inspiration Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Online conversion rates of e-commerce sites were the highest in the beauty & skincare sector, at ***** percent in the first quarter of 2025. Food & beverage followed, with a *** percent conversion rate. For comparison, the average conversion rate of e-commerce sites across all selected sectors stood at *** percent. How does conversion vary by region and device? The conversion rate, which indicates the proportion of visits to e-commerce websites that result in purchases, varies by country and region. For instance, since at least 2023, e-commerce sites have consistently recorded higher conversion rates among shoppers in Great Britain compared to those in the United States and other global regions. Furthermore, despite the increasing prevalence of mobile shopping worldwide, conversions remain more pronounced on larger screens such as tablets and desktops. Online shopping cart abandonment on the rise Recently, the rate at which consumers abandon their online shopping carts has been gradually rising to more than ** percent in 2025, showing a higher difficulty for e-commerce sites to convert website traffic into purchases. In 2024, food and beverage was one of the product categories with the lowest online cart abandonment rate, confirming the sector’s relatively high conversion rate. In the United States, the primary reason why customers abandoned their shopping carts is that extra costs such as shipping, tax, and service fees were too high at checkout.
Abstract: The task for this dataset is to forecast the spatio-temporal traffic volume based on the historical traffic volume and other features in neighboring locations.
Data Set Characteristics | Number of Instances | Area | Attribute Characteristics | Number of Attributes | Date Donated | Associated Tasks | Missing Values |
---|---|---|---|---|---|---|---|
Multivariate | 2101 | Computer | Real | 47 | 2020-11-17 | Regression | N/A |
Source: Liang Zhao, liang.zhao '@' emory.edu, Emory University.
Data Set Information: The task for this dataset is to forecast the spatio-temporal traffic volume based on the historical traffic volume and other features in neighboring locations. Specifically, the traffic volume is measured every 15 minutes at 36 sensor locations along two major highways in Northern Virginia/Washington D.C. capital region. The 47 features include: 1) the historical sequence of traffic volume sensed during the 10 most recent sample points (10 features), 2) week day (7 features), 3) hour of day (24 features), 4) road direction (4 features), 5) number of lanes (1 feature), and 6) name of the road (1 feature). The goal is to predict the traffic volume 15 minutes into the future for all sensor locations. With a given road network, we know the spatial connectivity between sensor locations. For the detailed data information, please refer to the file README.docx.
Attribute Information: The 47 features include: (1) the historical sequence of traffic volume sensed during the 10 most recent sample points (10 features), (2) week day (7 features), (3) hour of day (24 features), (4) road direction (4 features), (5) number of lanes (1 feature), and (6) name of the road (1 feature).
Relevant Papers: Liang Zhao, Olga Gkountouna, and Dieter Pfoser. 2019. Spatial Auto-regressive Dependency Interpretable Learning Based on Spatial Topological Constraints. ACM Trans. Spatial Algorithms Syst. 5, 3, Article 19 (August 2019), 28 pages. DOI:[Web Link]
Citation Request: To use these datasets, please cite the papers:
Liang Zhao, Olga Gkountouna, and Dieter Pfoser. 2019. Spatial Auto-regressive Dependency Interpretable Learning Based on Spatial Topological Constraints. ACM Trans. Spatial Algorithms Syst. 5, 3, Article 19 (August 2019), 28 pages. DOI:[Web Link]
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Benchmark Agent Meta and Traffic Dataset in AI Agent Marketplace | AI Agent Directory | AI Agent Index from DeepNLP
This dataset is collected from AI Agent Marketplace Index and Directory at http://www.deepnlp.org, which contains AI Agents's meta information such as agent's name, website, description, as well as the monthly updated Web performance metrics, including Google,Bing average search ranking positions, Github Stars, Arxiv References, etc. The dataset is helpful for AI… See the full description on the dataset page: https://huggingface.co/datasets/DeepNLP/ai-agent-benchmark.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The objective of this task was to determine if Virtual Reality-based captured behavioral data on responses to notifications are similar to what is expected in real-world settings. For this purpose, a real-world bench mark experiment was designed to capture participant response times to wearable watch alarms triggered upon simulated traffic near the mobile work zone on the experiment site in an urban setting. The proposed scope of data collection of the real-world study included the external environmental factors (e.g., site accessibility, weather). The key parameters of research are defined as reaction time to received alarms and the heart rate measures. Table 1 provides the list of parameters that were controlled and measured during the experiments.
Table 1. Key parameters measured and tracked during real-world experiments
Variable name
Descriptions
Key parameters captured
Reaction time
The time that one takes from getting the haptic or sound alarm from a wearable alarm device, herein referring to the apple watch, to the point when the participant gives a response by stopping the alarm by pressing on the screen of the smartwatch
Inter-beat interval (IBI, heart rate)
The time interval between individual beats of the heart; the data is measured by using E4 application provided by Empatica
External factors tracked
Ambient noise
The level of ambient noise in the area is a factor potentially influencing participants’ reactions and is considered in the experiment design
Temperature
Daytime temperature recorded at each experiment
Number of pedestrians on site
Number of participants counted during the time of the experiment to record on the varying factors in the external environment in real-world settings
In the experiment, each participant was asked to participate in the experiment three times. In each trial, data was recorded separately for each alarm sent to smartwatch from the administrator at triggering events (precisely, every time the remote-controlled toy car reaches the line 30 ft apart from the designated work area). Each alarm signal at each trial was recorded for all 31 participants to the experiment. Timestamps are automatically recorded in server in the events recorded in the format of Table 2:
Table 2. Format of raw data stored in the server, starting in December 2022.
Timestamp
From
Event
0
2022-12-08 13:37:53.101391
VR
Received car approaching alert, mode=3, id=1000
1
2022-12-08 15:53:05.098288
Watch
Start Simulation
2
2022-12-08 15:53:07.437488
VR
Received car approaching alert, mode=4, id=1004
3
2022-12-08 15:53:13.064067
Watch
Stop Simulation
4
2022-12-08 15:53:13.163635
Watch
Stop Simulation
...
2417
2023-03-03 16:17:46.166644
Watch
1398
2418
2023-03-03 16:18:00.004425
Watch
1398
2419
2023-03-03 16:18.01.272071
Watch
1398
2420
2023-03-03 16:18:07.359187
Watch
Stop Simulation
2421
2023-03-03 16:18:07.388183
Watch
Stop Simulation
Some intervals used different timestamps as benchmarks to calibrate on the vehicle speed and user response time to the alarm signals, which include the following cases:
1) At the beginning of each trial, vehicle travels 70 ft from start point to the 30 ft apart point, when the first alarm is signaled; given this travel distance, the travel time of the first trip the toy vehicle makes is calculated by subtracting tn_alarm1_sent from tn_start.
2) Similarly, user response times to all alarms are recorded by subtracting the timestamps when the alarm is received by participant from when the alarm is sent from the server. (tn_alarmn_sent - tn_alarmn_received)
Ambient noise level data were collected using a noise meter, allowing to save noise level by seconds to multiple seconds (i.e., 5, 10, 30, 60 seconds). All noise data recorded were recorded in the interval of one second using the meter. The collected data was processed to match the certain timestamps collected for user response time data collected in the experiment to allow comparisons and correlation analysis to be performed later on, which include the following: 1) worker response; 2) sending of alarm signals; 3) start and stop of experiments. All data points were later modified using the rolling mean function of pandas python module to replace the missing data points by moving average method.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Web Bench: A real-world benchmark for Browser Agents
WebBench is an open, task-oriented benchmark that measures how well browser agents handle realistic web workflows. It contains 2 ,454 tasks spread across 452 live websites selected from the global top-1000 by traffic. Last updated: May 28, 2025
Dataset Composition
Category Description Example Count (% of dataset)
READ Tasks that require searching and extracting information “Navigate to the news section and… See the full description on the dataset page: https://huggingface.co/datasets/Halluminate/WebBench.
Among selected consumer electronics retailers worldwide, apple.com recorded the highest bounce rate in April 2024, at approximately 55.3 percent. Rival samsung.com had a slightly lower bounce rate of nearly 54 percent. Among selected consumer electronics e-tailers, huawei.com had the lowest bounce rate at 30.91 percent. Bounce rate is a marketing term used in web traffic analysis reflecting the percentage of visitors who enter the site and then leave without taking any further action like making a purchase or viewing other pages within the website ("bounce"). A sector with growth potential With one of the lowest online shopping cart abandonment rates globally in 2022, consumer electronics is a burgeoning e-commerce segment that places itself at the crossroads between technological progress and digital transformation. Boosted by the pandemic-induced surge in online shopping, the global market size of consumer electronics e-commerce was estimated at more than 340 billion U.S. dollars in 2021 and forecast to nearly double less than five years later. Amazon and Apple lead the charts in electronics e-commerce With more than 59 billion U.S. dollars in e-commerce net sales in the consumer electronics segment in 2022, apple.com was the uncontested industry leader. The global powerhouse surpassed e-commerce giants amazon.com and jd.com with more than ten billion U.S. dollars difference in online sales in the consumer electronics category.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Context There's a story behind every dataset and here's your opportunity to share yours.
Content What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.
Acknowledgements We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
Inspiration Your data will be in front of the world's largest data science community. What questions do you want to see answered?