100+ datasets found

Question words leading to PAA and FS results during online searches in the...
statista.com
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Question words leading to PAA and FS results during online searches in the U.S. 2020 [Dataset]. https://www.statista.com/statistics/1181423/questions-paa-fs-search-engine-results-page/
Explore at:
Dataset updated
Jul 11, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Aug 2020
Area covered
United States
Description
As of August 2020, the keyword "can" triggered SERPs with featured snippet results that also contained a People Also Ask (PAA) box in approximately ** percent of searches. In ** percent of U.S. searches, the keyword "can" also triggered search engine result pages with PAA that also contained a featured snippet.
e
Statistics (ST), Question Paper, Graduate Aptitude Test in Engineering,...
paper.erudition.co.in
html
Updated Jul 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Einetic (2025). Statistics (ST), Question Paper, Graduate Aptitude Test in Engineering, Competitive Exams | Erudition Paper [Dataset]. https://paper.erudition.co.in/competitive-exams/gate/question-paper/statistics
Explore at:
htmlAvailable download formats
Dataset updated
Jul 14, 2025
Dataset authored and provided by
Einetic
License
https://paper.erudition.co.in/termshttps://paper.erudition.co.in/terms
Description
Question Paper Solutions of Statistics (ST),Question Paper,Graduate Aptitude Test in Engineering,Competitive Exams
QADO: An RDF Representation of Question Answering Datasets and their...
figshare.com
zip
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andreas Both; Oliver Schmidtke; Aleksandr Perevalov (2023). QADO: An RDF Representation of Question Answering Datasets and their Analyses for Improving Reproducibility [Dataset]. http://doi.org/10.6084/m9.figshare.21750029.v3
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.21750029.v3
Dataset updated
May 31, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Andreas Both; Oliver Schmidtke; Aleksandr Perevalov
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Measuring the quality of Question Answering (QA) systems is a crucial task to validate the results of novel approaches. However, there are already indicators of a reproducibility crisis as many published systems have used outdated datasets or use subsets of QA benchmarks, making it hard to compare results. We identified the following core problems: there is no standard data format, instead, proprietary data representations are used by the different partly inconsistent datasets; additionally, the characteristics of datasets are typically not reflected by the dataset maintainers nor by the system publishers. To overcome these problems, we established an ontology---Question Answering Dataset Ontology (QADO)---for representing the QA datasets in RDF. The following datasets were mapped into the ontology: the QALD series, LC-QuAD series, RuBQ series, ComplexWebQuestions, and Mintaka. Hence, the integrated data in QADO covers widely used datasets and multilinguality. Additionally, we did intensive analyses of the datasets to identify their characteristics to make it easier for researchers to identify specific research questions and to select well-defined subsets. The provided resource will enable the research community to improve the quality of their research and support the reproducibility of experiments.

Here, the mapping results of the QADO process, the SPARQL queries for data analytics, and the archived analytics results file are provided.

Up-to-date statistics can be created automatically by the script provided at the corresponding QADO GitHub RDFizer repository.
Share of questions answered by AI models in SimpleQA benchmark 2025
statista.com
Updated May 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Share of questions answered by AI models in SimpleQA benchmark 2025 [Dataset]. https://www.statista.com/statistics/1612496/ai-simpleqa-share-of-questions-answered/
Explore at:
Dataset updated
May 30, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2024
Area covered
Worldwide
Description
OpenAI's o1 had the highest share of questions answered when attempted in SimpleQA benchmark in 2025. Claude-3 had the highest share of simply not attempting questions, though whether this is due to lack of data or other reasons is unknown.
o
DOF Assembly Written Questions Performance Statistics - Dataset - Open Data...
admin.opendatani.gov.uk
Updated Jan 15, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). DOF Assembly Written Questions Performance Statistics - Dataset - Open Data NI [Dataset]. https://admin.opendatani.gov.uk/dataset/department-of-finance-performance-statistics-on-assembly-written-questions
Explore at:
Dataset updated
Jan 15, 2021
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
This dataset contains the Department of Finance Performance Statistics on Assembly Written Questions .
e
2021
paper.erudition.co.in
html
Updated Jul 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Einetic (2025). 2021 [Dataset]. https://paper.erudition.co.in/competitive-exams/gate/question-paper/statistics
Explore at:
htmlAvailable download formats
Dataset updated
Jul 14, 2025
Dataset authored and provided by
Einetic
License
https://paper.erudition.co.in/termshttps://paper.erudition.co.in/terms
Description
Question Paper Solutions of year 2021 of Statistics, Question Paper , Graduate Aptitude Test in Engineering
Most frequently asked questions on Google worldwide 2024
statista.com
Updated Jun 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Most frequently asked questions on Google worldwide 2024 [Dataset]. https://www.statista.com/statistics/267139/searched-questions-google/
Explore at:
Dataset updated
Jun 25, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Jan 2024 - Apr 2024
Area covered
Worldwide
Description
"What to watch" was the most frequently asked question on Google in 2023. This question generated an average of *** million online search queries per month. "What is my ip" and "do a barrel roll" followed as the most popular Google search questions worldwide with an average of *** million monthly searches each.
d
Data from: Reference Mysteries: The Quest for Answers
search.dataone.org
Updated Dec 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elizabeth Hamilton (2023). Reference Mysteries: The Quest for Answers [Dataset]. http://doi.org/10.5683/SP3/LH36YJ
Explore at:
Unique identifier
https://doi.org/10.5683/SP3/LH36YJ
Dataset updated
Dec 28, 2023
Dataset provided by
Borealis
Authors
Elizabeth Hamilton
Description
The solutions of mysteries can lead to salvation for those on the reference desk dealing with business students or difficult questions.
f
Data from: A nonparametric test for the two-sample problem based on order...
tandf.figshare.com
pdf
Updated Apr 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fazil Aliev; Levent Özbek; Mehmet Fedai Kaya; Coşkun Kuş; Hon Keung Tony Ng; Haikady N. Nagaraja (2024). A nonparametric test for the two-sample problem based on order statistics [Dataset]. http://doi.org/10.6084/m9.figshare.22009411.v2
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.22009411.v2
Dataset updated
Apr 5, 2024
Dataset provided by
Taylor & Francis
Authors
Fazil Aliev; Levent Özbek; Mehmet Fedai Kaya; Coşkun Kuş; Hon Keung Tony Ng; Haikady N. Nagaraja
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We study a nonparametric test procedure based on order statistics for testing the null hypothesis of equality of two continuous distributions. The exact null distribution of the proposed test statistic is obtained using an enumeration method and a novel combinatorial argument. A recurrence relation for the probability generating function and a sequential approach for computing the mean and variance of the distribution are given. Critical values and characteristics of the distribution for selected small sample sizes are presented. For the Lehmann alternative family, the exact power function of the new test is derived, and its power performance is examined. We also study the power performance of the proposed test under the location-shift and scale-shift alternatives using Monte Carlo simulations and observe its superior performance when compared to commonly used nonparametric tests under various scenarios. A generalization of the proposed procedure for unequal sample sizes is discussed. An illustrative example and some concluding remarks are provided.
o
DoJ Performance Statistics Assembly Written Questions - Dataset - Open Data...
admin.opendatani.gov.uk
Updated Mar 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). DoJ Performance Statistics Assembly Written Questions - Dataset - Open Data NI [Dataset]. https://admin.opendatani.gov.uk/dataset/doj-performance-statistics-assembly-written-questions
Explore at:
Dataset updated
Mar 16, 2021
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
This dataset contains the Department of Justice Performance Statistics on Assembly Written Questions
f
Questions assessing respondents' perceptions and behaviour relating to...
figshare.com
xls
Updated Jun 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristina Blennow; Johannes Persson; Margarida Tomé; Marc Hanewinkel (2023). Questions assessing respondents' perceptions and behaviour relating to climate change, and socio-demographic variables; possible responses to the questions; and percentage responses of respondents (or other summary statistics, where noted) who answered yes and no to the question Have you adapted your forest management in response to climate change? (n = 828). [Dataset]. http://doi.org/10.1371/journal.pone.0050182.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0050182.t001
Dataset updated
Jun 8, 2023
Dataset provided by
PLOS ONE
Authors
Kristina Blennow; Johannes Persson; Margarida Tomé; Marc Hanewinkel
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
n = Numbers of responses. Test statistics for Wilcoxon rank sum test (W), Student's t-test (t), and χ2-test (χ2). Mean, median and ranges calculated from raw data before imputation.
T
CorStat - Reference Question Statistics
corstat.coronaca.gov
application/rdfxml +5
Updated Sep 5, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). CorStat - Reference Question Statistics [Dataset]. https://corstat.coronaca.gov/w/323w-i6sy/default?cur=tFOJ3wjCQdi&from=q0Rbuq9PTOJ
Explore at:
csv, tsv, xml, json, application/rdfxml, application/rssxmlAvailable download formats
Dataset updated
Sep 5, 2019
Description
Questions asked by library patrons and responded to by library staff. This assistance may be requested in person or remotely and from a variety of public desks. Data is provided by a monthly administration report created by the Library and Recreation Services management staff.
Number of questions in the PAA feature box in U.S. Google search results...
statista.com
Updated Jul 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Number of questions in the PAA feature box in U.S. Google search results 2020 [Dataset]. https://www.statista.com/statistics/1180969/number-questions-paa-google-search-results/
Explore at:
Dataset updated
Jul 10, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Aug 2020
Area covered
United States
Description
As of August 2020, the People Also Ask (PAA) feature box on Google's search results in the United States usually had * questions ***** percent of the time. In comparison, only **** percent of search results had * questions in the PAA feature box.
e
2019
paper.erudition.co.in
html
Updated Jul 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Einetic (2025). 2019 [Dataset]. https://paper.erudition.co.in/competitive-exams/gate/question-paper/statistics
Explore at:
htmlAvailable download formats
Dataset updated
Jul 14, 2025
Dataset authored and provided by
Einetic
License
https://paper.erudition.co.in/termshttps://paper.erudition.co.in/terms
Description
Question Paper Solutions of year 2019 of Statistics, Question Paper , Graduate Aptitude Test in Engineering
Data for stats.stackexchange question
zenodo.org
bin
Updated Mar 17, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patrick Schratz; Patrick Schratz (2020). Data for stats.stackexchange question [Dataset]. http://doi.org/10.5281/zenodo.3713187
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3713187
Dataset updated
Mar 17, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Patrick Schratz; Patrick Schratz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data for stats.stackexchange question: https://stats.stackexchange.com/q/454499/101464
h
dst-question-plus-que-vs-stat
huggingface.co
Updated Sep 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marcin Stankiewicz (2024). dst-question-plus-que-vs-stat [Dataset]. https://huggingface.co/datasets/jooni22/dst-question-plus-que-vs-stat
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 12, 2024
Authors
Marcin Stankiewicz
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
jooni22/dst-question-plus-que-vs-stat dataset hosted on Hugging Face and contributed by the HF Datasets community
March Madness Augmented Statistics
kaggle.com
Updated Apr 4, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Colin Siles (2021). March Madness Augmented Statistics [Dataset]. https://www.kaggle.com/colinsiles/march-madness-augmented-statistics
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 4, 2021
Dataset provided by
Kaggle
Authors
Colin Siles
Description
Context

A team's mean seasons statistics can be used as predictors for their performance in future games. However, these statistics gain additional meaning when placed in the context of their opponents' (and opponents' opponents') performance. This dataset provides this context for each team. Furthermore, predicting games based on post-season stats causes data leakage, which from experience can be significant in this context (15-20% loss in accuracy). Thus, this dataset provides each of these statistics prior to each game of the regular season, preventing any source of data leakage.

Content

All data is derived from the March Madness competition data. Each original column was renamed to "A" and "B" instead of "W" and "L," and the mirrored to represent both orderings of opponents. Each team's mean stats are computed (both their stats, and the mean "allowed" or "forced" statistics by their opponents). To compute the mean opponents' stats, we analyze the games played by each opponent (excluding games played against the team in question), and compute the mean statistics for those games. We then compute the mean of these mean statistics, weighted by the number of times the team in question played each opponent. The opponents' opponent's stats are computed as a weighted average of the opponents' average. This results in statistics similar to those used to compute strength of schedule or RPI, just that they go beyond win percentages (See: https://en.wikipedia.org/wiki/Rating_percentage_index)

The per game statistics are computed by pretending we don't have any of the data on or after the day in question.

Next Steps

Currently, the data isn't computed particularly efficiently. Computing the per game averages for every day of the season is necessary to compute fully accurate opponents' opponents' average, but takes about 90 minutes to obtain. It is probably possible to parallelize this, and the per-game averages involve a lot of repeated computation (basically computing the final averages over and over again for each day). Speeding this up will make it more convenient to make changes to the dataset.

I would like to transform these statistics to be per-possession, add shooting percentages, pace, and number of games played (to give an idea of the amount uncertainty that exists in the per-game averages). Some of these can be approximated with the given data (but the results won't be exact), while others will need to be computed from scratch.
d
Data from: Reference Mysteries
search.dataone.org
Updated Dec 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elizabeth Hamilton (2023). Reference Mysteries [Dataset]. http://doi.org/10.5683/SP3/2VLBGJ
Explore at:
Unique identifier
https://doi.org/10.5683/SP3/2VLBGJ
Dataset updated
Dec 28, 2023
Dataset provided by
Borealis
Authors
Elizabeth Hamilton
Description
The requests we receive at the Reference Desk keep surprising us. We'll take a look at some of the best examples from the year on data questions and data solutions.
Expected response time for social media questions or complaints in U.S. &...
statista.com
Updated Jul 6, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2022). Expected response time for social media questions or complaints in U.S. & global 2018 [Dataset]. https://www.statista.com/statistics/808477/expected-response-time-for-social-media-questions-or-complaints/
Explore at:
Dataset updated
Jul 6, 2022
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2018
Area covered
United States, Worldwide
Description
This survey shows the expected response time for social media questions or complaints in the United States and worldwide in 2018. During the survey, 31 percent of respondents from the United States, stated that they expect a response in 24 hours or less.
d
Data from: Reference Mysteries, Part One
search.dataone.org
Updated Dec 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Elizabeth Hamilton; Chuck Humphrey; Data Liberation Initiative (DLI) (2023). Reference Mysteries, Part One [Dataset]. http://doi.org/10.5683/SP3/JIHZ4U
Explore at:
Unique identifier
https://doi.org/10.5683/SP3/JIHZ4U
Dataset updated
Dec 28, 2023
Dataset provided by
Borealis
Authors
Elizabeth Hamilton; Chuck Humphrey; Data Liberation Initiative (DLI)
Description
The reference desk is the common challenge for all of us. We will be taking some great moments of the past year on the reference desk (yours included) and look at tools, referrals, and colleagial education that will make your life easier.

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2025). Question words leading to PAA and FS results during online searches in the U.S. 2020 [Dataset]. https://www.statista.com/statistics/1181423/questions-paa-fs-search-engine-results-page/

Question words leading to PAA and FS results during online searches in the U.S. 2020

Explore at:

Dataset updated

Jul 11, 2025

Dataset authored and provided by

Statistahttp://statista.com/

Time period covered

Aug 2020

Area covered

United States

Description

As of August 2020, the keyword "can" triggered SERPs with featured snippet results that also contained a People Also Ask (PAA) box in approximately ** percent of searches. In ** percent of U.S. searches, the keyword "can" also triggered search engine result pages with PAA that also contained a featured snippet.

Clear search

Close search

Google apps

Main menu

Question words leading to PAA and FS results during online searches in the...

Statistics (ST), Question Paper, Graduate Aptitude Test in Engineering,...

QADO: An RDF Representation of Question Answering Datasets and their...

Share of questions answered by AI models in SimpleQA benchmark 2025

DOF Assembly Written Questions Performance Statistics - Dataset - Open Data...

2021

Most frequently asked questions on Google worldwide 2024

Data from: Reference Mysteries: The Quest for Answers

Data from: A nonparametric test for the two-sample problem based on order...

DoJ Performance Statistics Assembly Written Questions - Dataset - Open Data...

Questions assessing respondents' perceptions and behaviour relating to...

CorStat - Reference Question Statistics

Number of questions in the PAA feature box in U.S. Google search results...

2019

Data for stats.stackexchange question

dst-question-plus-que-vs-stat

March Madness Augmented Statistics

Context

Content

Next Steps

Data from: Reference Mysteries

Expected response time for social media questions or complaints in U.S. &...

Data from: Reference Mysteries, Part One

Question words leading to PAA and FS results during online searches in the U.S. 2020