1 dataset found
  1. P

    AQL-22 Dataset

    • paperswithcode.com
    Updated Apr 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Heinrich Reimer; Sebastian Schmidt; Maik Fröbe; Lukas Gienapp; Harrisen Scells; Benno Stein; Matthias Hagen; Martin Potthast (2023). AQL-22 Dataset [Dataset]. https://paperswithcode.com/dataset/aql-22
    Explore at:
    Dataset updated
    Apr 1, 2023
    Authors
    Jan Heinrich Reimer; Sebastian Schmidt; Maik Fröbe; Lukas Gienapp; Harrisen Scells; Benno Stein; Matthias Hagen; Martin Potthast
    Description

    The Archive Query Log (AQL) is a previously unused, comprehensive query log collected at the Internet Archive over the last 25 years. Its first version includes 356 million queries, 166 million search result pages, and 1.7 billion search results across 550 search providers. Although many query logs have been studied in the literature, the search providers that own them generally do not publish their logs to protect user privacy and vital business data. The AQL is the first publicly available query log that combines size, scope, and diversity, enabling research on new retrieval models and search engine analyses. Provided in a privacy-preserving manner, it promotes open research as well as more transparency and accountability in the search industry.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jan Heinrich Reimer; Sebastian Schmidt; Maik Fröbe; Lukas Gienapp; Harrisen Scells; Benno Stein; Matthias Hagen; Martin Potthast (2023). AQL-22 Dataset [Dataset]. https://paperswithcode.com/dataset/aql-22

AQL-22 Dataset

Archive Query Log

Explore at:
157 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Apr 1, 2023
Authors
Jan Heinrich Reimer; Sebastian Schmidt; Maik Fröbe; Lukas Gienapp; Harrisen Scells; Benno Stein; Matthias Hagen; Martin Potthast
Description

The Archive Query Log (AQL) is a previously unused, comprehensive query log collected at the Internet Archive over the last 25 years. Its first version includes 356 million queries, 166 million search result pages, and 1.7 billion search results across 550 search providers. Although many query logs have been studied in the literature, the search providers that own them generally do not publish their logs to protect user privacy and vital business data. The AQL is the first publicly available query log that combines size, scope, and diversity, enabling research on new retrieval models and search engine analyses. Provided in a privacy-preserving manner, it promotes open research as well as more transparency and accountability in the search industry.

Search
Clear search
Close search
Google apps
Main menu