Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
7 datasets found
  1. Z

    Webis-Web-Errors-19

    • data.niaid.nih.gov
    • webis.de
    • +2more
    Updated Jul 24, 2024
  2. C

    Allegheny County COVID-19 Tests, Cases and Deaths (Archive)

    • data.wprdc.org
    csv, html
    Updated Jun 13, 2024
    + more versions
  3. d

    Johns Hopkins COVID-19 Case Tracker

    • data.world
    csv, zip
    Updated Mar 25, 2025
  4. High-Frequency Monitoring of COVID-19 Impacts on Households 2021-2022,...

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Jul 11, 2023
  5. w

    COVID-19 National Panel Phone Survey 2021 - Wave 4 - Refugee Sample -...

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Aug 28, 2023
  6. Protection Monitoring of Refugees in Response to COVID-19, 2020 - Iraq

    • microdata.unhcr.org
    • catalog.ihsn.org
    • +1more
    Updated Sep 24, 2022
    + more versions
  7. CDC COVID-19 Community Levels by County

    • opendata.ramseycounty.us
    application/rdfxml +5
    Updated Mar 27, 2025
    + more versions
  8. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Potthast, Martin (2024). Webis-Web-Errors-19 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2549837

Webis-Web-Errors-19

Explore at:
Dataset updated
Jul 24, 2024
Dataset provided by
Kiesel, Johannes
Stein, Benno
Potthast, Martin
Hubricht, Fabienne
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages. If you use this dataset in your research, please cite it using this paper.

Search
Clear search
Close search
Google apps
Main menu