Saved datasets
Last updated
Download format
Croissant
Croissant is a format for Machine Learning datasets
Learn more about this at mlcommons.org/croissant.
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
4 datasets found
  1. Webis-Web-Errors-19

    • zenodo.org
    • webis.de
    • +1more
    csv, png, txt
    Updated Jul 24, 2024
  2. C

    Allegheny County COVID-19 Tests, Cases and Deaths (Archive)

    • data.wprdc.org
    csv, html
    Updated Jun 13, 2024
  3. High-Frequency Monitoring of COVID-19 Impacts on Households 2021-2022,...

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated Jul 11, 2023
    + more versions
  4. n

    Coronavirus (Covid-19) Data in the United States

    • nytimes.com
    • openicpsr.org
    • +3more
  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Johannes Kiesel; Johannes Kiesel; Fabienne Hubricht; Benno Stein; Martin Potthast; Martin Potthast; Fabienne Hubricht; Benno Stein (2024). Webis-Web-Errors-19 [Dataset]. http://doi.org/10.5281/zenodo.2640364
Organization logo

Webis-Web-Errors-19

Explore at:
csv, png, txtAvailable download formats
Dataset updated
Jul 24, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Johannes Kiesel; Johannes Kiesel; Fabienne Hubricht; Benno Stein; Martin Potthast; Martin Potthast; Fabienne Hubricht; Benno Stein
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages. If you use this dataset in your research, please cite it using this paper.

Search
Clear search
Close search
Google apps
Main menu