1 dataset found
  1. Z

    Webis-Web-Errors-19

    • data.niaid.nih.gov
    • webis.de
    Updated Jul 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kiesel, Johannes; Hubricht, Fabienne; Stein, Benno; Potthast, Martin (2024). Webis-Web-Errors-19 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2549837
    Explore at:
    Dataset updated
    Jul 24, 2024
    Dataset provided by
    Leipzig University
    Bauhaus-Universität Weimar
    Authors
    Kiesel, Johannes; Hubricht, Fabienne; Stein, Benno; Potthast, Martin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages. If you use this dataset in your research, please cite it using this paper.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Kiesel, Johannes; Hubricht, Fabienne; Stein, Benno; Potthast, Martin (2024). Webis-Web-Errors-19 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_2549837

Webis-Web-Errors-19

Explore at:
Dataset updated
Jul 24, 2024
Dataset provided by
Leipzig University
Bauhaus-Universität Weimar
Authors
Kiesel, Johannes; Hubricht, Fabienne; Stein, Benno; Potthast, Martin
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages. If you use this dataset in your research, please cite it using this paper.

Search
Clear search
Close search
Google apps
Main menu