2 datasets found
  1. W

    Webis-Web-Archive-Quality-22

    • anthology.aicmu.ac.cn
    • webis.de
    6881334
    Updated 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Potthast; Johannes Kiesel; Benno Stein (2022). Webis-Web-Archive-Quality-22 [Dataset]. http://doi.org/10.5281/zenodo.6881334
    Explore at:
    6881334Available download formats
    Dataset updated
    2022
    Dataset provided by
    Bauhaus-Universität Weimar
    Leipzig University
    The Web Technology & Information Systems Network
    Authors
    Martin Potthast; Johannes Kiesel; Benno Stein
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Webis-Web-Archive-Quality-22 comprises a total of 6,500 pairs of screenshots from web pages as they were archived and as they were reproduced from that archive, along with archive quality annotations and information of DOM elements on the screenshot.

  2. E

    Polish-English parallel corpus from the website of the National Digital...

    • live.european-language-grid.eu
    • catalog.elra.info
    • +1more
    tmx
    Updated Nov 14, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Polish-English parallel corpus from the website of the National Digital Archives (Processed) [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/3174
    Explore at:
    tmxAvailable download formats
    Dataset updated
    Nov 14, 2018
    License

    https://elrc-share.eu/terms/openUnderPSI.htmlhttps://elrc-share.eu/terms/openUnderPSI.html

    Description

    Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl)

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Potthast; Johannes Kiesel; Benno Stein (2022). Webis-Web-Archive-Quality-22 [Dataset]. http://doi.org/10.5281/zenodo.6881334

Webis-Web-Archive-Quality-22

Explore at:
6881334Available download formats
Dataset updated
2022
Dataset provided by
Bauhaus-Universität Weimar
Leipzig University
The Web Technology & Information Systems Network
Authors
Martin Potthast; Johannes Kiesel; Benno Stein
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis-Web-Archive-Quality-22 comprises a total of 6,500 pairs of screenshots from web pages as they were archived and as they were reproduced from that archive, along with archive quality annotations and information of DOM elements on the screenshot.

Search
Clear search
Close search
Google apps
Main menu