1 dataset found
  1. W

    Webis-WikiSciTech-23

    • webis.de
    7845809
    Updated 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wolfgang Kircheis; Arno Simons; Benno Stein; Martin Potthast (2023). Webis-WikiSciTech-23 [Dataset]. http://doi.org/10.5281/zenodo.7845809
    Explore at:
    7845809Available download formats
    Dataset updated
    2023
    Dataset provided by
    University of Kassel, hessian.AI, and ScaDS.AI
    Bauhaus-Universität Weimar
    The Web Technology & Information Systems Network
    stoneball
    Authors
    Wolfgang Kircheis; Arno Simons; Benno Stein; Martin Potthast
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Priority conflicts and the attribution of contributions to important scientific breakthroughs to individuals and groups play an important role in science, its governance, and evaluation. Debates and dynamics around these processes are analyzed by science studies. Our objective is to transform Wikipedia into an accessible, traceable primary source for analyzing such debates. We introduce Webis-WikiSciTech-23, a corpus consisting of science and technology Wikipedia articles, focusing on the identification of their history sections. We extract such articles from Wikipedia dumps through iterative filtering of the category network. The identification of passages covering the historical development of innovations is achieved by combining heuristics for section heading analysis and classifiers trained on a ground truth of articles with designated history sections.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Wolfgang Kircheis; Arno Simons; Benno Stein; Martin Potthast (2023). Webis-WikiSciTech-23 [Dataset]. http://doi.org/10.5281/zenodo.7845809

Webis-WikiSciTech-23

Explore at:
7845809Available download formats
Dataset updated
2023
Dataset provided by
University of Kassel, hessian.AI, and ScaDS.AI
Bauhaus-Universität Weimar
The Web Technology & Information Systems Network
stoneball
Authors
Wolfgang Kircheis; Arno Simons; Benno Stein; Martin Potthast
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Priority conflicts and the attribution of contributions to important scientific breakthroughs to individuals and groups play an important role in science, its governance, and evaluation. Debates and dynamics around these processes are analyzed by science studies. Our objective is to transform Wikipedia into an accessible, traceable primary source for analyzing such debates. We introduce Webis-WikiSciTech-23, a corpus consisting of science and technology Wikipedia articles, focusing on the identification of their history sections. We extract such articles from Wikipedia dumps through iterative filtering of the category network. The identification of passages covering the historical development of innovations is achieved by combining heuristics for section heading analysis and classifiers trained on a ground truth of articles with designated history sections.

Search
Clear search
Close search
Google apps
Main menu