4 datasets found
  1. o

    Kaggle Wikipedia Web Traffic Daily Dataset (without Missing Values)

    • explore.openaire.eu
    • data.niaid.nih.gov
    • +1more
    Updated Jun 13, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rakshitha Godahewa; Christoph Bergmeir; Geoff Webb (2020). Kaggle Wikipedia Web Traffic Daily Dataset (without Missing Values) [Dataset]. http://doi.org/10.5281/zenodo.3898473
    Explore at:
    Dataset updated
    Jun 13, 2020
    Authors
    Rakshitha Godahewa; Christoph Bergmeir; Geoff Webb
    Description

    This dataset was used in the Kaggle Wikipedia Web Traffic forecasting competition. It contains 145063 daily time series representing the number of hits or web traffic for a set of Wikipedia pages from 2015-07-01 to 2017-09-10. The original dataset contains missing values. They have been simply replaced by zeros. {"references": ["Google, 2017. Web traffic time series forecasting. URL https://www.kaggle.com/c/web-traffic-time-series-forecasting"]}

  2. Z

    Extended Wikipedia Web Traffic Daily Dataset (with Missing Values)

    • data.niaid.nih.gov
    Updated Nov 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Webb, Geoff (2022). Extended Wikipedia Web Traffic Daily Dataset (with Missing Values) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7370976
    Explore at:
    Dataset updated
    Nov 28, 2022
    Dataset provided by
    Montero-Manso, Pablo
    Webb, Geoff
    Godahewa, Rakshitha
    Hyndman, Rob
    Bergmeir, Christoph
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Description

    This dataset contains 145063 time series representing the number of hits or web traffic for a set of Wikipedia pages from 2015-07-01 to 2022-06-30. This is an extended version of the dataset that was used in the Kaggle Wikipedia Web Traffic forecasting competition. For consistency, the same Wikipedia pages that were used in the competition have been used in this dataset as well. The colons (:) in article names have been replaced by dashes (-) to make the .tsf file readable using our data loaders.

    The data were downloaded from the Wikimedia REST API. According to the conditions of the API, this dataset is licensed under CC-BY-SA 3.0 and GFDL licenses.

  3. Z

    Extended Wikipedia Web Traffic Daily Dataset (without Missing Values)

    • data.niaid.nih.gov
    Updated Nov 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bergmeir, Christoph (2022). Extended Wikipedia Web Traffic Daily Dataset (without Missing Values) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7371037
    Explore at:
    Dataset updated
    Nov 28, 2022
    Dataset provided by
    Montero-Manso, Pablo
    Webb, Geoff
    Godahewa, Rakshitha
    Hyndman, Rob
    Bergmeir, Christoph
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Description

    This dataset contains 145063 time series representing the number of hits or web traffic for a set of Wikipedia pages from 2015-07-01 to 2022-06-30. This is an extended version of the dataset that was used in the Kaggle Wikipedia Web Traffic forecasting competition. For consistency, the same Wikipedia pages that were used in the competition have been used in this dataset as well. The colons (:) in article names have been replaced by dashes (-) to make the .tsf file readable using our data loaders.

    The original dataset contains missing values. They have been simply replaced by zeros.

    The data were downloaded from the Wikimedia REST API. According to the conditions of the API, this dataset is licensed under CC-BY-SA 3.0 and GFDL licenses.

  4. Kaggle Wikipedia Web Traffic Daily Dataset (with Missing Values)

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Apr 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rakshitha Godahewa; Rakshitha Godahewa; Christoph Bergmeir; Christoph Bergmeir; Geoff Webb; Geoff Webb; Rob Hyndman; Rob Hyndman; Pablo Montero-Manso; Pablo Montero-Manso (2021). Kaggle Wikipedia Web Traffic Daily Dataset (with Missing Values) [Dataset]. http://doi.org/10.5281/zenodo.4656080
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 1, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Rakshitha Godahewa; Rakshitha Godahewa; Christoph Bergmeir; Christoph Bergmeir; Geoff Webb; Geoff Webb; Rob Hyndman; Rob Hyndman; Pablo Montero-Manso; Pablo Montero-Manso
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset was used in the Kaggle Wikipedia Web Traffic forecasting competition. It contains 145063 daily time series representing the number of hits or web traffic for a set of Wikipedia pages from 2015-07-01 to 2017-09-10.

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rakshitha Godahewa; Christoph Bergmeir; Geoff Webb (2020). Kaggle Wikipedia Web Traffic Daily Dataset (without Missing Values) [Dataset]. http://doi.org/10.5281/zenodo.3898473

Kaggle Wikipedia Web Traffic Daily Dataset (without Missing Values)

Explore at:
24 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 13, 2020
Authors
Rakshitha Godahewa; Christoph Bergmeir; Geoff Webb
Description

This dataset was used in the Kaggle Wikipedia Web Traffic forecasting competition. It contains 145063 daily time series representing the number of hits or web traffic for a set of Wikipedia pages from 2015-07-01 to 2017-09-10. The original dataset contains missing values. They have been simply replaced by zeros. {"references": ["Google, 2017. Web traffic time series forecasting. URL https://www.kaggle.com/c/web-traffic-time-series-forecasting"]}

Search
Clear search
Close search
Google apps
Main menu