Saved datasets
Last updated
Download format
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Provider
Free
Cost to access
Described as free to access or have a license that allows redistribution.
6 datasets found
  1. o

    Paderborn Genre Analysis Corpus 2012 (PaGA-12)

    • explore.openaire.eu
    • zenodo.org
    Updated Jan 1, 2012
  2. W

    Paderborn Genre Analysis Corpus 2012

    • webis.de
    Updated 2012
    + more versions
  3. e

    EMP12 — Paga del empleador durante la maternidad

    • data.europa.eu
    csv, excel xlsx +2
    Updated Oct 3, 2023
  4. T

    Denmark Social Security Rate For Companies

    • tradingeconomics.com
    • ko.tradingeconomics.com
    • +16more
    csv, excel, json, xml
    Updated Dec 15, 2023
    + more versions
  5. T

    France Payroll Employment in the Private Sector

    • it.tradingeconomics.com
    • ar.tradingeconomics.com
    • +16more
    csv, excel, json, xml
    Updated Jan 7, 2024
  6. T

    France Payroll Employment in Manufacturing

    • it.tradingeconomics.com
    • tradingeconomics.com
    • +16more
    csv, excel, json, xml
  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Michael Baumann; Theodor Lettmann; Benno Stein (2012). Paderborn Genre Analysis Corpus 2012 (PaGA-12) [Dataset]. http://doi.org/10.5281/zenodo.3250069

Paderborn Genre Analysis Corpus 2012 (PaGA-12)

Explore at:
19 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jan 1, 2012
Authors
Michael Baumann; Theodor Lettmann; Benno Stein
Area covered
Paderborn
Description

The Paderborn Genre Analysis 2012 corpus (PaGA-12) contains 1,639 HTML documents of 26 genres. All documents were collected from 2009-10-18 to 2009-11-20, and each document is manually assigned to exactly one genre. For each genre, the corpus provides at least 50 documents. All HTML documents contain German text only, and framesets are removed. The corpus is delivered in form of a MySQL database dump; the database structure is detailed in a README file delivered with the corpus.

Search
Clear search
Close search
Google apps
Main menu