Saved datasets
Last updated
Download format
Usage rights
License from data provider
Please review the applicable license to make sure your contemplated use is permitted.
Topic
Free
Cost to access
Described as free to access or have a license that allows redistribution.
3 datasets found
  1. o

    Paderborn Genre Analysis Corpus 2012 (PaGA-12)

    • explore.openaire.eu
    • zenodo.org
    Updated Jan 1, 2012
  2. Paderborn Genre Analysis Corpus 2012

    • webis.de
    Updated 2012
  3. México: salario mínimo nominal 2010-2021

    • es.statista.com
    Updated Dec 23, 2020
  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Michael Baumann; Theodor Lettmann; Benno Stein (2012). Paderborn Genre Analysis Corpus 2012 (PaGA-12) [Dataset]. http://doi.org/10.5281/zenodo.3250069

Paderborn Genre Analysis Corpus 2012 (PaGA-12)

20 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jan 1, 2012
Authors
Michael Baumann; Theodor Lettmann; Benno Stein
Description

The Paderborn Genre Analysis 2012 corpus (PaGA-12) contains 1,639 HTML documents of 26 genres. All documents were collected from 2009-10-18 to 2009-11-20, and each document is manually assigned to exactly one genre. For each genre, the corpus provides at least 50 documents. All HTML documents contain German text only, and framesets are removed. The corpus is delivered in form of a MySQL database dump; the database structure is detailed in a README file delivered with the corpus.

Search
Clear search
Close search
Google apps
Main menu