100+ datasets found

ETHOS.BUILDA: Building Footprint and Height Dataset Germany

zenodo.org

csv

Updated May 6, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Kristina Dabrock; Kristina Dabrock; Noah Pflugradt; Noah Pflugradt; Jann Michael Weinand; Jann Michael Weinand; Detlef Stolten; Detlef Stolten (2025). ETHOS.BUILDA: Building Footprint and Height Dataset Germany [Dataset]. http://doi.org/10.5281/zenodo.11845992

Explore at:

csvAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.11845992

Dataset updated

May 6, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Kristina Dabrock; Kristina Dabrock; Noah Pflugradt; Noah Pflugradt; Jann Michael Weinand; Jann Michael Weinand; Detlef Stolten; Detlef Stolten

License

Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically

Area covered

Germany

Description

Introduction

This dataset contains all buildings in Germany with their footprint polygon and height. It is a partial dump of the ETHOS.BUILDA database (version v7_20240429). ETHOS.BUILDA is a database containing building-level data for the German building stock. It is based on various data sources that are combined and enriched with machine learning approaches to generate one consistent and complete building dataset.

ETHOS.BUILDA is made available under the Open Database License (ODbL). The licenses of the contents of the database depend on the data source. The sources of the building attributes and information on the type of processing that was done to assign the information from the raw data to the building in ETHOS.BUILDA are provided for each individual data point.

Data structure and file overview

Building data is provided per federal state, the files are named according to the NUTS-1 region names. The building data has the following fields:

field name	description
ID	unique identifier of the building
source	the source of the building footprint
footprint	footprint polygon in WKT-format, EPSG:3035
height_m	value: height of the building in [m], source: source of the height data, lineage: height assignment method

A mapping of the abbreviations of "source" and "lineage" of individual data points to the descriptions is provided in sources.csv and lineages.csv. There is no source entry for the source "v7_model.json" in the sources.csv file, as this refers to the internally trained machine learning model and not to an external dataset.

Acknowledgements

This work was supported by the Helmholtz Association under the program "Energy System Design".

Furthermore, the authors would like to express their gratitude to the Federal Ministry for Economic Affairs and Climate Action (BMWK.IIB4) for providing the necessary resources to conduct this study. Our research was supported by the WAAGE Grant Program (Grant No. 03EI1044/03EE 5031D), and we appreciate their financial assistance.

N
Dataset for New Germany, MN Census Bureau Income Distribution by Race
neilsberg.com
Updated Jan 3, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2024). Dataset for New Germany, MN Census Bureau Income Distribution by Race [Dataset]. https://www.neilsberg.com/research/datasets/80e4b00e-9fc2-11ee-b48f-3860777c1fe6/
Explore at:
Dataset updated
Jan 3, 2024
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Minnesota, New Germany
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the New Germany median household income by race. The dataset can be utilized to understand the racial distribution of New Germany income.

Content

The dataset will have the following datasets when applicable

Please note: The 2020 1-Year ACS estimates data was not reported by the Census Bureau due to the impact on survey collection and analysis caused by COVID-19. Consequently, median household income data for 2020 is unavailable for large cities (population 65,000 and above).

New Germany, MN median household income breakdown by race betwen 2011 and 2021

Median Household Income by Racial Categories in New Germany, MN (2021, in 2022 inflation-adjusted dollars)

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Interested in deeper insights and visual analysis?

Explore our comprehensive data analysis and visual representations for a deeper understanding of New Germany median household income by race. You can refer the same here
d
Data from: Every single word - A new dataset including all parliamentary...
search.dataone.org
dataverse.harvard.edu
Updated Sep 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kroeber, Corinna; Remschel, Tobias (2024). Every single word - A new dataset including all parliamentary materials published in Germany [Dataset]. http://doi.org/10.7910/DVN/7EJ1KI
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/7EJ1KI
Dataset updated
Sep 25, 2024
Dataset provided by
Harvard Dataverse
Authors
Kroeber, Corinna; Remschel, Tobias
Area covered
Germany
Description
In this article, we introduce a unique dataset containing all written communication published by the German Bundestag between 1949 and 2017. Increasing numbers of scholars make use of protocols of parliamentary speeches, parliamentary questions, or the texts of legislative drafts in various fields of comparative politics including representation, responsiveness, professionalization and political careers, or parliamentary agenda studies. Since preparing parliamentary documents is rather resource intense, these studies remain limited to single points in time, types of documents and/or policy areas. The long time horizon and various types of documents covered by our new comprehensive dataset will enable scholars interested in parliaments, parties and representatives to answer various innovative research questions related to legislative studies.
d
505 Economics: Monthly Sub-National GDP Dataset for Germany (granular,...
datarade.ai
Updated May 5, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
505 Economics (2021). 505 Economics: Monthly Sub-National GDP Dataset for Germany (granular, timely and precise) [Dataset]. https://datarade.ai/data-products/505-economics-monthly-sub-national-gdp-dataset-for-germany-granular-timely-and-precise-505-economics
Explore at:
.json, .xml, .csv, .xlsAvailable download formats
Dataset updated
May 5, 2021
Dataset authored and provided by
505 Economics
Area covered
Germany
Description
505 Economics is on a mission to make academic economics accessible. We've developed the first monthly sub-national GDP data for EU and UK regions from January 2015 onwards.

Our GDP dataset uses luminosity as a proxy for GDP. The brighter a place, the more economic activity that place tends to have.

We produce the data using high-resolution night time satellite imagery and Artificial Intelligence.

This builds on our academic research at the London School of Economics, and we're producing the dataset in collaboration with the European Space Agency BIC UK.

We have published peer-reviewed academic articles on the usage of luminosity as an accurate proxy for GDP.

Key features:

Granular: Data is provided at the following geographical units:

NUTS3 (e.g. German Districts/Kreise),

NUTS2 (e.g. Regions/Regierungsbezirke),

NUTS1 (e.g. States/Länder), and

NUTS0 (e.g. Germany) levels.

Frequent: Data is provided every month from January 2015. This is more frequent than the annualised official datasets.

Timely: Data is provided with a one month lag (i.e. the data for January 2021 was published at the end of February 2021). This is substantially quicker than the 18 month lag of official datasets.

Accurate: Our dataset uses Deep Learning to maximise accuracy (RMSE 1.2%).

The dataset can be used by:

Governments and policy makers - to monitor the performance of local economies, to measure the localised impact of policies, and to get a real-time indication of economic activity.

Financial services - to get an indication of national-level GDP before official GDP statistics are released

Engineering companies - to monitor and evaluate the localised impact of infrastructure projects

Consultancies - to forecast the localised impact of specific projects, to retrospectively monitor and evaluate the localised impact of existing projects

Economics firms - to create macro forecasts at the national and sub-national level, to assess the impact of policy interventions.

Academia / Think Tanks - to conduct novel research at the local level. E.g. our dataset can be used to measure the impact of localised COVID-19 lockdowns.

We have created this dataset for all UK sub-national regions, 28 EU Countries and Switzerland.
germanquad
huggingface.co
opendatalab.com
Updated Jun 16, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
deepset (2021). germanquad [Dataset]. https://huggingface.co/datasets/deepset/germanquad
Explore at:
Dataset updated
Jun 16, 2021
Dataset authored and provided by
deepsethttps://www.deepset.ai/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In order to raise the bar for non-English QA, we are releasing a high-quality, human-labeled German QA dataset consisting of 13 722 questions, incl. a three-way annotated test set. The creation of GermanQuAD is inspired by insights from existing datasets as well as our labeling experience from several industry projects. We combine the strengths of SQuAD, such as high out-of-domain performance, with self-sufficient questions that contain all relevant information for open-domain QA as in the NaturalQuestions dataset. Our training and test datasets do not overlap like other popular datasets and include complex questions that cannot be answered with a single entity or only a few words.
F
In-Car Speech Dataset: German (Germany)
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). In-Car Speech Dataset: German (Germany) [Dataset]. https://www.futurebeeai.com/dataset/monologue-speech-dataset/in-car-speech-dataset-german
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
Welcome to the German Language In-car Speech Dataset, a comprehensive collection of audio recordings designed to facilitate the development of speech recognition models specifically tailored for in-car environments. This dataset aims to support research and innovation in automotive speech technology, enabling seamless and robust voice interactions within vehicles for drivers and co-passengers.
Speech Data
This dataset comprises over 5,000 high-quality audio recordings collected from various in-car environments. These recordings include scripted wake words and command-type prompts.
Participant Diversity:
- Speakers: 50+ native German speakers from the FutureBeeAI Community.
- Regions: Ensures a balanced representation of Germany1 accents, dialects, and demographics.
- Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
Recording Nature: Scripted wake word and command type of audio recordings.
- Duration: Average duration of 5 to 20 seconds per audio recording.
- Formats: WAV format with mono channels, a bit depth of 16 bits. The dataset contains different data at 16kHz and 48kHz.
Dataset Diversity
Apart from participant diversity, the dataset is diverse in terms of different wake words, voice commands, and recording environments.
Different Automobile Related Wake Words: Hey Mercedes, Hey BMW, Hey Porsche, Hey Volvo, Hey Audi, Hi Genesis, Hey Mini, Hey Toyota, Ok Ford, Hey Hyundai, Ok Honda, Hello Kia, Hey Dodge.
Different Cars: Data collection was carried out in different types and models of cars.
Different Types of Voice Commands:
- Navigational Voice Commands
- Mobile Control Voice Commands
- Car Control Voice Commands
- Multimedia & Entertainment Commands
- General, Question Answer, Search Commands
Recording Time: Participants recorded the given prompts at various times to make the dataset more diverse.
- Morning
- Afternoon
- Evening
Recording Environment: Various recording environments were captured to acquire more realistic data and to make the dataset inclusive of various types of noises. Some of the environment variables are as follows:
- Noise Level: Silent, Low Noise, Moderate Noise, High Noise
- Parking Location: Indoor, Outdoor
- Car Windows: Open, Closed
- Car AC: On, Off
- Car Engine: On, Off
- Car Movement: Stationary, Moving
Metadata
The dataset provides comprehensive metadata for each audio recording and participant:
Participant Metadata: Unique identifier, age, gender, country, state, district, accent, and dialect.
Other Metadata: Recording transcript, recording environment, device details, sample rate, bit depth, file format, recording time.
This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of German voice assistant speech recognition models.
Usage and Applications
This In-car Speech Dataset is a valuable resource for various applications in the field of in-car voice recognition and AI-driven voice technology. This dataset can be leveraged to enhance the performance and functionality of voice-activated systems across different domains.
Speech Recognition Model Training: Provides high-quality audio data for training models to accurately recognize and respond to in-car voice commands.
Safety and Emergency Response: Supports the development of systems that recognize and respond to emergency commands and safety alerts.
Driver Assistance: Facilitates the creation of advanced driver-assistance systems (ADAS) that leverage voice commands for hands-free operation.
Secure and Ethical Collection
Our proprietary data collection platform, “Yugo,” was used throughout the process of this dataset creation.
Throughout the data collection process, the data remained within our secure platform and did not leave our environment, ensuring data security and confidentiality.
The data collection process adhered to strict ethical guidelines, ensuring the privacy and consent of all participants.
It does not include any personally identifiable information about any participant, which makes the dataset safe to use.
Updates and Customization
Understanding the importance of diverse environments for robust voice assistant models, our in-car voice dataset is regularly updated with new audio data captured in various real-world conditions.
Customization & Custom Collection Options:
- Environmental Conditions: Custom collection in specific environmental conditions upon request.
- Sample Rates: Customizable from 8kHz to 48kHz.
- Diverse Pace: Custom collection can be done at a diverse pace upon request.
- Device Specific: Recording can be done with the specific mobile brand or operating system.
License
This German In-car audio dataset is created by FutureBeeAI and is available for commercial use.
Dataset for Generation of multiple true false questions
zenodo.org
zip
Updated Nov 8, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Regina Kasakowskij; Regina Kasakowskij; Thomas Kasakowskij; Niels Seidel; Niels Seidel; Thomas Kasakowskij (2022). Dataset for Generation of multiple true false questions [Dataset]. http://doi.org/10.5281/zenodo.7303300
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7303300
Dataset updated
Nov 8, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Regina Kasakowskij; Regina Kasakowskij; Thomas Kasakowskij; Niels Seidel; Niels Seidel; Thomas Kasakowskij
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Generation of multiple true-false questions

This project provides a Natural Language Pipeline for processing German Textbook sections as an input generating Multiple True-False Questions using GPT2.

Assessments are an important part of the learning cycle and enable the development and promotion of competencies. However, the manual creation of assessments is very time-consuming. Therefore, the number of tasks in learning systems is often limited. In this repository, we provide an algorithm that can automatically generate an arbitrary number of German True False statements from a textbook using the GPT-2 model. The algorithm was evaluated with a selection of textbook chapters from four academic disciplines (see `data` folder) and rated by individual domain experts. One-third of the generated MTF Questions are suitable for learning. The algorithm provides instructors with an easier way to create assessments on chapters of textbooks to test factual knowledge.

As a type of Multiple-Choice question, Multiple True False (MTF) Questions are, among other question types, a simple and efficient way to objectively test factual knowledge. The learner is challenged to distinguish between true and false statements. MTF questions can be presented differently, e.g. by locating a true statement from a series of false statements, identifying false statements among a list of true statements, or separately evaluating each statement as either true or false. Learners must evaluate each statement individually because a question stem can contain both incorrect and correct statements. Thus, MTF Questions as a machine-gradable format have the potential to identify learners’ misconceptions and knowledge gaps.

Example MTF question:

Check the correct statements:

[ ] All trees have green leafs.

[ ] Trees grow towards the sky.

[ ] Leafes can fall from a tree.

Features

- generation of false statements

- automatic selection of true statements

- selection of an arbitrary similarity for true and false statements as well as the number of false statements

- generating false statements by adding or deleting negations as well as using a german gpt2

Setup

Installation

1. Create a new environment: `conda create -n mtfenv python=3.9`

2. Activate the environment: `conda activate mtfenv`

3. Install dependencies using anaconda:

```

conda install -y -c conda-forge pdfplumber

conda install -y -c conda-forge nltk

conda install -y -c conda-forge pypdf2

conda install -y -c conda-forge pylatexenc

conda install -y -c conda-forge packaging

conda install -y -c conda-forge transformers

conda install -y -c conda-forge essential_generators

conda install -y -c conda-forge xlsxwriter

```

3. Download spacy: `python3.9 -m spacy download de_core_news_lg`

Getting started

After installation, you can execute the bash script `bash run.sh` in the terminal to compile MTF questions for the provided textbook chapters.

To create MTF questions for your own texts use the following command:

`python3 main.py --answers 1 --similarity 0.66 --input ./

The parameter `answers` indicates how many false answers should be generated.

By configuring the parameter `similarity` you can determine what portion of a sentence should remain the same. The remaining portion will be extracted and used to generate a false part of the sentence.

## History and roadmap

* Outlook third iteration: Automatic augmentation of text chapters with generated questions

* Second iteration: Generation of multiple true-false questions with improved text summarizer and German GPT2 sentence generator

* First iteration: Generation of multiple true false questions in the Bachelor thesis of Mirjam Wiemeler

Publications, citations, license

Publications

Kasakowskij, R., Kasakowskij, T. & Seidel, N., (2022). Generation of Multiple True False Questions. In: Henning, P. A., Striewe, M. & Wölfel, M. (Hrsg.), 20. Fachtagung Bildungstechnologien (DELFI). Bonn: Gesellschaft für Informatik e.V.. (S. 147-152). DOI: [10.18420/delfi2022-026](https://dl.gi.de/handle/20.500.12116/38826)

Citation of the Dataset

Kasakowskij, R., Kasakowskij, T., & Seidel, N. (2022). Dataset for Generation of multiple true false questions. Zenodo. https://doi.org/10.5281/zenodo.7303300

The source code and data are maintained at GitHub: https://github.com/D2L2/multiple-true-false-question-generation

Contact

Regina Kasakowskij (M.A.) - regina.kasakowskij@fernuni-hagen.de

Dr. Niels Seidel - niels.seidel@fernuni-hagen.de

License Distributed under the MIT License. See [LICENSE.txt](https://gitlab.pi6.fernuni-hagen.de/la-diva/adaptive-assessment/generationofmultipletruefalsequestions/-/blob/master/LICENSE.txt) for more information.

Acknowledgments This research was supported by CATALPA - Center of Advanced Technology for Assisted Learning and Predictive Analytics of the FernUniversität in Hagen, Germany.

This project was carried out as part of research in the CATALPA project [LA DIVA](https://www.fernuni-hagen.de/forschung/schwerpunkte/catalpa/forschung/projekte/la-diva.shtml)
Z
Urban Green Raster Germany 2018
data.niaid.nih.gov
zenodo.org
Updated Feb 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meinel, Gotthard (2022). Urban Green Raster Germany 2018 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5842521
Explore at:
Dataset updated
Feb 28, 2022
Dataset provided by
Meinel, Gotthard
Taubenböck, Hannes
Eichler, Lisa
Wurm, Michael
Krüger, Tobias
Tenikl, Julia
Area covered
Germany
Description
Abstract

The Urban Green Raster Germany is a land cover classification for Germany that addresses in particular the urban vegetation areas. The raster dataset covers the terrestrial national territory of Germany and has a spatial resolution of 10 meters. The dataset is based on a fully automated classification of Sentinel-2 satellite data from a full 2018 vegetation period using reference data from the European LUCAS land use and land cover point dataset. The dataset identifies eight land cover classes. These include Built-up, Built-up with significant green share, Coniferous wood, Deciduous wood, Herbaceous vegetation (low perennial vegetation), Water, Open soil, Arable land (low seasonal vegetation). The land cover dataset provided here is offered as an integer raster in GeoTiff format. The assignment of the number coding to the corresponding land cover class is explained in the legend file.

Data acquisition

The data acquisition comprises two main processing steps: (1) Collection, processing, and automated classification of the multispectral Sentinel 2 satellite data with the “Land Cover DE method”, resulting in the raw land cover classification dataset, NDVI layer, and RF assignment frequency vector raster. (2) GIS-based postprocessing including discrimination of (densely) built-up and loosely built-up pixels according NDVI threshold, and creating water-body and arable-land masks from geo-topographical base-data (ATKIS Basic DLM) and reclassification of water and arable land pixels based on the assignment frequency.

Data collection

Satellite data were searched and downloaded from the Copernicus Open Access Hub (https://scihub.copernicus.eu/).

The LUCAS reference and validation points were loaded from the Eurostat platform (https://ec.europa.eu/eurostat/web/lucas/data/database).

The processing of the satellite data was performed at the DLR data center in Oberpfaffenhofen.

GIS-based post-processing of the automatic classification result was performed at IOER in Dresden.

Value of the data

The dataset can be used to quantify the amount of green areas within cities on a homogeneous data base [5].

Thus it is possible to compare cities of different sizes regarding their greenery and with respect to their ratio of green and built-up areas [6].

Built-up areas within cities can be discriminated regarding their built-up density (dense built-up vs. built-up with higher green share).

Data description

A Raster dataset in GeoTIFF format: The dataset is stored as an 8 bit integer raster with values ranging from 1 to 8 for the eight different land cover classes. The nomenclature of the coded values is as follows: 1 = Built-up, 2=open soil; 3=Coniferous wood, 4= Deciduous wood, 5=Arable land (low seasonal vegetation), 6=Herbaceous vegetation (low perennial vegetation), 7=Water, 8=Built-up with significant green share. Name of the file ugr2018_germany.tif. The dataset is zipped alongside with accompanying files: *.twf (geo-referencing world-file), *.ovr (Overlay file for quick data preview in GIS), *.clr (Color map file).

A text file with the integer value assignment of the land cover classes. Name of the file: Legend_LC-classes.txt.

Experimental design, materials and methods

The first essential step to create the dataset is the automatic classification of a satellite image mosaic of all available Sentinel-2 images from May to September 2018 with a maximum cloud cover of 60 percent. Points from the 2018 LUCAS (Land use and land cover survey) dataset from Eurostat [1] were used as reference and validation data. Using Random Forest (RF) classifier [2], seven land use classes (Deciduous wood, Coniferous wood, Herbaceous vegetation (low perennial vegetation), Built-up, Open soil, Water, Arable land (low seasonal vegetation)) were first derived, which is methodologically in line with the procedure used to create the dataset "Land Cover DE - Sentinel-2 - Germany, 2015" [3]. The overall accuracy of the data is 93 % [4].

Two downstream post-processing steps served to further qualify the product. The first step included the selective verification of pixels of the classes arable land and water. These are often misidentified by the classifier due to radiometric similarities with other land covers; in particular, radiometric signatures of water surfaces often resemble shadows or asphalt surfaces. Due to the heterogeneous inner-city structures, pixels are also frequently misclassified as cropland.

To mitigate these errors, all pixels classified as water and arable land were matched with another data source. This consisted of binary land cover masks for these two land cover classes originating from the Monitor of Settlement and Open Space Development (IOER Monitor). For all water and cropland pixels that were outside of their respective masks, the frequencies of class assignments from the RF classifier were checked. If the assignment frequency to water or arable land was at least twice that to the subsequent class, the classification was preserved. Otherwise, the classification strength was considered too weak and the pixel was recoded to the land cover with the second largest assignment frequency.

Furthermore, an additional land cover class "Built-up with significant vegetation share" was introduced. For this purpose, all pixels of the Built-up class were intersected with the NDVI of the satellite image mosaic and assigned to the new category if an NDVI threshold was exceeded in the pixel. The associated NDVI threshold was previously determined using highest resolution reference data of urban green structures in the cities of Dresden, Leipzig and Potsdam, which were first used to determine the true green fractions within the 10m Sentinel pixels, and based on this to determine an NDVI value that could be used as an indicator of a significant green fraction within the built-up pixel. However, due to the wide dispersion of green fraction values within the built-up areas, it is not possible to establish a universally valid green percentage value for the land cover class of Built-up with significant vegetation share. Thus, the class essentially serves to the visual differentiability of densely and loosely (i.e., vegetation-dominated) built-up areas.

Acknowledgments

This work was supported by the Federal Institute for Research on Building, Urban Affairs and Spatial Development (BBSR) [10.06.03.18.101].The provided data has been developed and created in the framework of the research project “Wie grün sind bundesdeutsche Städte?- Fernerkundliche Erfassung und stadträumlich-funktionale Differenzierung der Grünausstattung von Städten in Deutschland (Erfassung der urbanen Grünausstattung)“ (How green are German cities?- Remote sensing and urban-functional differentiation of the green infrastructure of cities in Germany (Urban Green Infrastructure Inventory)). Further persons involved in the project were: Fabian Dosch (funding administrator at BBSR), Stefan Fina (research partner, group leader at ILS Dortmund), Annett Frick, Kathrin Wagner (research partners at LUP Potsdam).

References

[1] Eurostat (2021): Land cover / land use statistics database LUCAS. URL: https://ec.europa.eu/eurostat/web/lucas/data/database

[2] L. Breiman (2001). Random forests, Mach. Learn., 45, pp. 5-32

[3] M. Weigand, M. Wurm (2020). Land Cover DE - Sentinel-2—Germany, 2015 [Data set]. German Aerospace Center (DLR). doi: 10.15489/1CCMLAP3MN39

[4] M. Weigand, J. Staab, M. Wurm, H. Taubenböck, (2020). Spatial and semantic effects of LUCAS samples on fully automated land use/land cover classification in high-resolution Sentinel-2 data. Int J Appl Earth Obs, 88, 102065. doi: https://doi.org/10.1016/j.jag.2020.102065

[5] L. Eichler., T. Krüger, G. Meinel, G. (2020). Wie grün sind deutsche Städte? Indikatorgestützte fernerkundliche Erfassung des Stadtgrüns. AGIT Symposium 2020, 6, 306–315. doi: 10.14627/537698030

[6] H. Taubenböck, M. Reiter, F. Dosch, T. Leichtle, M. Weigand, M. Wurm (2021). Which city is the greenest? A multi-dimensional deconstruction of city rankings. Comput Environ Urban Syst, 89, 101687. doi: 10.1016/j.compenvurbsys.2021.101687
d
LDU | Germany | 2020 Reachable Population Counts (by age and sex) within a 4...
datarade.ai
.csv, .xls, .txt
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
London Data Unit, LDU | Germany | 2020 Reachable Population Counts (by age and sex) within a 4 Hours timeframe by Truck | 76174 Origins [Dataset]. https://datarade.ai/data-products/ldu-germany-2020-reachable-population-counts-by-age-and-london-data-unit-9020
Explore at:
.csv, .xls, .txtAvailable download formats
Dataset authored and provided by
London Data Unit
Area covered
Germany
Description
This is NOT a raw population dataset. We use our proprietary stack to combine detailed 'WorldPop' UN-adjusted, sex and age structured population data with a spatiotemporal OD matrix.

The result is a dataset where each record indicates how many people can be reached in a fixed timeframe (4 Hours in this case) from that record's location.

The dataset is broken down into sex and age bands at 5 year intervals, e.g - male 25-29 (m_25) and also contains a set of features detailing the representative percentage of the total that the count represents.

The dataset provides 76174 records, one for each sampled location. These are labelled with a h3 index at resolution 7 - this allows easy plotting and filtering in Kepler.gl / Deck.gl / Mapbox, or easy conversion to a centroid (lat/lng) or the representative geometry of the hexagonal cell for integration with your geospatial applications and analyses.

A h3 resolution of 7, is a hexagonal cell area equivalent to: - ~1.9928 sq miles - ~5.1613 sq km

Higher resolutions or alternate geographies are available on request.

More information on the h3 system is available here: https://eng.uber.com/h3/

WorldPop data provides for a population count using a grid of 1 arc second intervals and is available for every geography.

More information on the WorldPop data is available here: https://www.worldpop.org/

One of the main use cases historically has been in prospecting for site selection, comparative analysis and network validation by asset investors and logistics companies. The data structure makes it very simple to filter out areas which do not meet requirements such as: - being able to access 70% of the German population within 4 hours by Truck and show only the areas which do exhibit this characteristic.

Clients often combine different datasets either for different timeframes of interest, or to understand different populations, such as that of the unemployed, or those with particular qualifications within areas reachable as a commute.
F
German Scripted Monologue Speech Data for Healthcare
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). German Scripted Monologue Speech Data for Healthcare [Dataset]. https://www.futurebeeai.com/dataset/monologue-speech-dataset/healthcare-scripted-speech-monologues-german-germany
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
Germany
Dataset funded by
FutureBeeAI
Description
Introduction
Introducing the German Scripted Monologue Speech Dataset for the Healthcare Domain, a voice dataset built to accelerate the development and deployment of German language automatic speech recognition (ASR) systems, with a sharp focus on real-world healthcare interactions.
Speech Data
This dataset includes over 6,000 high-quality scripted audio prompts recorded in German, representing typical voice interactions found in the healthcare industry. The data is tailored for use in voice technology systems that power virtual assistants, patient-facing AI tools, and intelligent customer service platforms.
•Participant Diversity
•
Speakers: 60 native German speakers.

•
Regional Balance: Participants are sourced from multiple regions across Germany, reflecting diverse dialects and linguistic traits.

•
Demographics: Includes a mix of male and female participants (60:40 ratio), aged between 18 and 70 years.

•Recording Specifications
•
Nature of Recordings: Scripted monologues based on healthcare-related use cases.

•
Duration: Each clip ranges between 5 to 30 seconds, offering short, context-rich speech samples.

•
Audio Format: WAV files recorded in mono, with 16-bit depth and sample rates of 8 kHz and 16 kHz.

•
Environment: Clean and echo-free spaces ensure clear and noise-free audio capture.

Topic Coverage
The prompts span a broad range of healthcare-specific interactions, such as:
•Patient check-in and follow-up communication
•Appointment booking and cancellation dialogues
•Insurance and regulatory support queries
•Medication, test results, and consultation discussions
•General health tips and wellness advice
•Emergency and urgent care communication
•Technical support for patient portals and apps
•Domain-specific scripted statements and FAQs
Contextual Depth
To maximize authenticity, the prompts integrate linguistic elements and healthcare-specific terms such as:
•
Names: Gender- and region-appropriate Germany names

•
Addresses: Varied local address formats spoken naturally

•
Dates & Times: References to appointment dates, times, follow-ups, and schedules

•
Medical Terminology: Common medical procedures, symptoms, and treatment references

•
Numbers & Measurements: Health data like dosages, vitals, and test result values

•
Healthcare Institutions: Names of clinics, hospitals, and diagnostic centers

These elements make the dataset exceptionally suited for training AI systems to understand and respond to natural healthcare-related speech patterns.
Transcription
Every audio recording is accompanied by a verbatim, manually verified transcription.
•
Content: The transcription mirrors the exact scripted prompt recorded by the speaker.

•
Format: Files are delivered in plain text (.TXT) format with consistent naming conventions for seamless integration.

•
<b style="font-weight:
Number of small and medium-sized enterprises in Germany 2014-2029
statista.com
Updated Dec 15, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista Research Department (2022). Number of small and medium-sized enterprises in Germany 2014-2029 [Dataset]. https://www.statista.com/study/30619/sme-e-commerce/
Explore at:
Dataset updated
Dec 15, 2022
Dataset provided by
Statistahttp://statista.com/
Authors
Statista Research Department
Description
The number of small and medium-sized enterprises in Germany was forecast to continuously increase between 2024 and 2029 by in total 0.8 thousand enterprises (+0.38 percent). According to this forecast, in 2029, the number will have increased for the sixth consecutive year to 212.45 thousand enterprises. According to the OECD an enterprise is defined as the smallest combination of legal units, which is an organisational unit producing services or goods, that benefits from a degree of autonomy with regards to the allocation of resources and decision making. Shown here are small and medium-sized enterprises, which are defined as companies with 1-249 employees.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in more than 150 countries and regions worldwide. All input data are sourced from international institutions, national statistical offices, and trade associations. All data has been are processed to generate comparable datasets (see supplementary notes under details for more information).Find more key insights for the number of small and medium-sized enterprises in countries like Austria and Switzerland.
N
Germany Township, Pennsylvania Median Income by Age Groups Dataset: A...
neilsberg.com
csv, json
Updated Feb 25, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Germany Township, Pennsylvania Median Income by Age Groups Dataset: A Comprehensive Breakdown of Germany township Annual Median Income Across 4 Key Age Groups // 2025 Edition [Dataset]. https://www.neilsberg.com/insights/germany-township-pa-median-household-income-by-age/
Explore at:
json, csvAvailable download formats
Dataset updated
Feb 25, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Pennsylvania, Germany Township
Variables measured
Income for householder under 25 years, Income for householder 65 years and over, Income for householder between 25 and 44 years, Income for householder between 45 and 64 years
Measurement technique
The data presented in this dataset is derived from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. It delineates income distributions across four age groups (Under 25 years, 25 to 44 years, 45 to 64 years, and 65 years and over) following an initial analysis and categorization. Subsequently, we adjusted these figures for inflation using the Consumer Price Index retroactive series via current methods (R-CPI-U-RS). For additional information about these estimations, please contact us via email at research@neilsberg.com
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset presents the distribution of median household income among distinct age brackets of householders in Germany township. Based on the latest 2019-2023 5-Year Estimates from the American Community Survey, it displays how income varies among householders of different ages in Germany township. It showcases how household incomes typically rise as the head of the household gets older. The dataset can be utilized to gain insights into age-based household income trends and explore the variations in incomes across households.

Key observations: Insights from 2023

In terms of income distribution across age cohorts, in Germany township, the median household income stands at $139,318 for householders within the 45 to 64 years age group, followed by $111,071 for the 25 to 44 years age group. Notably, householders within the 65 years and over age group, had the lowest median household income at $66,250.

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates. All incomes have been adjusting for inflation and are presented in 2023-inflation-adjusted dollars.

Age groups classifications include:

Under 25 years

25 to 44 years

45 to 64 years

65 years and over

Variables / Data Columns

Age Of The Head Of Household: This column presents the age of the head of household

Median Household Income: Median household income, in 2023 inflation-adjusted dollars for the specific age group

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Germany township median household income by age. You can refer the same here
Z
Data from: SeasoNet: A Seasonal Scene Classification, Segmentation and...
data.niaid.nih.gov
zenodo.org
Updated Aug 10, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dominik Koßmann (2022). SeasoNet: A Seasonal Scene Classification, Segmentation and Retrieval Dataset for Satellite Imagery over Germany [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5850306
Explore at:
Dataset updated
Aug 10, 2022
Dataset provided by
Thorsten Wilhelm
Viktor Brack
Dominik Koßmann
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Germany
Description
This dataset consists of 1,759,830 multi-spectral image patches from the Sentinel-2 mission, annotated with image- and pixel-level land cover and land usage labels from the German land cover model LBM-DE2018 with land cover classes based on the CORINE Land Cover database (CLC) 2018. It includes pixel synchronous examples from each of the four seasons, plus an additional snowy set, spanning the time from April 2018 to February 2019. The patches were taken from 519,547 unique locations, covering the whole surface area of Germany, with each patch covering an area of 1.2km x 1.2km. The set is split into two overlapping grids, consisting of roughly 880,000 samples each, which are shifted by half the patch size in both dimensions. The images in each of the both grids themselves do not overlap.

Contents

Each sample includes:

3 10m resolution bands (RGB), 120px x 120px

1 10m resolution band (infrared), 120px x 120px

6 20m resolution bands, 60px x 60px

2 60m resolution bands, 20xp x 20px

1 pixel-level label map

2 binary masks for cloud and snow coverage

2 binary masks for easy and medium segmentation difficulties, marks areas <300px and <100px respectively

1 JSON-file containing additional meta-information

The meta.csv contains the following information about each sample:

Which season it belongs to

Which of the two grids it belongs to

Coordinates of the patch center

Whether it was acquired from Sentinel-2 Satellite A or B

Date and time of image acquisition

Snow and cloud coverage percentages

Image-level multi-class labels

Three additional image-level urbanization labels, based on the center pixel (details below)

The path to the sample

Classes

ID Class 1 Continuous urban fabric 2 Discontinuous urban fabric 3 Industrial or commercial units 4 Road and rail networks and associated land 5 Port areas 6 Airports 7 Mineral extraction sites 8 Dump sites 9 Construction sites 10 Green urban areas 11 Sport and leisure facilities 12 Non-irrigated arable land 13 Vineyards 14 Fruit trees and berry plantations 15 Pastures 16 Broad-leaved forest 17 Coniferous forest 18 Mixed forest 19 Natural grasslands 20 Moors and heathland 21 Transitional woodland/shrub 22 Beaches, dunes, sands 23 Bare rock 24 Sparsely vegetated areas 25 Inland marshes 26 Peat bogs 27 Salt marshes 28 Intertidal flats 29 Water courses 30 Water bodies 31 Coastal lagoons 32 Estuaries 33 Sea and ocean

Urbanization classes

SLRAUM

0: None

1: Ländlicher Raum (~ rural area)

2: Städtischer Raum (~ urban area)

RTYP3

0: None

1: Ländliche Regionen (~ rural areas)

2: Regionen mit Verstädterungsansätzen (~ urbanizing areas)

3: Städtische Regionen (~ urban areas)

KTYP4

0: None

1: Dünn besiedelte ländliche Kreise

2: Kreisfreie Großstädte

3: Ländliche Kreise mit Verdichtungsansätzen

4: Städtische Kreise

Further information on the urbanization classes can be found here:

SLRAUM

https://www.bbsr.bund.de/BBSR/DE/forschung/raumbeobachtung/Raumabgrenzungen/deutschland/kreise/staedtischer-laendlicher-raum/kreistypen.html

RTYP3

https://www.bbsr.bund.de/BBSR/DE/forschung/raumbeobachtung/Raumabgrenzungen/deutschland/regionen/siedlungsstrukturelle-regionstypen/regionstypen.html

KTYP4

https://www.bbsr.bund.de/BBSR/DE/forschung/raumbeobachtung/Raumabgrenzungen/deutschland/kreise/siedlungsstrukturelle-kreistypen/kreistypen.html

License of landcover model

Bundesamt für Kartographie und Geodäsie

dl-de/by-2-0 from https://www.govdata.de/dl-de/by-2-0

© GeoBasis-DE / BKG 2022

Source of landcover model

https://gdz.bkg.bund.de/index.php/default/catalog/product/view/id/1071/s/corine-land-cover-5-ha-stand-2018-clc5-2018/
T
Germany General Government Revenues
tradingeconomics.com
fr.tradingeconomics.com
+13more
csv, excel, json, xml
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS, Germany General Government Revenues [Dataset]. https://tradingeconomics.com/germany/government-revenues
Explore at:
json, excel, csv, xmlAvailable download formats
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Mar 31, 1991 - Dec 31, 2024
Area covered
Germany
Description
Government Revenues in Germany increased to 590.21 EUR Billion in the fourth quarter of 2024 from 485.91 EUR Billion in the third quarter of 2024. This dataset provides - Germany Government Revenues- actual values, historical data, forecast, chart, statistics, economic calendar and news.
F
German General Conversation Speech Dataset for ASR
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). German General Conversation Speech Dataset for ASR [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/general-conversation-german-germany
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
Welcome to the German General Conversation Speech Dataset — a rich, linguistically diverse corpus purpose-built to accelerate the development of German speech technologies. This dataset is designed to train and fine-tune ASR systems, spoken language understanding models, and generative voice AI tailored to real-world German communication.
Curated by FutureBeeAI, this 30 hours dataset offers unscripted, spontaneous two-speaker conversations across a wide array of real-life topics. It enables researchers, AI developers, and voice-first product teams to build robust, production-grade German speech models that understand and respond to authentic German accents and dialects.
Speech Data
The dataset comprises 30 hours of high-quality audio, featuring natural, free-flowing dialogue between native speakers of German. These sessions range from informal daily talks to deeper, topic-specific discussions, ensuring variability and context richness for diverse use cases.
•Participant Diversity:
•
Speakers: 60 verified native German speakers from FutureBeeAI’s contributor community.

•
Regions: Representing various provinces of Germany to ensure dialectal diversity and demographic balance.

•
Demographics: A balanced gender ratio (60% male, 40% female) with participant ages ranging from 18 to 70 years.

•Recording Details:
•
Conversation Style: Unscripted, spontaneous peer-to-peer dialogues.

•
Duration: Each conversation ranges from 15 to 60 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, recorded at 16kHz sample rate.

•
Environment: Quiet, echo-free settings with no background noise.

Topic Diversity
The dataset spans a wide variety of everyday and domain-relevant themes. This topic diversity ensures the resulting models are adaptable to broad speech contexts.
•Sample Topics Include:
•Family & Relationships
•Food & Recipes
•Education & Career
•Healthcare Discussions
•Social Issues
•Technology & Gadgets
•Travel & Local Culture
•Shopping & Marketplace Experiences, and many more.
Transcription
Each audio file is paired with a human-verified, verbatim transcription available in JSON format.
•Transcription Highlights:
•Speaker-segmented dialogues
•Time-coded utterances
•Non-speech elements (pauses, laughter, etc.)
•High transcription accuracy, achieved through double QA pass, average WER < 5%
These transcriptions are production-ready, enabling seamless integration into ASR model pipelines or conversational AI workflows.
Metadata
The dataset comes with granular metadata for both speakers and recordings:
•
Speaker Metadata: Age, gender, accent, dialect, state/province, and participant ID.

•
Recording Metadata: Topic, duration, audio format, device type, and sample rate.

Such metadata helps developers fine-tune model training and supports use-case-specific filtering or demographic analysis.
Usage and Applications
This dataset is a versatile resource for multiple German speech and language AI applications:
•
ASR Development: Train accurate speech-to-text systems for German.

•
Voice Assistants: Build smart assistants capable of understanding natural German conversations.

<span
R
Germany Demo Dataset
universe.roboflow.com
zip
Updated Jul 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neuro Core (2025). Germany Demo Dataset [Dataset]. https://universe.roboflow.com/neuro-core/germany-demo/dataset/2
Explore at:
zipAvailable download formats
Dataset updated
Jul 6, 2025
Dataset authored and provided by
Neuro Core
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Person Bounding Boxes
Description
Germany Demo

## Overview Germany Demo is a dataset for object detection tasks - it contains Person annotations for 586 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
T
Germany Exports
tradingeconomics.com
de.tradingeconomics.com
+13more
csv, excel, json, xml
Updated Aug 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). Germany Exports [Dataset]. https://tradingeconomics.com/germany/exports
Explore at:
json, csv, excel, xmlAvailable download formats
Dataset updated
Aug 7, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 31, 1962 - Jul 31, 2025
Area covered
Germany
Description
Exports in Germany decreased to 130.20 EUR Billion in July from 130.90 EUR Billion in June of 2025. This dataset provides - Germany Exports - actual values, historical data, forecast, chart, statistics, economic calendar and news.
h
germany-license-plate-dataset
huggingface.co
Updated Jul 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unidata Smart City (2025). germany-license-plate-dataset [Dataset]. https://huggingface.co/datasets/ud-smart-city/germany-license-plate-dataset
Explore at:
Dataset updated
Jul 5, 2025
Authors
Unidata Smart City
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
License Plate Recognition - 177 827 Image

This dataset provides 177 827 vehicle images captured in Germany, serving as a robust foundation for license plate recognition, license plate detection, and OCR tasks, supporting autonomous vehicles, traffic management, and smart city applications. - Get the data

Dataset characteristics:

Characteristic Data

Description License plate images with labeling for OCR tasks

Data types Image

Tasks Detection… See the full description on the dataset page: https://huggingface.co/datasets/ud-smart-city/germany-license-plate-dataset.
e
The grain prices in Germany 1791 to 1934. - Dataset - B2FIND
b2find.eudat.eu
Updated Jun 22, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). The grain prices in Germany 1791 to 1934. - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/51d3fc59-5fab-58a6-9a49-378292443166
Explore at:
Dataset updated
Jun 22, 2019
Area covered
Germany
Description
In this study, the development of the prices of grain, the staple food throughout Germany since the 17th century, is represented, starting from the end of the 18th century. This survey was carried out within the scope of a general historical examination on wholesale prices in Germany by the Reich Statistical Office (“Statistisches Reichsamt”) in cooperation with the German Institute for Economic Research (“Institut für Konjunkturforschung”). Since the records of prices for rye, wheat, barley, and oats available in the primary sources are incomplete as regards the whole length of the above-mentioned period, several values have been converted in order to make a comparison possible (conversion into German mark/Reichsmark per 1,000 kg). Furthermore, index numbers for the German grain prices have been calculated so that a continuous development becomes visible (base year: 1913 = 100). Apart from grain harvests and consumption in Germany since 1878/79, the study gives an overwiev of the foreign trade of rye, wheat, barley, and oats as well. Topics: List of Data tables within the HISTAT research and download system: A. Grain harvest, Foreign trade, and consumption in Germany: Rye, wheat, barley, and oats (1836–1934). B. Index numbers of grain prices in Germany, 1913=100 (1792–1934). C. Prices of different types of grain: Germany, other countries, and world market (rye, wheat, barley, and oats, 1000 kg in German mark and Reichsmark (1836–1934). In der vorliegenden Arbeit wird die Entwicklung der Preise für das Hauptnahrungsmittel seit dem 17. Jahrhundert, das Getreide, in Deutschland seit dem Ausgang des 18. Jahrhunderts dargestellt. Die Arbeit ist im Zusammenhang mit einer allgemeinen historischen Untersuchung der deutschen Großhandelspreise entstanden, die gemeinsam vom Statistischen Reichsamt und dem Institut für Konjunkturforschung durchgeführt worden ist. Da die Aufzeichnungen der Preise für die Getreidesorten Roggen, Weizen, Gerste und Hafer aus den Primärquellen nicht für den gesamten Zeitraum in vergleichbarer Form vorliegen, sind die zu einem Vergleich erforderlichen Umrechnungen vorgenommen worden (in Mark bzw. Reichsmark je 1000kg). Ferner wurden Indexziffern der Getreidepreise in Deutschland berechnet, die den kontinuierlichen Verlauf der Entwicklung zeigen (Basisjahr: 1913 = 100). Neben der Getreideernte und den Getreideverbrauch in Deutschland seit 1878/79 berücksichtigt die Arbeit auch den Außenhandel für die Getreidesorten Roggen, Weizen, Gerste und Hafer. Themen: Verzeichnis der Daten-Tabellen in dem Recherche- und Downloadsystem HISTAT: A. Getreideernte, Außenhandel und Verbrauch in Deutschland: Roggen, Weizen, Gerste und Hafer (1836 – 1934). B. Indexziffern der Getreidepreise in Deutschland, 1913=100 (1792 – 1934). C. Preise für Getreidesorten: Deutschland, Ausland bzw. Weltmarkt (Roggen, Weizen, Gerste und Hafer, 1000 kg in Mark u. Reichmark (1836 – 1934). Sources: One part of these documents was taken from earlier publications by the Statistisches Reichsamt and the former Prussian Statistical Authorities (Preußisches Statistisches Landesamt), as well as from other authorities and non-authorities. A second part has been retrieved from official files. Grain prices: cf Jacobs, A./ Richter, H., 1935: Wholesale prices in Germany between 1792-1934 (“Wholesale Prices in Germany 1792–1934”). Journals (special edition) published by the German Institute for Economic Research, 37. Berlin: Hanseat. Verl.-Anst. Hamburg, p. 52–55. Quellen: Die Unterlagen entstammen zum Teil früheren Veröffentlichungen des Statistischen Reichsamts und des ehemaligen Preußischen Statistischen Landesamtes sowie Veröffentlichungen anderer amtlicher und nichtamtlicher Stellen, zum Teil sind sie amtlichen Akten entnommen. Zu den Getreidepreisen siehe auch: Jacobs, A./ Richter, H., 1935: Die Großhandelspreise in Deutschland von 1792 bis 1934. Sonderhefte des Instituts für Konjunkturforschung, 37. Berlin: Hanseat. Verl.-Anst. Hamburg, S. 52 – 55.
w
Dataset of authors, books and publication dates of book subjects where books...
workwithdata.com
Updated Nov 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2024). Dataset of authors, books and publication dates of book subjects where books equals The conquest of nature : water, landscape and the making of modern Germany [Dataset]. https://www.workwithdata.com/datasets/book-subjects?col=book_subject%2Cj0-author%2Cj0-book%2Cj0-publication_date&f=1&fcol0=j0-book&fop0=%3D&fval0=The+conquest+of+nature+%3A+water%2C+landscape+and+the+making+of+modern+Germany&j=1&j0=books
Explore at:
Dataset updated
Nov 7, 2024
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about book subjects. It has 7 rows and is filtered where the books is The conquest of nature : water, landscape and the making of modern Germany. It features 4 columns: authors, books, and publication dates.

Facebook

Twitter

Click to copy link

Link copied

Cite

ETHOS.BUILDA: Building Footprint and Height Dataset Germany

Explore at:

csvAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.11845992

Dataset updated

May 6, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Kristina Dabrock; Kristina Dabrock; Noah Pflugradt; Noah Pflugradt; Jann Michael Weinand; Jann Michael Weinand; Detlef Stolten; Detlef Stolten

License

Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically

Area covered

Germany

Description

Introduction

Data structure and file overview

Building data is provided per federal state, the files are named according to the NUTS-1 region names. The building data has the following fields:

field name	description
ID	unique identifier of the building
source	the source of the building footprint
footprint	footprint polygon in WKT-format, EPSG:3035
height_m	value: height of the building in [m], source: source of the height data, lineage: height assignment method

Acknowledgements

This work was supported by the Helmholtz Association under the program "Energy System Design".

Clear search

Close search

Google apps

Main menu

ETHOS.BUILDA: Building Footprint and Height Dataset Germany

Introduction

Data structure and file overview

Acknowledgements

Dataset for New Germany, MN Census Bureau Income Distribution by Race

About this dataset

Content

Inspiration

Interested in deeper insights and visual analysis?

Data from: Every single word - A new dataset including all parliamentary...

505 Economics: Monthly Sub-National GDP Dataset for Germany (granular,...

germanquad

In-Car Speech Dataset: German (Germany)

Introduction

Speech Data

Dataset Diversity

Metadata

Usage and Applications

Secure and Ethical Collection

Updates and Customization

License

Dataset for Generation of multiple true false questions

Urban Green Raster Germany 2018

LDU | Germany | 2020 Reachable Population Counts (by age and sex) within a 4...

German Scripted Monologue Speech Data for Healthcare

Introduction

Speech Data

Topic Coverage

Contextual Depth

Transcription

Number of small and medium-sized enterprises in Germany 2014-2029

Germany Township, Pennsylvania Median Income by Age Groups Dataset: A...

About this dataset

Content

Inspiration

Recommended for further research

Data from: SeasoNet: A Seasonal Scene Classification, Segmentation and...

Germany General Government Revenues

German General Conversation Speech Dataset for ASR

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Usage and Applications

Germany Demo Dataset

Germany Demo

Germany Exports

germany-license-plate-dataset

The grain prices in Germany 1791 to 1934. - Dataset - B2FIND

Dataset of authors, books and publication dates of book subjects where books...

ETHOS.BUILDA: Building Footprint and Height Dataset Germany

Introduction

Data structure and file overview

Acknowledgements