2 datasets found
  1. Z

    Data from: OPTIMAP: A Dataset for Open Public Transport Infrastructure and...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fischer, Maximilian T.; Fürst, Daniel; Metz, Yannick; Schmidt, Manuel; Rauscher, Julius; Keim, Daniel A. (2025). OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_14772646
    Explore at:
    Dataset updated
    Feb 1, 2025
    Dataset provided by
    University of Konstanz
    Authors
    Fischer, Maximilian T.; Fürst, Daniel; Metz, Yannick; Schmidt, Manuel; Rauscher, Julius; Keim, Daniel A.
    License

    Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
    License information was derived automatically

    Description

    Introduction

    This dataset provides a comprehensive assessment of public transport connectivity across Germany by analyzing both walking distances to the nearest public transport stops as well as the quality of public transport connections for daily usage scenarios with housing-level-granularity on a country-wide scale. The data was generated through a novel approach that integrates multiple open data sources, simulation models, and visual analytics techniques, enabling researchers, policymakers, and urban planners to identify gaps and opportunities for transit network improvements. ewline

    Why does it matter?

    Efficient and accessible public transportation is a critical component of sustainable urban development. However, many transit networks struggle to adequately serve diverse populations due to infrastructural, financial, and urban planning limitations. Traditional transit planning often relies on aggregated statistics, expert opinions, or limited surveys, making it difficult to assess transport accessibility at an individual household level. This dataset provides a data-driven and reproducible methodology for unbiased country-wide comparisons.

    Find more information at https://mobility.dbvis.de.

    Key Facts, Download, Citation

    Title OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles

    Acronym OPTIMAP

    Download https://mobility.dbvis.de/data-results/OPTIMAP_v2025-02-01.parquet (478MB, parquet)

    License Datenlizenz Deutschland - Namensnennung - Version 2.0 (dl-de-by/2.0)

    Please cite the dataset as:Maximilian T. Fischer, Daniel Fürst, Yannick Metz, Manuel Schmidt, Julius Rauscher, and Daniel A. Keim. OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles. Zenodo, 2025. doi: 10.5281/zenodo.14772646.

    or, when using Bibtex

    @dataset{MobilityProfiles.DatasetGermany.2025, author = {Fischer, Maximilian T. and Fürst, Daniel and Metz, Yannick and Schmidt, Manuel and Rauscher, Julius and Keim, Daniel A.}, title = {OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles}, year = 2025, publisher = {Zenodo}, doi = {10.5281/zenodo.14772646}}

    Dataset Description

    The dataset in the PARQUET format includes detailed accessibility measures for public transport at a fine-grained, housing-level resolution. It consists of four columns:

    lat, lng (float32): GPS coordinates (EPSG:4326) of each house in Germany, expensively compiled from the house coordinates (HK-DE) data provided by the 16 federal states under the EU INSPIRE regulations.

    MinDistanceWalking (int32): An approximate walking distance (in meters) to the nearest public transport stop from each registered building in Germany.

    scores_OVERALL (float32): A simulated, demographic- and scenario-weighted measure of public transport quality for daily usage, considering travel times, frequency, and coverage across various daily scenarios (e.g., commuting, shopping, medical visits). The results are represented in an artificial time unit to allow comparative analysis across locations.

    Methodology

    The dataset was generated using a combination of open geospatial data and advanced transport simulation techniques.

    Data Sources: Public transit information from the German national access point (DELFI NeTEx), housing geolocation data from various state authorities, and routing information from OpenStreetMap.

    Walking Distance Calculation: The shortest path to the nearest transit stop was computed using the Dijkstra algorithm on a graph network of publicly available pathways sourced from OSM, considering the ten aerial-nearest public transport stops.

    Public Transport Quality Estimation: The dataset incorporates a scenario-based simulation model, analyzing weight-averaged travel times and connection frequency to typical daily POIs such as the individually nearest train stations, kindergartens, schools, institutions of higher education, fitness centers, cinemas, places of worship, supermarkets, shopping malls, restaurants, doctors, parks, and cultural institutions. It includes walking distances to the start and from the destination public transport stops as well as the averaged travel and waiting times on the shortest route calculated via a modified Dijkstra algorithm. The results are aggregated using a demographically- and scenario-weighted metric to ensure comparability. The value is in the unit of time, although it should not be interpreted directly as real minutes.

    Visualization and Validation: A WebGL-based interactive tool and static precomputed maps were developed to allow users to interactively explore transport accessibility metrics dynamically, available at https://mobility.dbvis.de.

    Potential Applications

    The dataset enables multiple use cases across research, policy, and urban planning:

    Public Accessibility Studies: Provides insights into transport equity by evaluating mobility gaps affecting different demographic groups, different regional areas, and comparing county and state efforts in improving public transport quality.

    Urban Planning and Transport Policy: Supports data-driven decision-making for optimizing transit networks, adjusting service schedules, or identifying underserved areas.

    Smart City Development: Assists in integrating mobility analytics into broader smart city initiatives for efficient resource allocation and sustainability planning.

    Academic Research: Facilitates studies in transportation engineering, urban geography, and mobility behavior analysis.

    Conclusion

    By offering high-resolution public transport accessibility data at housing-level granularity, this dataset contributes to a more transparent and objective understanding of urban mobility challenges. The integration of simulation models, demographic considerations, and scalable analytics provides a novel approach to evaluating and improving public transit systems. Researchers, city officials, and policymakers are encouraged to leverage this dataset to enhance transport infrastructure planning and accessibility.

    This dataset contains both the approximate walking distances in meters and a weighted overall quality score in an artificial time unit for each individual house in Germany. More advanced versions are currently not publicly available. This base dataset is publicly available and adheres to open data licensing principles, enabling its reuse for scientific and policy-oriented studies.

    Source Data Licenses

    While not part of this dataset, the scientific simulation used to create the results leverages public transit information via the National Access Point (NAP) DELFI as NeTEx, provided via GTFS feeds of Germany (CC BY 4.0).

    Also, routing information used during the processing was based on Open Street Map contributors (CC BY 4.0).

    Primarily, this dataset contains original and slightly processed housing locations (lat, lng) that were made available as part of the EU INSPIRE regulations, based on Directive (EU) 2019/1024 (of the European Parliament and of the Council of 20 June 2019 on open data and the re-use of public sector information (recast)).

    In Germany, the respective data is provided individually by the 16 federal states, with the following required attributions and license indications:

    BB: EU INSPIRE / © GeoBasis-DE/LGB, dl-de-by/2.0 (data modified)

    BE: EU INSPIRE / © Geoportal Berlin / Hauskoordinaten, dl-de-by/2.0 (data modified)

    BW: EU INSPIRE / © LGL, www.lgl-bw.de, dl-de-by/2.0 (data modified)

    BY: EU INSPIRE / © Bayerische Vermessungsverwaltung, CC BY 4.0 (data modified)

    HB: EU INSPIRE / © Landesamt GeoInformation Bremen, CC BY 4.0 (data modified)

    HE: EU INSPIRE / © HVBG, dl-de-by-zero/2.0 (data modified)

    HH: EU INSPIRE / © FHH (LGV), dl-de-by/2.0 (data modified)

    MV: EU INSPIRE / © LAiV M-V, CC BY 4.0 (data modified)

    NI: EU INSPIRE / © LGLN 2024, CC BY 4.0 (data modified)

    NW: EU INSPIRE / © Geobasis NRW, dl-de-by-zero/2.0 (data modified)

    RP: EU INSPIRE / © GeoBasis-DE / LVermGeoRP 2024, dl-de-by/2.0 (data modified)

    SH: EU INSPIRE / © GeoBasis-DE/LVermGeo SH, CC BY 4.0 (data modified)

    SL: EU INSPIRE / © GeoBasis DE/LVGL-SL (2024), dl-de-by/2.0 (data modified)

    SN: EU INSPIRE / © GeoSN, dl-de-by/2.0 (data modified)

    ST: EU INSPIRE / © GeoBasis-DE / LVermGeo LSA, dl-de-by/2.0 (data modified)

    TH: EU INSPIRE / © GDI-Th, dl-de-by/2.0 (data modified)

    Original Research

    The methodology and techniques are described in an original research article published in 2024. When referring to our approach, please cite the following publication:Yannick Metz, Dennis Ackermann, Daniel A. Keim, and Maximilian T. Fischer. Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent. In: 2024 IEEE Visualization in Data Science (VDS). VDS. IEEE, 2024, p. 9. doi: 10.1109/VDS63897.2024.00006

    or, when using bibtex:

    @inproceedings{MobilityProfiles.VDS.2024, author = {Metz, Yannick and Ackermann, Dennis and Keim, Daniel A. and Fischer, Maximilian T.}, title = {Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent}, booktitle = {2024 IEEE Visualization in Data Science (VDS)}, doi = {10.1109/VDS63897.2024.00006}, pages = {9}, publisher = {IEEE}, series = {VDS}, year = {2024}}

  2. Brasileirão Serie A

    • kaggle.com
    zip
    Updated Dec 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RafaelCSouza (2023). Brasileirão Serie A [Dataset]. https://www.kaggle.com/datasets/rafaelcsouza/brasileirao
    Explore at:
    zip(1218429 bytes)Available download formats
    Dataset updated
    Dec 12, 2023
    Authors
    RafaelCSouza
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Initially, I downloaded a brazilian championship dataset that contained the initial data to start the project.

    Link: https://www.kaggle.com/datasets/adaoduque/campeonato-brasileiro-de-futebol

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5998145%2F37b1d543a60a13264542f70911969712%2Fjogos.png?generation=1696378064972992&alt=media" alt="">

    Then, I started to process the data, keeping only the information that interested me most, and then created the insights in the "Data" spreadsheet. Later, I created a dynamic table to calculate the total number of editions that were played each time, and finally, I created a comparison between two times in the "BI" spreadsheet to facilitate visualization.

    As the initial dataset only has data from 2003 to 2022, I did a webscrapping from "ogol" website using Excel and PowerQuery to import data from other editions.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5998145%2F5c15d430af2302d78445f219859b999b%2Fdados.png?generation=1696375867626695&alt=media" alt="">

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5998145%2Fdfc5cc2844e86ae1ed151b069f044b37%2FBI.png?generation=1696375904810893&alt=media" alt="">

    I also made a PowerBI version of this data.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5998145%2Fc6e147d61cf290ce7818da27e85c6736%2Fpowerbi.png?generation=1696375990685823&alt=media" alt="">

    I plan to continue updating this data until I unite all editions of the Brazilian championship (until 1971) with as much interesting insights and design as possible, especially for the dashboards in PowerBI

    **UPDATE 12/12/2023

    I just discovered using Julius AI that were inconsistent data in 2007-2008, as you can see below:

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5998145%2Fa963a366f1bd8f46f2f3ca90d38263da%2FJulius.png?generation=1702421496018997&alt=media" alt="">

    Looks like there is 3 more games in 2007 and 3 less games in 2008. I asked AI which teams had more games than others, and here's what it returned:

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5998145%2Fa991bf445bc569e4747fc7d4925693bd%2Fresultado%20AI.png?generation=1702421525832660&alt=media" alt="">

    Then I filtered those teams and realized that those 3 more games in 2007 belong to 2008. Same games, same results, same hours, same everything, except the Year. I manually adjusted and updated the new file.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Fischer, Maximilian T.; Fürst, Daniel; Metz, Yannick; Schmidt, Manuel; Rauscher, Julius; Keim, Daniel A. (2025). OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_14772646

Data from: OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles

Related Article
Explore at:
Dataset updated
Feb 1, 2025
Dataset provided by
University of Konstanz
Authors
Fischer, Maximilian T.; Fürst, Daniel; Metz, Yannick; Schmidt, Manuel; Rauscher, Julius; Keim, Daniel A.
License

Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
License information was derived automatically

Description

Introduction

This dataset provides a comprehensive assessment of public transport connectivity across Germany by analyzing both walking distances to the nearest public transport stops as well as the quality of public transport connections for daily usage scenarios with housing-level-granularity on a country-wide scale. The data was generated through a novel approach that integrates multiple open data sources, simulation models, and visual analytics techniques, enabling researchers, policymakers, and urban planners to identify gaps and opportunities for transit network improvements. ewline

Why does it matter?

Efficient and accessible public transportation is a critical component of sustainable urban development. However, many transit networks struggle to adequately serve diverse populations due to infrastructural, financial, and urban planning limitations. Traditional transit planning often relies on aggregated statistics, expert opinions, or limited surveys, making it difficult to assess transport accessibility at an individual household level. This dataset provides a data-driven and reproducible methodology for unbiased country-wide comparisons.

Find more information at https://mobility.dbvis.de.

Key Facts, Download, Citation

Title OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles

Acronym OPTIMAP

Download https://mobility.dbvis.de/data-results/OPTIMAP_v2025-02-01.parquet (478MB, parquet)

License Datenlizenz Deutschland - Namensnennung - Version 2.0 (dl-de-by/2.0)

Please cite the dataset as:Maximilian T. Fischer, Daniel Fürst, Yannick Metz, Manuel Schmidt, Julius Rauscher, and Daniel A. Keim. OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles. Zenodo, 2025. doi: 10.5281/zenodo.14772646.

or, when using Bibtex

@dataset{MobilityProfiles.DatasetGermany.2025, author = {Fischer, Maximilian T. and Fürst, Daniel and Metz, Yannick and Schmidt, Manuel and Rauscher, Julius and Keim, Daniel A.}, title = {OPTIMAP: A Dataset for Open Public Transport Infrastructure and Mobility Accessibility Profiles}, year = 2025, publisher = {Zenodo}, doi = {10.5281/zenodo.14772646}}

Dataset Description

The dataset in the PARQUET format includes detailed accessibility measures for public transport at a fine-grained, housing-level resolution. It consists of four columns:

lat, lng (float32): GPS coordinates (EPSG:4326) of each house in Germany, expensively compiled from the house coordinates (HK-DE) data provided by the 16 federal states under the EU INSPIRE regulations.

MinDistanceWalking (int32): An approximate walking distance (in meters) to the nearest public transport stop from each registered building in Germany.

scores_OVERALL (float32): A simulated, demographic- and scenario-weighted measure of public transport quality for daily usage, considering travel times, frequency, and coverage across various daily scenarios (e.g., commuting, shopping, medical visits). The results are represented in an artificial time unit to allow comparative analysis across locations.

Methodology

The dataset was generated using a combination of open geospatial data and advanced transport simulation techniques.

Data Sources: Public transit information from the German national access point (DELFI NeTEx), housing geolocation data from various state authorities, and routing information from OpenStreetMap.

Walking Distance Calculation: The shortest path to the nearest transit stop was computed using the Dijkstra algorithm on a graph network of publicly available pathways sourced from OSM, considering the ten aerial-nearest public transport stops.

Public Transport Quality Estimation: The dataset incorporates a scenario-based simulation model, analyzing weight-averaged travel times and connection frequency to typical daily POIs such as the individually nearest train stations, kindergartens, schools, institutions of higher education, fitness centers, cinemas, places of worship, supermarkets, shopping malls, restaurants, doctors, parks, and cultural institutions. It includes walking distances to the start and from the destination public transport stops as well as the averaged travel and waiting times on the shortest route calculated via a modified Dijkstra algorithm. The results are aggregated using a demographically- and scenario-weighted metric to ensure comparability. The value is in the unit of time, although it should not be interpreted directly as real minutes.

Visualization and Validation: A WebGL-based interactive tool and static precomputed maps were developed to allow users to interactively explore transport accessibility metrics dynamically, available at https://mobility.dbvis.de.

Potential Applications

The dataset enables multiple use cases across research, policy, and urban planning:

Public Accessibility Studies: Provides insights into transport equity by evaluating mobility gaps affecting different demographic groups, different regional areas, and comparing county and state efforts in improving public transport quality.

Urban Planning and Transport Policy: Supports data-driven decision-making for optimizing transit networks, adjusting service schedules, or identifying underserved areas.

Smart City Development: Assists in integrating mobility analytics into broader smart city initiatives for efficient resource allocation and sustainability planning.

Academic Research: Facilitates studies in transportation engineering, urban geography, and mobility behavior analysis.

Conclusion

By offering high-resolution public transport accessibility data at housing-level granularity, this dataset contributes to a more transparent and objective understanding of urban mobility challenges. The integration of simulation models, demographic considerations, and scalable analytics provides a novel approach to evaluating and improving public transit systems. Researchers, city officials, and policymakers are encouraged to leverage this dataset to enhance transport infrastructure planning and accessibility.

This dataset contains both the approximate walking distances in meters and a weighted overall quality score in an artificial time unit for each individual house in Germany. More advanced versions are currently not publicly available. This base dataset is publicly available and adheres to open data licensing principles, enabling its reuse for scientific and policy-oriented studies.

Source Data Licenses

While not part of this dataset, the scientific simulation used to create the results leverages public transit information via the National Access Point (NAP) DELFI as NeTEx, provided via GTFS feeds of Germany (CC BY 4.0).

Also, routing information used during the processing was based on Open Street Map contributors (CC BY 4.0).

Primarily, this dataset contains original and slightly processed housing locations (lat, lng) that were made available as part of the EU INSPIRE regulations, based on Directive (EU) 2019/1024 (of the European Parliament and of the Council of 20 June 2019 on open data and the re-use of public sector information (recast)).

In Germany, the respective data is provided individually by the 16 federal states, with the following required attributions and license indications:

BB: EU INSPIRE / © GeoBasis-DE/LGB, dl-de-by/2.0 (data modified)

BE: EU INSPIRE / © Geoportal Berlin / Hauskoordinaten, dl-de-by/2.0 (data modified)

BW: EU INSPIRE / © LGL, www.lgl-bw.de, dl-de-by/2.0 (data modified)

BY: EU INSPIRE / © Bayerische Vermessungsverwaltung, CC BY 4.0 (data modified)

HB: EU INSPIRE / © Landesamt GeoInformation Bremen, CC BY 4.0 (data modified)

HE: EU INSPIRE / © HVBG, dl-de-by-zero/2.0 (data modified)

HH: EU INSPIRE / © FHH (LGV), dl-de-by/2.0 (data modified)

MV: EU INSPIRE / © LAiV M-V, CC BY 4.0 (data modified)

NI: EU INSPIRE / © LGLN 2024, CC BY 4.0 (data modified)

NW: EU INSPIRE / © Geobasis NRW, dl-de-by-zero/2.0 (data modified)

RP: EU INSPIRE / © GeoBasis-DE / LVermGeoRP 2024, dl-de-by/2.0 (data modified)

SH: EU INSPIRE / © GeoBasis-DE/LVermGeo SH, CC BY 4.0 (data modified)

SL: EU INSPIRE / © GeoBasis DE/LVGL-SL (2024), dl-de-by/2.0 (data modified)

SN: EU INSPIRE / © GeoSN, dl-de-by/2.0 (data modified)

ST: EU INSPIRE / © GeoBasis-DE / LVermGeo LSA, dl-de-by/2.0 (data modified)

TH: EU INSPIRE / © GDI-Th, dl-de-by/2.0 (data modified)

Original Research

The methodology and techniques are described in an original research article published in 2024. When referring to our approach, please cite the following publication:Yannick Metz, Dennis Ackermann, Daniel A. Keim, and Maximilian T. Fischer. Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent. In: 2024 IEEE Visualization in Data Science (VDS). VDS. IEEE, 2024, p. 9. doi: 10.1109/VDS63897.2024.00006

or, when using bibtex:

@inproceedings{MobilityProfiles.VDS.2024, author = {Metz, Yannick and Ackermann, Dennis and Keim, Daniel A. and Fischer, Maximilian T.}, title = {Interactive Public Transport Infrastructure Analysis through Mobility Profiles: Making the Mobility Transition Transparent}, booktitle = {2024 IEEE Visualization in Data Science (VDS)}, doi = {10.1109/VDS63897.2024.00006}, pages = {9}, publisher = {IEEE}, series = {VDS}, year = {2024}}

Search
Clear search
Close search
Google apps
Main menu