Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This file includes the code and parameters for the baseline MATSim-NYC model, network calibration, and other additional features. All the required input files are saved in the input folder.
The synthetic population as well as the data dictionary are also incorporated.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
An agent-based model of West Midlands Combined Area for 2022 composed of:
The model can be used as input for MATSim simulations.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Synthetic Sweden Mobility (SySMo) model provides a simplified yet statistically realistic microscopic representation of the real population of Sweden. The agents in this synthetic population contain socioeconomic attributes, household characteristics, and corresponding activity plans for an average weekday. This agent-based modelling approach derives the transportation demand from the agents’ planned activities using various transport modes (e.g., car, public transport, bike, and walking).
This open data repository contains four datasets:
(1) Synthetic Agents,
(2) Activity Plans of the Agents,
(3) Travel Trajectories of the Agents, and
(4) Road Network (EPSG: 3006)
(OpenStreetMap data were retrieved on August 28, 2023, from https://download.geofabrik.de/europe.html, and GTFS data were retrieved on September 6, 2023 from https://samtrafiken.se/)
The database can serve as input to assess the potential impacts of new transportation technologies, infrastructure changes, and policy interventions on the mobility patterns of the Swedish population.
This dataset contains statistically simulated 10.2 million agents representing the population of Sweden, their socio-economic characteristics and the activity plan for an average weekday. For preparing data for the MATSim simulation, we randomly divided all the agents into 10 batches. Each batch's agents are then simulated in MATSim using the multi-modal network combining road networks and public transit data in Sweden using the package pt2matsim (https://github.com/matsim-org/pt2matsim).
The agents' daily activity plans along with the road network serve as the primary inputs in the MATSim environment which ensures iterative replanning while aiming for a convergence on optimal activity plans for all the agents. Subsequently, the individual mobility trajectories of the agents from the MATSim simulation are retrieved.
The activity plans of the individual agents extracted from the MATSim simulation output data are then further processed. All agents with negative utility score and negative activity time corresponding to at least one activity are filtered out as the ‘infeasible’ agents. The dataset ‘Synthetic Agents’ contains all synthetic agents regardless of their ‘feasibility’ (0=excluded & 1=included in plans and trajectories). In the other datasets, only agents with feasible activity plans are included.
The simulation setup adheres to the MATSim 13.0 benchmark scenario, with slight adjustments. The strategy for replanning integrates BestScore (60%), TimeAllocationMutator (30%), and ReRoute (10%)— the percentages denote the proportion of agents utilizing these strategies. In each iteration of the simulation, the agents adopt these strategies to adjust their activity plans. The "BestScore" strategy retains the plan with the highest score from the previous iteration, selecting the most successful strategy an agent has employed up until that point. The "TimeAllocationMutator" modifies the end times of activities by introducing random shifts within a specified range, allowing for the exploration of different schedules. The "ReRoute" strategy enables agents to alter their current routes, potentially optimizing travel based on updated information or preferences. These strategies are detailed further in W. Axhausen et al. (2016) work, which provides comprehensive insights into their implementation and impact within the context of transport simulation modeling.
This dataset contains all agents in Sweden and their socioeconomic characteristics.
The attribute ‘feasibility’ has two categories: feasible agents (73%), and infeasible agents (27%). Infeasible agents are agents with negative utility score and negative activity time corresponding to at least one activity.
File name: 1_syn_pop_all.parquet
Column |
Description |
Data type |
Unit |
PId |
Agent ID |
Integer |
- |
Deso | Zone code of Demographic statistical areas (DeSO)1 | String | - |
kommun | Municipality code | Integer | - |
marital | Marital Status (single/ couple/ child) | String | - |
sex | Gender (0 = Male, 1 = Female) | Integer | - |
age | Age | Integer | - |
HId | A unique identifier for households | Integer | - |
HHtype | Type of households (single/ couple/ other) | String | - |
HHsize | Number of people living in the households | Integer | - |
num_babies | Number of children less than six years old in the household | Integer | - |
employment | Employment Status (0 = Not Employed, 1 = Employed) | Integer | - |
studenthood | Studenthood Status (0 = Not Student, 1 = Student) | Integer | - |
income_class | Income Class (0 = No Income, 1 = Low Income, 2 = Lower-middle Income, 3 = Upper-middle Income, 4 = High Income) | Integer | - |
num_cars | Number of cars owned by an individual | Integer | - |
HHcars | Number of cars in the household | Integer | - |
feasibility | Status of the individual (1=feasible, 0=infeasible) | Integer | - |
1 https://www.scb.se/vara-tjanster/oppna-data/oppna-geodata/deso--demografiska-statistikomraden/
The dataset contains the car agents’ (agents that use cars on the simulated day) activity plans for a simulated average weekday.
File name: 2_plans_i.parquet, i = 0, 1, 2, ..., 8, 9. (10 files in total)
Column |
Description |
Data type |
Unit |
act_purpose |
Activity purpose (work/ home/ school/ other) |
String |
- |
PId |
Agent ID |
Integer |
- |
act_end |
End time of activity (0:00:00 – 23:59:59) |
String |
hour:minute:seco nd |
act_id |
Activity index of each agent |
Integer |
- |
mode |
Transport mode to reach the activity location |
String |
- |
POINT_X |
Coordinate X of activity location (SWEREF99TM) |
Float |
metre |
POINT_Y |
Coordinate Y of activity location (SWEREF99TM) |
Float |
metre |
dep_time |
Departure time (0:00:00 – 23:59:59) |
String |
hour:minute:seco nd |
score |
Utility score of the simulation day as obtained from MATSim |
Float |
- |
trav_time |
Travel time to reach the activity location |
String |
hour:minute:seco nd |
trav_time_min |
Travel time in decimal minute |
Float |
minute |
act_time |
Activity duration in decimal minute |
Float |
minute |
distance |
Travel distance between the origin and the destination |
Float |
km |
speed |
Travel speed to reach the activity location |
Float |
km/h |
This dataset contains the driving trajectories of all the agents on the road network, and the public transit vehicles used by these agents, including buses, ferries, trams etc. The files are produced by MATSim simulations and organised into 10 *.parquet’ files (representing different batches of simulation) corresponding to each plan file.
File name: 3_events_i.parquet, i = 0, 1, 2, ..., 8, 9. (10 files in total)
Column |
Description |
Data type |
Unit |
time |
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Behavioural simulator requires 4 input data files. Each file contains the following information:
1. Network.xml file contains road network information based on no of lanes, speed limits, vehicle access details derived from open street maps.
2. Plan.xml file contains synthetic population along with activity-travel information. These activity-travel patterns are output generated from activity-based models.
3. Schedules.xml file have information about public transport schedules with stops details, timetables etc drived from GTFS data
4. Vehicels.xml file is comprised of Public Transport Fleet information e.g no of buses.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Background
Battery electric vehicles (BEVs) are crucial for a sustainable transportation system. As more people adopt BEVs, it becomes increasingly important to accurately assess the demand for charging infrastructure. However, much of the current research on charging infrastructure relies on outdated assumptions, such as the assumption that all BEV owners have access to home chargers and the "Liquid-fuel" mental model. To address this issue, we simulate the travel and charging demand on three charging behavior archetypes. We use a large synthetic population of Sweden, including detailed individual characteristics, such as dwelling types (detached house vs. apartment) and activity plans (for an average weekday). This data repository aims to provide the BEV simulation's input, assumptions, and output so that other studies can use them to study sizing and location design of charging infrastructure, grid impact, etc.
A journal paper published in Transportation Research Part D: Transport and Environment details the method to create the data (particularly Section 2.2 BEV simulation).
https://doi.org/10.1016/j.trd.2023.103645
Methodology
This data product is centered on the 1.7 million inhabitants of the Västra Götaland (VG) region, which includes the second largest city in Sweden, Gothenburg. We specifically simulated 284,000 car agents who live in VG, representing 35% of all car users and 18% of the total population in the region. They spend their simulation day (representing an average weekday) in a variety of locations throughout Sweden.
This open data repository contains the core model inputs and outputs. The numbers in parentheses correspond to the data sets. We use individual agents' activity plans (1) and travel trajectories from MATSim simulation for the BEV simulation (2), in which we consider overnight charger access (3), car fleet composition referencing the current private car fleet in Sweden (4), and Swedish road network with slope information (5) with realistic BEV charging & discharging dynamics. For the BEV simulation, we tested ten scenarios of charging behavior archetypes and fast charging powers (6). The output includes the time history of travel trajectories and charging of the simulated BEVs across the different scenarios (7).
Data description
The current data product covers seven data files.
(1) Agents' experienced activity plans
File name: 1_activity_plans.csv
Column |
Description |
Data type |
Unit |
person |
Agent ID |
Integer |
- |
act_id |
Activity index of each agent |
Integer |
- |
deso |
Zone code of Demographic statistical areas (DeSO)1 |
String |
- |
POINT_X |
Coordinate X of activity location (SWEREF99TM) |
Float |
meter |
POINT_Y |
Coordinate Y of activity location (SWEREF99TM) |
Float |
meter |
act_purpose |
Activity purpose (work, home, other) |
String |
- |
mode |
Transport mode to reach the activity location (car) |
String |
- |
dep_time |
Departure time in decimal hour (0-23.99) |
Float |
hour |
trav_time |
Travel time to reach the activity location |
String |
hour:minute:second |
trav_time_min |
Travel time in decimal minute |
Float |
minute |
speed |
Travel speed to reach the activity location |
Float |
km/h |
distance |
Travel distance between the origin and the destination |
Float |
km |
act_start |
Start time of activity in minute (0-1439) |
Integer |
minute |
act_time |
Activity duration in decimal minute |
Float |
minute |
act_end |
End time of activity in decimal hour (0-23.99) |
Float |
hour |
score |
Utility score of the simulation day given by MATSim |
Float |
- |
1 https://www.scb.se/vara-tjanster/oppna-data/oppna-geodata/deso--demografiska-statistikomraden/
(2) Travel trajectories
File name: 2_input_zip
Produced by MATSim simulation, the zip folder contains ten files (events_batch_X.csv.gz, X=1, 2, …, 10) of input events for the BEV simulation. They are the moving trajectories of the car agents in their simulation days.
Column |
Description |
Data type |
Unit |
time |
Time in second in a simulation day (0-86399) |
Integer |
Second |
type |
Event type defined by MATSim simulation2 |
String |
- |
person |
Agent ID |
Integer |
- |
link |
Nearest road link consistent with (5) |
String |
- |
vehicle |
Vehicle ID identical to person |
Integer |
- |
2 One typical episode of MATSim simulation events: Activity ends (actend) -> Agent’s vehicle enters traffic (vehicle enters traffic) -> Agent’s vehicle moves from previous road segment to its next connected one (left link) -> Agent’s vehicle leaves traffic for activity (vehicle leaves traffic) -> Activity starts (actstart)
(3) Overnight charger access
File name: 3_home_charger_access.csv
Column |
Description |
Data type |
Unit |
person |
Agent ID |
Integer |
- |
home_charger |
Whether an agent has access to a home garage charger/living in a detached house (0=no, 1=yes) |
Integer |
- |
(4) Car fleet composition
File name: 4_car_fleet.csv
Column |
Description |
Data type |
Unit |
person |
Agent ID |
Integer |
- |
income_class |
Income group (0=None, 1=below 180K, 2=180K-300K, 3=300K-420K, 4=above 420K) |
Integer |
- |
car |
Car model class (B=40 kWh, C=60 kWh, D=100 kWh) |
String |
- |
(5) Road network with slope information
File name: 5_road_network_with_slope.shp (5 files in total)
Column |
Description |
Data type |
Unit |
length |
The length of road |
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This file includes the code and parameters for the baseline MATSim-NYC model, network calibration, and other additional features. All the required input files are saved in the input folder.
The synthetic population as well as the data dictionary are also incorporated.