Integrating diverse data sources to support future-proof transport modelling

Abstract

Transport models historically rely on limited input datasets, such as ’trip generators’ and simplified networks, leading to biases and blind-spots. This lack of data diversity can lead to biases and blind-spots in model outputs. For example, over-reliance on commuting data over-emphasises arterial routes to historic employment centres, while motorised traffic datasets disproportionately highlight long-distance car trips and neglect active travel. Transport models were developed at a time when data was scarce and expensive to collect but the ‘data revolution’ has changed this. We argue that models should be capable of integrating open, proprietary, and crowdsourced, datasets, with ease of integrating new data sources being a key design principle. We present a case study of this approach in the Network Planning Tool for Scotland (NPT), which is publicly available at npt.scot. The NPT integrates data on transport infrastructure from 4 sources: OpenStreetMap, Ordnance Survey (OS) OpenRoads, OS MasterMap Highways, and OS Mastermap Topography, and we are planning to add more, including from the Scottish Spatial Hub, that integrates datasets from Scottish local authorities and partners. Furthermore, the NPT integrates multiple datasets on transport behaviour (including from the Census, the Scottish Household Travel Survey and the British National Travel Survey), and scenarios of change based on international datasets, supporting more data-driven cycling strategies. The results highlight the benefits of data integration, with results tending to improve as more data sources are added, and diminishing returns highlighting the importance of careful selection of input datasets. The approach, based on reproducible code written in open source languages, can be generalised and packaged for benefit of others seeking develop future-proof modelling solutions to transport challenges. We argue that integrating diverse data sources is essential for future-proof transport modelling, enabling adaptation to evolving travel patterns and behaviours.

Date
Jun 25, 2025 9:00 AM — 10:00 AM
Location
University College Dublin
University College Dublin, Dublin,

I presented an extended abstract at the 57th Universities Transport Studies Group (UTSG) Annual Conference, held at University College Dublin on 25th June 2025. The abstract is titled “Integrating diverse data sources to support future-proof transport modelling” and discusses the limitations of traditional transport models that rely on limited input datasets, leading to biases and blind-spots in model outputs. The presentation highlights a case study of the Network Planning Tool for Scotland (NPT), which integrates multiple datasets to support more evidence-based and data-driven transport strategies.

The extended abstract can be found here.

Robin Lovelace
Robin Lovelace
Professor of Transport Data Science

My research interests include geocomputation, data science for transport applications, active travel uptake and decarbonising transport systems