Salvador Urban Network Transportation (SUNT)

Published: 28 April 2025| Version 1 | DOI: 10.17632/85fdtx3kr5.1
Contributors:
Marcos Vinícius,
,
,
,
,
,
,

Description

Efficient public transportation management is essential for the development of large urban centers, providing several benefits such as comprehensive coverage of population mobility, the decrease of transport costs, better control of traffic congestion, and significant reduction of environmental impact limiting gas emissions and pollution. Realizing these benefits requires a deeply understanding the population and transit patterns and the adoption of approaches to model multiple relations and characteristics efficiently. This work addresses these challenges by providing a novel dataset that includes various public transportation components from three different systems: regular buses, subway, and BRT (Bus Rapid Transit). Our dataset comprises daily information from about 710,000 passengers in Salvador, one of Brazil's largest cities, and local public transportation data with approximately 2,000 vehicles operating across nearly 400 lines, connecting almost 3,000 stops and stations. With data collected from March 2024 to March 2025 at a frequency lower than one minute, SUNT stands as one of the largest, most comprehensive, and openly available urban datasets in the literature.

Files

Steps to reproduce

The source code, models, and processed datasets (Alighting, Boarding, and OD) are freely available at https://github.com/LabIA-UFBA/SUNT.

Institutions

Universidade Federal da Bahia, Universidade Federal da Bahia Departamento de Ciencia da Computacao

Categories

Artificial Intelligence, Machine Learning, Big Data, Time Series, Public Transport, Deep Neural Network, Database, Road Public Transport, Graph Neural Network

Funding

National Council for Scientific and Technological Development

404771/2024-6

National Council for Scientific and Technological Development

406354/2023-5

National Council for Scientific and Technological Development

312755/2023-6

National Council for Scientific and Technological Development

313053/2023-5

Licence