Warsaw Bike-Sharing Stations Dataset (RAW) - Season 2023
Description
This dataset contains information about Warsaw’s bikesharing system, collected every five minutes between May 6 and November 29, 2023. The data cover various aspects of the system, including detailed records of bike trips, station attributes (e.g., distances to points of interest), and comprehensive station snapshots (number of available bikes, bike IDs, etc.). Additionally, it includes geographical data (a 5×5 km grid covering Warsaw) as well as historical weather data (temperature, precipitation, wind) for the same period, aligned with the grid’s centroids. Data Collection and Scope 1. Frequency of Data Collection: - Every 5 minutes from May 6 to November 29, 2023. 2. Location: - City of Warsaw (Poland). 3. Data Types: - Bikesharing usage (bike movements, station statuses, bike IDs). - Geospatial attributes of stations and a 5×5 km grid of Warsaw. - Weather data (recorded at the grid centroids to reduce API calls). Files Included 1. bike_paths.json Contains trip records for individual bikes (by bike ID). - Each entry specifies: a) from: the station ID where the trip started. b) to: the station ID where the trip ended. c) departure_time: date and time the bike left the station. d) arrival_time: date and time the bike arrived at the next station. 2. bike_stations_with_attributes.geojson A GeoJSON file describing various attributes of each station, including: - Distances to points of interest (city center, metro station, bus/tram stop, etc.). - Population density near the station (e.g., pop_2023). 3. stations.rar A RAR archive containing 59,694 JSON files. Each file captures the state of all bikesharing stations at a given 5-minute interval. Information per file (one snapshot in time): - Date and time (year, month, day, hour, minute). - List of stations (stations_data), where each station includes: - Unique station ID (uid), station name, station type. - Location coordinates (lat, lng). - Availability status (number of bikes, free racks, etc.). - Detailed list of bikes currently at the station (bike IDs, types). - Centroid coordinates for higher-level aggregation. 4. warszawa_centroidy_5km.geojson A GeoJSON file containing coordinates of centroids for 5×5 km grid cells covering the Warsaw area. These centroid points were used to minimize the number of calls to the weather API (i.e., stations in the same cell share one centroid for weather queries). 5. warszawa_siatka_5km.geojson A GeoJSON file containing the polygons (squares) of the 5×5 km grid covering Warsaw. Useful for spatial analyses (e.g., aggregating station usage by grid cell). 6. weather_data.csv A CSV file with historical weather data for each centroid during the study period (May 6 – November 29, 2023). Each record includes: - Timestamp of the measurement, - Centroid coordinates (lat, lng), - Temperature (°C), - Weather description (e.g., Clear, Rain), - Precipitation (mm), - Wind speed (km/h).