Loading...
Loading...
Traffic data, public transit, aviation, shipping, ride-hailing, accident records
8,959 datasets
33,345 records of slave voyages and captive estimates across 40 routes from the 1650s to the 1860s. Created by Patrick Manning for the African Population and Migration Dataverse, the dataset uses Bayesian statistics to impute missing data on captive flows. It includes R-language code for simulating routes and populations based on these historical estimates.
Novel-Crafting-250x is a text dataset created by Crownelius and hosted on Hugging Face, last updated on March 15, 2026. It contains 1,179,878 total tokens, comprising 157,900 prompt tokens and 1,021,978 completion tokens. The data was generated with an average of 1.00 turns per interaction and no tool calls.
A final report from a follow-on study to the Crash Avoidance Metrics Partnership (CAMP) human factors work. The research gathered data from drivers performing last-second braking and steering maneuvers under normal or hard intensity instructions across a wide variety of vehicle-to-vehicle kinematic scenarios. Results validated a prior Required Deceleration Model and introduced a new 3-Tiered Inverse Time-To-Collision Model for crash alert timing.
airline-satisfaction-project-sample is a dataset hosted on Kaggle. The title suggests it likely contains records related to passenger feedback and airline service quality. Metadata is minimal; the actual content, scale, and origin require verification after download.
A dataset of traffic sign images, likely sourced from Kaggle. The specific number of images, collection method, and temporal coverage are unknown from the provided metadata. The dataset appears to be intended for machine learning tasks related to road infrastructure.
New York City Taxi and Limousine Commission (TLC) Trip Record Data contains records of trips taken by taxis and for-hire vehicles in New York City. The dataset is hosted on AWS Open Data and is provided by the City of New York Taxi and Limousine Commission. The license is the City of New York's Terms of Use.
GLM-5.0-8000x-formatted-fixed is a dataset of formatted text interactions, likely for training or evaluating language models. The dataset contains 4,090,360 total tokens, comprising 512,812 prompt tokens and 3,577,548 completion tokens, with an average of 261.87 tokens per row. It was uploaded by Crownelius to Hugging Face and was last updated on March 15, 2026.
Ohio restaurant market data sample from BeamStation includes traffic changes exceeding 30% and sentiment analysis. The dataset is a free sample posted on Kaggle, but its full size, specific time range, and detailed column structure are unknown.
Datasets intended for forecasting urban traffic at a metropolis scale. The specific number of records, data collection method, and temporal coverage are not provided in the available metadata. The data is hosted on Kaggle, but the author, organization, and last update date are unknown.
Accident_file likely contains records of traffic incidents. The dataset is published on Kaggle, but its specific origin, size, and creation date are unknown. Columns and data characteristics require verification after download.
The dataset likely examines bicycle usage trends during the year 2020. It was published on Kaggle, though the specific author and organization are unknown. The data may reflect changes in mobility patterns associated with the COVID-19 pandemic.
Flight-related data covering the period from 2024 to 2025. The dataset is hosted on Kaggle, but its specific origin and collection method are not detailed. The exact number of records and the complete set of data fields are unknown.
Raw flight data likely containing records of air travel for the 2024-2025 period. The dataset is hosted on Kaggle, but its specific origin, collection method, and detailed contents are not provided. Metadata such as column definitions, file size, and license information are currently unknown.
E-commerce shipping data published on Kaggle. The dataset likely contains records related to order fulfillment and delivery logistics. Metadata is minimal; the specific content, scale, and origin require verification after download.
Information about vehicles serving passenger routes in Poltava, Ukraine, sourced from the States site of Ukraine. The dataset likely contains details on vehicles serving city bus routes under public agreements, including vehicle count per route, brand, model, state number, and passenger capacity. The dataset was last updated on 2026-02-20.
Final_flight is a dataset published on Kaggle. The title suggests it contains information related to aviation, such as flight records or operational data. Specific details on size, columns, and origin are not provided in the available metadata.
Kaggle hosts a dataset titled 'smart-parking-db'. The dataset likely contains records related to parking space availability, occupancy, or sensor data. Metadata is minimal; actual content requires verification after download.
Mountain Project provides a listing of rock climbing routes across the United States. The dataset's specific temporal coverage, size, and update frequency are not detailed in the provided metadata. It likely contains route details such as location, difficulty, and type, aggregated from the Mountain Project platform.
Processed New York City taxi data from 2025 to 2026, updated with available TLC data. The dataset appears to contain metrics related to taxi revenue and trips. The original author and specific license are unknown.
28-day update cycles from the FAA provide current geospatial data on runway infrastructure. The dataset contains physical characteristics of runways at all official and operational aerodromes, linking to associated airports in the National Transportation Atlas Database. Derived from the FAA's National Airspace System Resource Aeronautical Data.