Loading...
Loading...
Traffic data, public transit, aviation, shipping, ride-hailing, accident records
8,960 datasets
The operation of a fuel cell inverter during grid-forming mode, grid-following mode, and transitions between these modes. It is provided by the Department of Energy and is intended for machine learning and artificial intelligence training applications. Specific data dimensions such as row count and column features are not detailed in the input.
Projected fuel demand per vessel type in the year 2050 at 1377 ports worldwide. The dataset includes baseline fuel demand and projected port throughput, considering fourteen different trade scenarios and excluding trips shorter than 1000 km. Projections are given in tonnes of Heavy Fuel Oil.
Audio recordings of conversations with a voicebot for flight booking, likely involving Passenger Name Record (PNR) interactions. The dataset was uploaded by author 'devansh502' to Hugging Face and last updated on April 1, 2026. Its specific content, size, and structure require verification after download.
May/June 2017 data from the ACLOUD campaign, measuring meteorological parameters at 100 Hz frequency from two aircraft. The dataset, authored by JΓΆrg Hartmann, contains time-stamped, geolocated measurements of temperature, three wind components, pressure, and aircraft attitude for flights based in Longyearbyen, Svalbard. Data processing and accuracy details are provided in referenced publications.
Over 2 million traffic sign images were created by the OLIVES Lab at Georgia Tech to test algorithm robustness. Real-world images from BelgiumTS and synthetic images from Unreal Engine were processed with Adobe After Effects to simulate 12 challenging conditions like rain, snow, and blur. The dataset includes 14 sign types, such as speed limit and stop, with challenge severity levels ranging from 1 to 5.
A dataset containing text prompts and model completions for evaluating creative reasoning capabilities of the Qwen3.5 large language model. It includes 318,934 total tokens generated by the model, with an average of 2,327.99 tokens per interaction. The dataset was created by Crownelius and was last updated in March 2026.
Continuum of Care (CoC) areas are geographic boundaries defined for competitive U.S. Department of Housing and Urban Development (HUD) homeless assistance programs. The data likely contains polygon boundaries for communities coordinating housing and support services to address homelessness. The dataset was last updated on 2026 03 11.
This project produces Ecolabels to compare the environmental impact of popular passenger aircraft, considering different aircraft types, cabin layouts, and engine configurations. It demonstrates the application using four case examples comparing low-cost and legacy carriers, different fleets, engine configurations, and manufacturers.
Google Mobility by Borough provides evidence of how movement in London was affected by COVID-19 control measures and subsequent recovery. The data, aggregated by the Greater London Authority, summarises changes in activity from a baseline for categories like retail, parks, transit, and workplaces. Google collected location data from Android smartphones, comparing visits to a baseline from early 2020, with updates stopping on 15 October 2022.
A report from paperswithcode discusses research on graduated driver licensing programs in the United States and Canada over the past twenty years. It analyzes the effectiveness of these programs in reducing traffic accidents involving young drivers and investigates reasons for their continued higher crash rates. The report targets three specific issues: provision effectiveness, compliance erosion, and accident conditions not addressed by licensing.
Performance and Objective Workload Evaluation Research (POWER) software was developed by Carol A. Manning to provide objective measures of Air Traffic Controller (ATC) taskload and performance. A study investigated the relationship of POWER measures with sector complexity, controller workload, and performance using data from National Airspace System (NAS) System Analysis Recording (SAR) files and traffic samples from Kansas City Center. The exploratory study involved sixteen instructors from the FAA Academy in Oklahoma City watching eight traffic samples via the Systematic Air Traffic Operations Research Initiative (SATORI) system.
A dataset titled 'ROUTER' published on Kaggle. The dataset's content likely relates to network routing hardware or protocols, as suggested by its title. Specific details regarding its size, origin, and creation date are unavailable from the provided metadata.
A review of driving patterns and crash involvement for elderly drivers in the United States, with emphasis on the role of medical conditions and functional limitations. The dataset likely contains analysis and statistics related to crash rates, fatality rates per mile driven, and behavioral adaptations. It was authored by John W. Eberhard and sourced from the paperswithcode platform.
PersonaRoute Bench provides datasets for training and testing personalized routing models, as presented in the 'PersonalizedRouter' paper. The repository, created by ulab-ai, includes data generated via multi-cost-efficiency and LLM-as-a-Judge simulation strategies. The dataset page was last updated in February 2026.
City of Chicago provides individual Divvy bike sharing trip records, including origin, destination, and timestamps. Trips with a subscriber pass include associated basic demographic data like gender and age. The dataset is updated as of March 2026.
A list of stations for the Divvy bicycle sharing system in Chicago. The dataset includes all stations, with separate resources available for active stations and real-time status. It is maintained by the City of Chicago and was last updated in March 2026.
Traffic violation information from all electronic traffic violations issued in Montgomery County, Maryland. The dataset is updated daily by the Montgomery County of Maryland organization. Specific row and column counts are not provided.
May 2016 onward, this dataset contains parking and camera violation records from the City of New York, with weekly updates for new violations and daily updates for status changes. The data is provided in multiple formats including XML, RDF, JSON, and CSV.
Aggregating 1,365 observations for a predictive modeling exercise on income mobility across three generations using Panel Study of Income Dynamics (PSID) data. The task is to predict log income in generation 3 using log incomes from prior generations, education levels, race, and sex. It includes separate learning and holdout files for model training and evaluation.
California Department of Transportation data documents non-motorized traffic volumes on state highways, collected via vendor technologies like video analytics. The dataset includes core variables such as Loc_id, Date/Hour, Direction, Count, and geospatial coordinates. It originates from Caltrans district pilots initiated in 2023 to support active transportation planning.