Loading...
Loading...
Traffic data, public transit, aviation, shipping, ride-hailing, accident records
8,918 datasets
December 2016 trip records for New York City's green taxi line, originally provided by the NYC Taxi and Limousine Commission. The dataset was transformed for a tabular regression benchmark, with 'tip_amount' as the target variable and specific columns removed to alter feature importance. It includes only credit card payment trips, as tips are not reliably recorded for other payment types.
December 2016 trip records for New York City green taxis, provided by the NYC Taxi and Limousine Commission. The dataset was transformed for a tabular data benchmark, focusing on predicting tip amounts from credit card trips. It includes numeric features derived from datetime strings and location IDs, with certain financial columns removed to adjust feature importance.
Over 9 million rows of U.S. domestic flight data from 2018, originally sourced from the Bureau of Transportation Statistics and refined by a data science student for a price prediction project. The dataset includes 13 columns covering origin, destination, miles flown, airline, and price per ticket. It aggregates data from all four quarters of 2018, derived from an original source of over 27 million rows.
Functioning as titled 'nyc-taxi-green-dec-2016'. No information is available on its contents, size, structure, or origin.
Known as titled 'nyc-taxi-green-dec-2016'. No information is available on its contents, size, structure, or origin.
Career Transition Analytics 2025 is a dataset hosted on Kaggle. The title suggests it contains data related to job changes, career paths, or workforce mobility for the year 2025. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Network traffic data from IoT devices designed for botnet attack detection. The dataset includes traffic labeled as benign, Mirai, and Gafgyt. It was sourced from Kaggle, but specific details about its creation, size, and update frequency are not provided.
A collection of resources related to fixing crashed notebooks, likely from the Kaggle platform. The dataset may contain examples, scripts, or documentation aimed at recovering data science workflows after interruptions. Its specific content, size, and authorship are unverified from the provided metadata.
June to October 2021 bathymetry survey of Van Diemen Gulf, Northern Territory, acquired for the Australian Hydrographic Office. Data was processed into a 30-meter resolution, 32-bit floating point GeoTIFF grid covering the survey area.
NASA STEREO mission data from the Extreme Ultraviolet Imager (EUVI), part of the SECCHI instrument suite. The EUVI captures solar imagery in four spectral channels (30.4 nm, 17.1 nm, 19.5 nm, 21.1 nm) with a 2048 x 2048 pixel resolution and a field of view out to 1.7 solar radii. This data is designed to observe coronal mass ejections (CMEs) from two vantage points, offering improved resolution and cadence over its predecessor.
A bilingual French/Arabic instruction-based dataset designed for legal AI systems specializing in the Moroccan Road Code (Law 52-05). It contains structured instruction-following examples where a legal question is posed, excerpts from legal texts are provided as input, and a clear, structured output is given. The dataset was created by author ApyHTML19 and was last updated on March 18, 2026.
Nathan Ashby's 1.5 GB dataset supports research on NCAA labor market design, containing STATA files to replicate tables from the paper 'Tipping the Balance: NCAA Labor Restrictions, Mobility, and Productivity'. The data includes variables for analyzing monopsony, matching, and labor mobility within collegiate sports.
Travel in London 3 report data includes aggregate daily journey volumes from 1993-2009, modal shares for 2009, and annual public transport passenger kilometres from 2008/09-2010/11. The spreadsheet compiles over 20 tables from Transport for London, covering road traffic indices, cycle flows, airport passengers, and road casualties. Data was published by the Greater London Authority and last updated on the platform in March 2026.
A paper by Douglas A. Stall discusses the application of a highly realistic driving simulator for evaluating in-vehicle Intelligent Transportation Systems (ITS) and Automated Highway System (AHS) technologies. The work is motivated by the U.S. Department of Transportation's goal to deploy ITS infrastructure in 75 metropolitan areas to reduce daily travel time by 15%. The simulator is presented as a method for assessing safety, driver workload, and traffic efficiency improvements.
Flight_price_prediction is a dataset hosted on Kaggle. Its title suggests it contains data for predicting commercial airfare prices. The dataset's specific contents, size, and origin are not detailed in the available metadata.
Taxi trajectory data from Porto, Portugal, converted into graph-structured snapshots. The dataset is hosted on Kaggle, but its author, organization, and specific collection details are unknown. The original data likely contains GPS coordinates and timestamps, which have been processed into a graph representation.
A collection of user-generated content related to Uber, hosted on Hugging Face. The dataset was created by author TITAN-2 and was last updated on May 7, 2026. The specific content and scale of the data require verification after download.
SmartParking API provides real-time bay sensor data from a 12-month trial designed to ease traffic congestion. The data is managed by the ACT Government Open Data team and was last updated in March 2026. Registration is required for API access.
Global data related to aviation, likely containing information on flights, airports, or airlines. The dataset is published on Kaggle, but its specific contents, creator, and update history are not detailed in the provided metadata. Further details about the data's origin, size, and structure require inspection after download.
Inspection results from the New York City Department of Transportation's Commercial Bicyclist Unit (CBU). The dataset lists outcomes from inspections of businesses that use bicycles for commercial purposes. The data was last updated on March 22, 2026.