Loading...
Loading...
Traffic data, public transit, aviation, shipping, ride-hailing, accident records
8,988 datasets
A processed and curated dataset for network traffic anomaly detection, derived from the CSE-CIC-IDS2018 intrusion detection dataset. It is designed for machine learning and deep learning research on network security and intrusion detection. The dataset was uploaded by author 'abmallick' and was last updated on December 15, 2025.
New York City's MTA bus system performance data from 2023-2024 provides average speeds and travel times between consecutive timepoints for every bus route. The dataset aggregates millions of bus trips by month, day of week, and hour of day, and includes route type, borough, and stop coordinates. It is published by data.ny.gov and was last updated in November 2025.
U.S. Census Bureau TIGER/Line shapefiles provide a seamless geographic representation of California's state-managed road infrastructure. The dataset distinguishes between primary roads (MTFCC S1100), such as interstate highways, and secondary roads (MTFCC S1200), including U.S. and State Highways. It is extracted from the national Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) System.
Quarterly bus fare evasion rates for New York City Transit, estimated systemwide using Automated Passenger Counters. Data covers Local/Limited, Express, and Select Bus Service categories, with methodology shifting from traffic checker surveys to automated counters in Q1 2020.
PhysX-Mobility is a dataset designed to address a gap in physics-annotated 3D data. It is the first dataset systematically annotated across five foundational dimensions: absolute scale, material, affordance, kinematics, and function description. The dataset was created by Caoza and was last updated on Hugging Face in December 2025.
Google Global Mobility Data provides anonymized, aggregated trends in human movement across geographic regions. The dataset originates from Google's analysis of location history data from users who enabled this setting. It was published to inform public health responses during the COVID-19 pandemic.
Road Accident Dataset (Cleaned & Structured) is a dataset hosted on Kaggle. The dataset's title suggests it contains information about traffic incidents. The platform tags indicate a potential focus on Africa and applications in data visualization and analytics.
Reasonmap is a benchmark dataset for evaluating multimodal large language models on fine-grained visual reasoning tasks using transit maps. The dataset was created by FSCCS and is associated with a research paper and project page. It was last updated on January 6,ๆไปฌๅ็ฐไบไธไธช้ฎ้ขใ
Taxi_Trip_Data_CSV is a dataset published on Kaggle. The dataset likely contains records of taxi journeys, which may include details like pickup and drop-off times, locations, and fares. Metadata is minimal; actual content requires verification after download.
MobilityData maintains this curated repository of transit-related APIs, software, and datasets, with the most recent update recorded in January 2026. It serves as a central directory for the transit community, focusing heavily on General Transit Feed Specification (GTFS) tools and real-time data resources.
171 Starbucks locations in Manhattan enriched with urban planning and demographic data from PLUTO, MTA, and Census sources. The dataset integrates pedestrian counts and Labor Force Survey (LFS) metrics to provide a spatial context for retail competition.
250,000 images of Chinese license plates across categories like weather, tilt, and illumination for oriented bounding box detection. Each image includes annotations for plate localization and character recognition across various parking lot environments.
Cyclistic Case Study dataset, associated with a Google Certificate program, is hosted on Kaggle. The dataset relates to the Divvy bicycle sharing service, likely containing trip records for analysis. The specific scope, size, and time range of the data are not detailed in the provided metadata.
Published on Kaggle, this dataset likely contains metrics on how people's movement patterns changed due to the COVID-19 pandemic. The raw description indicates the data is intended to help understand mobility shifts. Specific details on the time range, geography, and data collection method are not provided in the available metadata.
Nemotron Equation Candidate Critique Router v1 is a dataset published on Kaggle. The title suggests it contains evaluations or critiques of candidate outputs from a large language model, likely for routing or ranking purposes. The dataset's specific content, size, and authorship are not detailed in the provided metadata.
Flight_Ticket is a dataset hosted on Kaggle, likely containing information related to air travel bookings. The specific content, such as pricing, routes, or booking details, must be verified after download due to minimal metadata. Its origin and collection method are currently unspecified.
Traffic cameras are deployed at street intersections in Arlington County, Virginia, for traffic management purposes. The dataset lists all cameras currently in use and their specific geographic positions. The data is provided by Arlington County, VA, and was last updated in February 2026.
NASA's Launch Services Program provides a historical record of spacecraft launches from 1999 to the present. The data is maintained by the National Aeronautics and Space Administration and was last updated in January 2026.
Flight delay records spanning an 11-year period from 2014 to 2024. The dataset is hosted on Kaggle, but its specific origin, size, and detailed contents are not described in the provided metadata. Columns and data volume are unknown.
A dataset on road accidents in the United Kingdom, published on Kaggle. The specific temporal coverage, data volume, and collection methodology are not detailed in the available metadata. The dataset likely contains records of traffic incidents.