2013 trip records from yellow taxi Technology Service Providers in New York City. Each row represents a single taxi trip, capturing pick-up and drop-off details, fares, distances, and payment types. The data is published by data.cityofnewyork.us and was last updated in December 2023.
Use Cases
- Predicting total_amount from trip_distance, fare_amount, and tolls_amount using regression models.
- Analyzing peak-hour demand by clustering trips based on tpep_pickup_datetime and pickup_location.
- Classifying payment_type based on fare_amount, trip_distance, and passenger_count.
- Modeling spatial trip patterns by correlating pickup_longitude and pickup_latitude with dropoff_longitude and dropoff_latitude.
- Estimating tip_amount as a function of total_amount, payment_type, and trip_distance.
Strengths
- Includes 20 detailed features per trip record such as GPS coordinates, timestamps, and itemized fares.
- Data covers a full calendar year (2013) of taxi activity.
Limitations
- Exact row count is unknown, making sample size and representativeness unclear.
- Data is over a decade old, limiting analysis of current transportation trends.
- Geographic coverage is limited to New York City yellow taxis only.
Provenance
- Source
- data.cityofnewyork.us
- Collection Method
- Generated from trip record submissions by yellow taxi Technology Service Providers.
- Time Range
- 2013
- Freshness
- Last updated December 2023, but contains data from 2013.
- Geography
- New York City