2021 trip records for yellow taxis in New York City, generated from submissions by Technology Service Providers. Each row represents a single trip, capturing pick-up and drop-off times and locations, distances, itemized fares, and passenger counts. The data is hosted by data.cityofnewyork.us and was last updated in December 2023.
Use Cases
- Predict total_amount from trip_distance, fare_amount, and tolls_amount using regression models.
- Analyze peak travel times and congestion patterns using tpep_pickup_datetime and tpep_dropoff_datetime.
- Model passenger demand and distribution across taxi zones using PULocationID, DOLocationID, and passenger_count.
- Study fare composition and tip behavior using fare_amount, tip_amount, extra, and payment_type.
Strengths
- Includes detailed itemized fare columns such as fare_amount, tip_amount, mta_tax, and tolls_amount.
- Contains precise temporal and spatial identifiers with tpep_pickup_datetime, tpep_dropoff_datetime, PULocationID, and DOLocationID.
- Data is provided in multiple accessible formats including CSV, JSON, XML, and RDF.
Limitations
- Row count and total dataset size are unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- data.cityofnewyork.us
- Collection Method
- Generated from trip record submissions made by yellow taxi Technology Service Providers (TSPs).
- Time Range
- 2021
- Freshness
- Last updated 2023-12-14 20:46:51; freshness should be verified.
- Geography
- New York City, based on taxi zone location IDs.