2022 trip records for yellow taxis in New York City, generated from submissions by Technology Service Providers. Each row represents a single trip, with columns capturing pick-up and drop-off times and locations, distances, itemized fares, and passenger counts. The data is hosted by data.cityofnewyork.us and was last updated in December 2023.
Use Cases
- Predict total_amount from trip_distance, fare_amount, and tolls_amount using regression models.
- Analyze trip patterns and demand hotspots using PULocationID and DOLocationID for geospatial analysis.
- Model passenger_count distribution and its relationship to fare_amount and trip_distance.
- Analyze payment_type trends and their correlation with tip_amount and time of day from tpep_pickup_datetime.
Strengths
- Includes 19 detailed columns such as itemized fares, precise timestamps, and taxi zone IDs.
- Data is sourced directly from official Technology Service Provider submissions.
- Available in multiple machine-readable formats including CSV, JSON, XML, and RDF.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- data.cityofnewyork.us
- Collection Method
- Generated from trip record submissions made by yellow taxi Technology Service Providers (TSPs).
- Time Range
- 2022
- Freshness
- Last updated 2023-12-14 20:59:46; freshness should be verified.
- Geography
- New York City (based on taxi zone location IDs)