Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
New York City's Taxi & Limousine Commission (TLC) has defined Taxi Zones, "which are meant to approximate neighborhoods, so people can see which neighborhood a passenger was picked up in, and which neighborhood they were dropped off in" [1], in their TLC Trip Record Data
Files from https://data.cityofnewyork.us/Transportation/NYC-Taxi-Zones/d3c5-ddgc
[1] TLC Trip Record User Guide https://www.nyc.gov/assets/tlc/downloads/pdf/trip_record_user_guide.pdf
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The New York City Taxi and Limousine Commission (TLC) oversees the licensing and regulation of taxi cabs and for-hire vehicles in the city. The TLC gathers data from over 200,000 license holders, including taxi drivers and limousine operators, who collectively complete around one million trips each day.
Note: The dataset used for this project was designed for educational purposes and may not accurately represent the behavior of taxi cab riders in New York City.
| Column name | Description |
|---|---|
| ID | Trip identification number |
| VendorID | A code indicating the TPEP provider that provided the record. 1= Creative Mobile Technologies, LLC; 2= VeriFone Inc. |
| tpep_pickup_datetime | The date and time when the meter was engaged |
| tpep_dropoff_datetime | The date and time when the meter was disengaged |
| Passenger_count | The number of passengers in the vehicle. This is a driver-entered value |
| Trip_distance | The elapsed trip distance in miles reported by the taximeter |
| RateCodeID | The final rate code in effect at the end of the trip. 1= Standard rate 2=JFK 3=Newark 4=Nassau or Westchester 5=Negotiated fare 6=Group ride |
| Store_and_fwd_flag | This flag indicates whether the trip record was held in vehicle memory before being sent to the vendor, aka “store and forward,” because the vehicle did not have a connection to the server. Y= store and forward trip N= not a store and forward trip |
| PULocationID | TLC Taxi Zone in which the taximeter was engaged |
| DOLocationID | TLC Taxi Zone in which the taximeter was disengaged |
| Payment_type | A numeric code signifying how the passenger paid for the trip. 1= Credit card 2= Cash 3= No charge 4= Dispute 5= Unknown 6= Voided trip |
| Fare_amount | The time-and-distance fare calculated by the meter |
| Extra | Miscellaneous extras and surcharges. Currently, this only includes the $0.50 and $1 rush hour and overnight charges |
| MTA_tax | $0.50 MTA tax that is automatically triggered based on the metered rate in use |
| Tip_amount | Tip amount – This field is automatically populated for credit card tips. Cash tips are not included |
| Tolls_amount | Total amount of all tolls paid in trip |
| Improvement_surcharge | $0.30 improvement surcharge assessed trips at the flag drop. The improvement surcharge began being levied in 2015 |
| Total_amount | The total amount charged to passengers. Does not include cash tips |
Facebook
TwitterThis dataset consists of a set of polygons that will allow users of New York City Taxi and Limousine Commission data on trip records for taxi and for-hire vehicle trips to determine which trips originate and/or end in a Taxi Zone in the Central Business District and are thus subjected to the CBD Tolling Program. TLC trip record data is available on NYC Open Data in separate datasets each year for yellow taxis, green taxis, for-hire vehicles (FHVs) dispatched by a high-volume for-hire vehicle service (HVFHVs), and other FHVs.
Facebook
TwitterThese records are generated from the trip record submissions made by yellow taxi Technology Service Providers (TSPs). Each row represents a single trip in a yellow taxi. The trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off taxi zone locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by ahmadreza rostamani
Released under Apache 2.0
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Yellow taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemised fares, rate types, payment types, and driver-reported passenger counts. The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorised under the Taxicab & Livery Passenger Enhancement Programs (TPEP/LPEP). The trip data was not created by the TLC, and TLC makes no representations as to the accuracy of these data.
For-Hire Vehicle (“FHV”) trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID (shape file below). These records are generated from the FHV Trip Record submissions made by bases. Note: The TLC publishes base trip record data as submitted by the bases, and we cannot guarantee or confirm their accuracy or completeness. Therefore, this may not represent the total amount of trips dispatched by all TLC-licensed bases. The TLC performs routine reviews of the records and takes enforcement actions when necessary to ensure, to the extent possible, complete and accurate information.
https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf
| Sr no. | Field Name | Description |
|---|---|---|
| 1. | VendorID | A code indicating the TPEP provider that provided the record. 1 = Creative Mobile Technologies, LLC 2 = VeriFone Inc. |
| 2. | tpep_pickup_datetime | The date and time when the meter was engaged. |
| 3. | tpep_dropoff_datetime | The date and time when the meter was disengaged. |
| 4. | Passenger_count | The number of passengers in the vehicle. (Driver-entered value) |
| 5. | Trip_distance | The elapsed trip distance in miles reported by the taximeter. |
| 6. | PULocationID | TLC Taxi Zone in which the taximeter was engaged. |
| 7. | DOLocationID | TLC Taxi Zone in which the taximeter was disengaged. |
| 8. | RateCodeID | The final rate code in effect at the end of the trip. 1 = Standard rate 2 = JFK 3 = Newark 4 = Nassau or Westchester 5 = Negotiated fare 6 = Group ride |
| 9. | Store_and_fwd_flag | This flag indicates whether the trip record was held in vehicle memory before sending to the vendor. Y = store and forward trip N = not a store and forward trip |
| 10. | Payment_type | A numeric code signifying how the passenger paid for the trip. 1 = Credit card 2 = Cash 3 = No charge 4 = Dispute 5 = Unknown 6 = Voided trip |
| 11. | Fare_amount | The time-and-distance fare calculated by the meter. |
| 12. | Extra | Miscellaneous extras and surcharges. Currently, this only includes the $0.50 and $1 rush hour and overnight charges. |
| 13. | MTA_tax | $0.50 MTA tax that is automatically triggered based on the metered rate in use. |
| 14. | Improvement_surcharge | $0.30 improvement surcharge assessed trips at the flag drop. The improvement surcharge began being levied in 2015. |
| 15. | Tip_amount | Tip amount – This field is automatically populated for credit card tips. Cash tips are not included. |
| 16. | Tolls_amount | Total amount of all tolls paid in trip. |
| 17. | Total_amount | The total amount charged to passengers. Does not include cash tips. |
| 18. | Congestion_Surcharge | Total amount collected in trip for NYS congestion surcharge. |
| 19. | Airport_fee | $1.25 for pick up only at LaGuardia and John F. Kennedy Airports. |
Photo by Mourad Saadi on Unsplash
Facebook
TwitterThis dataset includes trip records from all trips completed in green taxis in NYC in 2014. Records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts. The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorized under the Livery Passenger Enhancement Program (LPEP). The trip data was not created by the TLC, and TLC makes no representations as to the accuracy of these data.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
In Newyork City, all taxi vehicles are managed by TLC (Taxi and Limousine Commission), here is a brief description about TLC:
The New York City Taxi and Limousine Commission (TLC), created in 1971, is the agency responsible for licensing and regulating New York City's Medallion (Yellow) taxi cabs, for-hire vehicles (community-based liveries, black cars and luxury limousines), commuter vans, and paratransit vehicles. The Commission's Board consists of nine members, eight of whom are unsalaried Commissioners. The salaried Chair/ Commissioner presides over regularly scheduled public commission meetings and is the head of the agency, which maintains a staff of approximately 600 TLC employees. Over 200,000 TLC licensees complete approximately 1,000,000 trips each day. To operate for hire, drivers must first undergo a background check, have a safe driving record, and complete 24 hours of driver training. TLC-licensed vehicles are inspected for safety and emissions at TLC's Woodside Inspection Facility.
Now NYC TLC has released its Trip Record data to public for research and study purposes. There are three main taxi types in NYC: Yellow taxis are traditionally hailed by signaling to a driver who is on duty and seeking a passenger (street hail), but now they may also be hailed using an e-hail app like Curb or Arro. Yellow taxis are the only vehicles permitted to respond to a street hail from a passenger in all five boroughs. Green taxis, also known as boro taxis and street-hail liveries, were introduced in August of 2013 to improve taxi service and availability in the boroughs. Green taxis may respond to street hails, but only in the areas indicated in green on the map (i.e. above W 110 St/E 96th St in Manhattan and in the boroughs). FHV data includes trip data from high-volume for-hire vehicle bases (bases for companies dispatching 10,000+ trip per day, meaning Uber, Lyft, Via, and Juno), community livery bases, luxury limousine bases, and black car bases. Uber as one of the biggest ride-hailing services providers, its trip records are collected in High Volume For-Hire Vehicle Trip Records as well.
Based on this dataset, there are some business goals we want to achieve to improve Uber's ride-hailing service: Exploratory data analysis, research data fhvhv_tripdata_2021 and figure out underlying trip patterns in 2021. Based on fhvhv_tripdata_2021 and weather data, build predict model to predict the peak footfall. Try explore Uber's user portrait in NYC (which orders are urgent and what kind of users should be given higher priorities?)
Some useful tips about this dataset:
- The trip data of the for-hire vehicles named like fhvhv_tripdata_2021-0X.parquet
- Columns' description of the trip data please refer to data_dictionary_trip_records_hvfhs.pdf.
- taxi_zones folder contains the geospatial data of NYC taxi zones (geopandas would be helpful).
- taxi_zone_lookup.csv stores taxi zones zip code and other relevant information.
- nyc 2021-01-01 to 2021-12-31.csv record the weather data of year 2021,taxi+_zone_lookup.csv stored the zone information of all taxi, data file end with .parquet could be processed by pyarrow package and convert to Pandas DataFrame.
If you find this dataset helpful, please up-vote and more high-quality datasets will be published in future!❤️
Facebook
TwitterThe For-Hire Vehicle ( FHV ) trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID (shape file below). These records are generated from the FHV Trip Record submissions made by bases. Note: The TLC publishes base trip record data as submitted by the bases, and we cannot guarantee or confirm their accuracy or completeness. Therefore, this may not represent the total amount of trips dispatched by all TLC-licensed bases. The TLC performs routine reviews of the records and takes enforcement actions when necessary to ensure, to the extent possible, complete and accurate information.For trip record data including TLC taxi zone location IDs, location names and corresponding boroughs for each ID can be found here. A shapefile containing the boundaries for the taxi zones can be found here.
Facebook
TwitterThese records are generated from trip record submissions made by FHV - High Volume companies and green and yellow taxi Technology Service Providers. This dataset aggregates the trip records to monthly pickup and drop-off counts for different TLC-regulated industries (FHV - High Volume, Green Cab, Yellow Taxi) by taxi zone. It is designed to make this information accessible to a wider audience, including those without the technical expertise to work with raw trip record data.
Facebook
TwitterThese records are generated from the For-Hire Vehicle (“FHV”) Trip Record submissions made by traditional livery, luxury, and black car bases. The FHV trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID, which correspond with the NYC Taxi Zones open dataset. Each row represents a single trip in an FHV.
Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:
See the Splitgraph documentation for more information.
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
The yellow taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts. The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorized under the Taxicab & Livery Passenger Enhancement Programs (TPEP/LPEP).
Column Description
Data is obtained from NYCTaxi & Limousine Commission website. https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
Facebook
TwitterThese records are generated from the trip record submissions made by green taxi Technology Service Providers (TSPs). Each row represents a single trip in a green taxi. The trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off taxi zone locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.
Splitgraph serves as an HTTP API that lets you run SQL queries directly on this data to power Web applications. For example:
See the Splitgraph documentation for more information.
Facebook
TwitterThese records are generated from the trip record submissions made by High Volume For-Hire Vehicle (FHV) bases. On August 14, 2018, Mayor de Blasio signed Local Law 149 of 2018, creating a new license category for TLC-licensed FHV businesses that currently dispatch or plan to dispatch more than 10,000 FHV trips in New York City per day under a single brand, trade, or operating name, referred to as High-Volume For-Hire Services (HVFHS). This law went into effect on Feb 1, 2019. Each row represents a single trip in a FHV dispatched by a high volume base. The trip records include fields capturing the high volume license number, the pickup and drop-off date, time, and taxi zone location ID, which correspond with the NYC Taxi Zones open dataset.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset is in parquet format because of its big size
This dataset offers detailed trip records of taxi rides, featuring information such as pickup and drop-off times, passenger counts, trip distances, and fare details. It includes unique identifiers for pickup and drop-off locations, breakdowns of fare components like tips, tolls, surcharges, and payment methods. Additionally, it provides data on congestion surcharges and the rate codes applied to trips.
The dataset is well-suited for transport analysis, predictive modeling, and fare optimization. Data scientists can leverage it to examine traffic patterns, forecast trip durations, study passenger behavior, and assess taxi service performance. It serves as a valuable resource for understanding New York City's transportation landscape and urban mobility trends.
Facebook
TwitterThe data was originally published by the NYC Taxi and Limousine Commission (TLC).
The data contains the Yellow Taxi Trip Records from January in PARQUET format.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains detailed records of Yellow Taxi trips in New York City from January to October 2024. It is a collection of files provided by the New York City Taxi and Limousine Commission (TLC) which can be used for analysis and insights into the city's taxi transportation patterns. The dataset is composed of the following components:
Data Source: The data is sourced from the New York City Taxi and Limousine Commission data page
For metadata look here
| Features Description | |
|---|---|
| Field Name | Description |
| VendorID | A code indicating the TPEP provider that provided the record. 1= Creative Mobile Technologies, LLC; 2= VeriFone Inc. |
| tpep_pickup_datetime | The date and time when the meter was engaged. |
| tpep_dropoff_datetime | The date and time when the meter was disengaged. |
| Passenger_count | The number of passengers in the vehicle. This is a driver-entered value. |
| Trip_distance | The elapsed trip distance in miles reported by the taximeter. |
| PULocationID | TLC Taxi Zone in which the taximeter was engaged |
| DOLocationID | TLC Taxi Zone in which the taximeter was disengaged |
| RateCodeID | The final rate code in effect at the end of the trip. 1= Standard rate 2=JFK 3=Newark 4=Nassau or Westchester 5=Negotiated fare 6=Group ride |
| Store_and_fwd_flag | This flag indicates whether the trip record was held in vehicle memory before sending to the vendor, aka “store and forward,” because the vehicle did not have a connection to the server. Y= store and forward trip N= not a store and forward trip |
| Payment_type | A numeric code signifying how the passenger paid for the trip. 1= Credit card 2= Cash 3= No charge 4= Dispute 5= Unknown 6= Voided trip |
| Fare_amount | The time-and-distance fare calculated by the meter. |
| Extra | Miscellaneous extras and surcharges. Currently, this only includes the $0.50 and $1 rush hour and overnight charges. |
| MTA_tax | $0.50 MTA tax that is automatically triggered base... |
Facebook
TwitterThese records are generated from the For-Hire Vehicle (“FHV”) Trip Record submissions made by traditional livery, luxury, and black car bases. The FHV trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID, which correspond with the NYC Taxi Zones open dataset. Each row represents a single trip in an FHV.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains trip records for yellow and green taxis operating in New York City. Each trip includes detailed information such as pickup and dropoff times and locations, passenger count, trip distance, payment type, fare amount, and various surcharges. The data can be used for urban mobility research, fare prediction, traffic analysis, and more.
VendorID: LPEP provider ID (e.g., CMT, Curb, Myle)lpep_pickup_datetime, lpep_dropoff_datetime: Pickup and dropoff timespassenger_count, trip_distanceRatecodeID: Final rate appliedstore_and_fwd_flag: Whether the trip was stored in vehicle memoryPULocationID, DOLocationID: Pickup and dropoff TLC taxi zonesfare_amount, extra, mta_tax, tip_amount, tolls_amount, improvement_surcharge, total_amountpayment_type, trip_type, congestion_surcharge, cbd_congestion_feeVendorID: TPEP provider ID (e.g., CMT, Curb, Myle, Helix)tpep_pickup_datetime, tpep_dropoff_datetimeairport_feeFor more information, refer to the NYC TLC website: http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml
Facebook
TwitterThis Dataset is from https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
Yellow and green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts. The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorized under the Taxicab & Livery Passenger Enhancement Programs (TPEP/LPEP). The trip data was not created by the TLC, and TLC makes no representations as to the accuracy of these data.
For-Hire Vehicle (“FHV”) trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID (shape file below). These records are generated from the FHV Trip Record submissions made by bases. Note: The TLC publishes base trip record data as submitted by the bases, and we cannot guarantee or confirm their accuracy or completeness. Therefore, this may not represent the total amount of trips dispatched by all TLC-licensed bases. The TLC performs routine reviews of the records and takes enforcement actions when necessary to ensure, to the extent possible, complete and accurate information.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
New York City's Taxi & Limousine Commission (TLC) has defined Taxi Zones, "which are meant to approximate neighborhoods, so people can see which neighborhood a passenger was picked up in, and which neighborhood they were dropped off in" [1], in their TLC Trip Record Data
Files from https://data.cityofnewyork.us/Transportation/NYC-Taxi-Zones/d3c5-ddgc
[1] TLC Trip Record User Guide https://www.nyc.gov/assets/tlc/downloads/pdf/trip_record_user_guide.pdf