Taxi trips, starting January 2024, reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. For earlier trips, see the link in the Featured Content section below. Due to the data reporting process, not all trips are reported but the City believes that most are.
This dataset ends with 2023. Please see the Featured Content link below for the dataset that starts in 2024. Taxi trips from 2013 to 2023 reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. Due to the data reporting process, not all trips are reported but the City believes that most are.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/07/01/taxi-dataset-relaunch.html for discussion of significant changes in July 2019.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2017/10/24/taxi-data-behind.html for a note on data freshness.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset accompanies a paper examining how tipping practices changed during the pandemic. I download data on unique taxi rides taken in Chicago freely available from the Chicago data portal for Jan 2018-March 2021. The data include a number of variables for each taxi trip, including the fare amount, the tip amount, and the passenger's location. I merge in demographic data available from CMAP for the community area the passenger came from in Chicago as well as daily data on COVID-19 hospitalizations to enrich the analysis. I filter the dataset to taxi rides payed with credit card, and remove trips with exceptionally weird data (e.g. 0 second trip duration, fares greater than $1000). I use the dataset to estimate the effect of the pandemic on whether passengers tip and if so, the average percent tipped. I find that the likelihood that a passenger leaves a tip declines by roughly 5 percentage points during the pandemic but the average non-zero tip left increased by roughly 2 percentage points higher. I exploit geographic and temporal heterogeneity in the data to explore the possible mechanisms behind these patterns in tipping.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2017/10/24/taxi-data-behind.html for a note on data freshness.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This is a table with names of all Chicago community areas for the Chicago Taxi Trips Dataset: https://www.kaggle.com/chicago/chicago-taxi-trips-bq
This table is useful if you want to name Chicago community areas which are represented only by numbers in the original Chicago Taxi Trips Dataset.
For this table I used information from Wikipedia: https://en.wikipedia.org/wiki/Community_areas_in_Chicago.
What is the name of Chicago community area number 76 and why does it have the highest average tip for taxi drivers?
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset accompanies a paper examining how tipping practices changed during the pandemic. I download data on unique taxi rides taken in Chicago freely available from the Chicago data portal for Jan 2018-March 2021. The data include a number of variables for each taxi trip, including the fare amount, the tip amount, and the passenger's location. I merge in demographic data available from CMAP for the community area the passenger came from in Chicago as well as daily data on COVID-19 hospitalizations to enrich the analysis. I filter the dataset to taxi rides payed with credit card, and remove trips with exceptionally weird data (e.g. 0 second trip duration, fares greater than $1000). I use the dataset to estimate the effect of the pandemic on whether passengers tip and if so, the average percent tipped. I find that the likelihood that a passengers tips during the pandemic is about 5 percentage points lower but the percentage tipped of the fare when they do is about 2 percentage points higher.
The City of Chicago has published trip-level data for every TNC trip since November 1, 2018. To the best of our knowledge, this dataset is the only one that includes trip fare variables. As we wrote this paper in Oct 2022, the dataset includes approximately 263 million trip records (rows) and 21 features (columns) for trips dated from November 1, 2018, through October 1, 2022. The features of this data include Trip ID, Trip Start Timestamp (rounded to the nearest 15 minutes), Trip End Timestamp (rounded to the nearest 15 minutes), Trip Seconds, Trip Miles, Pickup Census Tract, Dropoff Census Tract, Pickup Community Area, Drop Off Community Area, Trip Fare, Tip, Additional Charges, Total Trip Fare, Shared Trip Authorized, Trips Pooled, Pickup Centroid Latitude, Pickup Centroid Longitude, Pickup Centroid Location, Dropoff Centroid Latitude, Dropoff Centroid Longitude, Dropoff Centroid Location. As the dataset is too large to be processed without a supercomputer, we generated a random sample of 2 million trips from Nov 2018 to June 2022 with valid pickup and drop-down area information. To explore the data, we processed the features to extract date information from the timestamp. We created new variables, including each trip's average fare per mile (excluding tips and additional charges, mainly taxes). In dataset (1), the sampled TNC trips data was processed and summarized to include the average daily fare per mile (USD/mile), and exogenous variables that impact the price were added to the data including holidays (Christmas, thanksgiving, Independence Day, easter and new year) and other variables including gas prices, and climate (snow, precipitation, and average daily temperature). The City of Chicago also publishes taxi trips from 2013 to the present. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, and times are rounded to the nearest 15 minutes. Due to the data reporting process, not all but most trips are reported. Taxicabs in Chicago, Illinois, are operated by private companies and licensed by the city. About seven thousand licensed cabs are operating within the city limits. As the dataset is too large to be processed without a supercomputer, we generated a random sample of 2 million trips from Nov 2018 to June 2022 with valid pickup and drop-down area information. To explore the data, we processed the features to extract date information from the timestamp. We created new variables, including each trip's average fare per mile (excluding tips and additional charges, mainly taxes). In dataset (2), the taxi trips data was processed and summarized to include the average daily fare per mile (USD/mile), and exogenous variables that impact the price were added to the data including holidays (Christmas, thanksgiving, Independence Day, easter and new year) and other variables including gas prices, and climate (snow, precipitation, and average daily temperature).
Taxi trips, starting January 2024, reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. For earlier trips, see the link in the Featured Content section below. Due to the data reporting process, not all trips are reported but the City believes that most are.
Read about privacy protections in TNP and Taxi trip datasets.
Note: As discussed in http://dev.cityofchicago.org/open%20data/data%20portal/2019/07/01/taxi-dataset-relaunch.html, some trips in November-December 2014 are known to be missing. --Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/07/01/taxi-dataset-relaunch.html for discussion of significant changes in July 2019.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2017/10/24/taxi-data-behind.html for a note on data freshness.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/07/01/taxi-dataset-relaunch.html for discussion of significant changes in July 2019.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/07/01/taxi-dataset-relaunch.html for discussion of significant changes in July 2019.
Taxi trips reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes.
Due to the data reporting process, not all trips are reported but the City believes that most are.
See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html for further discussion of the approach to privacy in this dataset.
See http://dev.cityofchicago.org/open%20data/data%20portal/2019/07/01/taxi-dataset-relaunch.html for discussion of significant changes in July 2019.
Taxi Trips in 2024
A filtered view of the Taxi Trips dataset to produce a small, quasi-random selection of records for testing operations that would be slower with the full dataset.
All trips, starting November 2018, reported by Transportation Network Providers (sometimes called rideshare companies) to the City of Chicago as part of routine reporting required by ordinance.
Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. Fares are rounded to the nearest $2.50 and tips are rounded to the nearest $1.00.
For a discussion of the approach to privacy in this dataset, please see http://dev.cityofchicago.org/open%20data/data%20portal/2019/04/12/tnp-taxi-privacy.html.
This dataset ends with 2022. Please see the Featured Content link below for the dataset that starts in 2023.
All trips, from November 2018 to December 2022, reported by Transportation Network Providers (sometimes called rideshare companies) to the City of Chicago as part of routine reporting required by ordinance.
Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. Fares are rounded to the nearest $2.50 and tips are rounded to the nearest $1.00.
For a discussion of the approach to privacy in this dataset, please see https://data.cityofchicago.org/stories/s/82d7-i4i2.
Taxi trips, starting January 2024, reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. For earlier trips, see the link in the Featured Content section below. Due to the data reporting process, not all trips are reported but the City believes that most are.