8 datasets found
  1. Airline Fight Routes in The US [1993-2024]

    • kaggle.com
    zip
    Updated Jul 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oleksii Martusiuk (2024). Airline Fight Routes in The US [1993-2024] [Dataset]. https://www.kaggle.com/datasets/oleksiimartusiuk/all-airline-fight-routes-in-the-us
    Explore at:
    zip(13697874 bytes)Available download formats
    Dataset updated
    Jul 13, 2024
    Authors
    Oleksii Martusiuk
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Area covered
    United States
    Description

    This dataset provides a comprehensive overview of domestic airline routes within the United States. It includes valuable information for analyzing passenger travel patterns, market trends, and airline pricing strategies.

    Data Features:

    • Year
    • Quarter
    • City Market IDs
    • Departure City
    • Arrival City:
    • Miles: The distance between the origin and arrival cities in miles.
    • Average Daily Passengers: The average number of passengers flying this route per day.
    • Average Fare: The average fare paid by passengers for this route (consider including currency information).

    Potential Uses:

    • Travel Demand Analysis: Identify popular routes, and understand seasonal variations in passenger traffic.
    • Market Research: Analyze airline competition on specific routes and assess pricing strategies.
    • Route Optimization: Airlines can use this data to evaluate existing routes and identify potential new routes with high passenger demand.
    • Business Intelligence: Businesses can use this data to understand travel patterns relevant to their industry and make informed decisions.

    Data Cleaning and Transformation Considerations:

    • Ensure consistency in city names (consider using the city market ID to group nearby airports).
    • Handle missing values appropriately.
    • Consider converting categorical features to numerical representations for analysis.
  2. US Flights with COIVID-19(+) TSA Screening Officer

    • kaggle.com
    zip
    Updated Apr 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zac Dannelly (2020). US Flights with COIVID-19(+) TSA Screening Officer [Dataset]. https://www.kaggle.com/dannellyz/us-flights-with-coivid19-tsa-screening-officer
    Explore at:
    zip(110976 bytes)Available download formats
    Dataset updated
    Apr 24, 2020
    Authors
    Zac Dannelly
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    COVID-19(+) Interactions Within Air Travel

    Modeling potential interactions between healthy individuals and those carrying COVID-19, denoted hereafter as (+), has been identified as a key methodology in the effort to predict, combat, and respond to COVID-19. In order to contribute to this effort within the domain of airline travel, this dataset allows users to see all flights during the time period from 01MAR-14APR where airline passengers may have come in contact with a COVID-19(+) TSA Screening Agent during their presumed incubation period, 7 days, before that agent went in quarantine.

    Acknowledgements

    Inspiration

    The CORD-19 Research Challenge has been a great inspiration for this effort. Its focus on natural language processing has prompted the need for additional efforts in other statistical machine learning methods, such as those used in the UNCOVER COVID-19 Challenge. With COVID-19 research as a global focal point, I hope that this dataset provides researchers with another set of features to help build models towards finding answers.

    Methodology

    Airline Data Inc. provided airline schedule information for the time period of 01MAR-14APR. This is one of the data products available as a part of their Data Hub. The airline schedule includes information on future and historical airline flights updated in real-time as it is filed by the airlines. This data provides access to origins and destinations, flight times, aircraft types, seats, customized route mapping, and much more. For this work, we focused on getting flight information to include terminals and carriers in order to determine potential contact of passengers and, at the time, unknowingly COVID-19(+) TSA agents. Airline Data Inc. additionally provided the T100 data from March and April of last year. The T100 provides information on particular routes (ORD->JFK) for U.S. domestic and international air service reported by carriers. This dataset includes passenger counts, available seats, load factors, equipment types, cargo, and other operating statistics. These datasets were combined to estimate the number of passengers flying various routes thought the time period in question. Undoubtedly these numbers are much lower than those of the previous year, but we make the assumption that airline travel declined in a relatively equal proportions across the US, making the load factors for last year comparatively accurate. Since the T100 data is only released on a monthly basis, these figures will not be able to be updated until the coming months.

    The Transportation Security Administration posted publicly on their website a list of all Screening and Baggage Officers who tested positive for COVID-19. This list included the airport they worked in, their last day of work, and their work location with shift information. This data was taken and used to down-select the data from Airline Data Inc. to only include those flights that met the following criteria: - Origin airport with COVID-19(+) TSA Officer - Flight took off (the flight schedule data will show all potential flights even those that do not take off) - TSA Officer on shift at time of departure - TSA Officer working in terminal from which the flight departed

  3. Daily UK flights

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Nov 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2025). Daily UK flights [Dataset]. https://www.ons.gov.uk/economy/economicoutputandproductivity/output/datasets/dailyukflights
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Nov 27, 2025
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Area covered
    United Kingdom
    Description

    Daily data showing UK flight numbers and rolling seven-day average, including flights to, from, and within the UK. These are official statistics in development. Source: EUROCONTROL.

  4. U.S. Commercial Aviation Industry Metrics

    • kaggle.com
    zip
    Updated Jul 13, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Franklin Bradfield (2017). U.S. Commercial Aviation Industry Metrics [Dataset]. https://www.kaggle.com/shellshock1911/us-commercial-aviation-industry-metrics
    Explore at:
    zip(1573798 bytes)Available download formats
    Dataset updated
    Jul 13, 2017
    Authors
    Franklin Bradfield
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United States
    Description

    Context

    Have you taken a flight in the U.S. in the past 15 years? If so, then you are a part of monthly data that the U.S. Department of Transportation's TranStats service makes available on various metrics for 15 U.S. airlines and 30 major U.S airports. Their website unfortunately does not include a method for easily downloading and sharing files. Furthermore, the source is built in ASP.NET, so extracting the data is rather cumbersome. To allow easier community access to this rich source of information, I scraped the metrics for every airline / airport combination and stored them in separate CSV files.

    Occasionally, an airline doesn't serve a certain airport, or it didn't serve it for the entire duration that the data collection period covers*. In those cases, the data either doesn't exist or is typically too sparse to be of much use. As such, I've only uploaded complete files for airports that an airline served for the entire uninterrupted duration of the collection period. For these files, there should be 174 time series points for one or more of the nine columns below. I recommend any of the files for American, Delta, or United Airlines for outstanding examples of complete and robust airline data.

    * No data for Atlas Air exists, and Virgin America commenced service in 2007, so no folders for either airline are included.

    Content

    There are 13 airlines that have at least one complete dataset. Each airline's folder includes CSV file(s) for each airport that are complete as defined by the above criteria. I've double-checked the files, but if you find one that violates the criteria, please point it out. The file names have the format "AIRLINE-AIRPORT.csv", where both AIRLINE and AIRPORT are IATA codes. For a full listing of the airlines and airports that the codes correspond to, check out the airline_codes.csv or airport_codes.csv files that are included, or perform a lookup here. Note that the data in each airport file represents metrics for flights that originated at the airport.

    Among the 13 airlines in data.zip, there are a total of 161 individual datasets. There are also two special folders included - airlines_all_airports.csv and airports_all_airlines.csv. The first contains datasets for each airline aggregated over all airports, while the second contains datasets for each airport aggregated over all airlines. To preview a sample dataset, check out all_airlines_all_airports.csv, which contains industry-wide data.

    Each file includes the following metrics for each month from October 2002 to March 2017:

    1. Date (YYYY-MM-DD): All dates are set to the first of the month. The day value is just a placeholder and has no significance.
    2. ASM_Domestic: Available Seat-Miles in thousands (000s). Number of domestic flights * Number of seats on each flight
    3. ASM_International*: Available Seat-Miles in thousands (000s). Number of international flights * Number of seats on each flight
    4. Flights_Domestic
    5. Flights_International*
    6. Passengers_Domestic
    7. Passengers_International*
    8. RPM_Domestic: Revenue Passenger-Miles in thousands (000s). Number of domestic flights * Number of paying passengers
    9. RPM_International*: Revenue Passenger-Miles in thousands (000s). Number of international flights * Number of paying passengers

    * Frequently contains missing values

    Acknowledgements

    Thanks to the U.S. Department of Transportation for collecting this data every month and making it publicly available to us all.

    Source: https://www.transtats.bts.gov/Data_Elements.aspx

    Inspiration

    The airline / airport datasets are perfect for practicing and/or testing time series forecasting with classic statistical models such as autoregressive integrated moving average (ARIMA), or modern deep learning techniques such as long short-term memory (LSTM) networks. The datasets typically show evidence of trends, seasonality, and noise, so modeling and accurate forecasting can be challenging, but still more tractable than time series problems possessing more stochastic elements, e.g. stocks, currencies, commodities, etc. The source releases new data each month, so feel free to check your models' performances against new data as it comes out. I will update the files here every 3 to 6 months depending on how things go.

    A future plan is to build a SQLite database so a vast array of queries can be run against the data. The data in it its current time series format is not conducive for this, so coming up with a workable structure for the tables is the first step towards this goal. If you have any suggestions for how I can improve the data presentation, or anything that you would like me to add, please let me know. Looking forward to seeing the questions that we can answer together!

  5. Global air traffic - number of flights 2004-2025

    • statista.com
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Global air traffic - number of flights 2004-2025 [Dataset]. https://www.statista.com/statistics/564769/airline-industry-number-of-flights/
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    The number of flights performed globally by the airline industry has increased steadily since the early 2000s and reached **** million in 2019. However, due to the coronavirus pandemic, the number of flights dropped to **** million in 2020. The flight volume increased again in the following years and was forecasted to reach ** million in 2025.

  6. flight delays

    • kaggle.com
    zip
    Updated Aug 14, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MuhammadNadeemFerozi (2018). flight delays [Dataset]. https://www.kaggle.com/mrferozi/flight-delays
    Explore at:
    zip(187216 bytes)Available download formats
    Dataset updated
    Aug 14, 2018
    Authors
    MuhammadNadeemFerozi
    Description

    This dataset was downloaded from the US Department of transport website. This website holds both schedule and actual departure and arrival times. Those events were collected and authenticated by US airline carriers responsible for almost 1% of all domestic scheduled passenger revenues. The office of airline information, bureau of transportation statistics (BTS) collected and summarised the complete details.

    URL Source https://www.transportation.gov/aviation

    1. Table 1 Flight dataset information The dataset contains other information such as origin airports and destination airports, flight numbers, cancelled and diverted flights, taxi-in time and taxi out time, and time and distance (RITA, 2017).

    The data is available in CSV format, separated by comma and spread over the one hundred available attributes from the following:

    1. Attributes Name No of Similar Attribute Time- Period 6 Unique carrier: 5 Origin Airport 9 Destination airport 9 Departure performance 9 Diversions and cancellation 3 Summary of flight 6 The delay causes 5 Diverted airport information 45 Table 2 Flight dataset variables information

      Dataset download The original downloaded file which was in CSV format contained one hundred variables. Among the one hundred variables, this study has utilised 28 variables and the rest of the variables were deleted from the data file. This study has downloaded 12 data files which covered the time from July 2016 till July 2017. The downloading process took around 45 minutes and was downloaded in ZIP format. Each Zip file was 216 MB in size and contains 502458 records.

    There were one hundred variables in the original file, but this study decided to keep only 28 variables for analysis depending on their importance. A brief description of those variables are as follows:

    Field Name Type Description Year Integer Year of the flight Month Integer Month of flight Day Integer Day of the flight DayOfWeek Integer Day of the flight Flight_Date text Date of the flight UniqueCarrier text (This code assigns to each individual airline for analysis) Tail_Num text Tail Number of the flight FlightNum text Flight Number

    Origin_Airport text Origin Airport Origin_City_Name text Origin City Name Origin_State text Origin State Scheduled_Departure Integer Scheduled Departure Departure_Time Integer Departure Time Dep_Delay Integer Departure Delay less than 15 minutes DepDel15 Integer Departure Delay more than 15 minutes

    Dep_Delay_Groups Integer Departure Delay Groups Arrival_Time Integer Flight Arrival Time Arrival_Delay Integer Flight Arrival Delay Arr_Del_morethan15 Integer Arrival Delay more than 15 minutes Cancelled Integer Flight Cancelled indicator Diverted Integer Flight Diverted indicator Distance Integer Flight Distance DistanceGroup Integer Flight Distance Group Carrier_Delay Integer Carrier Delay WeatherDelay Integer Delay due to Weather NAS_Delay Integer National Air System Delay, in Minutes Security_Delay Integer Security Delay, in Minutes Late_Aircraft_Delay Integer Late Aircraft Delay, in Minutes

    The following are new variables added in the table below after performing pre-processing. Field Name Type Description Top_Carriers Integer Top Carrier Indicator Top_Origin Integer Top Origin Indicator DEPTIME_GROUP1 text Departure Time Group 1 DEPTIME_GROUP2 tex t Departure Time Group 2 DEPTIME_GROUP3 text Departure Time Group 3

  7. Global air traffic - scheduled passengers 2004-2024

    • statista.com
    • abripper.com
    Updated Jun 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Global air traffic - scheduled passengers 2004-2024 [Dataset]. https://www.statista.com/statistics/564717/airline-industry-passenger-traffic-globally/
    Explore at:
    Dataset updated
    Jun 27, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    In 2023, the estimated number of scheduled passengers boarded by the global airline industry amounted to approximately *** billion people. This represents a significant increase compared to the previous year since the pandemic started and the positive trend was forecast to continue in 2024, with the scheduled passenger volume reaching just below **** billion travelers. Airline passenger traffic The number of scheduled passengers handled by the global airline industry has increased in all but one of the last decade. Scheduled passengers refer to the number of passengers who have booked a flight with a commercial airline. Excluded are passengers on charter flights, whereby an entire plane is booked by a private group. In 2023, the Asia Pacific region had the highest share of airline passenger traffic, accounting for ********* of the global total.

  8. Flights

    • kaggle.com
    zip
    Updated Sep 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mahoora00135 (2023). Flights [Dataset]. https://www.kaggle.com/datasets/mahoora00135/flights
    Explore at:
    zip(10797806 bytes)Available download formats
    Dataset updated
    Sep 26, 2023
    Authors
    Mahoora00135
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The "flights.csv" dataset contains information about the flights of an airport. This dataset includes information such as departure and arrival time, delays, flight company, flight number, flight origin and destination, flight duration, distance, hour and minute of flight, and exact date and time of flight. This data can be used in management analysis and strategies and provide useful information about the performance of flights and placement companies. The analysis of the data in this dataset can be used as a basis for the following activities: - Analysis of time patterns and trends: by examining the departure and arrival time of the aircraft, changes and time changes, patterns and trends in flight behavior can be identified. - Analysis of American companies: By viewing information about airlines such as the number of flights, the impact and overall performance, you can compare and analyze the performance of each company. - Analysis of delays and service quality: By examining delays and arrival time, I can collect and analyze information about the quality of services provided by the airport and companies. - Analysis of flight routes: by checking the origin and destination of flights, distances and flight duration, popular routes and people's choices can be identified and analyzed. - Analysis of airport performance: by observing the characteristics of flights and airport performance, it is possible to identify and analyze the strengths and weaknesses of the airport and suggest improvements.

    It provides various tools for data analysis and visualization and can be used as a basis for managerial decisions in the field of aviation industry.

    Airline Company Codes (in order of frequency for this dataset)

    WN -- Southwest Airlines Co.

    DL -- Delta Air Lines Inc.

    AA -- American Airlines Inc.

    UA -- United Air Lines Inc.

    B6 -- JetBlue Airways

    AS -- Alaska Airlines Inc.

    NK -- Spirit Air Lines

    G4 -- Allegiant Air

    F9 -- Frontier Airlines Inc.

    HA -- Hawaiian Airlines Inc.

    SY -- Sun Country Airlines d/b/a MN Airlines

    VX -- Virgin America

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Oleksii Martusiuk (2024). Airline Fight Routes in The US [1993-2024] [Dataset]. https://www.kaggle.com/datasets/oleksiimartusiuk/all-airline-fight-routes-in-the-us
Organization logo

Airline Fight Routes in The US [1993-2024]

240,000+ Airline Routes (Cities, Passengers per Day, Average Fare, etc.)

Explore at:
zip(13697874 bytes)Available download formats
Dataset updated
Jul 13, 2024
Authors
Oleksii Martusiuk
License

http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

Area covered
United States
Description

This dataset provides a comprehensive overview of domestic airline routes within the United States. It includes valuable information for analyzing passenger travel patterns, market trends, and airline pricing strategies.

Data Features:

  • Year
  • Quarter
  • City Market IDs
  • Departure City
  • Arrival City:
  • Miles: The distance between the origin and arrival cities in miles.
  • Average Daily Passengers: The average number of passengers flying this route per day.
  • Average Fare: The average fare paid by passengers for this route (consider including currency information).

Potential Uses:

  • Travel Demand Analysis: Identify popular routes, and understand seasonal variations in passenger traffic.
  • Market Research: Analyze airline competition on specific routes and assess pricing strategies.
  • Route Optimization: Airlines can use this data to evaluate existing routes and identify potential new routes with high passenger demand.
  • Business Intelligence: Businesses can use this data to understand travel patterns relevant to their industry and make informed decisions.

Data Cleaning and Transformation Considerations:

  • Ensure consistency in city names (consider using the city market ID to group nearby airports).
  • Handle missing values appropriately.
  • Consider converting categorical features to numerical representations for analysis.
Search
Clear search
Close search
Google apps
Main menu