Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset provides a comprehensive overview of domestic airline routes within the United States. It includes valuable information for analyzing passenger travel patterns, market trends, and airline pricing strategies.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Daily data showing UK flight numbers and rolling seven-day average, including flights to, from, and within the UK. These are official statistics in development. Source: EUROCONTROL.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This dataset contains information on air traffic passenger statistics by the airline. It includes information on the airlines, airports, and regions that the flights departed from and arrived at. It also includes information on the type of activity, price category, terminal, boarding area, and number of passengers
Air traffic passenger statistics can be a useful tool for understanding the airline industry and for making travel plans. This dataset from Open Flights contains information on air traffic passenger statistics by airline for 2017. The data includes the number of passengers, the operating airline, the published airline, the geographic region, the activity type code, the price category code, the terminal, the boarding area, and the year and month of the flight
License: Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) - You are free to: - Share - copy and redistribute the material in any medium or format for non-commercial purposes only. - Adapt - remix, transform, and build upon the material for non-commercial purposes only. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - You may not: - Use the material for commercial purposes.
File: Air_Traffic_Passenger_Statistics.csv | Column name | Description | |:--------------------------------|:------------------------------------------------------------------------------| | Activity Period | The date of the activity. (Date) | | Operating Airline | The airline that operated the flight. (String) | | Operating Airline IATA Code | The IATA code of the airline that operated the flight. (String) | | Published Airline | The airline that published the fare for the flight. (String) | | Published Airline IATA Code | The IATA code of the airline that published the fare for the flight. (String) | | GEO Summary | A summary of the geographic region. (String) | | GEO Region | The geographic region. (String) | | Activity Type Code | The type of activity. (String) | | Price Category Code | The price category of the fare. (String) | | Terminal | The terminal of the flight. (String) | | Boarding Area | The boarding area of the flight. (String) | | Passenger Count | The number of passengers on the flight. (Integer) | | Adjusted Activity Type Code | The type of activity, adjusted for missing data. (String) | | Adjusted Passenger Count | The number of passengers on the flight, adjusted for missing data. (Integer) | | Year | The year of the activity. (Integer) | | Month | The month of the activity. (Integer) |
Facebook
TwitterThe number of flights performed globally by the airline industry has increased steadily since the early 2000s and reached **** million in 2019. However, due to the coronavirus pandemic, the number of flights dropped to **** million in 2020. The flight volume increased again in the following years and was forecasted to reach ** million in 2025.
Facebook
TwitterAviation statistics user engagement survey
Thank you very much for all responses to the survey and your interest in DfT Aviation Statistics. All feedback will be taken into consideration when we publish the Aviation Statistics update later this year, alongside which, we will update the background information with details of the feedback and any future development plans.
AVI0101 (TSGB0201): https://assets.publishing.service.gov.uk/media/6753137f21057d0ed56a0415/avi0101.ods">Air traffic at UK airports: 1950 onwards (ODS, 9.93 KB)
AVI0102 (TSGB0202): https://assets.publishing.service.gov.uk/media/6753138a14973821ce2a6d22/avi0102.ods">Air traffic by operation type and airport, UK (ODS, 37.6 KB)
AVI0103 (TSGB0203): https://assets.publishing.service.gov.uk/media/67531395dcabf976e5fb0073/avi0103.ods">Punctuality at selected UK airports (ODS, 41.1 KB)
AVI0105 (TSGB0205): https://assets.publishing.service.gov.uk/media/675313a014973821ce2a6d23/avi0105.ods">International passenger movements at UK airports by last or next country travelled to (ODS, 20.7 KB)
AVI0106 (TSGB0206): https://assets.publishing.service.gov.uk/media/67531f09e40c78cba1fb008d/avi0106.ods">Proportion of transfer passengers at selected UK airports (ODS, 9.52 KB)
AVI0107 (TSGB0207): https://assets.publishing.service.gov.uk/media/67531d7a14973821ce2a6d2d/avi0107.ods">Mode of transport to the airport (ODS, 14.3 KB)
AVI0108 (TSGB0208): https://assets.publishing.service.gov.uk/media/67531f17dcabf976e5fb007f/avi0108.ods">Purpose of travel at selected UK airports (ODS, 15.7 KB)
AVI0109 (TSGB0209): https://assets.publishing.service.gov.uk/media/67531f3b20bcf083762a6d3b/avi0109.ods">Map of UK airports (ODS, 193 KB)
AVI0201 (TSGB0210): https://assets.publishing.service.gov.uk/media/67531f527e5323915d6a042f/avi0201.ods">Main outputs for UK airlines by type of service (ODS, 17.7 KB)
AVI0203 (TSGB0211): https://assets.publishing.service.gov.uk/media/67531f6014973821ce2a6d31/avi0203.ods">Worldwide employment by UK airlines (ODS, <span class="
Facebook
TwitterIn 2023, the estimated number of scheduled passengers boarded by the global airline industry amounted to approximately *** billion people. This represents a significant increase compared to the previous year since the pandemic started and the positive trend was forecast to continue in 2024, with the scheduled passenger volume reaching just below **** billion travelers. Airline passenger traffic The number of scheduled passengers handled by the global airline industry has increased in all but one of the last decade. Scheduled passengers refer to the number of passengers who have booked a flight with a commercial airline. Excluded are passengers on charter flights, whereby an entire plane is booked by a private group. In 2023, the Asia Pacific region had the highest share of airline passenger traffic, accounting for ********* of the global total.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides monthly totals of US airline passengers from 1949 to 1960. This dataset is taken from a built-in dataset of R called AirPassengers. Analysts often employ various statistical techniques, such as decomposition, smoothing, and forecasting models, to analyze patterns, trends, and seasonal fluctuations within the data. Due to its historical nature and consistent temporal granularity, the Air Passengers dataset serves as a valuable resource for researchers, practitioners, and students in the fields of statistics, econometrics, and transportation planning.
Facebook
TwitterAirline on-time performance Have you ever been stuck in an airport because your flight was delayed or canceled and wondered if you could have predicted it if you'd had more data? This is your chance to find out.
The results We had a total of nine entries, and turn out at the poster session at the JSM was great, with plenty of people stopping by to find out why their flights were delayed.
The data The data consists of flight arrival and departure details for all commercial flights within the USA, from October 1987 to April 2008. This is a large dataset: there are nearly 120 million records in total and takes up 1.6 gigabytes of space when compressed and 12 gigabytes when uncompressed.
The challenge The aim of the data expo is to provide a graphical summary of important features of the data set. This is intentionally vague in order to allow different entries to focus on different aspects of the data, but here are a few ideas to get you started:
When is the best time of day/day of week/time of year to fly to minimise delays? Do older planes suffer more delays? How does the number of people flying between different locations change over time? How well does weather predict plane delays? Can you detect cascading failures as delays in one airport create delays in others? Are there critical links in the system? You are also welcome to work with interesting subsets: you might want to compare flight patterns before and after 9/11, or between the pair of cities that you fly between most often, or all flights to and from a major airport like Chicago (ORD). Smaller subsets may also help you to match up the data to other interesting datasets.
Columns | Name|Description| | --- | --- | |year| 1987-2008| |month| 1-12| |day of month| 1-31| |day of week| 1 (Monday) - 7 (Sunday)| |DepTime| actual departure time (minutes)| |CRSDepTime| scheduled departure time (minutes) |ArrTime| actual arrival time (minutes)| |CRSArrTime| scheduled arrival time (minutes)| |UniqueCarrier| unique carrier code| |FlightNum| flight number| |TailNum| plane tail number| |ActualElapsedTime| in minutes| |CRSElapsedTime| in minutes| |AirTime| in minutes| |ArrDelay| arrival delay, in minutes| |DepDelay| departure delay, in minutes| |Origin| origin IATA airport code| |Dest| destination IATA airport code| |Distance| in miles| |TaxiIn| taxi in time, in minutes| |TaxiOut| taxi out time in minutes| |Cancelled| was the flight cancelled?| |CancellationCode| reason for cancellation (A = carrier, B = weather, C = NAS, D = security)| |Diverted| 1 = yes, 0 = no| |CarrierDelay| in minutes| |WeatherDelay| in minutes| |NASDelay| in minutes| |SecurityDelay| in minutes| |LateAircraftDelay| in minutes|
Facebook
TwitterPassengers enplaned and deplaned at Canadian airports, annual.
Facebook
TwitterUpdates are delayed due to technical difficulties. How many people are staying at home? How far are people traveling when they don’t stay home? Which states and counties have more people taking trips? The Bureau of Transportation Statistics (BTS) now provides answers to those questions through our new mobility statistics. The Trips by Distance data and number of people staying home and not staying home are estimated for the Bureau of Transportation Statistics by the Maryland Transportation Institute and Center for Advanced Transportation Technology Laboratory at the University of Maryland. The travel statistics are produced from an anonymized national panel of mobile device data from multiple sources. All data sources used in the creation of the metrics contain no personal information. Data analysis is conducted at the aggregate national, state, and county levels. A weighting procedure expands the sample of millions of mobile devices, so the results are representative of the entire population in a nation, state, or county. To assure confidentiality and support data quality, no data are reported for a county if it has fewer than 50 devices in the sample on any given day. Trips are defined as movements that include a stay of longer than 10 minutes at an anonymized location away from home. Home locations are imputed on a weekly basis. A movement with multiple stays of longer than 10 minutes before returning home is counted as multiple trips. Trips capture travel by all modes of transportation. including driving, rail, transit, and air. The daily travel estimates are from a mobile device data panel from merged multiple data sources that address the geographic and temporal sample variation issues often observed in a single data source. The merged data panel only includes mobile devices whose anonymized location data meet a set of data quality standards, which further ensures the overall data quality and consistency. The data quality standards consider both temporal frequency and spatial accuracy of anonymized location point observations, temporal coverage and representativeness at the device level, spatial representativeness at the sample and county level, etc. A multi-level weighting method that employs both device and trip-level weights expands the sample to the underlying population at the county and state levels, before travel statistics are computed. These data are experimental and may not meet all of our quality standards. Experimental data products are created using new data sources or methodologies that benefit data users in the absence of other relevant products. We are seeking feedback from data users and stakeholders on the quality and usefulness of these new products. Experimental data products that meet our quality standards and demonstrate sufficient user demand may enter regular production if resources permit.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Modeling potential interactions between healthy individuals and those carrying COVID-19, denoted hereafter as (+), has been identified as a key methodology in the effort to predict, combat, and respond to COVID-19. In order to contribute to this effort within the domain of airline travel, this dataset allows users to see all flights during the time period from 01MAR-14APR where airline passengers may have come in contact with a COVID-19(+) TSA Screening Agent during their presumed incubation period, 7 days, before that agent went in quarantine.
The CORD-19 Research Challenge has been a great inspiration for this effort. Its focus on natural language processing has prompted the need for additional efforts in other statistical machine learning methods, such as those used in the UNCOVER COVID-19 Challenge. With COVID-19 research as a global focal point, I hope that this dataset provides researchers with another set of features to help build models towards finding answers.
Airline Data Inc. provided airline schedule information for the time period of 01MAR-14APR. This is one of the data products available as a part of their Data Hub. The airline schedule includes information on future and historical airline flights updated in real-time as it is filed by the airlines. This data provides access to origins and destinations, flight times, aircraft types, seats, customized route mapping, and much more. For this work, we focused on getting flight information to include terminals and carriers in order to determine potential contact of passengers and, at the time, unknowingly COVID-19(+) TSA agents. Airline Data Inc. additionally provided the T100 data from March and April of last year. The T100 provides information on particular routes (ORD->JFK) for U.S. domestic and international air service reported by carriers. This dataset includes passenger counts, available seats, load factors, equipment types, cargo, and other operating statistics. These datasets were combined to estimate the number of passengers flying various routes thought the time period in question. Undoubtedly these numbers are much lower than those of the previous year, but we make the assumption that airline travel declined in a relatively equal proportions across the US, making the load factors for last year comparatively accurate. Since the T100 data is only released on a monthly basis, these figures will not be able to be updated until the coming months.
The Transportation Security Administration posted publicly on their website a list of all Screening and Baggage Officers who tested positive for COVID-19. This list included the airport they worked in, their last day of work, and their work location with shift information. This data was taken and used to down-select the data from Airline Data Inc. to only include those flights that met the following criteria: - Origin airport with COVID-19(+) TSA Officer - Flight took off (the flight schedule data will show all potential flights even those that do not take off) - TSA Officer on shift at time of departure - TSA Officer working in terminal from which the flight departed
Facebook
TwitterAccessible Tables and Improved Quality
As part of the Analysis Function Reproducible Analytical Pipeline Strategy, processes to create all National Travel Survey (NTS) statistics tables have been improved to follow the principles of Reproducible Analytical Pipelines (RAP). This has resulted in improved efficiency and quality of NTS tables and therefore some historical estimates have seen very minor change, at least the fifth decimal place.
All NTS tables have also been redesigned in an accessible format where they can be used by as many people as possible, including people with an impaired vision, motor difficulties, cognitive impairments or learning disabilities and deafness or impaired hearing.
If you wish to provide feedback on these changes then please contact us.
NTS0501: https://assets.publishing.service.gov.uk/media/68a437a4cd7b7dcfaf2b5e88/nts0501.ods">Trips in progress by time of day and day of week - index: England, 2002 onwards (ODS, 65.8 KB)
NTS0502: https://assets.publishing.service.gov.uk/media/68a437a3f49bec79d23d2992/nts0502.ods">Trip start time by trip purpose (Monday to Friday only): England, 2002 onwards (ODS, 145 KB)
NTS0504: https://assets.publishing.service.gov.uk/media/68a437a4246cc964c53d2997/nts0504.ods">Average number of trips by day of the week or month and purpose or main mode: England, 2002 onwards (ODS, 148 KB)
National Travel Survey statistics
Email mailto:national.travelsurvey@dft.gov.uk">national.travelsurvey@dft.gov.uk
To hear more about DfT statistical publications as they are released, follow us on X at https://x.com/dftstats">DfTstats.
Facebook
Twitterhttps://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
The English Travel Chat Dataset is a comprehensive collection of over 12,000 text-based conversations between customers and call center agents. Focused on real-life travel and tourism interactions, this dataset captures the language, tone, and service dynamics essential for building robust conversational AI, chatbots, and NLP solutions for the travel industry in English-speaking markets.
The dataset encompasses a wide range of travel and tourism use cases across both customer-initiated and agent-initiated conversations:
This variety ensures wide applicability in both sales enablement and customer support automation.
Conversations are crafted to reflect the everyday language and nuances of English-speaking travelers:
These linguistic and cultural cues enable the development of context-aware, natural-sounding AI systems.
The dataset captures a variety of interaction types, including:
Facebook
TwitterThe statistics on daily passenger traffic provides some relevant figures concerning daily statistics on inbound and outbound passenger trips at all control points (with breakdown by Hong Kong Residents, Mainland Visitors and Other Visitors).
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Have you taken a flight in the U.S. in the past 15 years? If so, then you are a part of monthly data that the U.S. Department of Transportation's TranStats service makes available on various metrics for 15 U.S. airlines and 30 major U.S airports. Their website unfortunately does not include a method for easily downloading and sharing files. Furthermore, the source is built in ASP.NET, so extracting the data is rather cumbersome. To allow easier community access to this rich source of information, I scraped the metrics for every airline / airport combination and stored them in separate CSV files.
Occasionally, an airline doesn't serve a certain airport, or it didn't serve it for the entire duration that the data collection period covers*. In those cases, the data either doesn't exist or is typically too sparse to be of much use. As such, I've only uploaded complete files for airports that an airline served for the entire uninterrupted duration of the collection period. For these files, there should be 174 time series points for one or more of the nine columns below. I recommend any of the files for American, Delta, or United Airlines for outstanding examples of complete and robust airline data.
* No data for Atlas Air exists, and Virgin America commenced service in 2007, so no folders for either airline are included.
There are 13 airlines that have at least one complete dataset. Each airline's folder includes CSV file(s) for each airport that are complete as defined by the above criteria. I've double-checked the files, but if you find one that violates the criteria, please point it out. The file names have the format "AIRLINE-AIRPORT.csv", where both AIRLINE and AIRPORT are IATA codes. For a full listing of the airlines and airports that the codes correspond to, check out the airline_codes.csv or airport_codes.csv files that are included, or perform a lookup here. Note that the data in each airport file represents metrics for flights that originated at the airport.
Among the 13 airlines in data.zip, there are a total of 161 individual datasets. There are also two special folders included - airlines_all_airports.csv and airports_all_airlines.csv. The first contains datasets for each airline aggregated over all airports, while the second contains datasets for each airport aggregated over all airlines. To preview a sample dataset, check out all_airlines_all_airports.csv, which contains industry-wide data.
Each file includes the following metrics for each month from October 2002 to March 2017:
* Frequently contains missing values
Thanks to the U.S. Department of Transportation for collecting this data every month and making it publicly available to us all.
Source: https://www.transtats.bts.gov/Data_Elements.aspx
The airline / airport datasets are perfect for practicing and/or testing time series forecasting with classic statistical models such as autoregressive integrated moving average (ARIMA), or modern deep learning techniques such as long short-term memory (LSTM) networks. The datasets typically show evidence of trends, seasonality, and noise, so modeling and accurate forecasting can be challenging, but still more tractable than time series problems possessing more stochastic elements, e.g. stocks, currencies, commodities, etc. The source releases new data each month, so feel free to check your models' performances against new data as it comes out. I will update the files here every 3 to 6 months depending on how things go.
A future plan is to build a SQLite database so a vast array of queries can be run against the data. The data in it its current time series format is not conducive for this, so coming up with a workable structure for the tables is the first step towards this goal. If you have any suggestions for how I can improve the data presentation, or anything that you would like me to add, please let me know. Looking forward to seeing the questions that we can answer together!
Facebook
Twitterhttps://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
The Spanish Travel Chat Dataset is a comprehensive collection of over 10,000 text-based conversations between customers and call center agents. Focused on real-life travel and tourism interactions, this dataset captures the language, tone, and service dynamics essential for building robust conversational AI, chatbots, and NLP solutions for the travel industry in Spanish-speaking markets.
The dataset encompasses a wide range of travel and tourism use cases across both customer-initiated and agent-initiated conversations:
This variety ensures wide applicability in both sales enablement and customer support automation.
Conversations are crafted to reflect the everyday language and nuances of Spanish-speaking travelers:
These linguistic and cultural cues enable the development of context-aware, natural-sounding AI systems.
The dataset captures a variety of interaction types, including:
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Total monthly number of passengers arriving to and departing from Heathrow Airport, including both international and domestic flights.
Facebook
Twitterhttps://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
The Finnish Travel Chat Dataset is a comprehensive collection of over 10,000 text-based conversations between customers and call center agents. Focused on real-life travel and tourism interactions, this dataset captures the language, tone, and service dynamics essential for building robust conversational AI, chatbots, and NLP solutions for the travel industry in Finnish-speaking markets.
The dataset encompasses a wide range of travel and tourism use cases across both customer-initiated and agent-initiated conversations:
This variety ensures wide applicability in both sales enablement and customer support automation.
Conversations are crafted to reflect the everyday language and nuances of Finnish-speaking travelers:
These linguistic and cultural cues enable the development of context-aware, natural-sounding AI systems.
The dataset captures a variety of interaction types, including:
Facebook
TwitterAir passenger origin and destination data (passenger numbers, city rank), for transborder journeys, by total outbound and inbound passengers exceeding 4000, by city-pair, annual.
Facebook
TwitterList of the data tables as part of the Immigration system statistics Home Office release. Summary and detailed data tables covering the immigration system, including out-of-country and in-country visas, asylum, detention, and returns.
If you have any feedback, please email MigrationStatsEnquiries@homeoffice.gov.uk.
The Microsoft Excel .xlsx files may not be suitable for users of assistive technology.
If you use assistive technology (such as a screen reader) and need a version of these documents in a more accessible format, please email MigrationStatsEnquiries@homeoffice.gov.uk
Please tell us what format you need. It will help us if you say what assistive technology you use.
Immigration system statistics, year ending September 2025
Immigration system statistics quarterly release
Immigration system statistics user guide
Publishing detailed data tables in migration statistics
Policy and legislative changes affecting migration to the UK: timeline
Immigration statistics data archives
https://assets.publishing.service.gov.uk/media/691afc82e39a085bda43edd8/passenger-arrivals-summary-sep-2025-tables.ods">Passenger arrivals summary tables, year ending September 2025 (ODS, 31.5 KB)
‘Passengers refused entry at the border summary tables’ and ‘Passengers refused entry at the border detailed datasets’ have been discontinued. The latest published versions of these tables are from February 2025 and are available in the ‘Passenger refusals – release discontinued’ section. A similar data series, ‘Refused entry at port and subsequently departed’, is available within the Returns detailed and summary tables.
https://assets.publishing.service.gov.uk/media/691b03595a253e2c40d705b9/electronic-travel-authorisation-datasets-sep-2025.xlsx">Electronic travel authorisation detailed datasets, year ending September 2025 (MS Excel Spreadsheet, 58.6 KB)
ETA_D01: Applications for electronic travel authorisations, by nationality
ETA_D02: Outcomes of applications for electronic travel authorisations, by nationality
https://assets.publishing.service.gov.uk/media/6924812a367485ea116a56bd/visas-summary-sep-2025-tables.ods">Entry clearance visas summary tables, year ending September 2025 (ODS, 53.3 KB)
https://assets.publishing.service.gov.uk/media/691aebbf5a253e2c40d70598/entry-clearance-visa-outcomes-datasets-sep-2025.xlsx">Entry clearance visa applications and outcomes detailed datasets, year ending September 2025 (MS Excel Spreadsheet, 30.2 MB)
Vis_D01: Entry clearance visa applications, by nationality and visa type
Vis_D02: Outcomes of entry clearance visa applications, by nationality, visa type, and outcome
Additional data relating to in country and overse
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset provides a comprehensive overview of domestic airline routes within the United States. It includes valuable information for analyzing passenger travel patterns, market trends, and airline pricing strategies.