6 datasets found
  1. Airline Dataset

    • kaggle.com
    Updated Sep 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sourav Banerjee (2023). Airline Dataset [Dataset]. https://www.kaggle.com/datasets/iamsouravbanerjee/airline-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 26, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sourav Banerjee
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Airline data holds immense importance as it offers insights into the functioning and efficiency of the aviation industry. It provides valuable information about flight routes, schedules, passenger demographics, and preferences, which airlines can leverage to optimize their operations and enhance customer experiences. By analyzing data on delays, cancellations, and on-time performance, airlines can identify trends and implement strategies to improve punctuality and mitigate disruptions. Moreover, regulatory bodies and policymakers rely on this data to ensure safety standards, enforce regulations, and make informed decisions regarding aviation policies. Researchers and analysts use airline data to study market trends, assess environmental impacts, and develop strategies for sustainable growth within the industry. In essence, airline data serves as a foundation for informed decision-making, operational efficiency, and the overall advancement of the aviation sector.

    Content

    This dataset comprises diverse parameters relating to airline operations on a global scale. The dataset prominently incorporates fields such as Passenger ID, First Name, Last Name, Gender, Age, Nationality, Airport Name, Airport Country Code, Country Name, Airport Continent, Continents, Departure Date, Arrival Airport, Pilot Name, and Flight Status. These columns collectively provide comprehensive insights into passenger demographics, travel details, flight routes, crew information, and flight statuses. Researchers and industry experts can leverage this dataset to analyze trends in passenger behavior, optimize travel experiences, evaluate pilot performance, and enhance overall flight operations.

    Dataset Glossary (Column-wise)

    • Passenger ID - Unique identifier for each passenger
    • First Name - First name of the passenger
    • Last Name - Last name of the passenger
    • Gender - Gender of the passenger
    • Age - Age of the passenger
    • Nationality - Nationality of the passenger
    • Airport Name - Name of the airport where the passenger boarded
    • Airport Country Code - Country code of the airport's location
    • Country Name - Name of the country the airport is located in
    • Airport Continent - Continent where the airport is situated
    • Continents - Continents involved in the flight route
    • Departure Date - Date when the flight departed
    • Arrival Airport - Destination airport of the flight
    • Pilot Name - Name of the pilot operating the flight
    • Flight Status - Current status of the flight (e.g., on-time, delayed, canceled)

    Structure of the Dataset

    https://i.imgur.com/cUFuMeU.png" alt="">

    Acknowledgement

    The dataset provided here is a simulated example and was generated using the online platform found at Mockaroo. This web-based tool offers a service that enables the creation of customizable Synthetic datasets that closely resemble real data. It is primarily intended for use by developers, testers, and data experts who require sample data for a range of uses, including testing databases, filling applications with demonstration data, and crafting lifelike illustrations for presentations and tutorials. To explore further details, you can visit their website.

    Cover Photo by: Kevin Woblick on Unsplash

    Thumbnail by: Airplane icons created by Freepik - Flaticon

  2. Annual California Children’s Services Whole Child Model Summary Data

    • data.chhs.ca.gov
    • healthdata.gov
    • +2more
    csv, zip
    Updated Jan 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Health Care Services (2025). Annual California Children’s Services Whole Child Model Summary Data [Dataset]. https://data.chhs.ca.gov/dataset/annual-california-children-s-services-ccs-whole-child-model-wcm-summary-data
    Explore at:
    csv(449884), csv(239234), csv(113317), zip, csv(230661), csv(152116), csv(284404)Available download formats
    Dataset updated
    Jan 30, 2025
    Dataset provided by
    California Department of Health Care Serviceshttp://www.dhcs.ca.gov/
    Authors
    Department of Health Care Services
    Area covered
    California
    Description

    These datasets contain summary data about the Annual California Children’s Services (CCS) Whole Child Model (WCM) program. These summary files are intended to accompany the CCS Power BI Dashboard which is posted on the DHCS internet. The CCS and WCM Programs provide diagnostic and treatment services, case management, and physical and occupational therapy services to children under age 21 with CCS-eligible medical conditions. Examples of CCS-eligible conditions include, but are not limited to, chronic medical conditions such as cystic fibrosis, hemophilia, cerebral palsy, heart disease, cancer, traumatic injuries, and infectious diseases producing major sequelae.

  3. NETFLIX STOCK PRICE HISTORY

    • kaggle.com
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adil Shamim (2025). NETFLIX STOCK PRICE HISTORY [Dataset]. https://www.kaggle.com/datasets/adilshamim8/netflix-stock-price-history/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 8, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Adil Shamim
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset offers a comprehensive historical record of Netflix’s stock price movements, capturing the company’s financial journey from its early days to its position as a global streaming giant.

    From its IPO in May 2002, Netflix (Ticker: NFLX) has transformed from a DVD rental service to a powerhouse in on-demand digital content. With its disruptive innovation, strategic shifts, and global expansion, Netflix has seen dramatic shifts in stock prices, reflecting not just market trends but also cultural impact. This dataset provides a window into that evolution.

    What’s Included?

    Each row in this dataset represents daily trading activity on the stock market and includes the following columns:

    • Date – The trading day (from 2002 onward)
    • Open – Stock price when the market opened
    • High – Highest trading price of the day
    • Low – Lowest trading price of the day
    • Close – Final price at market close
    • Adj Close – Closing price adjusted for splits and dividends
    • Volume – Number of shares traded that day

    The data is structured in CSV format and is clean, easy to use, and ready for immediate analysis.

    Why Use This Dataset?

    Whether you're learning data science, building a financial model, or exploring machine learning in the real world, this dataset is a goldmine of insights. Netflix's market history includes:

    • Periods of explosive growth during digital transformation
    • Volatility during market crashes and global events (e.g., 2008, COVID-19)
    • Strategic pivots such as the shift to original content
    • Market reactions to earnings, acquisitions, and subscriber milestones

    This makes the dataset ideal for:

    • Time-series forecasting (ARIMA, Prophet, LSTM)
    • Technical and trend analysis (moving averages, RSI, Bollinger Bands)
    • Predictive modeling with machine learning
    • Investment simulation projects
    • Stock market visualization and storytelling
    • Financial dashboards (Tableau, Power BI, Streamlit, etc.)

    Who Can Use It?

    This dataset is designed for:

    • Aspiring data scientists practicing EDA and modeling
    • Financial analysts and traders exploring trends
    • AI researchers working on time-series models
    • Students building ML projects
    • Developers creating stock visualization tools
    • Kaggle competitors seeking real-world datasets

    Data Source & Credits

    The dataset is derived from publicly available historical stock price data, such as Yahoo Finance, and has been cleaned and organized for educational and research purposes. It is continuously maintained to ensure accuracy.

    Start Exploring

    Netflix’s rise is more than just a business story — it’s a data-driven journey. With this dataset, you can analyze the company’s stock behavior, train models to predict future trends, or simply visualize how tech reshapes the market.

  4. Inventory Management

    • kaggle.com
    Updated May 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fayez1 (2023). Inventory Management [Dataset]. https://www.kaggle.com/datasets/fayez1/inventory-management
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 25, 2023
    Dataset provided by
    Kaggle
    Authors
    Fayez1
    Description

    This dataset can be used for creating an Inventory Dashboard. We can find the: - ABC Inventory Classification - XYZ Classification - Inventory Turnover Ratio - Calculation of Safety Stock - Reorder points - Stock Status Classification - Demand Forecasting on Power BI It is extremely useful for Warehouse/ In-plant Inventory Managers to effectively control the Inventory levels and also maintain the Service Levels.

  5. Riyadh Metro Stations

    • kaggle.com
    Updated Mar 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meshal Alsanari (2025). Riyadh Metro Stations [Dataset]. https://www.kaggle.com/datasets/meshalalsanari/riyadh-metro-stations
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 20, 2025
    Dataset provided by
    Kaggle
    Authors
    Meshal Alsanari
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    Riyadh
    Description

    Riyadh Metro Stations Dataset

    Dataset Overview

    This dataset contains information about metro stations in Riyadh, Saudi Arabia. It includes details such as station names, types, ratings, and geographic coordinates. The dataset is valuable for transportation analysis, urban planning, and navigation applications.

    Dataset Contents

    The dataset consists of the following columns:

    Column NameData TypeDescription
    NamestringName of the metro station
    Type_of_UtilitystringType of station (Metro Station)
    Number_of_RatingsfloatTotal number of reviews received (some values may be missing)
    RatingfloatAverage rating score (scale: 0-5, some values may be missing)
    LongitudefloatGeographical longitude coordinate
    LatitudefloatGeographical latitude coordinate

    Potential Use Cases

    • Urban Mobility Analysis: Study metro station distribution and accessibility.
    • Transportation Planning: Analyze station usage based on ratings and reviews.
    • Navigation & Mapping: Enhance public transit applications with station locations.
    • Service Optimization: Identify areas needing better metro services.

    How to Use

    1. Load the dataset into a data analysis tool like Python (pandas), R, or Excel.
    2. Filter or group data based on ratings, locations, or number of reviews.
    3. Use visualization tools like matplotlib, seaborn, or Power BI for insights.
    4. Integrate with GIS software for geospatial mapping.

    License & Acknowledgments

    • Data sourced from publicly available platforms.
    • This dataset is open for non-commercial research and analysis purposes.
    • Proper attribution is required when using this dataset in research or publications.

    Contact Information

    For questions or collaboration, reach out via Kaggle comments or email.

  6. Los Angeles Census Tracts (500 Cities): Local Data for Better Health, 2017...

    • metropolis.demo.socrata.com
    csv, xlsx, xml
    Updated May 12, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Population Health (2018). Los Angeles Census Tracts (500 Cities): Local Data for Better Health, 2017 release for Power BI OData Demo [Dataset]. https://metropolis.demo.socrata.com/Health/Los-Angeles-Census-Tracts-500-Cities-Local-Data-fo/5tyu-tf6k
    Explore at:
    xlsx, xml, csvAvailable download formats
    Dataset updated
    May 12, 2018
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Authors
    Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Population Health
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Area covered
    Los Angeles
    Description

    This is the filtered dataset of LA Census Tracts from the 500 Cities project 2017 release. This dataset includes 2015, 2014 model-based small area estimates for 27 measures of chronic disease related to unhealthy behaviors (5), health outcomes (13), and use of preventive services (9). Data were provided by the Centers for Disease Control and Prevention (CDC), Division of Population Health, Epidemiology and Surveillance Branch. The project was funded by the Robert Wood Johnson Foundation (RWJF) in conjunction with the CDC Foundation. It represents a first-of-its kind effort to release information on a large scale for cities and for small areas within those cities. It includes estimates for the 500 largest US cities and approximately 28,000 census tracts within these cities. These estimates can be used to identify emerging health problems and to inform development and implementation of effective, targeted public health prevention activities. Because the small area model cannot detect effects due to local interventions, users are cautioned against using these estimates for program or policy evaluations. Data sources used to generate these measures include Behavioral Risk Factor Surveillance System (BRFSS) data (2015, 2014), Census Bureau 2010 census population data, and American Community Survey (ACS) 2011-2015, 2010-2014 estimates. Because some questions are only asked every other year in the BRFSS, there are 7 measures from the 2014 BRFSS that are the same in the 2017 release as the previous 2016 release. More information about the methodology can be found at www.cdc.gov/500cities.

  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sourav Banerjee (2023). Airline Dataset [Dataset]. https://www.kaggle.com/datasets/iamsouravbanerjee/airline-dataset
Organization logo

Airline Dataset

Navigating the Skies: Exploring Insights from Synthetic Airline Data

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 26, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sourav Banerjee
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

Airline data holds immense importance as it offers insights into the functioning and efficiency of the aviation industry. It provides valuable information about flight routes, schedules, passenger demographics, and preferences, which airlines can leverage to optimize their operations and enhance customer experiences. By analyzing data on delays, cancellations, and on-time performance, airlines can identify trends and implement strategies to improve punctuality and mitigate disruptions. Moreover, regulatory bodies and policymakers rely on this data to ensure safety standards, enforce regulations, and make informed decisions regarding aviation policies. Researchers and analysts use airline data to study market trends, assess environmental impacts, and develop strategies for sustainable growth within the industry. In essence, airline data serves as a foundation for informed decision-making, operational efficiency, and the overall advancement of the aviation sector.

Content

This dataset comprises diverse parameters relating to airline operations on a global scale. The dataset prominently incorporates fields such as Passenger ID, First Name, Last Name, Gender, Age, Nationality, Airport Name, Airport Country Code, Country Name, Airport Continent, Continents, Departure Date, Arrival Airport, Pilot Name, and Flight Status. These columns collectively provide comprehensive insights into passenger demographics, travel details, flight routes, crew information, and flight statuses. Researchers and industry experts can leverage this dataset to analyze trends in passenger behavior, optimize travel experiences, evaluate pilot performance, and enhance overall flight operations.

Dataset Glossary (Column-wise)

  • Passenger ID - Unique identifier for each passenger
  • First Name - First name of the passenger
  • Last Name - Last name of the passenger
  • Gender - Gender of the passenger
  • Age - Age of the passenger
  • Nationality - Nationality of the passenger
  • Airport Name - Name of the airport where the passenger boarded
  • Airport Country Code - Country code of the airport's location
  • Country Name - Name of the country the airport is located in
  • Airport Continent - Continent where the airport is situated
  • Continents - Continents involved in the flight route
  • Departure Date - Date when the flight departed
  • Arrival Airport - Destination airport of the flight
  • Pilot Name - Name of the pilot operating the flight
  • Flight Status - Current status of the flight (e.g., on-time, delayed, canceled)

Structure of the Dataset

https://i.imgur.com/cUFuMeU.png" alt="">

Acknowledgement

The dataset provided here is a simulated example and was generated using the online platform found at Mockaroo. This web-based tool offers a service that enables the creation of customizable Synthetic datasets that closely resemble real data. It is primarily intended for use by developers, testers, and data experts who require sample data for a range of uses, including testing databases, filling applications with demonstration data, and crafting lifelike illustrations for presentations and tutorials. To explore further details, you can visit their website.

Cover Photo by: Kevin Woblick on Unsplash

Thumbnail by: Airplane icons created by Freepik - Flaticon

Search
Clear search
Close search
Google apps
Main menu