Key Metrics
Project Overview
About the Dataset
Dataset Information
- Dataset: Synthetic Airline Dataset (Kaggle)
- Author: Sourav Banerjee
- Tool: Tableau Public
- Records: 98,619 passenger-flight records
- Target: Flight Status
- Categories: On Time, Delayed, Cancelled
Business Context
Airlines generate vast amounts of passenger and flight data, yet delays and cancellations remain critical operational and customer experience challenges. Understanding the factors behind flight disruptions is essential for improving efficiency, resource allocation, and passenger satisfaction.
Business Question: "What demographic, geographic, and temporal factors are associated with flight delays and cancellations?"
Methodology
STAR Framework
Situation
Airlines need better visibility into disruptions such as delays and cancellations to improve operational planning and customer experience.
Task
Analyze airline data and build an interactive dashboard to identify flight status patterns and key contributing factors.
Action
Cleaned and profiled the data, created calculated fields, standardized continent names, and designed a comprehensive Tableau dashboard.
Result
Delivered an interactive dashboard summarizing flight performance and highlighting patterns by time, geography, airport, age group, gender, and nationality.
Live Dashboard
Interactive Tableau Dashboard
Explore flight performance data interactively. Analyze delays, cancellations, and on-time performance by airport, continent, passenger demographics, and time.
Project Presentation
Presentation Slides
A detailed walkthrough of the analysis, methodology, and key findings.
Findings
Key Insights
Limitations
- The dataset is synthetic and does not represent real airline operations.
- The target variable (Flight Status) is artificially balanced - not typical of real-world distributions.
- Results should be interpreted as analytical patterns, not real operational performance.
- Some categories (e.g., specific nationalities) have very small record counts, which may produce misleading high rates.
Future Improvements
- Add minimum-volume filters for nationalities and airports to avoid sparse-data artifacts.
- Include route-level analysis to identify high-risk origin-destination pairs.
- Incorporate weekday and holiday effects on flight status.
- Build predictive models for flight status classification.
- Evaluate models using accuracy, F1-score, and confusion matrix.