Skip to main content Accessibility help

Prediction of warning level in aircraft accidents using data mining techniques

  • A. B. Arockia Christopher (a1) and S. Appavu alias Balamurugan (a2)


Data mining is a data analysis process which is designed for large amounts of data. It proposes a methodology for evaluating risk and safety and describes the main issues of aircraft accidents. We have a huge amount of knowledge and data collection in aviation companies. This paper focuses on different feature selectwindion techniques applied to the datasets of airline databases to understand and clean the dataset. CFS subset evaluator, consistency subset evaluator, gain ratio feature evaluator, information gain attribute evaluator, OneR attribute evaluator, principal components attribute transformer, ReliefF attribute evaluatoboundar and symmetrical uncertainty attribute evaluator are used in this study in order to reduce the number of initial attributes. The classification algorithms, such as DT, KNN, SVM, NN and NB, are used to predict the warning level of the component as the class attribute. We have explored the use of different classification techniques on aviation components data. For this purpose Weka software tools are used. This study also proves that the principal components attribute with decision tree classifier would perform better than other attributes and techniques on airline data. Accuracy is also very highly improved. This work may be useful for an aviation company to make better predictions. Some safety recommendations are also addressed to airline companies.


Corresponding author


Hide All
1. Arauzo-Azofra, A., Benitez, J.M. and Castro, J.C. Consistency measures for feature selection, J Intell Inf Syst, 2008, 30, pp 273292.
2. Asha, G.K, Manjunath, A.S. and Jayaram, M.A. A comparative study of attribute selection using gain ratio and correlation based feature selection, Int J of Info Tech and Knowledge Management, July-December 2012, 2, pp 271277.
3. Ramana, B.V. Babu, M.S.P. and Venkateswarlu, N.B. A critical study of selected classification algorithms for liver disease diagnosis, Int J Database Management Systems, May 2011, 3, (2).
4. Ienco, D., Pensa, R.G. and Meo, R. Context-based Distance Learning for Categorical Data Clustering, 2009, IDA 2009, LNCS 5772, Springer, Berlin, pp 8394.
5. Recent advances in data mining, Eng Apps of Artificial Intelligence (Editorial), 19, 2006, pp 361362.
6. Guyon, I. and Elisseeff, A. An introduction to variable and feature selection, J Machine Learning Research, 2003, 3, pp 11571182.
7. He, H., Graco, W. and Yao, X. Application of genetic algorithm and k -nearest neighbour method in medical fraud detection, simulated evolution and learning, November 1998, Second Asia-Pacific Conference on Simulated Evolution and Learning, SEAL’ 98, Canberra, Australia, Springer, pp 7481.
8. Jiawei, H. and Kamber, M. Data Mining: Concepts and Techniques, 2001, University of Simon Fraser.
9. Wang, H., Khoshgoftaar, T.M. and Napolitano, A. A comparative study of ensemble feature selection techniques for software defect prediction, 2010, Ninth International Conference on Machine Learning and Applications, 12-14 December 2010, Washington, DC, USA, pp 135140.
10. Shyur, H.J. A quantitative model for aviation safety risk assessment, Computers and Ind Eng, 2007.
11. Han, J. and Kamber, M. Data Mining; Concepts and Techniques, 2000, Morgan Kaufmann Publishers.
12. Bineid, M. and Fielding, J.P. Development of a civil aircraft dispatch reliability prediction methodology, Aircraft Eng and Aerospace Tech, 2003, 75, (6), 2003, pp 588594.
13. Murty, M.N. and Devi, V.S. Pattern Recognition: An Algorithmic Approach, Springer, Ch 4, pp 8697.
14. Grimaldi, M., Cunningham, P. and Kokaram, A. An Evaluation of Alternative Feature Selection Strategies and Ensemble Techniques for Classifying Music, Trinity College Dublin, Ireland.
15. Pizzi, N.J. and Pedrycz, W. Effective classifcation using feature selection and fuzzy, Integration Fuzzy Sets and Systems, 2008.
16. Dessureault, S., Sinuhaji, A. and Coleman, P. Data mining mine safety data, Mining Engineering Littleton, 2007, 59, (8), p 64. 7 pgs.
17. Pulatova, S. Covering (rule-based) Algorithms Lecture Notes in Data Mining, 2006, World Scientific Publishing Co, pp 8797.
18. Solomon, S. Nguyen, H. Liebowitz, J. and Agresti, W. Using data mining to improve traffic safety programs, Ind Management and Data Systems, 2006, 106, (5), pp 621643.
19. Suraj, Z. and Delimata, P. Data mining exploration system for feature selection tasks. In International conference on hybrid information technology, 2006, Ichıt’06, User Manuel of Polyanalyst 5, April 2005, IEEE, Computer Society.
20. Mitchell, T. Machine Learning, 1997, McGraw Hill.
21. Altidor, W., Khoshgoftaar, T.M., Van Hulse, J. and Napolitano, A. Ensemble feature ranking methods for data intensive computing applications, Handbook of Data Intensive Computing, Springer Science + Business Media, LLC 2011, pp 349376.
22. Chen, Y., Li, Y., Cheng, X., Guo, L., Lipmaa, H., Yung, M. and Lin, D. Survey and taxonomy of feature selection algorithms in intrusion detection system, 2006, Inscrypt 2006, LNCS 4318 Springer-Verlag, Berlin, Germany, pp153167.
23. Nazeri, Z. and Jianping, Z. Mining aviation data to understand impacts of severe weather on airspace system performance, International Conference on Information Technology, 2002, IEEE.

Related content

Powered by UNSILO

Prediction of warning level in aircraft accidents using data mining techniques

  • A. B. Arockia Christopher (a1) and S. Appavu alias Balamurugan (a2)


Full text views

Total number of HTML views: 0
Total number of PDF views: 0 *
Loading metrics...

Abstract views

Total abstract views: 0 *
Loading metrics...

* Views captured on Cambridge Core between <date>. This data will be updated every 24 hours.

Usage data cannot currently be displayed.