An efficient semisupervised feedforward neural network clustering

Roya Asadi; Mitra Asadi; Sameem Abdul Kareem

doi:10.1017/S0890060414000675

An efficient semisupervised feedforward neural network clustering

Published online by Cambridge University Press: 02 December 2014

Roya Asadi ,

Mitra Asadi and

Sameem Abdul Kareem

Show author details

Roya Asadi*: Affiliation:
Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia
Mitra Asadi: Affiliation:
Department of Research, Iranian Blood Transfusion Organization, Tehran, Iran
Sameem Abdul Kareem: Affiliation:
Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia
*: Reprint requests to: Roya Asadi, Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, 60503, Selangor, Malaysia. E-mail: royaasadi@siswa.um.edu.my

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

We developed an efficient semisupervised feedforward neural network clustering model with one epoch training and data dimensionality reduction ability to solve the problems of low training speed, accuracy, and high memory complexity of clustering. During training, a codebook of nonrandom weights is learned through input data directly. A standard weight vector is extracted from the codebook, and the exclusive threshold of each input instance is calculated based on the standard weight vector. The input instances are clustered based on their exclusive thresholds. The model assigns a class label to each input instance through the training set. The class label of each unlabeled input instance is predicted by considering a linear activation function and the exclusive threshold. Finally, the number of clusters and the density of each cluster are updated. The accuracy of the proposed model was measured through the number of clusters and the quantity of correctly classified nodes, which was 99.85%, 100%, and 99.91% of the Breast Cancer, Iris, and Spam data sets from the University of California at Irvine Machine Learning Repository, respectively, and the superior F measure results between 98.29% and 100% accuracies for the breast cancer data set from the University of Malaya Medical Center to predict the survival time.

Keywords

Artificial Neural Network Feedforward Neural Network Nonrandom Weight Semiclustering Supervised and Unsupervised Learning

Type: Regular Articles
Information: AI EDAM , Volume 30 , Issue 1 , February 2016 , pp. 1 - 15

DOI: https://doi.org/10.1017/S0890060414000675 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2014

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

REFERENCES

Alippi, C., Piuri, V., & Sami, M. (1995). Sensitivity to errors in artificial neural networks: a behavioral approach. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications 42(6), 358–361.CrossRef Google Scholar

Andonie, R., & Kovalerchuk, B. (2007). Neural Networks for Data Mining: Constraints and Open Problems. Ellensburg, WA: Central Washington University, Computer Science Department.Google Scholar

Asadi, R., & Kareem, S.A. (2013). Review of feedforward neural network classification preprocessing techniques. Proc. 3rd Int. Conf. Mathematical Sciences (ICMS3), pp. 567–573, Kuala Lumpur, Malaysia.Google Scholar

Asadi, R., & Kareem, S.A. (2014). An unsupervised feedforward neural network model for efficient clustering. Manuscript submitted for publication.Google Scholar

Asadi, R., Sabah Hasan, H., & Abdul Kareem, S. (2013). Review of current online dynamic unsupervised feedforward neural network classification. Proc. Computer Science and Electronics Engineering (CSEE—ISI/Scopus) Conf., Kuala Lumpur, Malaysia.Google Scholar

Asadi, R., Sabah Hasan, H., & Abdul Kareem, S. (2014). Review of current online dynamic unsupervised feedforward neural network classification. International Journal of Artificial Intelligence and Neural Networks 4(2), 12.Google Scholar

Asuncion, A., & Newman, D. (2007). UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science. Accessed at http://www.ics.uci.edu/~mlearn/MLRepository Google Scholar

Bengio, Y. (2000). 1M. Zurada. Introduction to the Special Issue on neural networks for data mining and knowledge discovery. IEEE Transactions on Neural Networks 100(3), 545–549.Google Scholar

Bengio, Y., Buhmann, J., Embrechts, M., & Zurada, J. (2000). Neural networks for data mining and knowledge discovery [Special Issue]. IEEE Transactions on Neural Networks 11(2).Google Scholar

Bose, N.K., & Liang, P. (1996). Neural Network Fundamentals With Graphs, Algorithms, and Applications. New York: McGraw–Hill.Google Scholar

Bouchachia, A., Gabrys, B., & Sahel, Z. (2007). Overview of some incremental learning algorithms. Proc. Fuzzy Systems Conf. Fuzz-IEEE, pp. 1–16, London, July 23–26.CrossRef Google Scholar

Camastra, F., & Verri, A. (2005). A novel kernel method for clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(5), 801–805.CrossRef Google Scholar PubMed

Chattopadhyay, M., Pranab, K., & Mazumdar, S. (2011). Principal component analysis and self-organizing map for visual clustering of machine-part cell formation in cellular manufacturing system. Systems Research Forum 5(1), 25–51.CrossRef Google Scholar

Costa, J.A.F., & Oliveira, R.S. (2007). Cluster analysis using growing neural gas and graph partitioning. Proc. Int. Joint Conf. Neural Networks, Orlando, FL, August 12–17.CrossRef Google Scholar

Craven, M.W., & Shavlik, J.W. (1997). Using neural networks for data mining. Future Generation Computer Systems 13(2), 211–229.CrossRef Google Scholar

Daffertshofer, A., Lamoth, C.J.C., Meijer, O.G., & Beek, P.J. (2004). PCA in studying coordination and variability: a tutorial. Clinical Biomechanics 19(4), 415–428.CrossRef Google Scholar PubMed

Dasarathy, B.V. (1990). Nearest Neighbor Pattern Classification Techniques. Los Alamitos, CA: IEEE Computer Society Press.Google Scholar

Demuth, H., Beale, M., & Hagan, M. (2008). Neural Network Toolbox TM 6: User's Guide. Natick, MA: Math Works.Google Scholar

Deng, D., & Kasabov, N. (2003). On-line pattern analysis by evolving self-organizing maps. Neurocomputing 51, 87–103.CrossRef Google Scholar

Fisher, R. (1950). The Use of Multiple Measurements in Taxonomic Problems: Contributions to Mathematical Statistics. New York: Wiley. (Original work published 1936)Google Scholar

Fritzke, B. (1995). A growing neural gas network learns topologies. Advances in Neural Information Processing Systems 7, 625–632.Google Scholar

Fritzke, B. (1997). Some Competitive Learning Methods. Dresden: Dresden University of Technology, Artificial Intelligence Institute.Google Scholar

Furao, S., Ogura, T., & Hasegawa, O. (2007). An enhanced self-organizing incremental neural network for online unsupervised learning. Neural Networks 20(8), 893–903.CrossRef Google Scholar PubMed

Germano, T. (1999). Self-organizing maps. Accessed at http://davis.wpi.edu/~matt/courses/soms Google Scholar

Goebel, M., & Gruenwald, L. (1999). A survey of data mining and knowledge discovery software tools. ACM SIGKDD Explorations Newsletter 1(1), 20–33.CrossRef Google Scholar

Gui, V., Vasiu, R., & Bojković, Z. (2001). A new operator for image enhancement. Facta Universitatis-Series: Electronics and Energetics 14(1), 109–117.Google Scholar

Hamker, F.H. (2001). Life-long learning cell structures—continuously learning without catastrophic interference. Neural Networks 14(4–5), 551–573.CrossRef Google Scholar PubMed

Han, J., & Kamber, M. (2006). Data Mining, Southeast Asia Edition: Concepts and Techniques. San Francisco, CA: Morgan Kaufmann.Google Scholar

Hazlina, H., Sameem, A., NurAishah, M., & Yip, C. (2004). Back propagation neural network for the prognosis of breast cancer: comparison on different training algorithms. Proc. 2nd. Int. Conf. Artificial Intelligence in Engineering & Technology (ICAIET), pp. 445–449.Google Scholar

Hebb, D.O. (1949). The Organization of Behavior: A Neuropsychological Approach. New York: Wiley.Google Scholar

Hinton, G.E. (1989). Deterministic Boltzmann learning performs steepest descent in weight space. Neural Computation 1(1), 143–150.CrossRef Google Scholar

Hebboul, A., Hacini, M., & Hachouf, F. (2011). An incremental parallel neural network for unsupervised classification. Proc. 7th Int. Workshop on Systems, Signal Processing Systems and Their Applications (WOSSPA), pp. 400–403, Tipaza, Algeria, May 9–11.Google Scholar

Hegland, M. (2003). Data Mining—Challenges, Models, Methods and Algorithms. Canberra, Australia: Australia National University, ANU Data Mining Group.Google Scholar

Hinton, G.E., & Salakhutdinov, R.R. (2006). Reducing the dimensionality of data with neural networks. Science 313(5786), 504.CrossRef Google Scholar PubMed

Honkela, T. (1998). Description of Kohonen's self-organizing map. Accessed at http://www.cis.hut.fi/~tho/thesis Google Scholar

Jacquier, E., Kane, A., & Marcus, A.J. (2003). Geometric or arithmetic mean: a reconsideration. Financial Analysts Journal 59(6), 46–53.CrossRef Google Scholar

Jean, J.S., & Wang, J. (1994). Weight smoothing to improve network generalization. IEEE Transactions on Neural Networks 5(5), 752–763.CrossRef Google Scholar PubMed

Jolliffe, I. (1986). Principal Component Analysis (pp. 1–7). New York: Springer.CrossRef Google Scholar

Jolliffe, I.T. (2002). Principal Component Analysis (pp. 1–9). New York: Springer–Verlag.Google Scholar

Kamiya, Y., Ishii, T., Furao, S., & Hasegawa, O. (2007). An online semisupervised clustering algorithm based on a self-organizing incremental neural network. Proc. Int. Joint Conf. Neural Networks (IJCNN), pp. 1061–1066.CrossRef Google Scholar

Kantardzic, M. (2011). Data Mining: Concepts, Models, Methods, and Algorithms. New York: Wiley–Interscience.CrossRef Google Scholar

Kasabov, N.K. (1998). ECOS: evolving connectionist systems and the ECO learning paradigm. Proc. 5th Int. Conf. Neural Information Processing, ICONIP’98, pp. 123–128.Google Scholar

Kemp, R.A., MacAulay, C., & Palcic, B. (1997). Detection of malignancy associated changes in cervical cell nuclei using feed-forward neural networks. Journal of the European Society for Analytical Cellular Pathology 14(1), 31–40.CrossRef Google Scholar PubMed

Kohonen, T. (1997). Self-Organizing Maps (Springer Series in Information Sciences, Vol. 30, pp. 22–25). Berlin: Springer–Verlag.CrossRef Google Scholar

Kohonen, T. (2000). Self-Organization Maps (3rd ed.). Berlin: Springer–Verlag.Google Scholar

Larochelle, H., Mandel, M., Pascanu, R., & Bengio, Y. (2012). Learning algorithms for the classification restricted Boltzmann machine. Journal of Machine Learning Research 13, 643–669.Google Scholar

Linde, Y., Buzo, A., & Gray, R. (1980). An algorithm for vector quantizer design. IEEE Transactions on Communications 28(1), 84–95.CrossRef Google Scholar

Lindsay, R.S., Funahashi, T., Hanson, R.L., Matsuzawa, Y., Tanaka, S., Tataranni, P.A., et al. (2002). Adiponectin and development of type 2 diabetes in the Pima Indian population. Lancet 360(9326), 57–58.CrossRef Google Scholar PubMed

Martinetz, T.M., Berkovich, S.G., & Schulten, K.J. (1993). Neural-gas network for vector quantization and its application to time-series prediction. IEEE Transactions on Neural Networks 4(4), 558–569.CrossRef Google Scholar

McClelland, J.L., Thomas, A.G., McCandliss, B.D., & Fiez, J.A. (1999). Understanding failures of learning: Hebbian learning, competition for representational space, and some preliminary experimental data. Progress in Brain Research 121, 75–80.CrossRef Google Scholar PubMed

McCloskey, S. (2000). Neural networks and machine learning, p. 755. Accessed at http://www.cim.mcgill.ca/~scott/RIT/research_project.html Google Scholar

Melek, W.W., & Sadeghian, A. (2009). A theoretic framework for intelligent expert systems in medical encounter evaluation. Expert Systems 26(1), 82–99.CrossRef Google Scholar

Oh, M., & Park, H.M. (2011). Preprocessing of independent vector analysis using feed-forward network for robust speech recognition. Proc. Neural Information Processing Conf., pp. 366–373.CrossRef Google Scholar

Özbay, Y., Ceylan, R., & Karlik, B. (2006). A fuzzy clustering neural network architecture for classification of ECG arrhythmias. Computers in Biology and Medicine 36(4), 376–388.CrossRef Google Scholar PubMed

Pavel, B. (2002). Survey of Clustering Data Mining Techniques. San Jose, CA: Accrue Software.Google Scholar

Peng, J.-M., & Lin, Z. (1999). A non-interior continuation method for generalized linear complementarity problems. Mathematical Programming 86(3), 533–563.CrossRef Google Scholar

Prudent, Y., & Ennaji, A. (2005). An incremental growing neural gas learns topologies. Proc. IEEE Int. Joint Conf. Neural Networks, IJCNN'05, pp. 1211–1216.CrossRef Google Scholar

Rougier, N., & Boniface, Y. (2011). Dynamic self-organising map. Neurocomputing 74(11), 1840–1847.CrossRef Google Scholar

Shen, F., Yu, H., Sakurai, K., & Hasegawa, O. (2011). An incremental online semisupervised active learning algorithm based on self-organizing incremental neural network. Neural Computing and Applications 20(7), 1061–1074.CrossRef Google Scholar

Tong, X., Qi, L., Wu, F., & Zhou, H. (2010). A smoothing method for solving portfolio optimization with CVaR and applications in allocation of generation asset. Applied Mathematics and Computation 216(6), 1723–1740.CrossRef Google Scholar

Ultsch, A., & Siemon, H.P. (1990). Kohonen's self organizing feature maps for exploratory data analysis. Proc. Int. Neural Networks Conf., pp. 305–308.Google Scholar

Van der Maaten, L.J., Postma, E.O., & Van den Herik, H.J. (2009). Dimensionality reduction: a comparative review. Journal of Machine Learning Research 10(1), 66–71.Google Scholar

Vandesompele, J., De Preter, K., Pattyn, F., Poppe, B., Van Roy, N., De Paepe, A., et al. (2002). Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biology 3(7).CrossRef Google Scholar PubMed

Werbos, P. (1974). Beyond regression: new tools for prediction and analysis in the behavioral sciences. PhD Thesis. Harvard University.Google Scholar

Wolberg, W.H., & Mangasarian, O.L. (1990). Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proceedings of the National Academy of Sciences 87(23), 9193–9196.CrossRef Google Scholar PubMed

Ziegel, E.R. (2002). Statistical inference. Technometrics 44(4).CrossRef Google Scholar

Article contents

An efficient semisupervised feedforward neural network clustering

Abstract

Keywords

Access options

References

REFERENCES

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests