Open Access Open Access  Restricted Access Subscription or Fee Access

Pattern Classification Using Optimized Machine Learning Techniques

R. Kalaivani, C. Devi Arockia Vanitha, R. Lawrance, M. Lydia Packiam Mettilda

Abstract


Most of the real world problems in engineering, medicine, industry, science and business also involve data classification. Classification is a supervised machine learning technique used to predict group membership for data instances. Pattern classification problems belong to the category of supervised learning. Pattern Classification involves assigning a label to a given input data. Neural Networks are an effective tool in the field of pattern classification, using training and testing data to build a model. Training neural networks in classification problems, especially when biological data are is a very challenging task. The protein superfamily classification problem, which consists of determining the superfamily membership of a given unknown protein sequence, is very important for a biologist for many practical reasons, such as drug discovery, prediction of molecular function and medical diagnosis. The objective of this work is creating a classification model for classifying data using Multilayer feed forward network. It contain two phases. First, classifier model was build for iris plant classification. Second, classifier model was build for protein sequence classification to know the organism of protein and family of the given protein sequence.

Keywords


Data Mining, Pattern Classification, Neural Network, Back Propagation, Iris Plant, Bioinformatics, Protein Sequence

Full Text:

PDF

References


Kalaivani.R, Vanitha,D.A, Lawrance.R. Feed Forward Neural Network Model for Classification of Data, In the Proceedings of National

Conference on Information Science and Engineering 2013 published by Coimbatore Institute of Information Technology (CiiT),PSN College of Engineering and Technology, Melathediyoor, 08.04.2013.

Han, J., Kamber, M. Data Mining: Concepts and Techniques, Second Edition, Morgan Kaufmann publishers.

Gupta, M., Aggarwal,N. Classification techniques analysis,NCCI2010- National Conference on Computational Instrumentation, CSIO Chandigarh, India, 19-20,March 2010.

Chen, Y.P . Bioinformatics Technologies, Springer-Verlag Berlin Heidelberg, 2005.

Kapadia, M.T., Lakhani, A. The integration of the back propagation algorithm into an autonomous robot control system, Dwarkadas j. Sanghvi College of engineering, vile parle, Mumbai.

Naik, A.R., Pathan. S.K. Weather Classification and Forecasting using Back Propagation Feed-forward Neural Network , International Journal of Scientific and Research Publications, Volume 2, Issue 12, December 2012.

Rehman. M. Z., Nawi. N. M. Improving the Accuracy of Gradient Descent Back Propagation Algorithm (GDAM) on Classification Problems, International Journal on New Computer Architectures and Their Applications (IJNCAA) 1(4): 838-847, The Society of Digital Information and Wireless Communications, 2011 (ISSN: 2220-9085).

Dhande.J. D., Dr.Gulhane.S.M. Design of Classifier Using Artificial Neural Network for Patients Survival Analysis, International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 1, Issue 2, November 2012

Swain, M., Dash, S,K.., Dash,S and Mohapatra,A. An approach for iris plant classification using neural network, International Journal on Soft Computing ( IJSC ) Vol.3, No.1, February 2012

Mubark, R.I., Keshk,H.A. and Eladawy,M.I. Different Species Classifier and Hemoglobin Structure Predictor based on DNA Sequences, International Journal of Biology and Biomedical Engineering, Issue 3, Volume 2,2008.

Rao, P.N., Devi ,T.U., Kaladhar ,D., Sridhar ,G. and Rao,A.A . A Probabilistic Neural Network Approach For Protein Super family Classification, Journal of Theoretical and Applied Information Technology, Vol6.No1, pp. 101-105, 2005 - 2009.

Zainuddin,Z. and Kumar,M. Radial Basis Function Neural Networks in Protein Sequence Classification, Malaysian Journal of Mathematical Sciences 2(2), pp.195 – 204 , 2008.

Zhao,X., Huang,D., Cheung,Y., Wang,H. and Huang ,X. A Novel Hybrid GA/SVM System for Protein Sequences Classification, Springer-Verlag Berlin Heidelberg, pp. 11–16, 2004.

Mansoori,E.G., Zolghadri,M.J., Katebi,S.D., Mohabatkar,H., Boostani,R. and Sadreddini,M.H . Generating Fuzzy Rules For Protein Classification, Iranian Journal of Fuzzy Systems, Vol. 5, No. 2, pp. 21-33, 2008.

Wang, D .and Huang, G. Protein Sequence Classification Using Extreme Learning Machine, Proceedings of International Joint Conference on Neural Networks(IJCNN), Montreal, Canada, 2005.

Mohamed, S., Rubin, D. and Marwala, T. Multi-class Protein Sequence Classification Using Fuzzy ARTMAP , IEEE conference on Systems, Man and Cybernetics, Taipei, Taiwan, October, 2006.

http://archive.ics.uci.edu/ml/datasets/Iris

http://neuroph.sourceforge.net/index.html

http://www.uniprot.org/uniprot/?query=hemoglobin&offset=2175&sort=score

http://www.ncbi.nlm.nih.gov/protein/?term=keratin

http://pir.georgetown.edu/cgi-bin/textsearch.pl


Refbacks

  • There are currently no refbacks.