Open Access Open Access  Restricted Access Subscription or Fee Access

Comparative Analysis on Classification Data Mining Techniques through WEKA

Lukeshkumar Barapatre, Anand Sharma, Hemant Barapatre

Abstract


Data mining refers to extraction or mining of information/knowledge from huge amounts of data. Data mining is also called as Knowledge Discovery from Database (KDD). There are number of data mining techniques such as classification and regression; the mining of frequent patterns, associations, and correlations; clustering analysis; and outlier analysis. [1]

Classification is a major technique in data mining which is widely used in various fields.

Classification can be defined as the process of finding a model (or function) that describes and distinguishes data classes or concepts, for the purpose of being able to use the function/model to guess the class of objects whose class label is unknown. The derived model is based on the analysis of a set of training data (i.e., data objects whose class label is known). Classification models predict categorical class labels. Classification may also called as supervised Learning.[2]

In this paper, we are going to discuss the various techniques of classification.Different  kinds of classification techniques  includes if-then rule, decision tree, Neural Network .Bayesian networks, k-nearest neighbor classifier, and  support vector machine(SVM)[3], and the aim of this study is to provide a comprehensive review of different classification techniques in data mining.


Keywords


Classification, Data Mining, Decision Tree, Supervised Learning, WEKA, Pattern Evaluation, Neural Network, KDD

Full Text:

PDF

References


Jiawei Han,”Data Mining: Concepts and Techniques”, Second Edition, Morgan Kaufmann, 2006

Ritu Sharma et al “Comparative Analysis of Classification Techniques in Data Mining Using Different Datasets” International Journal of Computer Science and Mobile Computing, Vol.4 Issue.12, December-2015, pg. 125-134

A Tutorial on Support Vector Machines for Pattern Recognition CHRISTOPHER J.C. BURGES

S.Archana1, Dr. K. Elangovan, “Survey of Classification Techniques in Data Mining”, International Journal of Computer Science and Mobile Applications, Vol.2 Issue. 2, February-2014

S.Neelamegam, Dr.E.Ramaraj, “Classification algorithm in Data mining: An Overview”, International Journal of P2P Network Trends and Technology, Volume 4 Issue 8-Sep 2013

Megha Gupta, Naveen Aggarwal, “CLASSIFICATION TECHNIQUES ANALYSIS”, National Conference on Computational Instrumentation, March 2010.

Dr. Sudhir B. Jagtap, Dr. Kodge B. G, “Census Data Mining and Data Analysis using WEKA”, International Conference in “Emerging Trends in Science, Technology and Management, 2013

Dr.A.Bharathi, E.Deepankumar,” Survey on Classification Techniques in Data Mining”, International Journal on Recent and Innovation Trends in Computing and Communication, Volume. 2 Issue. 7, July 2014

A tutorial on support vector machines for pattern recognition,” Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121-167, 1998 by C. Burges

N. Abirami, T. Kamalakannan, Dr. A. Muthukumaravel, “A Study on Analysis of Various Data mining Classification Techniques on Healthcare Data”, International Journal of Emerging Technology and Advanced Engineering, Volume 3, Issue 7, July 2013

SAGAR S. NIkAM,” A Comparative Study of Classification Techniques in Data Mining Algorithms, ORIENTAL JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, ISSN: 0974-6471, April 2015, Vol. 8, No. (1): Pgs. 13-19

E. K. Onyari and F. M. Ilunga,” Application of MLP Neural Network and M5P Model Tree in Predicting Streamflow: A Case Study of Luvuvhu Catchment”, International Journal of Innovation, Management and Technology, Vol. 4, No. 1, February 2013

M. Thangaraj, C. R. Vijayalakshmi, “Performance Study on Rule-based Classification Techniques across Multiple Database Relations, International Journal of Applied Information Systems (IJAIS) –ISSN: 2249-0868,Foundation of Computer Science FCS, New York, USA Volume 5–No.4, March2013–www.ijais.org

Overview of Decision Trees by H.Hamilton. E. Gurak, L. Findlater W. Olive

Decision tree Lior Rokach Department of Industrial Engineering Tel-Aviv University, Oded Maimon Department of Industrial Engineering Tel-Aviv University maimon@eng.tau.ac.il

http://www.cs.ubbcluj.ro/~gabis/DocDiplome/DT/DecisionTrees.pdf

S. P. Shinde, V.P. Deshmukh, “Implementation of Pattern Recognition Techniques and Overview of Its Applications in Various Areas of Artificial Intelligence, International Journal of Advances in Engineering & Technology, Sept 2011. ©IJAET ISSN: 2231-1963

http://www.brighthubpm.com/project-planning/106000-advantages-of-decision-tree-analysis/

Dr. Neeraj Bhargava, Girja Sharma, Dr. Ritu Bhargava Manish Mathuria, “Decision Tree Analysis on J48 Algorithm for Data Mining, International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 6,June 2013.

G.P. Zhang, “Neural networks for classification: a survey, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) (Volume: 30, Issue: 4, Nov 2000)

Hassan Ramchoun, Mohammed Amine Janati Idrissi, Youssef Ghanou, Mohamed Ettaouil, “Multilayer Perceptron: Architecture Optimization and Training, International Journal of Interactive Multimedia and Artificial Intelligence, Vol. 4, Nº1

Dayana C. Tejera Hernandez, “An Experimental Study of K* Algorithm, I.J. Information Engineering and Electronic Business, 2015, 2, 14-19 Published Online March 2015 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijieeb.2015.02.03

Overview of Bayesian network approaches to model gene-environment interactions and cancer susceptibility Chengwei Su, Angeline Andrew, Margaret Karagas, Mark E. Borsuk

Sayali D. Jadhav, H.P. Channe, “Comparative Study of K-NN, Naive Bayes and Decision Tree Classification Techniques, International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Index Copernicus Value (2013): 6.14 | Impact Factor (2014): 5.611

Data Mining: practical machine learning tools and techniques with java implementation” Morgan Kaufmann Publishers, San Francisco, USA, 2000. http://www.cs.waikato.ac.nz/ml/weka/. By I. H. Witten and E.farnk


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.