Open Access Open Access  Restricted Access Subscription or Fee Access

A Survey on Disease Prediction from Healthcare Communities Over Big Data

T. Nagamani, V. Gokul Rajhen, B. Deventheran


Data mining is the process of extracting hidden interesting patterns from massive database.  Medical domain contains heterogeneous data in the form of text, numbers and images that can be mined properly to provide variety of useful information for the physicians. The patterns obtained from the medical data can be useful for the physicians to detect diseases, predict the survivability of the patients after disease, severity of diseases etc.  The main aim of this paper is to analyse the application of data mining in medical domain and some of the techniques used in disease prediction.Medical datasets are often categorized by huge amount of disease measurements and comparatively small amount of patient records. These measurements (feature selection) are not relevant, where this irrelevant and redundancy features are difficult to evaluate. On the other hand, the large number of features may cause the problem of memory storage in order to represent the data set. Different kinds of machine learning algorithms can convenient with imprecision and uncertainty in data analysis and can effectively remove impurities and failure information.  


Machine Learning, Accuracy, Datasets Datamining, Disease Prediction.

Full Text:



Kirubha.V. and Manju Priya, S.,”Survey on data mining algorithms in disease prediction”. International Journal of Computer Trends and Technology, 38(3), pp.24-128, 2016.

Jiewai Han and Micheline Kamber ,”Data Mining Concepts and Techniques”, second edition

Tsumoto, S., “Mining diagnostic rules from clinical databases using rough sets and medical diagnostic model”, .Information sciences, 162(2), pp.65-80. 2004.

Komorowski, J. and Ohrn, A. “Modelling prognostic power of cardiac tests using rough sets”. Artificial intelligence in medicine, 15(2), pp.167-191, 1999. M. Young, The Techincal Writers Handbook. Mill Valley, CA: University Science, 1989.

Bazan, J.G.” A comparison of dynamic and non-dynamic rough set methods for extracting laws from decision tables”. Rough sets in knowledge discovery, 1, pp.321-365, 1998.

Nookala, G.K.M., Pottumuthu, B.K., Orsu.N. and Mudunuri, S.B.”Performance analysis and evaluation of different data mining algorithms used for cancer classification”. International Journal of Advanced Research in Artificial Intelligence, 2(5), pp.49-55, 2013.

Priyanga, A. and Prakasam, S.”Effectiveness of Data Mining-based Cancer Prediction System (DMBCPS)”. International Journal of Computer Applications, 83(10), 2013.

Shweta Kharya, S.”Using data mining techniques for diagnosis and prognosis of cancer disease”. arXiv preprint arXiv:1205.1923,2012.

Chen, C.M., Hsu, C.Y., Chiu, H.W. and Rau, H.H. “Prediction of survival in patients with liver cancer using artificial neural networks and classification and regression trees”. In Natural Computation (ICNC), 2011 Seventh International Conference on (Vol. 2, pp. 811-815). IEEE, 2011, July.

Senthilvel Murugan, N.S.V., Vallinayagam, V., Kannan, K.S. and Viveka, T. “Analysis of Liver Cancer DNA Sequence Data using Data Mining”. International Journal of Computer Applications, 61(3), 2013.

Dursun Delen, D., Walker, G. and Kadam, A.”Predicting breast cancer survivability: a comparison of three data mining methods”. Artificial intelligence in medicine, 34(2), pp.113-127, 2005.

Vasantha, M., Bharathi, V.S. and Dhamodharan, R.”Medical image feature, extraction, selection and classification”. International Journal of Engineering Science and Technology, 2(6), pp.2071-2076, 2010.

Rajendran, P. and Madheswaran, M.”An improved image mining technique for brain tumour classification using efficient classifier”. arXiv preprint arXiv:1001.1988,2010.

T. Revathi,S.Jeevitha,Comparative Study on Heart Disease Prediction System Using Data Mining Techniques‖, Volume 4 Issue 7, ISSN (Online): 2319-7064, July 2015.

Vijayarani, S., Dhayanand, S. and Phil, M.”Kidney disease prediction using SVM and ANN algorithms”. International Journal of Computing and Business Research (IJCBR), 6(2), 2015.

Aneeshkumar, A.S. and Venkateswaran, C.J.” A novel approach for Liver disorder Classification using Data Mining Techniques”. Engineering and Scientific International Journal, 2(1), pp.15-18, 2015.

Krishnaiah, V., Narsimha, D.G. and Subhash Chandra, D.N.S.”Diagnosis of lung cancer prediction system using data mining classification techniques”. International Journal of Computer Science and Information Technologies, 4(1), pp.39-45, 2013.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.