Open Access Open Access  Restricted Access Subscription or Fee Access

Analysis of Machine Learning Algorithms in Prediction of Cardiovascular Diseases

Ashwini Biradar, N. Sushmitha, B M Sagar


Heart failure is considered as one among the most fatal diseases in the contemporary world. Diabetes mellitus, hypertension, and dyslipidemia are considered as the observed predictors of cardiovascular disease. Few routine style risk factors include depression, physical inactivity, smoking, alcohol consumption, stress, food habits and obesity which are the major causes for cardiovascular disease. In India, heart failure among people is increasing at an alarming rate because there is lack of proper estimation for the root cause of cardiovascular diseases and the absence of surveillance programme in order to track the occurrence, extensiveness and outcomes of heart failure. Data mining techniques prove to be an efficient approach in predicting the risk of cardiovascular diseases in the data deluge age. In this research study, data mining techniques are applied to get useful information from medical reports of patients. Using machine learning algorithms, the impact of each risk factor on heart disease is predicted. Firstly, the heart disease dataset is collected from the Cleveland Heart Disease database. With the help of the dataset, the attributes significant to the heart attack prediction are extracted. The dataset is split into training and test dataset. Different classification techniques are applied on preprocessed data to measure their accuracy in predicting the risk of heart disease. Two such algorithms are Logistic Regression and Gradient Boosting Algorithm. The objective is to attain high accuracy in the prediction of risk of cardiovascular diseases among patients. In order to prevent the occurrence of the cardiovascular diseases, the prevalence of risk factors should be minimized. Further, early conclusion and treatment can enhance quality and future of individuals who have heart disappointment.


Data Mining; Cardiovascular Diseases; Machine Learning Algorithms; Logistic Regression; Gradient Boosting Algorithm.

Full Text:



Niti Guru, Anil Dahiya, Navin Rajpal, "Decision Support System for Heart Disease Diagnosis Using Neural Network", Delhi Business Review, Vol. 8, No. I (January - June 2007.

M. Hertzong, and B. Pozehl, “Cluster analysis of symptom occurrence to identify subgroups of heart failure patients: A pilot study,” Journal of Cardiovascular Nursing, vol. 25, pp. 273–283, July/August 2010.

M. Panahiazar, V. Taslimitehrani, N. Pereira, and J. Pathak, “Using EHRs and machine learning for heart failure survival analysis,” Studies in health technology and informatics, MedInfo, vol. 216, pp. 40-44, 2015.

K. Kwon, H. Hwang, H. Kang, K. G. Woo, amd K. Shim, “A remote cardiac monitoring system for preventive care in Consumer Electronics (ICCE),” Proc. IEEE, pp. 197-200, January 2013.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.