Open Access Open Access  Restricted Access Subscription or Fee Access

Outlier Mining of Drug Database in Chemo Informatics

Shipra Raheja, Shweta Taneja, Savneet Kaur

Abstract


The task of outlier detection is to find small groups ofdata objects that are exceptional when compared with rest large amount of data. Recently, the problem of outlier detection in categorical data is defined as an optimization problem. The Clustering and Outlier Analysis are theimportant techniques in data mining Earlier we implemented clustering technique on GENERAL drugs and ANTI HIV (Human Immunodeficiency Virus) drug databases based on descriptors mentioned using WEKA tool and in this paper we have detected the outliers. This method reduces the number of drugs in the GENERAL drug database as well as ANTI HIV database that need not be clinically tested. The outlier drugs obtained as a result should not be tried as they exhibit exceptional behavior.


Keywords


ANTI HIV Drugs, Chemo informatics, Clustering, Outlier mining

Full Text:

PDF

References


“Data Mining Promises To Dig Up New Drugs”, Science Daily, 2009

Ying Liu, Eran Eyal, and Ivet Bahar, “ Analysis of correlated mutations in HIV-1 protease using spectral clustering” Department of Computational Biology, School of Medicine, University of Pittsburgh, PA 15232,1243-1250 USA,2008

Shuang Liu Jiyi Wang Guolin Xing,“The review of outlier mining based on granular computing “Sch. of Math. & Inf. Sci., Zhejiang Normal Univ., Jinhua; Granular Computing,, GrC 2008. IEEE International Conference on Publication Date: 26 28 Aug. 2008

A.M. Mora1, J.J. Merelo1, C. Briones2, F. Mor´an2, 3, and J.L.J.Laredo1 “Clustering and Visualizing HIV Quasispecies Using Kohonen’s Self-Organizing Maps”1 Dept. ATC, U. Granada, Spain IWANN 2007,LNCS 4507, pp. 940–947, 2007.c_Springer-Verlag Berlin Heidelberg 2007

Md. Wasim Aktar and Sidhu Murmu “Chemoinformatics: Principles and Applications”1Pesticide Residue Laboratory, Department of Agricultural Chemicals,2Department of Agricultural Chemistry and Soil Science, Bidhan Chandra Krishi Viswavidyalaya, Mohanpur-741252,Nadia, West Bengal, India

Ray S. Lin, Soo-Yon Rhee, Robert W. Shafer, and Amar K. Das“A Combined Data mining approach for infrequent events: Analyzing HIV mutation changes based on treatment history “, Stanford Medical Informatics and Division of Infectious Diseases, Department of Medicine,Stanford University, 2006

Syahrul N. Junaini, and Huspi S.H., and Mat A.R., and Bolhassan N.A,“ A Framework for Data Mining Visualization using Cluster Analysis, An Experimental Study on HIV-AIDS Datasets” ICoMMS , Langkawi,Malaysia ,2006

Stephen D. Bay and Michael J. Pazzani “Detecting Change in Categorical Data: Mining Contrast Sets” Department of Information and Computer Science, University of California, Irvine, USA,2004

He,Z., Xu, X., Huang, J., Deng, S.,”A Frequent Pattern Discovery Based Method for Outlier Detection” In: Proc. of WAIM’04, LNCS 3129, pp.726-732, 2004

Stefano Zanero, Sergio M. Savaresi “Unsupervised learning techniques for an intrusion detection system” Piazza L. da Vinci, 32; 20133, Milan,Italy.Copyright 2004 ACM 1581138121/03/04 ...$5.00.

Carleton R. Sage, Kevin R. Holme, Nianish Sud and Rudy Potenzone “Providing Cheminformatics solutions to support drug discovery decisions”Lionbioscience, Inc., 9880 Campus Point Dr., San Diego, CA 92121, USA Published: 15th May 2003

Bay, S. D., Schwabacher, M,” Mining Distance Based Outliers in Near Linear Time with Randomization and a Simple Pruning Rule” In: Proc of KDD’03, pp.29-38, 2003

Niko Beerenwinkel, Barbara Schmidt, Hauke Walter, Rolf Kaiser, Thomas Lengauer, Daniel Hoffmann,Klaus Korn, and Joachim Selbig, “Diversity and complexity of HIV-1 drug resistance: A bioinformatics approach to predicting phenotype from genotype”Communicated by Richard M. Karp, International Computer Science Institute, Berkeley, CA, March 26, 2002 (received for review September 1, 2001)

Ramaswamy, S., Rastogi, R., Kyuseok, S.,”Efficient Algorithms for Mining[ Outliers from Large Data Sets.” In: Proc. of SIGMOD’00, pp.93-104,2001

Mapping Factors that Drive Drug Resistance (with a Focus on Resource-Limited Settings): A First Step Towards Better Informed Policy By Alexandra Beith . Drug Resistance Working Group

Community case management for former jail inmates: Its impacts on rearrest, drug use, and HIV risk . Karen Needels, Susanne James-Burdum, John Burghardt1. Publisher-Springer New York, Volume 82, Number 3 / September, 2005


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.