Open Access Open Access  Restricted Access Subscription or Fee Access

An Archetype Match Algorithm Using Parser Based Search & Production Rules

K. K. Senapati, Rahul Verma

Abstract


Pattern matching Algorithm is presented here in two phases. The first phase is preprocessing in which an n-ary tree like Structure constructed for the given text data and Griebach normal form is created using context free grammar. The second phase called as search phase takes as input an n-ary tree structure and Griebach normal form of given context free grammar constructed in phase 1 and outputs those strings that match both the text data and context free grammar. There are two common ways to describe how a given string can be derived from the start symbol of a given grammar. The simplest way is to list the consecutive strings of symbols, beginning with the start symbol and ending with the string, and the rules that have been applied. In computer science, to say that a context-free grammar is in Greibach normal form (GNF) means that all production rules are of the
form:A->cX
or
S->Z
Where A is a non terminal symbol, c is a terminal symbol, X is a (possibly empty) sequence of non terminal symbols not including the start symbol, S is the start symbol, and Z is the null string. The algorithm proposed has the advantage of multiple string searches at random. It finds wide applications in Bioinformatics where one need to contrast and compare long sequences of human genome sequences for the earlier detection of genetic diseases and drug discovery.


Keywords


Context free grammar, Genomics, Griebach normal form, n-ary tree structure, Pattern Matching, Preprocessing.

Full Text:

PDF

References


Abarbanel, R.M., Wieneke, P.R., Mansfield, E., Jaffe D.A., and Brutlag,D.L. (1984), ‘‘Rapid searches for complex patterns in biological molecules,’’ NucleicAcids Research, Vol. 12, No. 1, pp. 263-280.

Aleksander, I. & Hanna F.H., 1976 Automata Theory: An Engineering Approach, Crane,Russak &Company, Inc.

Hawley, D.K., and McClure, W.R. (1983), ‘‘Compilation and analysis of Escherichia coli promoter DNA sequences,’’ Nucleic Acids Research Vol. 11, No. 8, pp. 2237-2255..

IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.3, March 2007

INCOSE, 2000 Systems Engineering Handbook,(Online 20 October,2003)

Luger, G. F., & Stubblefield, W. A., 1998, Artificial Intelligence,Structures and Strategies For Complex Problem Solving, Addison Wesley Longman Inc

Martin, J., 2000, ‘Requirements Mythology: Shattering Myths About Requirements and the Management Thereof’, Proceedings of the 2000 INCOSE Symposium

Feinberg, M. and Carroll, B.J.: Biological markers for endogenous depression in series and parallel, Biological Psychiatry 193-11, 1984.

Feinberg, M. and Carroll. BJ.: Biological and nonbiological depression, Presented at Annual Meeting of the Society of Biological Psychiatry, Los Angeles, May, 1984, Abstract #81.

Feinberg, M, Gillin, J. C., Carroll, B. J.. Greden, J. F., and Zis. A. P.zEEG studies of sleep in the diagnosis of depression, Biological Psychiatry, 17,305-316, 1982.

Heiser. J. F. and Brooks, R. E:Design considerations for a clinical psychopharmacology advisor, Proc. Second Annual Symp. on Computer Applications in Medical Care. New Yorlc IEEE, 1978, 278-285.

Heiser, J. F. and Brooks, R. E.zSome experience with transferring the MYCIN system to a new domain, IEEE Trans. on Pattern Analysis and Machine Intelligence, PAMI-2, No. 5, 477-478, 1980.

Kiloh. L. G., Andrews, G., and Neilson, M.zThe relationship of the syndromes called endogenous and neurotic depression, Brit. J.Psychiatry,121, 183-196, 1972.

Kupfer, D. J., Foster, F. G.. Cable, P., McPartland. R. J., and Ulrich,R.F.:The application of EEG sleep for the differential diagnosis of affective disorders, Am. J. Psychiatry, 135, 69-74, 1978.

Mulsant, B. and Servan-Schreiber, DXnowledge engineering: A daily activity on a hospital ward, Computers in Biomedical Research, 1984.

Spitzer. R. L, Endicoff J. and Robins, E.: Research diagnostic criteria, (2ded.) New York State Department of Mental Hygiene. New York Psychiatric Institute, Biometrics Research Division, 1975.

Spitzer, R. L.: (Ed.)Diagnostic and statistical manual of mental disorders,(3d ed.). Washington, D. C.: American Psychiatric Association, 1980.

Van Melle, W,The EMYCIN Manual, Computer Science Department,Stanford University, Report HPP-81-16, 1981.


Refbacks

  • There are currently no refbacks.