Open Access Open Access  Restricted Access Subscription or Fee Access

Extraction and Visualization of a Molecular Pathway Using Natural Language Processing System

R. Buvaneshwari, K. Lavanya

Abstract


In recent year Natural Language Processing (shortly NLP) is a very attractive method of human–computer interaction. The importance of extracting biomedical information from scientific publications is well recognized. A number of information extraction systems in the biomedical domain. In this paper present extraction, visualization and analysis the present in the biomedical system. We present GENIES, that extracts and structures information about cellular pathways from the biological literature in accordance with a knowledge We implemented GENIES by modifying an existing medical natural language processing system, MedLEE and performed a preliminary evaluation study. Our results demonstrate the value of the underlying techniques for the purpose of acquiring valuable knowledge from biological journals.


Keywords


Natural Language Processing, Information Extraction, MedLee

Full Text:

PDF

References


Shatkay H, Edwards S, Wilbur WJ, Boguski M. Genes, themes and microarrays: using information retrieval for large-scale gene analysis. Proc Int Conf Intell Syst Mol Biol 2000; 8:317–28.

Shatkay H, Wilbur J. Finding themes in Medline documents: probabilistic similarity search. Ismb 2000.

Shatkay H, Wilbur WJ. Finding themes in medline abstracts. In: IEEE Advances in Digital Libraries; 2000; 2000.

Iliopoulos I, Enright AJ, Ouzounis CA. Textquest: document clustering of Medline abstracts for concept discovery in molecular biology. Pac Symp Biocomput 2001:384–95.

Craven M, Kumlien J. Constructing biological knowledge bases by extracting information from text sources. Proc Int Conf Intell Syst Mol Biol 1999:77–86.

Marcotte EM, Xenarios I, Eisenberg D. Mining literature for protein–protein interactions. Bioinformatics 2001; 17(4):359–63.

Joachims T. A statistical learning model of text classification with support vector machines. In: Croft WB, Harper DJ, Kraft DH, Zobel J, editors. SIGIR-01, 24th ACM International Conference on Research and Development in Information Retrieval. New York: ACM Press; 2001.

Joachims T. Transductive inference for text classifcation using support vector machines. In: International Conference on Machine Learning (ICML); 1999; 1999.

Jacquemin C. Spotting and discovering terms through natural language processing. Cambridge, MA: MIT Press; 2001.

Friedman C, Kra P, Yu H, Krauthammer M, Rzhetsky A.GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles. Bioinformatics 2001; 17(Suppl 1):S74–82.

Fukuda K, Tamura A, Tsunoda T, Takagi T. Toward information extraction: identifying protein names from biological papers. Pac Symp Biocomput 1998:707–18.

Tanabe L, Wilbur WJ. Tagging gene and protein names in biomedical text. Bioinformatics 2002; 18(8):1124–32.

Proux D, Rechenmann F, Julliard L, Pillet VV, Jacq B. Detecting gene symbols and names in biological texts: a first step toward pertinent information extraction. Genome Inform Ser Workshop Genome Inform 1998; 9:72–80.

Gaizauskas R, Demetriou G, Humphreys K. Term recognition and classification in biological science journal articles. In: 2nd International Conference on Natural Language Processing (NLP- 2000); 2000 June 4; Patras, Greece; 2000. p. 37–44.

Rindflesch TC, Hunter L, Aronson AR. Mining molecular binding terminology from biomedical text. Proc AMIA Symp 1999;1:127–31.

Kazama J, Makino T, Ohta Y, Tsujii J. Tuning support vector machines for biomedical named entity recognition. In: ACL-02. Philadelphia; 2002.


Refbacks

  • There are currently no refbacks.