Open Access Open Access  Restricted Access Subscription or Fee Access

Cross Language Representation for Commercial Web Applications in Context of Indian Languages using Phonetic Model

M.L. Dhore, S.K. Dixit, J.B. Karande

Abstract


This paper focuses on an important aspect of how to provide access facilitation to the user in their native language. English continues to be ubiquitous as a mode of communication in higher education, judiciary, bureaucracy, and the corporate sector. Most of the web based commercial on-line applications such as Internet Banking, on-line shopping usually has their input and output user interfaces in English. This paper proposes an approach by which user can input the data in his native language as well can view the output reports in his native language with the little cost of computation required for transliteration. In this paper, the authors describe how to input data and get the information in Marathi, Hindi and Gujarati languages using transliteration based on the phonetic basis of Indian languages. Authors have focused more on transliterating the input in multiple languages into common representation using intermediate phonetic based code while maintaining the master database in world business language English.

Keywords


Phonemics, Phonetic Model, Transliteration, Multilingual Dictionary

Full Text:

PDF

References


D. Chakrabarti, D. Narayan, P. Pandey, P. Bhattacharyya, “Experiences in building the Indo Wordnet- A wordnet for Hindi”, In, Proceedings of the First Global Word Net Conference. Central Institute of Indian Languages, Mysore, 2002, pp. 57-64.

S. Dave, J. Parikh, P. Bhattacharyya, “Interlingua-based English–Hindi Machine Translation and Language Divergence”, Journal of Machine Translation, Volume 17, September, 2002.

B. K. Murthy and W. R. Deshpande, “Country Report, Language Technology in India: past, present and future”, DoE, Government of India, 1998.

Frost & Sullivan, “Local Language Information Technology Market in India”, TDIL, Department of IT, Ministry of Communications and Information Technology, India, 2003.

S. P.Mudur, N. Nayak, S. Shanbhag, and R. K. Joshi, Graphics and CAD Division, National Centre for Software Technology, Juhu, Mumbai “An architecture for the shaping of indic texts,” Computers and Graphics, vol. 23, 1999, pp. 7–24,

J. Shah, S. Hajare, “Localization for e-governance”, Journal of Language Technology, 2004, pp. 154-160.

D.S Katre, “Cross Cultural Usability Issues of Bilingual (Hindi & English) Mobile Phones”, In the Proceedings of Indo Danish HCI Research Symposium, 2006.

N. Padariya, M. Chinnakotla, A. Nagesh, O. P. Damani, “Evaluation of Hindi to English, Marathi to English and English to Hindi” , CLIR at FIRE 2008.

R.K. Joshi, K. Shroff , S. P. Mudur, “A Phonemic Code Based Scheme for Effective Processing of Indian Languages” 23rd Internationalization and Unicode Conference, Prague, Czech Republic, 1 March 2003.

A. K. Singh, “A Computational Phonetic Model for Indian Language Scripts”, In Constraints on Spelling Changes: Fifth International Workshop on Writing Systems, 2007.

India Internet Usage Stats and Telecommunications Market Report 2009, World Internet Usage And Population Statistics Report 2009, online at: http://www.internetworldstats.com.

J. Hutchins, “Current commercial machine translation systems and computer-based translation tools: system types and their uses”, online at: http://ourworld.compuserve.com/homepages/WJHutchins.

J. Hutchins, “Machine translation over fifty years”, Published in Histoire, Epistemologie, Langage, Tome XXII, fasc. 1 (2001), p.7-31

R. Sharma, “Development of a bilingual electronic glossary for automated assistance in user interface localization”, TIDL,2002.

BIS. Indian standard code for information interchange (iscii), 1991.

C-DAC. Standards for Indian languages http://www.cdac.in/ Unicode Consortium, http://www.unicode.org.

K Knight, J. Graehl, “Machine transliteration” , Comput. Linguist. 1997, pp.128–135.

S. Y. Jung,, S. Hong, S., E. Paek,. “English to Korean transliteration model of extended Markov window”, In Proceedings of the 18th Conference on Computational Linguistics (CL‟00). Pp. 2000383–389.

M. Ganapathiraju, M. Balakrishnan, N. Balakrishnan, R. Reddy.OM: One Tool for Many (Indian) Languages. ICUDL: International Conference on Universal Digital Library, Hangzhou, 2005.

M.G.A. Malik. “Punjabi Machine Transliteration”, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL. 2006, pages 1137–1144.

R Sproat. “Brahmi scripts” In Constraints on Spelling Changes: Fifth International Workshop on Writing Systems, Nijmegen, The Netherlands, 2002.

R. Sproat. “A formal computational analysis of indic scripts”, In International Symposium on Indic Scripts: Past and Future, Tokyo, Dec. 2003.

R. Sproat. “A computational theory of writing systems”, In Constraints on Spelling Changes: Fifth International Workshop on Writing Systems, Nijmegen, The Netherlands, 2004.

M. Kopytonenko, K. Lyytinen, and T. Krkkinen, “Comparison of phonological representations for the grapheme-to-phoneme mapping”, In Constraints on Spelling Changes: Fifth International Workshop on Writing Systems, Nijmegen, The Netherlands, 2006.

Levenshtein, V.I. 1966, Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10: 707–710.

J. Zobel, P. Dart, “Phonetic string matching: Lessons from information retrieval”, In Proceedings of the Eighteenth ACM SIGIR International Conference on Research and Development in Information Retrieval, Zurich, Switzerland, August 1996, pp. 166-173.

M. L. Dhore,A. K. Dhote, “Automating the HTML Localisation Process: An Implementation Using a Java Internalisation Approach,”, 11th Annual Localisation and Internationalisation Conference organized by LRC, “2006 http://www.localisation.ie.

S. Y. Yoon, K. Y. Kim, R. Sproat, “Multilingual Transliteration Using Feature based Phonetic Method “ in Proceedings of the 45th Annual

Meeting of the Association of Computational Linguistics, pages, Association for Computational Linguistics, 2007, pp. 112–119

J. Shah, S. Hajare, “ Localization for e-governance”, Journal of Language Technology, 2004, pp. 154-160


Refbacks

  • There are currently no refbacks.