Open Access Open Access  Restricted Access Subscription or Fee Access

A Novel Approach for Image Classification using Annotations

Shalini Batra

Abstract


The syntactical exploitation of the data and searching of the images on the Web demands the inclusion of the supplement of some finite structures into the list of standards to enhance the accuracy of the search results. We propose a novel image classification technique, where images tag is used as only means to provide the pertinent information about the image. This tag is used in conjunction with the current Web standards to classify the uploaded images into a finite number of related groups. The categorization process is in two stages: preprocessing stage where data is extracted from the annotation tag of the image and training and classification stage where extracted data is trained into categories and new data is classified based on the trained data set. We have used Random Forest based classifier and IBk for experimentation and result evaluation. Our work relies on simple and extensible keyword-based query language and enables efficient categorization of images.

Keywords


Classification, Annotation, Random Forest, IBK

Full Text:

PDF

References


www.google.com/imghp.

Michael R. Tomkins, ―Lifescape's Picasa aims to be your digital shoebox‖. The Imaging Resource, Published on imaging-resource.com under "Comdex Fall 2002 Show", 2002.

"Google Picasa", Obsessable, 2009.

http://images.search.yahoo.com.

Flickr Web Site, http://www.flickr.com.

Daniel, ―Photo Site a Hit With Bloggers‖. Wired Magazine, 2004.

T. Osman, D. Thakker, G. Schaefer, M. Leroy, A. Fournier, ―Semantic Annotation And Retrieval Of Image Collections‖, The 21st European Conference on Modelling and Simulation, Prague, Czech Republic, pp. 324-329, 2007.

L. Hollink, A. Th. Schreiber, J. Wielemaker, B. Wielinga, ― Semantic Annotation of Image Collections‖, KCAP'03 Workshop on Knowledge Capture and Semantic Annotation, Florida.

M. Jarmasz, S. Szpakowicz, Rogets Thesaurus and Semantic Similarity, Recent Advances in Natural Language Processing (RANLP-03), Borovets, Bulgaria, pp. 212-219, September 10-12, 2003.

C. Fellbaum (ed) WordNet: An Electronic Lexical Database, Bradford Books, 1998.

Marques Oge, Barman Nitish, ―Semi-automatic semantic annotation of images using machine learning techniques‖, International Semantic Web Conference (ISWC), Sanibel Island, FL, 2003 .

N. Elahi, R. Karlsen, S. Akselsen, ― A Context Centric Approach for Semantic Image Annotation and Retrieval‖ Future Computing, Service Computation, Cognitive, Adaptive Content, Patterns, Computation World '09, pp. 665 – 668, 2009.

Eibe Frank, Mark A. Hall, Geoffrey Holmes, Richard Kirkby, and Bernhard Pfahringer. ―WEKA - A machine learning workbench for data mining‖, The Data Mining and Knowledge Discovery Handbook, pp. 1305–1314, Springer, 2005.

G. Fabrizio Sebastiani ―Machine learning in automated text categorization‖ ACM Computing Surveys, 34(1), pp.1–47, 2002.

L. Breiman, ―Random forests‖, Machine Learning, 45, pp. 5–32, 2002.

C. Chen, A. Liaw, L. Breiman, ―Using random forest to learn imbalanced data‖, Technical Report 666, University of California, Berkeley, 2004.

R. R. Korfhage, Information Storage and Retrieval, John Wiley & Sons, 1997.


Refbacks

  • There are currently no refbacks.