Combining textual and visual clusters for semantic image retrieval and auto-annotation


Celebi E., Alpkoca A.

2nd European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology, EWIMT 2005, London, United Kingdom, 30 November - 01 December 2005, vol.2005, pp.219-225 identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 2005
  • Doi Number: 10.1049/ic.2005.0735
  • City: London
  • Country: United Kingdom
  • Page Numbers: pp.219-225
  • Keywords: clustering, Image annotation, image retrieval, semantic, C3M
  • Dokuz Eylül University Affiliated: Yes

Abstract

In this paper, we propose a novel strategy at an abstract level by combining textual and visual clustering results to retrieve images using semantic keywords and auto-annotate images based on similarity with existing keywords. Our main hypothesis is that images that fall in to the same textcluster can be described with common visual features of those images. In this approach, images are first clustered according to their text annotations using C3M clustering technique. The images are also segmented into regions and then clustered based on low-level visual features using k-means clustering algorithm on the image regions. The feature vector of the images is then changed to a dimension equal to the number of visual clusters where each entry of the new feature vector signifies the contribution of the image to that visual cluster. Then a matrix is created for each textual cluster, where each row in the matrix is the new feature vector for the image in that textual cluster. A feature vector is also created for the query image and it is then appended to the matrix for each textual cluster and images in the textual cluster that give the highest coupling coefficient are considered for retrieval and annotations of the images in that textual cluster are considered as candidate annotations for the query image. Experiments have demonstrated that good accuracy of proposal and its high potential of use in annotation of images and for improvement of content based image retrieval.