Multimodal Kernel learning for image retrieval

Yen-Yu Lin*, Chiou Shann Fuh

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

We propose a semi-supervised learning technique to address the problem of fusing multimodal information sources for CBIR. In our approach, user's preferences in the form of reference feedback are treated as labeled data, and the key idea is to devise an on-line scheme to effectively transform the abstract semantics into useful training data for improving the query performance. Specifically, our method can be characterized with the following three advantages: 1) Kernel matrices are used to encode each modality of information so that the fusion can be conveniently carried out via boosting; 2) The base kernel matrices are derived from eigendecomposing the graph Laplacian, and further refined to satisfy a pivotal monotone property that ensures intrinsic structure will be reasonably maintained for each modality; 3) The adopted optimization criterion in boosting is to align with a target kernel matrix accounting for relevance feedback, and the learned multimodal kernel matrix can be used for training, and then for testing with those unlabeled ones in the database. To demonstrate the efficiency of the proposed framework, experimental results on CBIR are provided to illustrate several practical considerations.

Original languageEnglish
Title of host publication2010 International Conference on System Science and Engineering, ICSSE 2010
Pages155-160
Number of pages6
DOIs
StatePublished - 11 Oct 2010
Event2010 International Conference on System Science and Engineering, ICSSE 2010 - Taipei, Taiwan
Duration: 1 Jul 20103 Jul 2010

Publication series

Name2010 International Conference on System Science and Engineering, ICSSE 2010

Conference

Conference2010 International Conference on System Science and Engineering, ICSSE 2010
CountryTaiwan
CityTaipei
Period1/07/103/07/10

Keywords

  • Boosting
  • Image retrieval
  • Kernel fusion

Fingerprint Dive into the research topics of 'Multimodal Kernel learning for image retrieval'. Together they form a unique fingerprint.

  • Cite this

    Lin, Y-Y., & Fuh, C. S. (2010). Multimodal Kernel learning for image retrieval. In 2010 International Conference on System Science and Engineering, ICSSE 2010 (pp. 155-160). [5551790] (2010 International Conference on System Science and Engineering, ICSSE 2010). https://doi.org/10.1109/ICSSE.2010.5551790