Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems

José Costa Pereira; Vasconcelos,N

Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems

dc.contributor.author	José Costa Pereira	en
dc.contributor.author	Vasconcelos,N	en
dc.date.accessioned	2018-01-19T17:31:57Z
dc.date.available	2018-01-19T17:31:57Z
dc.date.issued	2014	en
dc.description.abstract	In query-by-semantic-example image retrieval, images are ranked by similarity of semantic descriptors. These descriptors are obtained by classifying each image with respect to a pre-defined vocabulary of semantic concepts. In this work, we consider the problem of improving the accuracy of semantic descriptors through cross-modal regularization, based on auxiliary text. A cross-modal regularizer, composed of three steps, is proposed. Training images and text are first mapped to a common semantic space. A regularization operator is then learned for each concept in the semantic vocabulary. This is an operator which maps the semantic descriptors of images labeled with that concept to the descriptors of the associated texts. A convex formulation of the learning problem is introduced, enabling the efficient computation of concept-specific regularization operators. The third step is the selection of the most suitable operator for the image to regularize. This is implemented through a quantization of the semantic space, where a regularization operator is associated with each quantization cell. Overall, the proposed regularizer is a non-linear mapping, implemented as a piecewise linear transformation of the semantic image descriptors to regularize. This transformation is a form of cross-modal domain adaptation. It is shown to achieve better performance than recent proposals in the domain adaptation literature, while requiring much simpler optimization.	en
dc.identifier.uri	http://repositorio.inesctec.pt/handle/123456789/7130
dc.identifier.uri	http://dx.doi.org/10.1016/j.cviu.2014.03.003	en
dc.language	eng	en
dc.relation	4529	en
dc.rights	info:eu-repo/semantics/embargoedAccess	en
dc.title	Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems	en
dc.type	article	en
dc.type	Publication	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: P-00M-X2S.pdf
Size:: 2.68 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Non INESC TEC publications - Indexed Articles in Journals