Visible to the public Multi-view Collective Tensor Decomposition for Cross-modal Hashing

TitleMulti-view Collective Tensor Decomposition for Cross-modal Hashing
Publication TypeConference Paper
Year of Publication2018
AuthorsCui, Limeng, Chen, Zhensong, Zhang, Jiawei, He, Lifang, Shi, Yong, Yu, Philip S.
Conference NameProceedings of the 2018 ACM on International Conference on Multimedia Retrieval
Date PublishedJune 2018
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-5046-4
Keywordscross-modal hashing, metric learning, multi-view learning, pubcrawl, resilience, Resiliency, Scalability, Tensor Factorization, work factor metrics

Multimedia data available in various disciplines are usually heterogeneous, containing representations in multi-views, where the cross-modal search techniques become necessary and useful. It is a challenging problem due to the heterogeneity of data with multiple modalities, multi-views in each modality and the diverse data categories. In this paper, we propose a novel multi-view cross-modal hashing method named Multi-view Collective Tensor Decomposition (MCTD) to fuse these data effectively, which can exploit the complementary feature extracted from multi-modality multi-view while simultaneously discovering multiple separated subspaces by leveraging the data categories as supervision information. Our contributions are summarized as follows: 1) we exploit tensor modeling to get better representation of the complementary features and redefine a latent representation space; 2) a block-diagonal loss is proposed to explicitly pursue a more discriminative latent tensor space by exploring supervision information; 3) we propose a new feature projection method to characterize the data and to generate the latent representation for incoming new queries. An optimization algorithm is proposed to solve the objective function designed for MCTD, which works under an iterative updating procedure. Experimental results prove the state-of-the-art precision of MCTD compared with competing methods.

Citation Keycui_multi-view_2018