A Neighbourhood Encoding Framework for Deep Mining Heterogeneous Texts in Recipe-image Retrieval

Changsheng Zhu; Nan Ji; Jin Yu; Dazhi Jiang; Lin Zheng

doi:10.1088/1742-6596/1813/1/012029

Journal of Physics: Conference Series

Paper • The following article is Open access

A Neighbourhood Encoding Framework for Deep Mining Heterogeneous Texts in Recipe-image Retrieval

Changsheng Zhu^1,2, Nan Ji¹, Jin Yu^1,3, Dazhi Jiang^1,3 and Lin Zheng^1,3,1

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 1813, 2020 International Conference on Modeling, Big Data Analytics and Simulation (MBDAS2020) 20-21 December 2020, Xiamen, China Citation Changsheng Zhu et al 2021 J. Phys.: Conf. Ser. 1813 012029 DOI 10.1088/1742-6596/1813/1/012029

Download Article PDF

Article metrics

59 Total downloads

Author e-mails

lzheng@stu.edu.cn

Author affiliations

¹ Department of Computer Science, College of Engineering, Shantou University, China

² Research Office, Shantou University, China

³ Key Laboratory of Intelligent Manufacturing Technology (Shantou University), Ministry of Education, China

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

Cross-modal retrieval usually fills the semantic gap between different modalities by sharing subspaces. However, existing methods rarely consider that the data in a certain modality may be heterogeneous when mapping multimodal data into a shared subspace. In addition, most existing methods focus on semantic associations between different modalities, while few approaches consider the semantic associations within a single modality. To address the above two deficiencies, we propose a Neighbourhood Encoding (NE) framework that mines the semantic association of data in the same modality, solves the problem of data heterogeneity by improving the semantic expression of a single modality. To verify the effectiveness of the proposed framework, we use two types of recurrent neural networks to instantiate the framework. Experiments show that the instantiated approaches outperform existing advanced methods in both text-to-image and image-to-text retrieval directions.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.

A Neighbourhood Encoding Framework for Deep Mining Heterogeneous Texts in Recipe-image Retrieval

Article metrics

Share this article

Author e-mails

Author affiliations

Abstract