Login Paper Search My Schedule Paper Index Help

My ICIP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Paper Detail

Paper IDARS-10.11
Paper Title SEMANTIC PRESERVING GENERATIVE ADVERSARIAL NETWORK FOR CROSS-MODAL HASHING
Authors Fei Wu, Xiaokai Luo, Qinghua Huang, Pengfei Wei, Ying Sun, Nanjing University of Posts and Telecommunications, China; Xiwei Dong, Jiujiang University, China; Zhiyong Wu, Nanjing University of Posts and Telecommunications, China
SessionARS-10: Image and Video Analysis and Synthesis
LocationArea H
Session Time:Monday, 20 September, 15:30 - 17:00
Presentation Time:Monday, 20 September, 15:30 - 17:00
Presentation Poster
Topic Image and Video Analysis, Synthesis, and Retrieval: Image & Video Storage and Retrieval
IEEE Xplore Open Preview  Click here to view in IEEE Xplore
Abstract Cross-modal hashing has achieved significant progress in recent years. However, how to effectively learn more discriminative hash codes of each modality and simultaneous alleviate the loss of modality information is still a challenging problem. Focusing on this problem, in this paper, we propose a novel cross-modal hashing approach named Semantic Preserving Generative Adversarial Network (SPGAN). The overall network architecture consists of two sub-networks, i.e., a semantic preserving generative adversarial network module and a discriminative hashing module. The generator maps text features into the image feature space. And the discriminator judges whether the feature representations are real image features or generated image features. The adversarial learning process can effectively reduce modality difference and preserve information of the image modality as much as possible. The discriminative hashing module projects the real and generated image features into a Hamming space to obtain hash codes, and explores semantic similarities for enhancing the discriminant ability of hash codes. Experiments on two widely used datasets demonstrate that SPGAN can outperform state-of-the-art related works.