IEEE ICIP 2021 || Anchorage, Alaska, USA || 19-22 September 2021

My ICIP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.

Create a login based on your email (takes less than one minute)
Perform 'Paper Search'
Select papers that you desire to save in your personalized schedule
Click on 'My Schedule' to see the current list of selected papers
Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Paper Detail

Paper ID

ARS-7.10

Paper Title

LISTEN TO THE PIXELS

Authors

Sanjoy Chowdhury, IIIT, Hyderabad, India; Subhrajyoti Dasgupta, Sudip Das, Ujjwal Bhattacharya, Indian Statistical Institute, Kolkata, India

Session

ARS-7: Image and Video Interpretation and Understanding 2

Location

Area H

Session Time:

Wednesday, 22 September, 08:00 - 09:30

Presentation Time:

Wednesday, 22 September, 08:00 - 09:30

Presentation

Poster

Topic

Image and Video Analysis, Synthesis, and Retrieval: Image & Video Interpretation and Understanding

IEEE Xplore Open Preview

Click here to view in IEEE Xplore

Abstract

Performing sound source separation and visual object segmentation jointly in naturally occurring videos is a notoriously difficult task, especially in the absence of annotated data. In this study, we leverage the concurrency between audio and visual modalities in an attempt to solve the joint audio-visual segmentation problem in a self-supervised manner. Human beings interact with the physical world through a few sensory systems such as vision, auditory, movement, etc. The usefulness of the interplay of such systems lies in the concept of degeneracy. It tells us that the cross-modal signals can educate each other without the presence of an external supervisor. In this work, we efficiently exploit this fact that learning from one modality inherently helps to find patterns in others by introducing a novel audio-visual fusion technique. Also, to the best of our knowledge, we are the first to address the partially occluded sound source segmentation task. Our study shows that the proposed model significantly outperforms existing state-of-the-art methods in both visual and audio source separation tasks.

2021 IEEE International Conference on Image Processing

19-22 September 2021 • Anchorage, Alaska, USA

Imaging Without Borders

2021 IEEE International Conference on Image Processing

19-22 September 2021 • Anchorage, Alaska, USA

My ICIP 2021 Schedule

Paper Detail