Paper ID | MLR-APPL-IVASR-6.9 | ||
Paper Title | ENHANCE VIA DECOUPLING: IMPROVING MULTI-LABEL CLASSIFIERS WITH VARIATIONAL FEATURE AUGMENTATION | ||
Authors | Ming Chen, Guijin Wang, Tsinghua University, China; Jing-Hao Xue, University College London, United Kingdom; Zijian Ding, Li Sun, Tsinghua University, China | ||
Session | MLR-APPL-IVASR-6: Machine learning for image and video analysis, synthesis, and retrieval 6 | ||
Location | Area D | ||
Session Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation | Poster | ||
Topic | Applications of Machine Learning: Machine learning for image & video analysis, synthesis, and retrieval | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Multi-label classification remains a challenging problem due to the inherent label imbalance issue, which brings over-fitting of minor categories to modern deep models. In this paper, to tackle this issue, we propose a novel method named Variational Feature Augmentation (VFA) to enhance the deep neural networks for multi-label classification. Our method decouples the feature vectors extracted by the backbone network into multiple low-dimensional spaces via a novely proposed Variational Feature Decoupling Module. The decoupled feature vectors are then re-combined with a shuffle operation and a Feature Augmentation Layer to enrich the minor co-occurrence relations, mitigating the label imbalance. Different from most other methods, VFA does not modify the network architecture or introduce extra computation cost in inference phase. We conduct comprehensive experiments on four benchmarks of two visual multi-label classification tasks, pedestrian attribute recognition and multi-label image recognition, and the results demonstrate the effectiveness and generality of the proposed VFA. |