Paper ID | TEC-5.6 | ||
Paper Title | A TWO-STAGE AUTOENCODER FOR VISUAL ANOMALY DETECTION | ||
Authors | Yezhou Zhu, Jianzhu Wang, Jing Zhang, Qingyong Li, Beijing Jiaotong University, China | ||
Session | TEC-5: Image and Video Processing 1 | ||
Location | Area G | ||
Session Time: | Monday, 20 September, 13:30 - 15:00 | ||
Presentation Time: | Monday, 20 September, 13:30 - 15:00 | ||
Presentation | Poster | ||
Topic | Image and Video Processing: Formation and reconstruction | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Deep convolutional autoencoder (DCAE) is usually optimized to minimize the difference between the input and the reconstruction, and the reconstruction error has been widely used as an indicator for visual anomaly detection. However, DCAE sometimes can reconstruct anomalies very well and thus may yield misdetections. To tackle this issue, we propose a novel non-symmetrical DCAE, which is trained in a two-stage manner. Specifically, a single RotNet is first trained to serve as encoder. Then, discriminative representations generated by the frozen encoder are used to train two parallel decoders for image reconstruction. Finally, the reconstruction errors obtained by the two decoders are combined as the anomaly score. Massive experiments on three public datasets and one practical industrial dataset demonstrate the superiority of the proposed method among existing reconstruction based methods. |