Paper ID | SS-IVC-DL.4 | ||
Paper Title | RATE-DISTORTION-OPTIMIZATION FOR DEEP IMAGE COMPRESSION | ||
Authors | Michael Schaefer, Sophie Pientka, Jonathan Pfaff, Heiko Schwarz, Detlev Marpe, Thomas Wiegand, Fraunhofer Heinrich-Hertz-Institute, Germany | ||
Session | SS-IVC-DL: Special Session: Optimized Image and Video Coding Using Deep Learning | ||
Location | Area B | ||
Session Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation | Poster | ||
Topic | Special Sessions: Optimized image and video coding schemes using deep learning | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Given the capabilities of massive GPU hardware, there has been a surge of using artificial neural networks (ANN) for still image compression. These compression systems usually consist of convolutional layers and can be considered as non-linear transform coding. Notably, these ANNs are based on an end-to-end approach where the encoder determines a compressed version of the image as features. In contrast to this, existing image and video codecs employ a block-based architecture with signal-dependent encoder optimizations. A basic requirement for designing such optimizations is estimating the impact of the quantization error on the resulting bitrate and distortion. As for non-linear, multi-layered neural networks, this is a difficult problem. This paper presents a performant auto-encoder architecture for still image compression, which represents the compressed features at multiple scales. Then, we demonstrate how an algorithm, which tests multiple feature candidates, can reduce the Lagrangian cost and optimize compression efficiency. The algorithm avoids multiple network executions by pre-estimating the impact of the quantization on the distortion by a higher-order polynomial. |