Paper ID | MLR-APPL-IVASR-6.5 | ||
Paper Title | TWO-STAGE SEAMLESS TEXT ERASING ON REAL-WORLD SCENE IMAGES | ||
Authors | Benjamin Conrad, University of Amsterdam, Netherlands; Pei-I Chen, Jumio AI Labs, Canada | ||
Session | MLR-APPL-IVASR-6: Machine learning for image and video analysis, synthesis, and retrieval 6 | ||
Location | Area D | ||
Session Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation | Poster | ||
Topic | Applications of Machine Learning: Machine learning for image & video analysis, synthesis, and retrieval | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Erasing text from images is a common image-editing task in film industry and shared media. Existing text-erasing models either tend to produce artifacts or fail to remove all the text in real-world images. In this paper, we follow a two-stage text erasing framework that first masks the text by segmentation, and then inpaints the masked region to create a text-erased image. Our proposed text mask generator is designed to accurately cover text, which combined with inpainting, can produce reliable text-erased results. In the inpainting model, we propose a Multiscale Gradient Reconstruction Loss to generate sharp realistic-looking images. Our model achieves state-of-the-art results on both synthetic and real world data in both quantitative and qualitative measures. |