Paper ID | MLR-APPL-IP-7.9 | ||
Paper Title | DEFORMATION-INVARIANT NETWORKS FOR HANDWRITTEN TEXT RECOGNITION | ||
Authors | George Retsinas, National Technical University of Athens, Greece; Giorgos Sfikas, Christophoros Nikou, University of Ioannina, Greece; Petros Maragos, National Technical University of Athens, Greece | ||
Session | MLR-APPL-IP-7: Machine learning for image processing 7 | ||
Location | Area E | ||
Session Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation Time: | Wednesday, 22 September, 08:00 - 09:30 | ||
Presentation | Poster | ||
Topic | Applications of Machine Learning: Machine learning for image processing | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Image deformations under simple geometric restrictions are crucial for Handwriting Text Recognition (HTR), since different writing styles can be viewed as simple geometrical deformations of the same textual elements. In this respect, the usefulness of including deformation invariance to an HTR system is indisputable. We explore different existing strategies for ensuring deformation invariance, including spatial transformers and deformable convolutions, under the context of text recognition, as well as introduce a new deformation-based algorithm, inspired by adversarial learning, which aims to reduce character output uncertainty during evaluation time. The resulting HTR system is shown to achieve state-of-the-art performance on the IAM and RIMES datasets. |