UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

EnsExam: A Dataset for Handwritten Text Erasure on Examination Papers

Huang, Liufeng; Chen, Bangdong; Liu, Chongyu; Zhou, Weiying; Peng, Dezhi; Wu, Yaqiang; Li, Hui; ... Jin, Lianwen; + view all (2023) EnsExam: A Dataset for Handwritten Text Erasure on Examination Papers. In: Fink, Gernot A and Jain, Rajiv and Kise, Koichi and Zanibbi, Richard, (eds.) Proceedings of the 17th International Conference on Document Analysis and Recognition. (pp. pp. 470-485). Springer: Cham, Switzerland. Green open access

[thumbnail of 2023_ICDAR_paper_8652.pdf]
Preview
Text
2023_ICDAR_paper_8652.pdf - Accepted Version

Download (11MB) | Preview

Abstract

Handwritten text erasure on examination papers is an important new research topic with high practical value due to its ability to restore examination papers and collect questions that are answered incorrectly for review, thereby improving educational efficiency. However, to the best of our knowledge, there is no publicly available dataset for handwritten text erasure on examination papers. To facilitate the development of this field, we build a real-world dataset called SCUT-EnsExam (short for EnsExam). The dataset consists of 545 examination paper images, each of which has been carefully annotated to provide a visually reasonable erasure target. With EnsExam, we propose an end-to-end model, which introduces a soft stroke mask to erase the handwritten text precisely. Furthermore, we propose a simple yet effective loss called stroke normalization (SN) loss to alleviate the imbalance between text and non-text regions. Extensive numerical experiments shows that our proposed method outperforms previous state-of-the-art methods on EnsExam. In addition, quantitative experiments on scene text removal benchmark, SCUT-EnsText, demonstrate the generalizability of our method. The EnsExam will be made available at https://github.com/SCUT-DLVCLab/SCUT-EnsExam.

Type: Proceedings paper
Title: EnsExam: A Dataset for Handwritten Text Erasure on Examination Papers
Event: The 17th International Conference on Document Analysis and Recognition
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/978-3-031-41682-8_29
Publisher version: https://doi.org/10.1007/978-3-031-41682-8_29
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Examination papers restoration · Handwritten text erasure · Generative adversarial network · Dense erasure.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Mathematics
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10174246
Downloads since deposit
341Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item