UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit Segmentation

Huang, Jiarong; Peng, Dezhi; Li, Hongliang; Ni, Hao; Jin, Lianwen; (2023) SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit Segmentation. In: Fink, Gernot A and Jain, Rajiv and Kise, Koichi and Zanibbi, Richard, (eds.) Document Analysis and Recognition - ICDAR 2023. (pp. pp. 332-349). Springer: Cham, Switzerland. Green open access

[thumbnail of 2023_ICDAR_paper_8156.pdf]
Preview
Text
2023_ICDAR_paper_8156.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Handwritten Chinese text recognition (HCTR) is still a challenging and unsolved problem. The existing recognition methods are mainly categorized into two: explicit vs implicit segmentation-based methods. Explicit segmentation recognition methods use explicit character location information to train the recognizers. However, the widely used weakly supervised training strategy based on pseudo-label makes it difficult to get effective supervised training for difficult character samples. In contrast, the implicit segmentation recognition method use all transcript annotations for supervised training, but it is prone to misalignment problem due to the lack of explicit supervised information of character positions. To take advantage of the complementary nature of explicit and implicit segmentation approaches, we propose a new method, SegCTC, which better integrates these two approaches into a unified to be a more powerful recognizer. Specifically, we designed a hybrid Segmentation-based and Segmentation-free Feature Fusion Module (S FFM) to better fuse the features of both explicit and implicit segmentation-based recognition branches. Moreover, a co-transcription strategy is also proposed to better combine the predictions from different branches. Experiments on four widely used benchmarks including CASIA-HWDB, ICDAR2013, SCUT-HCCDoc and MTHv2 show that our method achieves state-of-the-art performance for the HCTR task under different scenarios.

Type: Proceedings paper
Title: SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit Segmentation
Event: The 17th International Conference on Document Analysis and Recognition
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/978-3-031-41685-9_21
Publisher version: https://doi.org/10.1007/978-3-031-41685-9_21
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Handwritten Chinese text recognition, Branch feature fusion, Co-transcription
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Mathematics
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10174245
Downloads since deposit
299Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item