Huang, Jiarong;
Peng, Dezhi;
Li, Hongliang;
Ni, Hao;
Jin, Lianwen;
(2023)
SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit Segmentation.
In: Fink, Gernot A and Jain, Rajiv and Kise, Koichi and Zanibbi, Richard, (eds.)
Document Analysis and Recognition - ICDAR 2023.
(pp. pp. 332-349).
Springer: Cham, Switzerland.
Preview |
Text
2023_ICDAR_paper_8156.pdf - Accepted Version Download (1MB) | Preview |
Abstract
Handwritten Chinese text recognition (HCTR) is still a challenging and unsolved problem. The existing recognition methods are mainly categorized into two: explicit vs implicit segmentation-based methods. Explicit segmentation recognition methods use explicit character location information to train the recognizers. However, the widely used weakly supervised training strategy based on pseudo-label makes it difficult to get effective supervised training for difficult character samples. In contrast, the implicit segmentation recognition method use all transcript annotations for supervised training, but it is prone to misalignment problem due to the lack of explicit supervised information of character positions. To take advantage of the complementary nature of explicit and implicit segmentation approaches, we propose a new method, SegCTC, which better integrates these two approaches into a unified to be a more powerful recognizer. Specifically, we designed a hybrid Segmentation-based and Segmentation-free Feature Fusion Module (S FFM) to better fuse the features of both explicit and implicit segmentation-based recognition branches. Moreover, a co-transcription strategy is also proposed to better combine the predictions from different branches. Experiments on four widely used benchmarks including CASIA-HWDB, ICDAR2013, SCUT-HCCDoc and MTHv2 show that our method achieves state-of-the-art performance for the HCTR task under different scenarios.
Type: | Proceedings paper |
---|---|
Title: | SegCTC: Offline Handwritten Chinese Text Recognition via Better Fusion between Explicit and Implicit Segmentation |
Event: | The 17th International Conference on Document Analysis and Recognition |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1007/978-3-031-41685-9_21 |
Publisher version: | https://doi.org/10.1007/978-3-031-41685-9_21 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Handwritten Chinese text recognition, Branch feature fusion, Co-transcription |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Mathematics |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10174245 |
Archive Staff Only
![]() |
View Item |