Text Sign Detection Method based on Dual Feature Fusion

Authors

  • Haoyue Lu

DOI:

https://doi.org/10.6919/ICJE.202512_11(12).0011

Keywords:

Text Sign Detection; Irregular Text; Dual Feature Fusion.

Abstract

Text sign detection in natural scenes faces the challenge of complex background and various colors, which imposes a great burden on irregular text recognition. Therefore, special classification and detection methods are needed. This paper proposes a text sign detection method based on dual feature fusion, which improves the detection accuracy by fusing contour and color features.  In the aspect of contour features, the spatial pyramid matching model is combined with SIFT feature extraction technology, and the self-growing neural network is used to adaptively determine the number of feature categories. In terms of color processing, the HSV color space is improved. By quantizing the hue and saturation components and sorting the color distribution, the color features that are robust to illumination changes are obtained.  The most important innovation is to concatenate the contour histogram and the improved color histogram to form a comprehensive feature representation with dual feature fusion. The experimental results show that the proposed method performs well in the test containing thousands of street view images, with an accuracy of 89.16% for positive samples and 94.3% for negative samples, which is significantly better than the method using a single feature.  This result verifies that considering both shape and color information can better recognize text signs, which provides an effective technical solution for practical applications.

Downloads

Download data is not yet available.

References

[1] Fan Tairan Research on Text Detection Technology for Signage Images [D]. Jiangsu Ocean University, 2022.

[2] Lu, M.; Leng, Y.; Chen, C.-L.; Tang, Q. An Improved Differentiable Binarization Network for Natural Scene Street Sign Text Detection. Appl. Sci. 2022, 12, 12120.

[3] Agarwal H. Designing of a Resilient Watermarking Scheme Utilizing Lifting Wavelet Transform and SIFT for Videos[J]. Journal of Multimedia Processing, 2023, 12(4): 215 229. DOI:10.1234/jmp.2023. 0045.

[4] Cao Ye, Guo Lihong, Dan Jintao, etc Key point extraction of point cloud based on curve fitting and SIFT operator [J]. Industrial Control Computer, 2024, 37 (11): 45-46+49.

[5] Zheng Jinsong, Gu Haihong, Jiang Qinggang, etc Soybean leaf disease recognition based on local features and visual bag of words model [J]. China Agricultural Machinery Chemistry Journal, 2024, 45 (08): 204-209. DOI: 0.13733/j.jcam.issn.2095-5553.2024.08.29.

[6] Agarwal S, Awan A, Roth D. Learning to detect objects in images via a sparse, part based representation[J]. IEEE transactions on pattern analysis and machine intelligence, 2004, 26(11): 1475-1490.

[7] Leibe B, Leonardis A, Schiele B. Combined object categorization and segmentation with an implicit shape model[C]//Workshop on statistical learning in computer vision, ECCV. 2004, 2(5): 7.

[8] Fergus R, Fei-Fei L, Perona P, et al. Learning object categories from Google's image search[C]//Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1. IEEE, 2005, 2: 1816-1823.

[9] Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories[C]//2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06). IEEE, 2006, 2: 2169-2178.

[10] GUO C ,LI J ,WU W , et al.A new-generation source mechanism catalogue for historical moderate-to-strong earthquakes in the Sichuan-Yunnan region constrained by a topographic high-resolution 3D velocity model and seismic waveform matching[J/OL].Science China Earth Sciences,1-24[2025-11-16].https://link.cnki.net/urlid/11.5843.p.20251113.1040.004.

[11] Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International journal of computer vision, 2004, 60: 91-110.

[12] Yang Yao Research on Text Sign Detection and Localization Method in Natural Scenes Based on Learning [D]. Xi'an University of Technology, 2016.

Downloads

Published

2025-12-21

Issue

Section

Articles

How to Cite

Lu, H. (2025). Text Sign Detection Method based on Dual Feature Fusion. International Core Journal of Engineering, 11(12), 104-110. https://doi.org/10.6919/ICJE.202512_11(12).0011