A learning-based method to detect and segment text from scene images

Research paper by Ren-jie Jiang, Fei-hu Qi, Li Xu, Guo-rong Wu, Kai-hua Zhu

Indexed on: 01 Apr '07Published on: 01 Apr '07Published in: Journal of Zhejiang University SCIENCE A


This paper proposes a learning-based method for text detection and text segmentation in natural scene images. First, the input image is decomposed into multiple connected-components (CCs) by Niblack clustering algorithm. Then all the CCs including text CCs and non-text CCs are verified on their text features by a 2-stage classification module, where most non-text CCs are discarded by an attentional cascade classifier and remaining CCs are further verified by an SVM. All the accepted CCs are output to result in text only binary image. Experiments with many images in different scenes showed satisfactory performance of our proposed method.