Automatic detection and recognition of Korean text in outdoor signboard images

作者:Park Jonghyun*; Lee Gueesang; Kim Euichul; Lim Junsik; Kim Soohyung; Yang Hyungjeong; Lee Myunghun; Hwang Seongtaek
来源:Pattern Recognition Letters, 2010, 31(12): 1728-1739.
DOI:10.1016/j.patrec.2010.05.024

摘要

In this paper, an automatic translation system for Korean signboard images is described. The system includes detection and extraction of text for the recognition and translation of shop names into English. It deals with impediments caused by different font styles and font sizes, as well as illumination changes and noise effects. Firstly, the text region is extracted by an edge-histogram, and the text is binarized by clustering. Secondly, the extracted text is divided into individual characters, which are recognized by using a minimum distance classifier. A shape-based statistical feature is adopted, which is adequate for Korean character recognition, and candidates of the recognition results are generated for each character. The final translation step incorporates the database of shop names, to obtain the most probable result from the list of candidates. The system has been implemented in a mobile phone and is demonstrated to show acceptable performance.

  • 出版日期2010-9-1