Automatic Feature Extraction and Text Recognition From Scanned Topographic Maps

Pezeshk Aria<sup>*</sup>; Tutwiler Richard L

doi:10.1109/TGRS.2011.2157697

摘要

A system for automatic extraction of various feature layers and recognition of the text content of scanned topographic maps is presented here. Linear features which are often intersecting with the text are first extracted using a novel line representation method and a set of directional morphological operations. Other graphical objects are then removed in several stages to obtain a text-only image. A custom defect model is subsequently used to create an artificial training set for a Hidden Markov Model-based character recognition engine. Finally, the recovered text is recognized using this multifont segmentation-free optical character recognition (OCR). Extensive testing is conducted to assess the performance of different stages of the proposed system. Furthermore, our custom OCR is shown to achieve a 94% recognition rate for the extracted text, thereby outperforming a commercial OCR used as a benchmark.

出版日期2011-12

全文

访问全文

收藏分享被引(46) 浏览

更新时间：2024-05-01 17:15

Automatic Feature Extraction and Text Recognition From Scanned Topographic Maps

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友