HMM word graph based keyword spotting in handwritten document images

Toselli Alejandro Hector<sup>*</sup>; Vidal Enrique; Romero Veronica; Frinken Volkmar

doi:10.1016/j.ins.2016.07.063

摘要

Line-level keyword spotting (KWS) is presented on the basis of frame-level word posterior probabilities. These posteriors are obtained using word graphs derived from the recognition process of a full-fledged handwritten text recognizer based on hidden Markov models and N-gram language models. This approach has several advantages. First, since it uses a holistic, segmentation-free technology, it does not require any kind of word or character segmentation. Second, the use of language models allows the context of each spotted word to be taken into account, thereby considerably increasing KWS accuracy. And third, the proposed KWS scores are based on true posterior probabilities, taking into account all (or most) possible word segmentations of the input image. These scores are properly bounded and normalized. This mathematically clean formulation lends itself to smooth, threshold-based keyword queries which, in turn, permit comfortable trade-offs between search precision and recall. Experiments are carried out on several historic collections of handwritten text images, as well as a well-known data set of modern English handwritten text. According to the empirical results, the proposed approach achieves KWS results comparable to those obtained with the recently-introduced "BLSTM neural networks KWS" approach and clearly outperform the popular, state-of-the-art "Filler HMM" KWS method. Overall, the results clearly support all the above-claimed advantages of the proposed approach.

出版日期2016-11-20

全文

访问全文

收藏分享被引(46) 浏览

更新时间：2024-05-02 02:13

HMM word graph based keyword spotting in handwritten document images

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友