Extraction of Text Touching Graphics using SURF

Sheraz Ahmed, Marcus Liwicki, Andreas Dengel

In: 10th IAPR International Workshop on Document Analysis Systems. IAPR International Workshop on Document Analysis Systems (DAS-2012) 10th March 27-29 Gold Coast Queensland Australia Seiten 349-353 IEEE 2012.


In this paper we propose a novel part-based method for the extraction of text touching graphic components. The Speeded Up Robust Features (SURF) are used to localize the text components and distinguish them from graphics. We introduce several post-processing steps to finally detect the text. We have tested our method on a publicly available data set of architectural floor plans and on real geographical maps. On floor plans we have located more than 95% of the text components which were not identified as text beforehand because they were touching graphic components

4661a349.pdf (pdf, 485 KB )

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence