Publikation
IAMonDo-Database: an Online Handwritten Document Database with Non-Uniform Contents
Emanuel Indermühle; Marcus Liwicki; Horst Bunke
In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems. IAPR International Workshop on Document Analysis Systems (DAS-10), June 9-11, Boston, MA, USA, Pages 97-104, 2010.
Zusammenfassung
In this paper we present a new database of online handwritten
documents with different contents such as text, drawings,
diagrams, formulas, tables, lists, and markings. It was
designed to serve as a standard dataset for the development,
training, testing and comparison of methods in the field of
handwritten document analysis. The database can serve as
a basis for layout analysis, and different segmentation and
recognition tasks considering online or just offine information.
Its size is 1,000 documents produced by approximately
200 writers including a total of 329,849 online strokes. Few
constraints were imposed on the writers when creating the
documents. Nonetheless, the database has a stable distribution
of the different content types. A software tool was
developed to allow easy access to the documents which are
stored in InkML. In this paper we also present two experiments
which show the challenge this database poses. They
may figure as references for further research in this area.