Skip to main content Skip to main navigation


Enriching Multiword Terms in Wiktionary with Pronunciation Information

Lenka Bajčetić; Thierry Declerck; Gilles Sérasset
In: Archna Bhatia; Kilian Evang; Marcos Garcia; Voula Giouli; Lifeng Han; Shiva Taslimipoor (Hrsg.). Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023). Workshop on Multiword Expressions (MWE-2023), located at The 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), May 6, Dubrovnik, Croatia, Pages 65-72, ISBN 978-1-959429-59-3, Association for Computational Linguistics (ACL), 209 N. Eighth Street, Stroudsburg, PA 18360, USA, 5/2023.


We report on work in progress dealing with the automated generation of pronunciation information for English multiword terms (MWTs) in Wiktionary, combining information available for their single components. We describe the issues we were encountering, the building of an evaluation dataset, and our teaming with the DBnary resource maintainer. Our approach shows potential for automatically adding morphosyntactic and semantic information to the components of such MWTs.