Project | IMPRESS

Duration: 08/01/2020 - 01/31/2024

Improving Embeddings with Semantic Knowledge

Research Topics

Application fields

Other

Virtually all NLP systems nowadays use vector representations of words, a.k.a. word embeddings. Similarly, the processing of language combined with vision or other sensory modalities employs multimodal embeddings. While embeddings do embody some form of semantic relatedness, the exact nature of the latter remains unclear. This loss of precise semantic information can affect downstream tasks.

The goals of IMPRESS are to investigate the integration of semantic and common sense knowledge into linguistic and multimodal embeddings and the impact on selected downstream tasks. IMPRESS will also develop open source software and lexical resources, focusing on video activity recognition as a practical testbed. Furthermore, while there is a growing body of NLP research on languages other than English, most research on multimodal embeddings is still done on English. IMPRESS will consider a multilingual extension of the developed methods to handle French, German and English.

Partners

DFKI 2. INRIA

Contact Person

Dipl.-Inf. Bernd Kiefer

Bernd.Kiefer@dfki.de
Phone: +49 681 85775 5332

Keyfacts

Publications

All publications

Multilingual coreference resolution: Adapt and Generate
Tatiana Anikina; Natalia Skachkova; Anna Mokhova
In: Zdeněk ´abokrtský; Maciej Ogrodniczuk (Hrsg.). Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution. Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC-2023), located at EMNLP 2023, December 6-7, Singapore, Singapore, Pages 19-33, Association for Computational Linguistics, 12/2023.
AutoQIR: Auto-Encoding Questions with Retrieval Augmented Decoding for Unsupervised Passage Retrieval and Zero-shot Question Generation
Stalin Varanasi; Muhammad Umer Butt; Günter Neumann
In: Large Language Models for Natural Language Processing. International Conference on Recent Advances in Natural Language Processing (RANLP-2023), located at RANLP, September 4-6, Varna, Bulgaria, Pages 1171-1179, ISBN ISBN 978-954-452-092-2, INCOMA Ltd. Shoumen, BULGARIA, 9/2023.
Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization
Cennet Oguz; Pascal Denis; Emmanuel Vincent; Simon Ostermann; Josef van Genabith
In: Houda Bouamor; Juan Pino; Kalika Bali (Hrsg.). Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, Pages 8099-8110, Association for Computational Linguistics, 2023.

Project | IMPRESS

Improving Embeddings with Semantic Knowledge

Research Topics

Application fields

Partners

Contact Person

Keyfacts

Involved research areas

Head

Website

Publications

Multilingual coreference resolution: Adapt and Generate

AutoQIR: Auto-Encoding Questions with Retrieval Augmented Decoding for Unsupervised Passage Retrieval and Zero-shot Question Generation

Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization

Funding Authorities

BMBF - Federal Ministry of Education and Research

01IS20076

Research Topics

Application fields

Partners

Share project:

Contact Person

Keyfacts

Involved research areas

Head

Website

Multilingual coreference resolution: Adapt and Generate

AutoQIR: Auto-Encoding Questions with Retrieval Augmented Decoding for Unsupervised Passage Retrieval and Zero-shot Question Generation

Find-2-Find: Multitask Learning for Anaphora Resolution and Object Localization

Funding Authorities

BMBF - Federal Ministry of Education and Research

01IS20076