Project | TAKE

Duration: 01/01/2009 - 12/31/2011

Technologies for Advanced Knowledge Extraction

The project TAKE aims to adapt, develop and utilize a range of language and knowledge technologies for the gradual automatic extraction of knowledge from the World Wide Web. Rule-based and statistical methods for language processing will be combined for systematically extending a body of formalized knowledge.

The central technology for this endeavor is semantically driven advanced information extraction, especially relation extraction, i.e., the detection of instances of semantic relations in large volumes of texts. Such relevant relations may belong to several classes such as facts, definitions, events, citations and opinions.

In TAKE, information extraction is not viewed as a pragmatic shortcut to getting at least something out of natural language texts but rather as a method for gradually approaching the unsolved problem of text understanding in a systematic and controlled way.

Existing bodies of formalized linguistic knowledge such as lexicons, morphologies and grammars will be utilized as well as tools for statistical processing.

The developed methods, architectures and systems will be tested and demonstrated in two knowledge domains:

scientific/technological literature in a selected field of research, i.e., language technology, and
general biographical texts.

TAKE is funded under contract 01IW08003.

Keyfacts

Involved research areas

Head

Prof. Dr. Hans Uszkoreit

Website

http://take.dfki.de

Publications

All publications

The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries
Ulrich Schäfer; Bernd Kiefer; Christian Spurk; Jörg Steffen; Rui Wang; Benjamin Weitz; Magdalena Wolska
In: LIBER quarterly, Vol. 22, No. 4, Pages 285-309, Association of European Research Libraries, 2/2013.
Domain Adaptive Relation Extraction for Semantic Web
Feiyu Xu; Hans Uszkoreit; Hong Li; Peter Adolphs; Xiwen Cheng
In: Hermann Friedrich; Hans-Joachim Grallert; Wolfgang Wahlster; Stefan Wess; Thomas Widenka (Hrsg.). Theseus-Buch. Chapter X, Springer, 2013.
A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology
Ulrich Schäfer; Christian Spurk; Jörg Steffen
In: Proceedings of the 24th International Conference on Computational Linguistics. International Conference on Computational Linguistics (COLING-2012), December 10-14, Mumbai, India, Pages 1059-1070, ICCL, 12/2012.

Project | TAKE

Technologies for Advanced Knowledge Extraction

Keyfacts

Involved research areas

Head

Website

Publications

The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

Domain Adaptive Relation Extraction for Semantic Web

A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology

Funding Authorities

BMBF - Federal Ministry of Education and Research

Share project:

Keyfacts

Involved research areas

Head

Website

The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

Domain Adaptive Relation Extraction for Semantic Web

A Fully Coreference-annotated Corpus of Scholarly Papers from the ACL Anthology

Funding Authorities

BMBF - Federal Ministry of Education and Research