 |
NEWS
ITEM

|
1.12.2008
ICLT Lecture
on Developing Combined Taggers
The
next talk in the Icelandic Centre for Language Technology (ICLT)
seminar series will be given at Reykjavik University, Kringlan 1, room
K5, Tuesday December 2nd, and starts at 12:00. The speakers are Verena
Henrich and Timo Reuter from University of Applied Sciences Darmstadt,
Germany. The title of their talk is CombiTagger:
A System for Developing Combined Taggers.
The talk will be given
in English.
The main task of part-of-speech (PoS) tagging is to assign the
appropriate morphosyntactic category to each word in a sentence. A
combination of different PoS taggers usually results in higher tagging
accuracy than obtained by the use of only a single tagger. We present a
new language and tagset independent system, CombiTagger, which combines
automatically the output of several taggers. The system, which is open
source, provides algorithms for simple and weighted voting, but it is
extensible so that other combination algorithms can be added easily. We
demonstrate the functionality of CombiTagger by using it for combined
tagging of English and Icelandic text.
Verena Henrich and Timo Reuter are MSc students in Computer Science at
University of Applied Sciences Darmstadt, Germany. During Spring and
Fall semesters 2008 they have been exchange students at Reykjavik
University, where they are currently working on their
MSc-thesis.
|
|
|