ACL RD-TEC 1.0 Summarization of C02-1021
Paper Title:
(SEMI-)AUTOMATIC DETECTION OF ERRORS IN POS-TAGGED CORPORA
(SEMI-)AUTOMATIC DETECTION OF ERRORS IN POS-TAGGED CORPORA
Authors: Pavel Kvĕtoň and Karel Oliva
Primarily assigned technology terms:
Other assigned terms:
- adverb
- ambiguous word
- annotated corpora
- annotators
- approach
- bigram
- case
- corpora
- data sparseness
- data sparseness problem
- error rate
- fact
- feature
- finite verb
- german syntax
- grammatical relation
- human annotators
- implementation
- knowledge
- language resources
- linguistic
- linguistic knowledge
- method
- morphological features
- n-grams
- negra
- nouns
- orthography
- part-of-speech
- phonological rules
- preposition
- preposition stranding
- prepositions
- presupposition
- procedure
- process
- relation
- sentence
- sentences
- source text
- sparseness problem
- syntax
- tagged corpora
- tagged corpus
- tagging scheme
- tags
- tagset
- technique
- text
- training
- training data
- trigram
- understanding
- verb
- word
- words