ACL RD-TEC 1.0 Summarization of P01-1035
Paper Title:
SERIAL COMBINATION OF RULES AND STATISTICS: A CASE STUDY IN CZECH TAGGING
SERIAL COMBINATION OF RULES AND STATISTICS: A CASE STUDY IN CZECH TAGGING
Authors: Jan Hajic and Pavel Krbec and Pavel Kveton and Karel Oliva and Vladimir Petkevic
Primarily assigned technology terms:
- analyzer
- automaton
- bracketing
- classifier
- classifier combination
- classifiers
- cross-validation
- disambiguation
- error analysis
- error reduction
- finite state
- finite state automaton
- finite-state automaton
- hidden markov
- hidden markov models
- hmm tagger
- hmms
- learning
- likelihood estimate
- linear interpolation
- machine learning
- maximum likelihood
- maximum-entropy
- morphological analyzer
- morphological processor
- morphology
- np bracketing
- parsing
- partial disambiguation
- processor
- programming language
- reading
- rule development
- rule-based approach
- rule-based system
- rule-writing
- sense disambiguation
- smoothing
- state automaton
- statistical learning
- statistical system
- statistical tagger
- statistical tagging
- tagger
- taggers
- tagging
- weighting
- word sense disambiguation
Other assigned terms:
- adjective
- adverb
- ambiguity
- ambiguous word
- annotation
- annotator
- approach
- brazilian portuguese
- case
- data set
- dependency treebank
- dictionary
- error rate
- evaluation method
- f-measure
- fact
- finite verb
- free word order
- french
- grammars
- grammatical agreement
- implementation
- inflection
- input text
- input word form
- interpolation
- interpolation coefficients
- interpretation
- knowledge
- language model
- lexemes
- lexical information
- lexical model
- likelihood
- linguistic
- main verb
- manual annotation
- markov models
- maximum likelihood estimate
- meaning
- method
- morphological ambiguity
- n-gram
- paragraph
- part of speech
- part-of-speech
- prague dependency treebank
- precision
- preposition
- prepositions
- probability
- probability distributions
- pronoun
- punctuation
- relative error reduction
- relative frequency
- sentence
- sentences
- sentential context
- sparse data
- sparse data problem
- statistics
- symbols
- syntactic knowledge
- syntax
- tag sequence
- tags
- tagset
- test data
- test data set
- testing corpus
- text
- tokens
- training
- training data
- training data set
- treebank
- trigram
- verb
- word
- word form
- word order
- word sense
- words