ACL RD-TEC 1.0 Summarization of C96-2151
Paper Title:
HANDLING SPARSE DATA BY SUCCESSIVE ABSTRACTION
HANDLING SPARSE DATA BY SUCCESSIVE ABSTRACTION
Primarily assigned technology terms:
- back-off smoothing
- baum-welch reestimation
- dynamic programming
- dynamic programming technique
- internet
- iterative reestimation
- language processing
- likelihood estimation
- linear interpolation
- natural language processing
- parameter estimation
- parameter setting
- parameter-estimation
- part-of-speech tagging
- pos tagging
- processing
- programming technique
- reestimation
- search
- smoothing
- statistical tagging
- statistical techniques
- suffix tree
- tagger
- taggers
- tagging
- tile
- trigram tagger
Other assigned terms:
- bias
- bigram
- case
- conditional probabilities
- conditional probability
- context size
- contextual information
- corpora
- distribution
- entropy
- error rate
- estimation
- experimental results
- fact
- finite set
- good-turing estimation
- grammar
- grammar rules
- input string
- input text
- interpolation
- language model
- language models
- lattice
- lexicon
- likelihood
- linear combination
- linear order
- linguistic
- linguistic structure
- measure
- method
- morphological structure
- n-gram
- n-grams
- natural language
- part-of-speech
- part-of-speech tagging task
- part-of-speech tags
- probabilities
- probability
- probability distribution
- probability distributions
- probability estimate
- probability estimates
- procedure
- process
- relative frequency
- semantic
- sparse data
- standard deviation
- statistical data
- statistical language model
- statistics
- suffix
- suffixes
- susanne corpus
- symbol
- tag set
- tagging task
- tags
- technique
- test corpora
- test corpus
- text
- training
- training corpus
- training data
- training set
- training text
- tree
- trigram
- uniform distribution
- unigram
- untagged text
- word
- word string
- words