ACL RD-TEC 1.0 Summarization of W96-0113
Paper Title:
A RE-ESTIMATION METHOD FOR STOCHASTIC LANGUAGE MODELING FROM AMBIGUOUS OBSERVATIONS
A RE-ESTIMATION METHOD FOR STOCHASTIC LANGUAGE MODELING FROM AMBIGUOUS OBSERVATIONS
Primarily assigned technology terms:
- algorithm
- analyzer
- baum-welch algorithm
- baum-welch reestimation
- bigram estimation
- bigram model training
- error correcting
- estimation algorithm
- estimation method
- evaluation process
- forward-backward algorithm
- hmms
- language modeling
- language processing
- learning
- likelihood training
- matching
- maximum likelihood
- maximum likelihood training
- model estimation
- model training
- modeling
- morphological analyzer
- natural language processing
- optimization
- part-of-speech assignment
- processing
- re-estimation
- re-estimation algorithm
- recognition
- reestimation
- reestimation algorithm
- rule-based tagger
- search
- segmentation
- smoothing
- smoothing method
- speech modeling
- speech recognition
- spelling
- stochastic language modeling
- stochastic tagger
- stochastic tagging
- synchronization
- tag assignment
- tagger
- taggers
- tagging
- tagging system
- weighting
- word extraction
- word prediction
- word segmentation
Other assigned terms:
- adjective
- ambiguity
- ambiguous word
- approach
- bigram
- bigram model
- character sequence
- characters
- corpora
- dictionary
- edr corpus
- estimation
- experimental results
- fact
- implementation
- interpolation
- japanese corpus
- japanese language
- japanese sentences
- japanese text
- language corpora
- language model
- language models
- language processing applications
- lattice
- lattice structure
- likelihood
- method
- model parameters
- modeling language
- morpheme
- morphemes
- n-gram
- n-gram language model
- n-gram model
- natural language
- noise
- pairs of words
- part-of-speech
- precision
- probabilities
- probability
- procedure
- process
- search problem
- segmentation ambiguity
- sentence
- sentences
- stochastic language model
- stochastic model
- symbol
- symbols
- tag model
- tag sequence
- tagged corpus
- tagged text
- tagging accuracy
- tagging problem
- tags
- technique
- technology
- test data
- text
- training
- training data
- training material
- trigram
- trigram model
- unigram
- untagged corpora
- untagged corpus
- verb
- word
- word boundaries
- word sequence
- words