ACL RD-TEC 1.0 Summarization of N06-1001
Paper Title:
CAPITALIZING MACHINE TRANSLATION
CAPITALIZING MACHINE TRANSLATION
Authors: Wei Wang and Kevin Knight and Daniel Marcu
Primarily assigned technology terms:
- algorithm
- ambiguity resolution
- bayesian approach
- capitalization
- conditional random field
- conditional random fields
- crfs
- decoder
- decoding
- dictating
- end-to-end machine translation
- error rate reduction
- extractor
- information technology
- kneser-ney smoothing
- language modeling
- learning
- lexical ambiguity resolution
- linear search
- machine translation
- markov model
- maximum entropy
- model training
- modeling
- mt system
- mt systems
- parallelization
- parameter estimation
- phrase aligner
- phrase alignment
- postprocessing
- rate reduction
- search
- segmentation
- sequence labeling
- smoothing
- smt system
- statistical mt
- syntax-based machine translation
- tagger
- translation model training
- word aligner
- word alignment
- word translation
Other assigned terms:
- alignment information
- ambiguity
- approach
- bilingual corpus
- bitext
- bleu
- bleu scores
- case
- case information
- chunks
- corpora
- corpus size
- data sets
- derivation
- development set
- distribution
- entropy
- entropy markov model
- error rate
- estimation
- experimental results
- fact
- feature
- feature value
- feature weight
- feature weights
- frame
- french
- gaussian prior
- heuristic
- heuristics
- knowledge
- labeling
- language information
- language model
- language models
- language pair
- language pairs
- large training
- lexical ambiguity
- log-likelihood
- machine translation output
- monolingual capitalization model
- n-gram
- names
- performance comparison
- phrase
- precision
- probabilistic model
- probabilities
- probability
- process
- punctuation
- punctuation mark
- punctuation marks
- resolution problem
- right-hand side
- sentence
- sentence pair
- sentences
- source language
- source sentence
- surface form
- tag sequence
- tags
- target language
- target language model
- target sentence
- target vocabulary
- technology
- term
- terms
- text
- training
- training corpus
- training data
- training set
- translation model
- translation models
- translation output
- translation probability
- translation rule
- translation table
- translations
- trigram
- trigram language model
- vocabulary
- weight vector
- word
- word alignments
- word pair
- words