ACL RD-TEC 1.0 Summarization of I05-3002
Paper Title:
USING WORD-PAIR IDENTIFIER TO IMPROVE CHINESE INPUT SYSTEM
USING WORD-PAIR IDENTIFIER TO IMPROVE CHINESE INPUT SYSTEM
Primarily assigned technology terms:
- algorithm
- auto-generating word-pair
- backoff smoothing
- character recognition
- chinese language processing
- chinese speech recognition
- database
- disambiguation
- editing
- error analysis
- error reduction
- homophone disambiguation
- language processing
- learning
- matching
- maximum matching
- nlp
- nlp system
- optical character recognition
- post-processing
- processing
- recognition
- recognition systems
- search
- search engine
- segmentation
- semi-automatic learning
- smoothing
- speech recognition
- speech recognition systems
- stw conversion
- syllable segmentation
- syllable-to-word conversion
- syllable-word segmentation
- translator
- tuning
- unknown word extraction
- word extraction
- word segmentation
- word-segmentation
- wp identifier
Other assigned terms:
- approach
- backoff
- bigram
- bigram model
- case
- characters
- chinese characters
- chinese language
- chinese sentence
- chinese word
- chinese words
- co-occurrence
- contextual information
- dictionaries
- dictionary
- error reduction rate
- experimental results
- fact
- generation
- handwriting
- homonym
- knowledge
- language model
- language models
- linguistic
- linguistic approach
- mappings
- method
- moe-mandarin dictionary
- online dictionary
- open test
- pinyin
- precision
- probabilities
- process
- segmentation problem
- semantic
- sentence
- sentences
- statistical approach
- statistical language model
- syllables
- syntax
- technique
- test set
- testing corpus
- training
- training corpus
- translations
- trigram
- understanding
- user
- web corpus
- word
- word frequencies
- words