ACL RD-TEC 1.0 Summarization of W96-0107
Paper Title:
AUTOMATIC EXTRACTION OF WORD SEQUENCE CORRESPONDENCES IN PARALLEL CORPORA
AUTOMATIC EXTRACTION OF WORD SEQUENCE CORRESPONDENCES IN PARALLEL CORPORA
Authors: Mihoko Kitamura and Yuji Matsumoto
Primarily assigned technology terms:
- algorithm
- automatic extraction
- em algorithm
- iterative algorithm
- iterative method
- language translation
- machine translation
- machine translation systems
- matching
- morphological analysis
- mt systems
- pair extraction
- pattern extraction
- preprocessing
- ranking
- recognizer
- similarity calculation
- statistical method
- structural matching
- structural translation
- translation memory
- translation memory \
- translation pair extraction
- translation systems
- weighting
- word extraction
Other assigned terms:
- approach
- bilingual corpora
- bilingual dictionary
- case
- co-occurrence
- co-occurrence frequency
- co-occurrences
- coefficient
- collocation
- compounds
- computational overhead
- content words
- corpora
- dice
- dice coefficient
- dictionaries
- dictionary
- english corpus
- english translation
- fact
- french
- french word
- heuristics
- high quality translation
- human inspection
- implementation
- japanese compound
- japanese corpus
- japanese translation
- japanese word
- language corpus
- measure
- method
- mutual information
- noun phrases
- nouns
- parallel corpora
- parallel corpus
- parallel texts
- precision
- probabilistic measure
- probabilities
- procedure
- process
- sentence
- sentence level
- sentences
- similarity measure
- source language
- statistical approach
- statistics
- target language
- technical terms
- terms
- translation candidates
- translation dictionary
- translation pair
- translation pairs
- translation pattern
- word
- word level
- word sequence
- word sequences
- word similarity
- words