ACL RD-TEC 1.0 Summarization of W03-1508

Paper Title:
TRANSLITERATION OF PROPER NAMES IN CROSS-LINGUAL INFORMATION RETRIEVAL

Authors: Paola Virga and Sanjeev Khudanpur

Primarily assigned technology terms:

Other assigned terms:

  • approach
  • arabic text
  • back-transliteration
  • break
  • case
  • character error rate
  • character sequence
  • characters
  • chinese characters
  • chinese text
  • clusters
  • document
  • document collection
  • document frequency
  • document model
  • domain-specific terminology
  • edit distance
  • english text
  • english translation
  • english translations
  • error rate
  • fact
  • foreign language
  • function words
  • ibm model
  • index
  • intrinsic testing
  • knowledge
  • language model
  • language models
  • large corpus
  • lexicon
  • likelihood
  • linguistic
  • linguistic knowledge
  • mapping
  • maps
  • markov models
  • mean average precision
  • measure
  • method
  • multi-lingual text
  • n-grams
  • name transliteration procedure
  • named entity
  • named entity corpus
  • named-entity
  • names
  • noisy channel
  • orthography
  • parallel corpus
  • phoneme
  • phoneme sequence
  • phonemes
  • phonemic representation
  • precision
  • probability
  • procedure
  • process
  • pronunciation
  • proper names
  • queries
  • query
  • recipe
  • retrieval performance
  • retrieval task
  • seed
  • sentence
  • sentences
  • source-channel model
  • specialized terminology
  • speech synthesis literature
  • statistical approach
  • statistical language model
  • statistical model
  • statistical significance
  • statistical translation model
  • statistics
  • symbol
  • symbols
  • system description
  • target language
  • tdt corpus
  • technical terms
  • technique
  • term
  • terms
  • test set
  • text
  • tokens
  • tone
  • toolkit
  • topics
  • training
  • training corpus
  • training data
  • training set
  • transformation
  • transformation rules
  • translation lexicon
  • translation model
  • translation models
  • translation output
  • translations
  • transliteration performance
  • transliteration procedure
  • trigram
  • trigram language model
  • trigram model
  • vocabulary
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***