ACL RD-TEC 1.0 Summarization of W04-3208

Paper Title:
MINING VERY-NON-PARALLEL CORPORA: PARALLEL SENTENCE AND LEXICON EXTRACTION VIA BOOTSTRAPPING AND E

Authors: Pascale Fung and Percy Cheung

Other assigned terms:

  • approach
  • bilingual corpora
  • bilingual lexicon
  • bilingual sentence
  • case
  • comparable corpora
  • comparable document
  • convergence
  • corpora
  • cosine similarity
  • document
  • document set
  • english sentence
  • estimation
  • experimental results
  • fact
  • ibm model
  • language pairs
  • lexical information
  • lexicon
  • measures
  • method
  • model parameters
  • monolingual corpora
  • parallel corpus
  • parallel sentence
  • paraphrase
  • paraphrases
  • phrase
  • precision
  • process
  • sentence
  • sentence pair
  • sentence similarity
  • sentences
  • similarity measures
  • similarity scores
  • tdt corpus
  • translation candidates
  • translations
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***