ACL RD-TEC 1.0 Summarization of P03-1051

Paper Title:
LANGUAGE MODEL BASED ARABIC WORD SEGMENTATION

Authors: Young-Suk Lee and Kishore Papineni and Salim Roukos and Ossama Emam and Hany Hassan

Other assigned terms:

  • acquisition technique
  • adjective
  • adverb
  • affixes
  • ambiguity
  • ambiguity problem
  • arabic treebank
  • bigram
  • case
  • contextual information
  • corpora
  • corpus size
  • correlation
  • derivation
  • dictionary
  • dutch
  • error rate
  • estimation
  • evaluations
  • exact match
  • experimental results
  • f-score
  • foreign words
  • implementation
  • inflected forms
  • inflectional morphology
  • input text
  • interpolation
  • knowledge
  • language model
  • language model score
  • language processing applications
  • lemma
  • likelihood
  • linguistic
  • linguistic resources
  • meaning
  • meanings
  • method
  • minimum description length
  • model parameters
  • model probability
  • morpheme
  • morphemes
  • natural language
  • natural language processing applications
  • orthography
  • part of speech
  • part-of-speech
  • part-of-speech information
  • parts-of-speech
  • prefixes and suffixes
  • prepositions
  • probabilities
  • probability
  • process
  • pronouns
  • proper noun
  • punctuation
  • russian
  • seed
  • segmentation accuracy
  • segmentation ambiguity
  • segmented corpus
  • segments
  • semantic
  • semitic languages
  • sentence
  • stem
  • stems
  • suffix
  • suffixes
  • target languages
  • technique
  • test corpus
  • test set
  • text
  • text corpora
  • tokens
  • toolkit
  • training
  • training corpus
  • translations
  • treebank
  • trigram
  • trigram language model
  • trigram model
  • unigram
  • verb
  • vocabulary
  • vocabulary size
  • word
  • word corpus
  • word error rate
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***