ACL RD-TEC 1.0 Summarization of P96-1041

Paper Title:
AN EMPIRICAL STUDY OF SMOOTHING TECHNIQUES FOR LANGUAGE MODELING

Authors: Stanley F. Chen and Joshua Goodman

Other assigned terms:

  • acoustic signal
  • bayesian framework
  • bigram
  • bigram model
  • brown corpus
  • case
  • chunks
  • concept
  • corpora
  • data consortium
  • data sets
  • distribution
  • entropy
  • estimation
  • good-turing estimation
  • implementation
  • interpolation
  • knowledge
  • language model
  • large corpus
  • large training
  • likelihood
  • linguistic
  • linguistic data
  • linguistic data consortium
  • maximum likelihood estimate
  • measure
  • method
  • methodology
  • n-gram
  • n-gram models
  • n-grams
  • parameter settings
  • parameter values
  • part-of-speech
  • performance evaluation
  • perplexity
  • phrase
  • phrase attachment
  • prepositional phrase
  • prepositional phrase attachment
  • probabilities
  • probability
  • recursion
  • segments
  • sentence
  • sentences
  • set size
  • signal
  • standard deviation
  • technique
  • term
  • terms
  • test data
  • text
  • tokens
  • training
  • training data
  • training set
  • training set size
  • treebank
  • trigram
  • trigram model
  • uniform distribution
  • unigram
  • unigram model
  • vocabulary
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***