ACL RD-TEC 1.0 Summarization of W99-0908

Paper Title:
TEXT CLASSIFICATION BY BOOTSTRAPPING WITH KEYWORDS, EM AND SHRINKAGE

Authors: Andrew McCallum and Kamal Nigam

Other assigned terms:

  • anchors
  • approach
  • class hierarchy
  • class probability
  • classification accuracy
  • classification error
  • classification hierarchy
  • computer science research
  • data set
  • dictionary
  • disambiguation task
  • disk
  • distribution
  • document
  • document frequency
  • estimation
  • experimental results
  • feature
  • feature space
  • generation
  • generative model
  • implementation
  • intelligence
  • interpolation
  • keyword
  • knowledge
  • labeling
  • large training
  • leaf
  • likelihood
  • local maxima
  • method
  • prior probability
  • probabilistic model
  • probabilities
  • probability
  • probability estimate
  • probability estimates
  • process
  • random sample
  • seed
  • segments
  • sparse data
  • sparse data problem
  • technique
  • term
  • test set
  • text
  • text documents
  • theoretical framework
  • topics
  • training
  • training data
  • training documents
  • training examples
  • uniform distribution
  • unigram
  • unigram model
  • unlabeled examples
  • vocabulary
  • web page
  • web pages
  • word
  • word distribution
  • word features
  • word sense
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***