ACL RD-TEC 1.0 Summarization of P95-1026
Paper Title:
UNSUPERVISED WORD SENSE DISAMBIGUATION RIVALING SUPERVISED METHODS
UNSUPERVISED WORD SENSE DISAMBIGUATION RIVALING SUPERVISED METHODS
Primarily assigned technology terms:
- algorithm
- bootstrapping
- classification
- classification algorithm
- classifier
- classifiers
- clustering
- co-occurrence analysis
- decision list algorithm
- disambiguation
- em algorithm
- error correction
- error rate reduction
- final state
- hierarchical clustering
- human language
- induction
- iterative training
- learning
- learning algorithm
- learning algorithms
- list algorithm
- machine translation
- manufacturing
- matching
- modeling
- non-compositional interpretation
- partitioning
- probabilistic disambiguation
- pruning
- rate reduction
- relative distance
- sense disambiguation
- sense induction
- sense tagger
- sense tagging
- sense-disambiguation
- simulated annealing
- smoothing
- statistical sense-disambiguation
- supervised classification
- supervised learning
- supervised sense tagger
- supervised training
- tagger
- tagging
- training algorithm
- training procedure
- unsupervised algorithm
- unsupervised learning
- unsupervised learning algorithm
- unsupervised training
- word sense disambiguation
- word-sense disambiguation
Other assigned terms:
- anchor
- approach
- bag of words
- bilingual corpora
- bilingual lexicons
- case
- cluster
- clusters
- co-occurrence
- co-occurrence information
- co-occurrence statistics
- collocate
- collocation
- collocational information
- concept
- content words
- context window
- convergence
- corpora
- corpus frequency
- dictionary
- dictionary definition
- dictionary definitions
- dictionary entries
- discourse
- discourse information
- disk
- distribution
- distributional information
- distributional similarity
- document
- error rate
- fact
- feature
- function word
- function words
- hard constraint
- hypothesis
- inflected forms
- interpolation
- interpretation
- keyword
- language models
- large corpus
- lexical choice
- likelihood
- local context
- log-likelihood
- log-likelihood ratio
- measures
- method
- monolingual corpora
- monolingual corpus
- noise
- parts of speech
- poisson distribution
- polysemous word
- predicate-argument
- probabilities
- probability
- probability distributions
- procedure
- process
- relation
- seed
- seed words
- sense distinctions
- senses of a word
- sentences
- statistics
- stems
- syntactic relationship
- tags
- target word
- terms
- text
- text corpora
- tokens
- training
- training corpus
- training data
- training examples
- training set
- trigram
- untagged corpus
- verb-object pair
- word
- word classes
- word sense
- word senses
- word sequence
- wordnet
- words