ACL RD-TEC 1.0 Summarization of W04-2406
Paper Title:
WORD SENSE DISCRIMINATION BY CLUSTERING CONTEXTS IN VECTOR AND SIMILARITY SPACES
WORD SENSE DISCRIMINATION BY CLUSTERING CONTEXTS IN VECTOR AND SIMILARITY SPACES
Authors: Amruta Purandare and Ted Pedersen
Primarily assigned technology terms:
- agglomerative clustering
- algorithm
- classifiers
- clustering
- clustering algorithm
- clustering method
- comparative analysis
- context group discrimination
- decomposition
- disambiguation
- exact matching
- feature collection
- feature selection
- group discrimination
- hybrid clustering
- hybrid method
- indexing
- k-means
- latent semantic analysis
- latent semantic indexing
- link clustering
- matching
- maximal matching
- partitioning
- search
- semantic analysis
- semantic indexing
- sense disambiguation
- sense discrimination
- singular value decomposition
- style clustering
- unsupervised clustering
- unsupervised technique
- vector representation
- word sense disambiguation
- word sense discrimination
Other assigned terms:
- approach
- baseline clustering
- bigram
- british national corpus
- case
- cluster
- clustering space
- clusters
- co-occurrence
- co-occurrence information
- co-occurrence matrix
- co-occurrences
- concept
- concepts
- confusion matrix
- content words
- context vector
- context vectors
- convergence
- corpora
- data sets
- dictionary
- dictionary definitions
- dimensionality
- estimation
- exact match
- experimental results
- f-measure
- fact
- feature
- feature set
- feature sets
- feature space
- feature vectors
- frequency counts
- graph theory
- hypothesis
- index
- large corpora
- large corpus
- latent semantic
- likelihood
- likelihood ratio
- log-likelihood
- log-likelihood ratio
- mapping
- measure
- method
- nouns
- pairs of words
- part of speech
- parts of speech
- precision
- process
- reordering
- representations
- search strategy
- second order context
- semantic
- semantic space
- sense-tagged corpora
- sentences
- similarity matrix
- sparse data
- speech information
- style
- tagged corpora
- tagged corpus
- tags
- target word
- technique
- test data
- test set
- text
- theory
- training
- training and test data
- training corpus
- training data
- vector space
- word
- word co-occurrence
- word level
- word sense
- words