ACL RD-TEC 1.0 Summarization of W02-2009
Paper Title:
CROSS-DATASET CLUSTERING: REVEALING CORRESPONDING THEMES ACROSS MULTIPLE CORPORA
CROSS-DATASET CLUSTERING: REVEALING CORRESPONDING THEMES ACROSS MULTIPLE CORPORA
Authors: Ido Dagan and Zvika Marx and Eli Shamir
Primarily assigned technology terms:
Other assigned terms:
- approach
- case
- characters
- cluster
- clustering paradigm
- clustering procedure
- clusters
- co-occurrence
- co-occurrence statistics
- cognitive
- content words
- convergence
- corpora
- data sets
- distribution
- feature
- feature set
- feature vector
- feature vectors
- function words
- geometric mean
- grounding
- joint probability
- keyword
- kullback-leibler divergence
- meaning
- measure
- measures
- method
- names
- precision
- probabilities
- probability
- probability distribution
- procedure
- process
- relation
- right-hand side
- sentence
- statistics
- tags
- term
- terms
- text
- tokens
- topics
- word
- word co-occurrence
- words