ACL RD-TEC 1.0 Summarization of C02-1045
Paper Title:
A METHOD OF CLUSTER-BASED INDEXING OF TEXTUAL DATA
A METHOD OF CLUSTER-BASED INDEXING OF TEXTUAL DATA
Primarily assigned technology terms:
- analyzer
- artificial intelligence
- categorization
- cluster-based indexing
- clustering
- clustering method
- conceptual indexing
- discounting method
- document clustering
- entropy calculation
- graph partitioning
- indexing
- information retrieval
- information retrieval systems
- language modeling
- latent semantic indexing
- micro-clustering
- modeling
- morphological analyzer
- naive bayes
- partitioning
- preprocessing
- probabilistic language modeling
- retrieval systems
- semantic indexing
- simultaneous clustering
- spectral graph partitioning
- supervised text categorization
- support vector machine
- term indexing
- text categorization
- text-based information retrieval
- transformation-based indexing
- unsupervised text categorization
Other assigned terms:
- case
- cluster
- cluster evaluation
- clusters
- co-occurrence
- co-occurrence matrix
- co-occurrences
- coefficient
- community
- compact representation
- contingency table
- data set
- distribution
- document
- document sets
- entropy
- estimation
- events
- experimental results
- feature
- feature space
- generation
- generation process
- graphical representation
- implementation
- information retrieval research
- information space
- information theory
- intelligence
- interpretation
- japanese text
- joint distribution
- latent semantic
- linguistic
- method
- mutual information
- noise
- nouns
- occurrence probability
- part-of-speech
- part-of-speech tags
- precision
- probabilistic models
- probabilities
- probability
- procedure
- process
- semantic
- support vector
- tags
- term
- terms
- test set
- text
- text collection
- textual information
- theory
- topics
- training
- training documents
- training set
- words