ACL RD-TEC 1.0 Summarization of W98-1214
Paper Title:
CHOOSING A DISTANCE METRIC FOR AUTOMATIC WORD CATEGORIZATION
CHOOSING A DISTANCE METRIC FOR AUTOMATIC WORD CATEGORIZATION
Authors: Emin Erkan Korkmaz and Gokturk Ucoluk
Primarily assigned technology terms:
- algorithm
- approximation
- automatic word categorization
- bottom-up algorithm
- categorization
- clustering
- computational natural language learning
- distance function
- fuzzy clustering
- fuzzy set
- genetic algorithm
- genetic algorithms
- greedy algorithm
- k-means
- knowledge bases
- language acquisition
- language learning
- language processing
- learning
- linguistic categorization
- machine translation
- measuring
- natural language learning
- natural language processing
- neural network
- nlp
- processing
- search
- statistical methods
- statistical natural language processing
- statistical nlp
- tile
- top-down approach
- unsupervised algorithm
- weighting
- word categorization
- word categorization process
- word clustering
Other assigned terms:
- adjective
- approach
- bigram
- case
- cluster
- clusters
- coefficient
- concept
- convergence
- corpora
- correlation
- correlation coefficient
- distance metric
- distribution
- english corpus
- entropy
- fact
- fuzzy logic
- grammar
- incremental approach
- knowledge
- language corpora
- language models
- lexicon
- linguist
- linguistic
- linguistic similarity
- logic
- measure
- method
- mutual information
- n-gram
- n-gram model
- n-gram models
- natural language
- natural language corpora
- natural language sentences
- nouns
- probabilities
- probability
- procedure
- process
- rank correlation
- run-time
- search space
- sentences
- similarity function
- singular noun
- spearman rank correlation
- statistic
- statistical knowledge
- statistical natural language
- statistics
- stochastic model
- sublanguage
- text
- theory
- tree
- unigram
- verb
- verb classes
- word
- word pair
- words