ACL RD-TEC 1.0 Summarization of C04-1115
Paper Title:
FEATURE WEIGHTING FOR CO-OCCURRENCE-BASED CLASSIFICATION OF WORDS
FEATURE WEIGHTING FOR CO-OCCURRENCE-BASED CLASSIFICATION OF WORDS
Authors: Viktor Pekar and Michael Krkoska and Steffen Staab
Primarily assigned technology terms:
- acquisition process
- algorithm
- automatic construction
- bayes algorithm
- bayes classifier
- categorization
- classification
- classifier
- classifiers
- co-occurrence-based classification
- computing
- cross-validation
- document categorization
- document retrieval
- entity classification
- feature selection
- feature weighting
- global feature weighting
- information extraction
- information retrieval
- k-nn
- knn
- learning
- learning techniques
- lexical acquisition
- machine learning
- machine learning techniques
- naive bayes
- naive bayes classifier
- naive bayes classifiers
- named entity classification
- nearest neighbors
- nlp
- optimization
- parameter optimization
- parsing
- pre-processing
- processing
- scoring
- scoring function
- statistical nlp
- ten-fold cross-validation
- text categorization
- text classification
- text retrieval
- training process
- weighting
- weighting method
- word classification
Other assigned terms:
- analogy
- approach
- background knowledge
- bilingual lexicons
- binomial distribution
- british national corpus
- categorization task
- class membership
- classification task
- classification tasks
- cluster
- co-occurrence
- co-occurrences
- conditional probabilities
- conditional probability
- context words
- distribution
- distributional similarity
- document
- empirical evaluation
- entropy
- estimation
- evaluation measure
- evaluation method
- extraction patterns
- fact
- feature
- feature vector
- hyponyms
- independence model
- information gain
- information theory
- jensen-shannon divergence
- knowledge
- meaning
- measure
- measures
- method
- mutual information
- named entity
- natural language
- nouns
- polysemy
- positive and negative examples
- precision
- probabilities
- probability
- probability distributions
- procedure
- process
- representations
- russian
- semantic
- semantic classes
- similarity measure
- similarity metric
- similarity metrics
- statistical significance
- synonymy
- technique
- technologies
- term
- terms
- test data
- text
- theory
- training
- training data
- training set
- word
- word meaning
- wordnet
- words