ACL RD-TEC 1.0 Summarization of N04-4002
Paper Title:
MMR-BASED FEATURE SELECTION FOR TEXT CATEGORIZATION
MMR-BASED FEATURE SELECTION FOR TEXT CATEGORIZATION
Authors: Changki Lee and Gary Geunbae Lee
Primarily assigned technology terms:
- algorithm
- approximation
- categorization
- chi-square test
- classification
- computer science
- computing
- cross validation
- document retrieval
- feature elimination
- feature selection
- feature selection method
- feature subset selection
- greedy algorithm
- greedy feature selection
- indexing
- learning
- learning algorithm
- learning algorithms
- learning techniques
- machine learning
- machine learning algorithms
- machine learning techniques
- maximum entropy
- mmr-based feature selection
- naive bayes
- search
- search engines
- selection method
- statistical classification
- summarization
- support vector machine
- term selection
- text categorization
- tuning
- validation
Other assigned terms:
- 10-fold cross validation
- classification accuracy
- classification task
- co-occurrence
- conditional independence
- conditional probability
- data set
- data sets
- dimensionality
- distribution
- document
- empirical results
- entropy
- experimental results
- feature
- feature set
- feature space
- information gain
- knowledge
- knowledge base
- kullback-leibler divergence
- linear combination
- measure
- measures
- method
- mmr-based feature
- predictive information
- probability
- query
- running time
- semantic
- semantic categories
- support vector
- term
- terms
- text
- text collection
- text documents
- training
- training examples
- user
- vocabulary
- word