ACL RD-TEC 1.0 Summarization of C04-1112
Paper Title:
A LEMMA-BASED APPROACH TO A MAXIMUM ENTROPY WORD SENSE DISAMBIGUATION SYSTEM FOR DUTCH
A LEMMA-BASED APPROACH TO A MAXIMUM ENTROPY WORD SENSE DISAMBIGUATION SYSTEM FOR DUTCH
Primarily assigned technology terms:
- algorithm
- automaton
- classification
- classification algorithm
- classifier
- classifiers
- clustering
- computing
- corpus preparation
- database
- dictionary lookup
- disambiguation
- encoding
- english wsd
- entropy classification
- entropy classifier
- error rate reduction
- feature selection
- finite state
- finite state automaton
- groningen
- grouping
- language processing
- learning
- learning algorithm
- learning algorithms
- lemmatization
- lemmatizer
- linguistic processing
- machine learning
- machine learning algorithm
- matching
- maximum entropy
- maximum entropy classifier
- maximum entropy model
- memory-based learning
- modeling
- morphology
- naive bayes
- nlp
- optimization
- parameter optimization
- porter stemmer
- pos tagging
- preprocessing
- processing
- rate reduction
- semantic disambiguation
- sense disambiguation
- sense disambiguation system
- single classifier
- smoothing
- state automaton
- statistical classification
- statistical language processing
- stemmer
- supervised word sense disambiguation
- tagger
- tagging
- voting
- voting scheme
- word sense disambiguation
Other assigned terms:
- ambiguity
- ambiguous word
- ambiguous wordform
- ambiguous words
- approach
- case
- classification accuracy
- classification task
- clusters
- compact representation
- context features
- context size
- context words
- convergence
- corpora
- data set
- data sparseness
- dictionaries
- dictionary
- dictionary entry
- disambiguation system
- distribution
- dutch
- entropy
- entropy models
- error rate
- estimation
- events
- fact
- feature
- feature vectors
- feature weights
- gaussian prior
- hypothesis
- inflected form
- inflected forms
- knowledge
- labeled training data
- lemma
- lexical database
- likelihood
- linguistic
- linguistic features
- linguistic information
- linguistic knowledge
- local context
- maximum entropy models
- meaning
- method
- ontology
- pos information
- pos tag
- probabilities
- probability
- probability distribution
- probability distributions
- procedure
- semantic
- senseval-2 data set
- sentence
- sentences
- sources of information
- statistical model
- stem
- suffix
- tag set
- tags
- technique
- test data
- tokens
- training
- training corpus
- training data
- training material
- uniform distribution
- verb
- word
- word sense
- word senses
- wordform
- wordform model
- wordform-based model
- wordnet
- words
- wsd model