ACL RD-TEC 1.0 Summarization of W04-3202
Paper Title:
ACTIVE LEARNING AND THE TOTAL COST OF ANNOTATION
ACTIVE LEARNING AND THE TOTAL COST OF ANNOTATION
Authors: Jason Baldridge and Miles Osborne
Primarily assigned technology terms:
- 10-fold crossvalidation
- active learning
- algorithm
- approximation
- co-training
- corpus building
- crossvalidation
- hpsg parse
- hpsg parse selection
- human language
- identification
- inner product
- language processing
- learner
- learning
- learning algorithm
- machine learning
- machine learning algorithm
- measuring
- modeling
- natural language processing
- nlp
- normalization
- parse selection
- parser
- parsers
- perceptron
- processing
- random sampling
- random selection
- sampling
- sequential sampling
- treebanking
- uncertainty sampling
Other assigned terms:
- ambiguity
- annotation
- annotator
- annotators
- appointment scheduling
- approach
- bias
- case
- conditional probability
- correlation
- dependency graphs
- derivation
- derivation trees
- derivations
- distribution
- entropy
- estimation
- exact match
- fact
- feature
- feature set
- feature sets
- feature vector
- forest
- grammar
- grammars
- hpsg
- hpsg grammar
- human annotator
- information gain
- labeling
- log-linear model
- log-linear models
- measure
- measures
- method
- natural language
- ngram
- noise
- normalization factor
- parameter settings
- parse
- parse forest
- parse selection model
- phrase
- phrase structure
- probabilities
- probability
- rank correlation
- redwoods treebank
- seed
- selection accuracy
- selection model
- semantic
- sentence
- sentences
- technologies
- terms
- test set
- training
- training data
- training material
- training set
- tree
- treebank
- treebank project
- trees
- words