ACL RD-TEC 1.0 Summarization of J04-3001
Paper Title:
SAMPLE SELECTION FOR STATISTICAL PARSING
SAMPLE SELECTION FOR STATISTICAL PARSING
Primarily assigned technology terms:
- active learning
- algorithm
- approximation
- bagging
- batch selection
- binary classification
- binary classifier
- boosting
- bracketing
- categorization
- chunking
- classification
- classification process
- classifier
- classifiers
- co-training
- collins parser
- committee-based selection
- computational linguistics
- computer science
- computing
- cross-validation
- cutoff
- data reduction
- deterministic shift-reduce parser
- disambiguation
- dynamic programming
- dynamic programming technique
- em algorithm
- expectation-maximization
- expectation-maximization-based induction
- heuristic search
- history-based learning
- identification
- induction
- induction algorithm
- interactive learning
- iterative procedure
- iterative reestimation
- language processing
- learner
- learning
- learning algorithm
- learning algorithms
- learning framework
- learning method
- learning system
- learning technique
- learning techniques
- machine translation
- measuring
- natural language processing
- noun identification
- noun phrase chunking
- parser
- parser induction
- parsers
- parsing
- part-of-speech tagging
- phrase chunking
- predictor
- processing
- programming technique
- random sampling
- random selection
- reestimation
- reranking
- sample selection
- sampling
- scoring
- search
- selection algorithm
- selection learning
- selection process
- semantic parsing
- sense disambiguation
- set disambiguation
- shift-reduce parser
- spelling
- statistical parsers
- statistical parsing
- supervised learning
- supervised training
- syntactic analysis
- syntactic learning
- syntactic parsing
- table lookup
- tagging
- terminology
- text categorization
- unsupervised learning
- word sense disambiguation
Other assigned terms:
- 10-fold cross-validation
- ambiguity
- anchors
- annotated corpora
- annotated training corpora
- annotated training set
- annotation
- annotator
- annotators
- approach
- association for computational linguistics
- back-off model
- backoff
- base noun
- base noun phrase
- bias
- case
- chomsky normal form
- classification error
- classification task
- classification tasks
- clusters
- co-occurrence
- co-occurrence statistics
- co-occurrences
- complex sentence
- computational models
- concept
- corpora
- correlation
- density function
- derivation
- development set
- distribution
- entropy
- estimation
- evaluation function
- evaluation strategy
- f-score
- frequency counts
- generation
- gold standard
- grammar
- grammar rules
- grammars
- heuristic
- human annotator
- human involvement
- hypotheses
- hypothesis
- information theory
- knowledge
- kullback-leibler distance
- labeling
- leaf
- learning problem
- learning rate
- lexical items
- likelihood
- linguistic
- linguistics
- mapping
- measure
- measures
- method
- model parameters
- model performance
- natural language
- nonterminal
- normal form
- noun phrase
- nouns
- parse
- parse tree
- parser performance
- parsing accuracy
- parsing model
- parsing models
- parsing problem
- part-of-speech
- part-of-speech tags
- pcfg
- penn treebank
- phrase
- pp-attachment
- pp-attachment ambiguity
- ppattachment
- precision
- preposition
- prepositional phrase
- prepositional phrases
- prepositional-phrase attachment
- probabilities
- probability
- probability distribution
- probability distributions
- procedure
- process
- processing time
- scoring scheme
- search space
- seed
- semantic
- semantic representation
- sentence
- sentences
- standard deviation
- statistic
- statistical parsing model
- statistical significance
- statistics
- stems
- substring
- subtrees
- syntactic structure
- tags
- technique
- terms
- test data
- test set
- text
- theory
- training
- training corpora
- training corpus
- training data
- training example
- training examples
- training set
- training size
- tree
- treebank
- trees
- uniform distribution
- verb
- vocabulary
- word
- word pair
- word sense
- words
- wsj treebank