ACL RD-TEC 1.0 Summarization of P04-1057
Paper Title:
ERROR MINING FOR WIDE-COVERAGE GRAMMAR ENGINEERING
ERROR MINING FOR WIDE-COVERAGE GRAMMAR ENGINEERING
Primarily assigned technology terms:
- automaton
- binomial test
- cd-rom
- computing
- disambiguation
- efficient parsing
- electronic dictionary
- error mining
- feature selection
- finite automata
- grammar engineering
- linguistic processing
- maximum entropy
- maximum entropy model
- mining
- nominalization
- parser
- parsing
- processing
- reading
- spelling
- statistical feature selection
- tokenization
- tokenizer
Other assigned terms:
- adjective
- approach
- array
- automata
- break
- case
- characters
- cluster
- corpora
- corpus size
- data structure
- dependency relations
- dictionaries
- dictionary
- distribution
- dutch
- ellipsis
- entropy
- events
- fact
- feature
- finite automaton
- finite set
- frame
- french
- frequency cut-off
- genitive marker
- gold standard
- grammar
- grammar rules
- grammars
- heuristic
- heuristics
- hpsg
- idiom
- idiomatic expression
- idiomatic expressions
- implementation
- inflection
- large corpora
- large corpus
- lexical entries
- lexicon
- linguistic
- mapping
- maps
- meaning
- method
- modifier
- n-gram
- n-grams
- named entities
- nouns
- parse
- parsing accuracy
- part-of-speech
- part-of-speech tag
- phrasal verb
- phrase
- pp modifier
- preposition
- priori
- probability
- procedure
- proper name
- punctuation
- punctuation marks
- runtime
- sentence
- sentence boundary
- sentences
- sentential subject
- statistical model
- subcorpus
- suffix
- suffixes
- syntactic construction
- syntactic constructions
- syntactic feature
- syntax
- technique
- television
- terms
- text
- tokens
- tree-bank
- trees
- twente nieuws corpus
- unannotated corpora
- valency
- verb
- word
- word order
- word sequence
- word sequences
- words