ACL RD-TEC 1.0 Summarization of H90-1055
Paper Title:
DEDUCING LINGUISTIC STRUCTURE FROM THE STATISTICS OF LARGE CORPORA
DEDUCING LINGUISTIC STRUCTURE FROM THE STATISTICS OF LARGE CORPORA
Authors: Eric Brill and David Magerman and Mitchell Marcus and Beatrice Santorini
Primarily assigned technology terms:
- algorithm
- ambiguity resolution
- approximation
- bracketing
- computing
- constituent boundary parsing
- discovery system
- distributional analysis
- feature discovery
- grammatical analysis
- language processing
- lexical ambiguity resolution
- natural language processing
- parser
- parsing
- parsing algorithm
- part-of-speech tagging
- processing
- tagger
- tagging
Other assigned terms:
- ambiguity
- annotation
- annotator
- annotators
- approach
- benchmark
- bigram
- brown corpus
- case
- clusters
- co-occurrence
- constituent boundary
- corpora
- distribution
- english text
- error rate
- fact
- feature
- genre
- gold standard
- grammar
- grammatical category
- grammatical structure
- hypothesis
- implementation
- large corpora
- large corpus
- lexical ambiguity
- lexical items
- likelihood
- linguistic
- linguistic structure
- mapping
- measure
- measures
- method
- mutual information
- n-gram
- natural language
- nouns
- pairs of words
- part of speech
- part of speech tags
- part-of-speech
- part-of-speech annotation
- part-of-speech tag
- part-of-speech tags
- parts of speech
- penn treebank
- penn treebank project
- priori
- pronoun
- pronouns
- relation
- semantic
- sentence
- sentence structure
- sentences
- similarity measure
- speech tag
- statistic
- statistics
- symbol
- syntax
- tag set
- tags
- tagset
- terms
- test corpus
- text
- theoretic measure
- tokens
- transitive closure
- treebank
- treebank project
- word
- word classes
- words