ACL RD-TEC 1.0 Summarization of W01-1605
Paper Title:
BUILDING A DISCOURSE-TAGGED CORPUS IN THE FRAMEWORK OF RHETORICAL STRUCTURE THEORY
BUILDING A DISCOURSE-TAGGED CORPUS IN THE FRAMEWORK OF RHETORICAL STRUCTURE THEORY
Authors: Lynn Carlson and Daniel Marcu and Mary Ellen Okurovsky
Primarily assigned technology terms:
- bracketing
- computing
- content analysis
- data annotation
- discourse annotation
- discourse annotation process
- discourse parser
- document analysis
- hands-on training
- identification
- information retrieval
- language analysis
- language engineering
- language generation
- language processing
- learning
- linguistic analysis
- linking
- machine translation
- measuring
- natural language generation
- natural language processing
- parser
- processing
- reporting
- segmentation
- summarization
- summarization systems
- syntactic analysis
- syntactic bracketing
- taggers
- tagging
- tagging process
- text segmentation
- text summarization
- tree traversal
- validation
- validation process
- visualization
Other assigned terms:
- annotated corpus
- annotation
- annotation process
- annotation task
- annotator
- annotators
- approach
- binary features
- chunks
- clusters
- coefficient
- cognitive
- community-wide use
- concept
- corpora
- data consortium
- discourse
- discourse structure
- discourse tree
- discourse unit
- discourse units
- document
- document structure
- elementary discourse unit
- frame
- generation
- heuristics
- hierarchical structure
- human judgment
- hypotheses
- inter-annotator agreement
- kappa
- kappa coefficient
- knowledge
- linguistic
- linguistic annotation
- linguistic data
- linguistic data consortium
- linguistic phenomena
- main verb
- maps
- meaning
- measure
- measures
- method
- methodology
- natural language
- nucleus
- penn treebank
- process
- quality assurance
- relation
- representations
- rhetorical information
- rhetorical relation
- rhetorical relations
- rhetorical structure
- rhetorical structure theory
- root node
- segments
- semantic
- semantic content
- sentence
- sentence level
- sentences
- statistical models
- statistics
- style
- sub-tree
- syntactic form
- syntax
- tagged corpus
- technologies
- term
- terms
- text
- text segments
- theoretical framework
- theory
- topics
- training
- tree
- tree structure
- treebank
- trees
- verb
- web site
- words