ACL RD-TEC 1.0 Summarization of H91-1068
Paper Title:
FAST TEXT PROCESSING FOR INFORMATION RETRIEVAL
FAST TEXT PROCESSING FOR INFORMATION RETRIEVAL
Authors: Tomek Strzalkowski and Barbara Vauthey
Primarily assigned technology terms:
- approximate parsing
- automated indexing
- backtracking
- classification
- clustering
- co-occurrence analysis
- computer science
- document retrieval
- english parser
- extraction procedure
- identification
- indexing
- information retrieval
- learner
- parser
- parsing
- pattern-matching
- processing
- prolog
- quintus prolog
- rule selection
- sentence processing
- statistical analysis
- statistical methods
- stochastic tagger
- syntactic processing
- tagger
- tagging
- term classification
- term clustering
- terminology
- text indexing
- text parser
- text processing
- text processing system
- tokenization
- top-down parser
- word clustering
Other assigned terms:
- adjective
- adjunct
- alphabet
- ambiguity
- approach
- case
- characters
- chinese characters
- co-occurrence
- co-occurrences
- conditional probability
- contemporary english
- context-free grammar
- corpora
- correlation
- determiner
- dictionaries
- dictionary
- distribution
- document
- document collections
- grammar
- grammars
- grammatical relation
- implementation
- information theory
- key-word
- large corpora
- ldoce
- linguistic
- logical structure
- main verb
- meaning
- measure
- measures
- method
- morphological features
- mutual information
- natural language
- noise
- nonterminal
- nonterminals
- noun phrase
- noun phrases
- orthography
- parse
- parse structure
- parse tree
- part of speech
- partial parse
- penn treebank
- penn treebank tagset
- phrase
- phrase attachment
- precision
- predeterminer
- preposition
- probabilities
- probability
- procedure
- process
- pronoun
- proper name
- quantitative information
- queries
- relation
- right-hand side
- semantic
- senses of a word
- sentence
- sentences
- similarity measure
- statistical data
- stem
- structural ambiguity
- structure of the sentence
- style
- subclass
- sublanguage
- symbols
- syntactic category
- syntactic phrase
- syntactic structures
- syntax
- tagged text
- tags
- tagset
- technical terminology
- term
- terms
- text
- theory
- tokens
- tree
- treebank
- verb
- vocabulary
- word
- word co-occurrence
- word corpus
- word frequencies
- word senses
- words