ACL RD-TEC 1.0 Summarization of M91-1035
Paper Title:
DATA EXTRACTION AS TEXT CATEGORIZATION: AN EXPERIMENT WITH THE MUC-3 CORPUS
DATA EXTRACTION AS TEXT CATEGORIZATION: AN EXPERIMENT WITH THE MUC-3 CORPUS
Primarily assigned technology terms:
- algorithm
- bayesian classification
- binary categorization
- categorization
- category assignment
- classification
- data extraction
- estimation method
- extraction system
- extraction systems
- feature selection
- feature selection algorithm
- indexing
- language processing
- learning
- learning algorithms
- learning technique
- machine learning
- machine learning algorithms
- modeling
- nlp
- nlp systems
- pattern recognition
- predictor
- processing
- reasoning
- recognition
- regression
- reporting
- retrieval systems
- scoring
- scoring program
- screening
- selection algorithm
- slot filling
- statistical categorization
- statistical estimation
- statistical methods
- template generation
- tex t categorization
- text categorization
- text extraction
- text processing
- text retrieval
Other assigned terms:
- break
- case
- characters
- composition
- concept
- concepts
- conditional probability
- data set
- dimensionality
- distribution
- document
- document content
- domain knowledge
- estimation
- evaluation measures
- events
- extraction process
- fact
- feature
- feature set
- feature sets
- feature vectors
- finite set
- function words
- generation
- human intervention
- information gain
- information measure
- information theory
- knowledge
- knowledge base
- linguistic
- mapping
- mappings
- measure
- measures
- message
- method
- muc corpus
- muc-3
- muc-3 corpus
- muc-3 testset
- mutual information
- names
- noise
- parameter settings
- precision
- prior probability
- probabilities
- probability
- probability estimates
- procedure
- process
- punctuation
- relation
- semantic
- slot
- statistical model
- substring
- technique
- term
- text
- theory
- training
- training corpus
- training data
- training documents
- training set
- typographical errors
- understanding
- word
- word features
- word types
- words