ACL RD-TEC 1.0 Summarization of W97-0115
Paper Title:
STATISTICAL ACQUISITION OF TERMINOLOGY DICTIONARY
STATISTICAL ACQUISITION OF TERMINOLOGY DICTIONARY
Authors: Huang Xuan-jing and Wu Li-de and Wang Wen-xin
Primarily assigned technology terms:
- automatic extraction
- automatic indexing
- automatic recognition
- categorization
- chi-square test
- chinese information processing
- collocation extraction
- computational linguistics
- computer science
- computing
- document categorization
- identification
- indexing
- information processing
- information retrieval
- langnage processing
- language processing
- likelihood estimation
- matching
- name recognition
- natural language processing
- part of speech tagging
- partial syntactic analysis
- pattern matching
- phrase extraction
- phrase generation
- processing
- random sampling
- recognition
- sampling
- segmentation
- semantic analysis
- speech tagging
- statistical acquisition
- statistical method
- statistical methods
- syntactic analysis
- tagging
- terminology
- terminology phrase extraction
- text processing
- word extraction
Other assigned terms:
- approach
- binomial distribution
- characters
- chinese characters
- chinese text
- chinese word
- co-occurrence
- co-occurrence frequency
- coefficient
- collocation
- compound words
- compounds
- corpora
- correlation
- correlation coefficient
- data sparseness
- dice
- dice coefficient
- dictionaries
- dictionary
- distribution
- document
- domain corpus
- estimation
- experimental results
- fact
- function words
- generation
- heuristic
- heuristic rules
- human intervention
- hypothesis
- implementation
- index
- keyword
- knowledge
- knowledge base
- language processing applications
- likelihood
- linguistics
- meaning
- meanings
- measure
- method
- mutual information
- n-grams
- names
- natural language
- natural language processing applications
- nouns
- null hypothesis
- part of speech
- phrase
- precision
- probability
- procedure
- proper names
- queries
- seed
- seed words
- semantic
- statistic
- sublanguage
- syntactic relations
- technical terminology
- technology
- terminology \
- terminology phrase
- text
- user
- vocabulary
- word
- word frequency
- word pair
- word sequences
- words