ACL RD-TEC 1.0 Summarization of C02-1096
Paper Title:
WORDFORM- AND CLASS-BASED PREDICTION OF THE COMPONENTS OF GERMAN NOMINAL COMPOUNDS IN AN AAC SYSTEM
WORDFORM- AND CLASS-BASED PREDICTION OF THE COMPONENTS OF GERMAN NOMINAL COMPOUNDS IN AN AAC SYSTEM
Authors: Marco Baroni and Johannes Matiasek and Harald Trost
Primarily assigned technology terms:
- aac word prediction
- algorithm
- analyzer
- bigram training
- clustering
- clustering algorithm
- compound prediction
- grouping
- hardware
- language modeling
- language processing
- linking
- modeling
- morphological analyzer
- n-gram language modeling
- natural language processing
- noun prediction
- prediction system
- predictor
- processing
- splitting
- word prediction
- word-formation
Other assigned terms:
- apa corpus
- approach
- baseline model
- bigram
- characters
- class-based model
- cluster
- co-occurrences
- compounding
- compounds
- corpus frequency
- correlation
- culture
- data sparseness
- dutch
- fact
- hapax legomena
- interpolation
- labeling
- language processing applications
- lexical association
- lexical unit
- lexicon
- measure
- measures
- modifier
- mutual information
- n-gram
- names
- natural language
- natural language processing applications
- nouns
- pairs of words
- parse
- perplexity
- polysemy
- pos bigram
- prediction model
- prediction task
- probabilities
- probability
- procedure
- proper names
- semantic
- semantic classes
- statistics
- suffix
- suffixes
- symbol
- syntactic properties
- target word
- term
- terms
- test corpus
- test set
- text
- token frequency
- tokens
- toolkit
- training
- training corpus
- unigram
- user
- vocabulary
- word
- word types
- wordform
- words