ACL RD-TEC 1.0 Summarization of C02-1096

Paper Title:
WORDFORM- AND CLASS-BASED PREDICTION OF THE COMPONENTS OF GERMAN NOMINAL COMPOUNDS IN AN AAC SYSTEM

Authors: Marco Baroni and Johannes Matiasek and Harald Trost

Other assigned terms:

  • apa corpus
  • approach
  • baseline model
  • bigram
  • characters
  • class-based model
  • cluster
  • co-occurrences
  • compounding
  • compounds
  • corpus frequency
  • correlation
  • culture
  • data sparseness
  • dutch
  • fact
  • hapax legomena
  • interpolation
  • labeling
  • language processing applications
  • lexical association
  • lexical unit
  • lexicon
  • measure
  • measures
  • modifier
  • mutual information
  • n-gram
  • names
  • natural language
  • natural language processing applications
  • nouns
  • pairs of words
  • parse
  • perplexity
  • polysemy
  • pos bigram
  • prediction model
  • prediction task
  • probabilities
  • probability
  • procedure
  • proper names
  • semantic
  • semantic classes
  • statistics
  • suffix
  • suffixes
  • symbol
  • syntactic properties
  • target word
  • term
  • terms
  • test corpus
  • test set
  • text
  • token frequency
  • tokens
  • toolkit
  • training
  • training corpus
  • unigram
  • user
  • vocabulary
  • word
  • word types
  • wordform
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***