ACL RD-TEC 1.0 Summarization of W06-0121

Paper Title:
CHINESE WORD SEGMENTATION WITH MAXIMUM ENTROPY AND N-GRAM LANGUAGE MODEL

Authors: Xinhao Wang and Xiaojun Lin and Dianhai Yu and Hao Tian and Xihong Wu

Other assigned terms:

  • ambiguity
  • approach
  • association for computational linguistics
  • backoff
  • bigram
  • bigram language model
  • bigram model
  • characters
  • chinese language
  • chinese word
  • classification task
  • corpora
  • dictionaries
  • dictionary
  • entropy
  • language model
  • lexical tree
  • linguistics
  • method
  • n-gram
  • n-gram language model
  • names
  • natural language
  • organization names
  • process
  • processing strategy
  • punctuation
  • relation
  • segmentation bakeoff
  • sentence
  • substring
  • subtree
  • suffix
  • suffixes
  • system description
  • test data
  • test set
  • toolkit
  • training
  • training data
  • training set
  • tree
  • unigram
  • vocabulary
  • word
  • word boundaries
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***