ACL RD-TEC 1.0 Summarization of W06-0118

Paper Title:
VOTING BETWEEN DICTIONARY-BASED AND SUBWORD TAGGING MODELS FOR CHINESE WORD SEGMENTATION

Authors: Dong Song and Anoop Sarkar

Other assigned terms:

  • ambiguity
  • approach
  • association for computational linguistics
  • bigram
  • characters
  • chinese language
  • chinese word
  • corpora
  • crf model
  • data set
  • dictionary
  • dictionary entries
  • experimental results
  • external knowledge
  • f-measure
  • f-score
  • feature
  • feature sets
  • gold test set
  • input text
  • knowledge
  • lattice
  • lattices
  • lexicon
  • linguistics
  • mapping
  • meaning
  • method
  • names
  • organization names
  • out-of-vocabulary rate
  • part-of-speech
  • part-of-speech information
  • person names
  • precision
  • procedure
  • process
  • segmentation bakeoff
  • segmentation lattice
  • sentence
  • statistical sequence
  • system description
  • system performance
  • tags
  • test data
  • test set
  • text
  • training
  • training corpora
  • training corpus
  • training data
  • training data set
  • training material
  • training set
  • understanding
  • unigram
  • upuc corpora
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***