ACL RD-TEC 1.0 Summarization of W04-1119

Paper Title:
A SEMI-SUPERVISED APPROACH TO BUILD ANNOTATED CORPUS FOR CHINESE NAMED ENTITY RECOGNITION

Authors: Xiaoshan Fang and Jianfeng Gao and Huanye Sheng

Other assigned terms:

  • abbreviation
  • annotated corpus
  • annotated training corpora
  • annotation
  • approach
  • backoff
  • characters
  • chinese characters
  • chinese corpus
  • chinese language
  • chinese sentence
  • chinese text
  • chinese word
  • chinese words
  • context model
  • context models
  • corpora
  • data sets
  • data sparseness
  • dictionary
  • estimation
  • evaluations
  • f-measure
  • fact
  • generation
  • generative probability
  • interpolation
  • knowledge
  • lexicon
  • likelihood
  • linguistic
  • linguistic knowledge
  • location name
  • markov models
  • method
  • named entities
  • named entity
  • names
  • parametric model
  • part-of-speech
  • part-of-speech tags
  • person names
  • precision
  • probabilities
  • probability
  • process
  • schema
  • seed
  • semi-supervised approach
  • sentence
  • sparse data
  • sparse data problem
  • syntactic structure
  • tags
  • terms
  • test set
  • text
  • text corpus
  • training
  • training corpora
  • training corpus
  • training data
  • trigram
  • trigram model
  • word
  • word boundaries
  • word sequence
  • word type
  • word types
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***