ACL RD-TEC 1.0 Summarization of A00-1024

Paper Title:
CATEGORIZING UNKNOWN WORDS: USING DECISION TREES TO IDENTIFY NAMES AND MISSPELLINGS

Other assigned terms:

  • abbreviation
  • abbreviations
  • approach
  • binary feature
  • call center
  • capitalization information
  • case
  • case information
  • character sequence
  • characters
  • checker
  • concept
  • confidence measure
  • confusion matrix
  • corpora
  • corpus frequency
  • data set
  • data sets
  • determiners
  • dictionary
  • edit distance
  • f-score
  • feature
  • foreign words
  • genre
  • heuristic
  • information sources
  • knowledge
  • language resources
  • leaf
  • lexicon
  • linguistic
  • linguistic resources
  • measure
  • measures
  • modular architecture
  • morphological variant
  • multicomponent architecture
  • names
  • natural language
  • noise
  • nouns
  • orthographic similarity
  • part of speech
  • part-of-speech
  • parts of speech
  • portability
  • pos information
  • precision
  • predictive information
  • procedure
  • process
  • pronouns
  • proper name
  • proper names
  • punctuation
  • recognition module
  • sentences
  • spelling error
  • system architecture
  • tag set
  • tags
  • tagset
  • technique
  • term
  • terms
  • test corpus
  • test data
  • test data set
  • text
  • training
  • training and test data
  • training corpus
  • training data
  • transcript
  • transcripts
  • tree
  • trees
  • word
  • word corpus
  • word level
  • words
  • world knowledge
  • writing system

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***