ACL RD-TEC 1.0 Summarization of P06-3002
Paper Title:
UNSUPERVISED PART-OF-SPEECH TAGGING EMPLOYING EFFICIENT GRAPH CLUSTERING
UNSUPERVISED PART-OF-SPEECH TAGGING EMPLOYING EFFICIENT GRAPH CLUSTERING
Primarily assigned technology terms:
- algorithm
- capitalization
- classifier
- classifiers
- clustering
- clustering algorithm
- computational linguistics
- computing
- graph clustering
- graph construction
- graph representation
- graph-clustering
- hidden markov
- hidden markov model
- lexicon construction
- markov model
- measuring
- morphology
- nlp
- part-of-speech tagging
- partitioning
- pos tagger
- pos tagging
- pos-tagging
- re-estimation
- tagger
- taggers
- tagging
- tagging system
- unsupervised method
- unsupervised tagging
- unsupervised tagging system
- viterbi
Other assigned terms:
- affix
- ambiguity
- ambiguous words
- approach
- association for computational linguistics
- case
- cluster
- clusters
- co-occurrence
- computational complexity
- context similarity
- corpora
- cosine similarity
- distribution
- distributional similarity
- english corpus
- entropy
- evaluation methodology
- feature
- feature vectors
- frequency counts
- gold standard
- implementation
- knowledge
- large corpora
- lexicon
- linguistics
- log-likelihood
- measure
- method
- methodology
- morphological component
- morphological features
- mutual information
- nlp applications
- nouns
- part-of-speech
- penn treebank
- perplexity
- pos information
- prefixes and suffixes
- prepositions
- probabilities
- probability
- procedure
- semantic
- semantic class
- sentences
- similarity scores
- similarity threshold
- statistics
- suffix
- suffixes
- syntactic categories
- syntactic category
- tag perplexity
- tagger model
- tagging performance
- tags
- tagset
- target word
- term
- text
- tokens
- training
- training material
- transition probability
- treebank
- trigram
- trigram model
- undirected graph
- unlabeled corpus
- web corpus
- word
- word classes
- words