ACL RD-TEC 1.0 Summarization of J94-4003

Paper Title:
WORD SENSE DISAMBIGUATION USING A SECOND LANGUAGE MONOLINGUAL CORPUS

Authors: Ido Dagan and Alon Itai

Primarily assigned technology terms:

Other assigned terms:

  • adjective
  • aligned corpus
  • ambiguity
  • ambiguous word
  • ambiguous words
  • approach
  • association for computational linguistics
  • benchmark
  • bilingual corpora
  • bilingual corpus
  • bilingual lexicon
  • case
  • cluster
  • clusters
  • co-occurrence
  • co-occurrence information
  • co-occurrence statistics
  • co-occurrences
  • coefficient
  • compounds
  • concept
  • concepts
  • conditional probabilities
  • content words
  • context vector
  • context vectors
  • context words
  • corpora
  • corpus size
  • correlation
  • data sets
  • data sparseness
  • data structure
  • decision theory
  • device
  • dictionary
  • disambiguation model
  • discourse
  • distance metric
  • distribution
  • domain-specific terminology
  • electronic form
  • english corpus
  • english sentence
  • english translation
  • estimation
  • evaluation method
  • evaluations
  • events
  • experimental results
  • experimental setting
  • fact
  • finite set
  • formalisms
  • frame
  • french
  • french sentence
  • frequency counts
  • function words
  • generation
  • grammars
  • hansard corpus
  • head noun
  • hebrew corpus
  • heuristics
  • hypotheses
  • hypothesis
  • implementation
  • index
  • information sources
  • interpolation
  • interpretation
  • knowledge
  • language corpora
  • language corpus
  • language information
  • language model
  • language models
  • language pairs
  • language processing tasks
  • large corpora
  • large corpus
  • lexical ambiguity
  • lexical co-occurrence
  • lexical items
  • lexical relation
  • lexical relations
  • lexicon
  • likelihood
  • linguistic
  • linguistic data
  • linguistic information
  • linguistic model
  • linguistic phenomena
  • linguistic structures
  • linguistic theories
  • linguistics
  • local context
  • manual tagging
  • manual translation
  • mapping
  • mappings
  • maps
  • meaning
  • measure
  • method
  • monolingual corpora
  • monolingual corpus
  • multinomial distribution
  • multinomial model
  • n-gram
  • n-gram model
  • n-gram models
  • names
  • natural language
  • natural language processing tasks
  • noise
  • normal distribution
  • nouns
  • null hypothesis
  • opinion
  • paragraphs
  • parallel bilingual corpus
  • parse
  • parse time
  • parsed corpus
  • part of speech
  • parts of speech
  • performance evaluation
  • phrase
  • phrase attachment
  • possible translation
  • pragmatic knowledge
  • precision
  • preposition
  • prepositional phrase
  • prepositional phrase attachment
  • prepositional phrases
  • prepositions
  • priori
  • probabilistic model
  • probabilities
  • probability
  • procedure
  • process
  • processing tasks
  • pronoun
  • psycholinguistic research
  • quantitative information
  • relation
  • relative clauses
  • relative frequency
  • representations
  • research and development
  • right-hand side
  • semantic
  • semantic categories
  • semantic constraint
  • semantic knowledge
  • sense disambiguation model
  • sense distinction
  • sense distinctions
  • senses of a word
  • sentence
  • sentences
  • similarity between words
  • slot
  • slot grammars
  • source language
  • source language sentence
  • source language word
  • source sentence
  • sources of information
  • statistical data
  • statistical decision theory
  • statistical information
  • statistical model
  • statistical models
  • statistical significance
  • statistics
  • subcategorization frame
  • subcategorization frames
  • syntactic behavior
  • syntactic categories
  • syntactic context
  • syntactic function
  • syntactic relation
  • syntactic relations
  • syntactic rules
  • syntactic theory
  • tagged corpus
  • target language
  • target language corpus
  • target language model
  • target languages
  • target word
  • technology
  • term
  • terms
  • test data
  • test set
  • testing set
  • text
  • theories
  • theory
  • thesaurus
  • tokens
  • topics
  • training
  • training corpora
  • training corpus
  • training examples
  • training phase
  • transfer model
  • transformation
  • transformation rules
  • translation probabilities
  • translations
  • understanding
  • untagged corpus
  • verb
  • word
  • word classes
  • word co-occurrence
  • word frequencies
  • word frequency
  • word level
  • word order
  • word sense
  • word senses
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***