ACL RD-TEC 1.0 Summarization of P05-1077

Paper Title:
RANDOMIZED ALGORITHMS AND NLP: USING LOCALITY SENSITIVE HASH FUNCTIONS FOR HIGH SPEED NOUN CLUSTERING

Authors: Deepak Ravichandran and Patrick Pantel and Eduard Hovy

Other assigned terms:

  • approach
  • association for computational linguistics
  • beam
  • case
  • cluster
  • co-occurrence
  • co-occurrence statistics
  • community
  • context window
  • context words
  • corpora
  • cosine distance
  • cosine similarity
  • data set
  • dimensionality
  • disk
  • distribution
  • document
  • document set
  • error rate
  • feature
  • feature set
  • feature vectors
  • gaussian distribution
  • generation
  • gold standard
  • gold standard test
  • grammatical features
  • hamming distance
  • implementation
  • index
  • inferences
  • knowledge
  • linear time
  • linguistics
  • meaning
  • measure
  • method
  • mutual information
  • newspaper corpus
  • nlp community
  • noun phrase
  • noun similarity
  • nouns
  • parse
  • permutation
  • phrase
  • pointwise mutual information
  • probability
  • process
  • processing time
  • running time
  • search parameter
  • search time
  • seed
  • semantic
  • sentence
  • similarity matrix
  • similarity scores
  • statistics
  • tags
  • technique
  • test collection
  • test set
  • text
  • theorem
  • theory
  • time complexity
  • transformation
  • vector space
  • web corpus
  • web pages
  • word
  • word sense
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***