ACL RD-TEC 1.0 Summarization of P05-1077
Paper Title:
RANDOMIZED ALGORITHMS AND NLP: USING LOCALITY SENSITIVE HASH FUNCTIONS FOR HIGH SPEED NOUN CLUSTERING
RANDOMIZED ALGORITHMS AND NLP: USING LOCALITY SENSITIVE HASH FUNCTIONS FOR HIGH SPEED NOUN CLUSTERING
Authors: Deepak Ravichandran and Patrick Pantel and Eduard Hovy
Primarily assigned technology terms:
- algorithm
- beam search
- clustering
- computational linguistics
- computing
- dependency parser
- dimensionality reduction
- disambiguation
- distance calculation
- distance search
- fast search
- feature representation
- hamming distance algorithm
- high performance computing
- indexing
- internet
- language analysis
- linking
- mining
- minipar
- nearest neighbors
- nlp
- noun clustering
- parser
- parsing
- processing
- processor
- question answering
- randomization
- randomized algorithms
- reverse indexing
- sampling
- search
- search algorithm
- searching
- sense disambiguation
- similarity calculation
- word sense disambiguation
Other assigned terms:
- approach
- association for computational linguistics
- beam
- case
- cluster
- co-occurrence
- co-occurrence statistics
- community
- context window
- context words
- corpora
- cosine distance
- cosine similarity
- data set
- dimensionality
- disk
- distribution
- document
- document set
- error rate
- feature
- feature set
- feature vectors
- gaussian distribution
- generation
- gold standard
- gold standard test
- grammatical features
- hamming distance
- implementation
- index
- inferences
- knowledge
- linear time
- linguistics
- meaning
- measure
- method
- mutual information
- newspaper corpus
- nlp community
- noun phrase
- noun similarity
- nouns
- parse
- permutation
- phrase
- pointwise mutual information
- probability
- process
- processing time
- running time
- search parameter
- search time
- seed
- semantic
- sentence
- similarity matrix
- similarity scores
- statistics
- tags
- technique
- test collection
- test set
- text
- theorem
- theory
- time complexity
- transformation
- vector space
- web corpus
- web pages
- word
- word sense
- words