Yan, Xin, Li, Xue and Song, Daqei (2004). A Correlation Analysis on LSA and HAL Semantic Space Models. In: Y. Fu, J. Han and J. Zhang, Computational and Information Science. First International Symposium, CIS 2004, Shanghai, China, (711-717). 16-18 December, 2004.

Author Yan, Xin
Li, Xue
Song, Daqei
Title of paper A Correlation Analysis on LSA and HAL Semantic Space Models
Conference name First International Symposium, CIS 2004
Conference location Shanghai, China
Conference dates 16-18 December, 2004
Proceedings title Computational and Information Science   Check publisher's open access policy
Journal name Computational and Information Science, Proceedings   Check publisher's open access policy
Place of Publication Berlin
Publisher Springer-Verlag
Publication Year 2004
Sub-type Fully published paper
DOI 10.1007/b104566
ISBN 3-540-24127-2
ISSN 0302-9743
Editor Y. Fu
J. Han
J. Zhang
Volume 3314/2005
Start page 711
End page 717
Total pages 7
Collection year 2004
Language eng
Abstract/Summary In this paper, we compare a well-known semantic spacemodel, Latent Semantic Analysis (LSA) with another model, Hyperspace Analogue to Language (HAL) which is widely used in different area, especially in automatic query refinement. We conduct this comparative analysis to prove our hypothesis that with respect to ability of extracting the lexical information from a corpus of text, LSA is quite similar to HAL. We regard HAL and LSA as black boxes. Through a Pearsonrsquos correlation analysis to the outputs of these two black boxes, we conclude that LSA highly co-relates with HAL and thus there is a justification that LSA and HAL can potentially play a similar role in the area of facilitating automatic query refinement. This paper evaluates LSA in a new application area and contributes an effective way to compare different semantic space models.
Subjects E1
280103 Information Storage, Retrieval and Management
700100 Computer Software and Services
Keyword Correlation Analysis
Hyperspace Analogue to Language
Latent Semantic Indexing
Automatic Query Refinement
Q-Index Code E1

