Bounded coordinate system indexing for real-time video clip search

Huang, Zi, Shen, Heng Tao, Shao, Jie, Zhou, Xiaofang and Cui, Bin (2009) Bounded coordinate system indexing for real-time video clip search. ACM Transactions on Information Systems, 27 3: 17.1-17.33. doi:10.1145/1508850.1508855

Attached Files (Some files may be inaccessible until you login with your UQ eSpace credentials)
Name Description MIMEType Size Downloads

Author Huang, Zi
Shen, Heng Tao
Shao, Jie
Zhou, Xiaofang
Cui, Bin
Title Bounded coordinate system indexing for real-time video clip search
Journal name ACM Transactions on Information Systems   Check publisher's open access policy
ISSN 1046-8188
1558-2868
Publication date 2009-05-01
Sub-type Article (original research)
DOI 10.1145/1508850.1508855
Volume 27
Issue 3
Start page 17.1
End page 17.33
Total pages 33
Editor Jamie Callan
Place of publication New York, NY, United States
Publisher Association for Computing Machinery
Collection year 2010
Language eng
Subject 080699 Information Systems not elsewhere classified
890399 Information Services not elsewhere classified
C1
Abstract Recently, video clips have become very popular online. The massive influx of video clips has created an urgent need for video search engines to facilitate retrieving relevant clips. Different from traditional long videos, a video clip is a short video often expressing a moment of significance. Due to the high complexity of video data, efficient video clip search from large databases turns out to be very challenging. We propose a novel video clip representation model called the Bounded Coordinate System (BCS), which is the first single representative capturing the dominating content and content—changing trends of a video clip. It summarizes a video clip by a coordinate system, where each of its coordinate axes is identified by principal component analysis (PCA) and bounded by the range of data projections along the axis. The similarity measure of BCS considers the operations of translation, rotation, and scaling for coordinate system matching. Particularly, rotation and scaling reflect the difference of content tendencies. Compared with the quadratic time complexity of existing methods, the time complexity of measuring BCS similarity is linear. The compact video representation together with its linear similarity measure makes real-time search from video clip collections feasible. To further improve the retrieval efficiency for large video databases, a two-dimensional transformation method called Bidistance Transformation (BDT) is introduced to utilize a pair of optimal reference points with respect to bidirectional axes in BCS. Our extensive performance study on a large database of more than 30,000 video clips demonstrates that BCS achieves very high search accuracy according to human judgment. This indicates that content tendencies are important in determining the meanings of video clips and confirms that BCS can capture the inherent moment of video clip to some extent that better resembles human perception. In addition, BDT outperforms existing indexing methods greatly. Integration of the BCS model and BDT indexing can achieve real-time search from large video clip databases.
Keyword Design
Compact signatures
Q-Index Code C1
Q-Index Status Confirmed Code
Institutional Status UQ
Additional Notes Article number 17

 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 18 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 40 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Thu, 03 Sep 2009, 17:56:44 EST by Mr Andrew Martlew on behalf of School of Information Technol and Elec Engineering