Web wrapper validation

Pek, E., Li, X. and Liu, Y. (2003). Web wrapper validation. In: X. Zhou, Y. Zhang and M. Orlowska, Proceedings of the 5th Asia-Pacific Web Conference (APWeb 2003). 5th Asia-Pacific Web Conference, Xian, China, (388-393). 23-25 April 2003. doi:10.1007/3-540-36901-5_40


Author Pek, E.
Li, X.
Liu, Y.
Title of paper Web wrapper validation
Conference name 5th Asia-Pacific Web Conference
Conference location Xian, China
Conference dates 23-25 April 2003
Proceedings title Proceedings of the 5th Asia-Pacific Web Conference (APWeb 2003)   Check publisher's open access policy
Journal name Web Technologies and Applications   Check publisher's open access policy
Place of Publication Berlin, Germany
Publisher Springer Verlag
Publication Year 2003
Sub-type Fully published paper
DOI 10.1007/3-540-36901-5_40
ISBN 978-3-540-02354-8
ISSN 0302-9743
Editor X. Zhou
Y. Zhang
M. Orlowska
Volume 2642
Start page 388
End page 393
Total pages 6
Collection year 2003
Language eng
Abstract/Summary Web wrapper extracts data from HTML document. The accuracy and quality of the information extracted by web wrapper relies on the structure of the HTML document. If an HTML document is changed, the web wrapper may or may not function correctly. This paper presents an Adjacency-Weight method to be used in the web wrapper extraction process or in a wrapper self-maintenance mechanism to validate web wrappers. The algorithm and data structures are illustrated by some intuitive examples.
Subjects E1
280112 Information Systems Development Methodologies
700103 Information processing services
Keyword Web wrapper
Wrapper validation
Q-Index Code E1

 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 2 times in Thomson Reuters Web of Science Article | Citations
Google Scholar Search Google Scholar
Created: Fri, 24 Aug 2007, 10:21:37 EST