A data structure for representing multi-version texts online

Schmidt, Desmond and Colomb, Robert (2009) A data structure for representing multi-version texts online. International Journal of Human Computer Studies, 67 6: 497-514. doi:10.1016/j.ijhcs.2009.02.001


Author Schmidt, Desmond
Colomb, Robert
Title A data structure for representing multi-version texts online
Journal name International Journal of Human Computer Studies   Check publisher's open access policy
ISSN 1071-5819
1095-9300
Publication date 2009-06-01
Sub-type Critical review of research, literature review, critical commentary
DOI 10.1016/j.ijhcs.2009.02.001
Volume 67
Issue 6
Start page 497
End page 514
Total pages 18
Place of publication London, United Kingdom
Publisher Academic Press
Language eng
Abstract The digitisation of cultural heritage and linguistics texts has long been troubled by the problem of how to represent overlapping structures arising from different markup perspectives ('overlapping hierarchies') or from different versions of the same work ('textual variation'). These two problems can be reduced to one by observing that every case of overlapping hierarchies is also a case of textual variation. Overlapping textual structures can be accurately modelled either as a minimally redundant directed graph, or, more practically, as an ordered list of pairs, each containing a set of versions and a fragment of text or data. This 'pairs-list' representation is provably equivalent to the graph representation. It can record texts consisting of thousands of versions or perspectives without becoming overloaded with data, and the most common operations on variant text, e.g. comparison between two versions, can be performed in linear time. This representation also separates variation or other overlapping structures from the document content, leading to a simplification of markup suitable for wiki-like web applications.
Keyword Electronic editions
Markup
Overlapping hierarchies
Textual variation
Q-Index Code C1
Q-Index Status Provisional Code
Institutional Status UQ

Document type: Journal Article
Sub-type: Critical review of research, literature review, critical commentary
Collection: School of Information Technology and Electrical Engineering Publications
 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 13 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 19 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Wed, 27 Nov 2013, 14:42:54 EST by System User on behalf of School of Information Technol and Elec Engineering