Efficient profiling for estimation of query result quality
Yeganeh, Naiem K., Sadiq, Shazia, Sharaf, Mohamed A. and Deng, Ke (2011). Efficient profiling for estimation of query result quality. In: Proceedings of the 16th International Conference on Information Quality (ICIQ-11). 16th International Conference on Information Quality (ICIQ-11), Adelaide, Australia, (415-426). 18-20 November 2011.
(Some files may be inaccessible until you login with your UQ eSpace credentials)
The issue of Data Quality (DQ) is of increasing importance as individuals as well as corporations are relying on multiple, often external sources of data to make decisions. Data quality profiles consist of statistical measurements about the quality of data sets. Query systems can use DQ profiles as a form of metadata to estimate the quality of a query result set. Traditional DQ profiling provides an estimate on the overall quality of a data set or data source, but quality of a query result can be remarkably different from the overall quality of the data set because conditions within the query typically select a subset of the data. In this paper we propose an efficient conditional DQ profiling method which can estimate the quality of a result set for a given query with guaranteed user definable level of accuracy.