Compression distance can discriminate animals by genetic profile, build relationship matrices and estimate breeding values

Hudson, Nicholas J., Porto-Neto, Laercio, Kijas, James W. and Reverter, Antonio (2015) Compression distance can discriminate animals by genetic profile, build relationship matrices and estimate breeding values. Genetics Selection Evolution, 47 1: . doi:10.1186/s12711-015-0158-9


Author Hudson, Nicholas J.
Porto-Neto, Laercio
Kijas, James W.
Reverter, Antonio
Title Compression distance can discriminate animals by genetic profile, build relationship matrices and estimate breeding values
Journal name Genetics Selection Evolution   Check publisher's open access policy
ISSN 1297-9686
Publication date 2015-10-13
Sub-type Article (original research)
DOI 10.1186/s12711-015-0158-9
Open Access Status DOI
Volume 47
Issue 1
Total pages 14
Place of publication London, United Kingdom
Publisher BioMed Central
Collection year 2016
Language eng
Formatted abstract
Background
Genetic relatedness is currently estimated by a combination of traditional pedigree-based approaches (i.e. numerator relationship matrices, NRM) and, given the recent availability of molecular information, using marker genotypes (via genomic relationship matrices, GRM). To date, GRM are computed by genome-wide pair-wise SNP (single nucleotide polymorphism) correlations.

Results
We describe a new estimate of genetic relatedness using the concept of normalised compression distance (NCD) that is borrowed from Information Theory. Analogous to GRM, the resultant compression relationship matrix (CRM) exploits numerical patterns in genome-wide allele order and proportion, which are known to vary systematically with relatedness. We explored properties of the CRM in two industry cattle datasets by analysing the genetic basis of yearling weight, a phenotype of moderate heritability. In both Brahman (Bos indicus) and Tropical Composite (Bos taurus by Bos indicus) populations, the clustering inferred by NCD was comparable to that based on SNP correlations using standard principal component analysis approaches. One of the versions of the CRM modestly increased the amount of explained genetic variance, slightly reduced the ‘missing heritability’ and tended to improve the prediction accuracy of breeding values in both populations when compared to both NRM and GRM. Finally, a sliding window-based application of the compression approach on these populations identified genomic regions influenced by introgression of taurine haplotypes.

Conclusions
For these two bovine populations, CRM reduced the missing heritability and increased the amount of explained genetic variation for a moderately heritable complex trait. Given that NCD can sensitively discriminate closely related individuals, we foresee CRM having possible value for estimating breeding values in highly inbred populations.
Q-Index Code C1
Q-Index Status Provisional Code
Institutional Status Non-UQ

Document type: Journal Article
Sub-type: Article (original research)
Collections: Non HERDC
School of Agriculture and Food Sciences
School of Chemistry and Molecular Biosciences
 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in Thomson Reuters Web of Science Article
Scopus Citation Count Cited 0 times in Scopus Article
Google Scholar Search Google Scholar
Created: Mon, 16 May 2016, 13:32:34 EST by System User on behalf of Learning and Research Services (UQ Library)