Rare variants in transcript and potential regulatory regions explain a small percentage of the missing heritability of complex traits in cattle

Gonzalez-Recio, Oscar, Daetwyler, Hans D., MacLeod, Iona M., Pryce, Jennie E., Bowman, Phil J., Hayes, Ben J. and Goddard, Michael E. (2015) Rare variants in transcript and potential regulatory regions explain a small percentage of the missing heritability of complex traits in cattle. PLoS One, 10 12: . doi:10.1371/journal.pone.0143945


Author Gonzalez-Recio, Oscar
Daetwyler, Hans D.
MacLeod, Iona M.
Pryce, Jennie E.
Bowman, Phil J.
Hayes, Ben J.
Goddard, Michael E.
Title Rare variants in transcript and potential regulatory regions explain a small percentage of the missing heritability of complex traits in cattle
Journal name PLoS One   Check publisher's open access policy
ISSN 1932-6203
Publication date 2015-12-07
Sub-type Article (original research)
DOI 10.1371/journal.pone.0143945
Open Access Status DOI
Volume 10
Issue 12
Total pages 16
Place of publication San Francisco, CA, United States
Publisher Public Library of Science
Language eng
Formatted abstract
The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.
Q-Index Code C1
Q-Index Status Provisional Code
Institutional Status Non-UQ
Additional Notes Article number e0143945

Document type: Journal Article
Sub-type: Article (original research)
Collection: Queensland Alliance for Agriculture and Food Innovation
 
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in Thomson Reuters Web of Science Article
Scopus Citation Count Cited 0 times in Scopus Article
Google Scholar Search Google Scholar
Created: Thu, 07 Jul 2016, 12:17:48 EST by Anthony Yeates on behalf of Learning and Research Services (UQ Library)