LASSO with cross-validation for genomic selection

Usai, M. Graziano, Goddard, Mike E. and Hayes, Ben J. (2009) LASSO with cross-validation for genomic selection. Genetics Research, 91 6: 427-436. doi:10.1017/S0016672309990334

Author Usai, M. Graziano
Goddard, Mike E.
Hayes, Ben J.
Title LASSO with cross-validation for genomic selection
Journal name Genetics Research   Check publisher's open access policy
ISSN 0016-6723
Publication date 2009-12
Sub-type Article (original research)
DOI 10.1017/S0016672309990334
Open Access Status Not yet assessed
Volume 91
Issue 6
Start page 427
End page 436
Total pages 10
Place of publication Cambridge, United Kingdom
Publisher Cambridge University Press
Language eng
Abstract We used a least absolute shrinkage and selection operator (LASSO) approach to estimate marker effects for genomic selection. The least angle regression (LARS) algorithm and cross-validation were used to define the best subset of markers to include in the model. The LASSO-LARS approach was tested on two data sets: a simulated data set with 5865 individuals and 6000 Single Nucleotide Polymorphisms (SNPs); and a mouse data set with 1885 individuals genotyped for 10 656 SNPs and phenotyped for a number of quantitative traits. In the simulated data, three approaches were used to split the reference population into training and validation subsets for cross-validation: random splitting across the whole population; random sampling of validation set from the last generation only, either within or across families. The highest accuracy was obtained by random splitting across the whole population. The accuracy of genomic estimated breeding values (GEBVs) in the candidate population obtained by LASSO-LARS was 089 with 156 explanatory SNPs. This value was higher than those obtained by Best Linear Unbiased Prediction (BLUP) and a Bayesian method (BayesA), which were 0.75 and 0.84, respectively. In the mouse data, 1600 individuals were randomly allocated to the reference population. The GEBVs for the remaining 285 individuals estimated by LASSO-LARS were more accurate than those obtained by BLUP and BayesA for weight at six weeks and slightly lower for growth rate and body length. It was concluded that LASSO-LARS approach is a good alternative method to estimate marker effects for genomic selection, particularly when the cost of genotyping can be reduced by using a limited subset of markers.
Q-Index Code C1
Q-Index Status Provisional Code
Institutional Status Non-UQ

Document type: Journal Article
Sub-type: Article (original research)
Collection: Queensland Alliance for Agriculture and Food Innovation
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 43 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 50 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Fri, 05 Aug 2016, 10:12:53 EST by System User on behalf of Learning and Research Services (UQ Library)