ORIGINAL ARTICLE
Using the genomic relationship matrix to predict the accuracy
- f genomic selection
M.E. Goddard1,2, B.J. Hayes2 & T.H.E. Meuwissen3
1 Department of Agriculture and Food Systems, University of Melbourne, Melbourne, Vic., Australia 2 Biosciences Research Division, Victorian Department of Primary Industries, Bundoora, Vic., Australia 3 Norwegian University of Life Sciences, A ˚ s, Norway
Introduction The matrix of relationships among a group of indi- viduals can be used to predict their breeding values, to manage inbreeding and in genetic conservation. This relationship matrix can be calculated from the pedigree, but it is also possible to calculate the rela- tionship matrix from genotypes at genetic markers such as single-nucleotide polymorphisms (SNPs). Elements of the genomic relationship matrix are esti- mates of the realized proportion of the genome that two individuals share, whereas the pedigree-derived relationship matrix is the expectation of this propor-
- tion. This genomic relationship matrix can be used
in genomic selection to estimate breeding values. Genomic selection refers to the use of a large num- ber of genetic markers, such as SNPs, covering the whole genome to predict the genetic value of indi- viduals (Meuwissen et al. 2001). The individuals might be people whose genetic risk of developing a complex disease is being predicted, or they might be domestic animals or plants in which estimates of
Keywords Genomic selection; relationship matrix. Correspondence
- M. Goddard, Biosciences Research Division,
Victorian Department of Agriculture, 1 Park Drive, Bundoora, Vic. 3083, Australia. Tel: +61 39032 7091; Fax: +61 39032 7158; E-mail: mike.goddard@dpi.vic.gov.au Received: 6 February 2011; accepted: 18 August 2011
Summary Estimated breeding values (EBVs) using data from genetic markers can be predicted using a genomic relationship matrix, derived from animal’s genotypes, and best linear unbiased prediction. However, if the accuracy
- f the EBVs is calculated in the usual manner (from the inverse element
- f the coefficient matrix), it is likely to be overestimated owing to sam-
pling errors in elements of the genomic relationship matrix. We show here that the correct accuracy can be obtained by regressing the rela- tionship matrix towards the pedigree relationship matrix so that it is an unbiased estimate of the relationships at the QTL controlling the trait. This method shows how the accuracy increases as the number of mark- ers used increases because the regression coefficient (of genomic rela- tionship towards pedigree relationship) increases. We also present a deterministic method for predicting the accuracy of such genomic EBVs before data on individual animals are collected. This method estimates the proportion of genetic variance explained by the markers, which is equal to the regression coefficient described above, and the accuracy with which marker effects are estimated. The latter depends on the vari- ance in relationship between pairs of animals, which equals the mean linkage disequilibrium over all pairs of loci. The theory was validated using simulated data and data on fat concentration in the milk of Hol- stein cattle.
- J. Anim. Breed. Genet. ISSN 0931-2668
ª 2011 Blackwell Verlag GmbH • J. Anim. Breed. Genet. 128 (2011) 409–421 doi:10.1111/j.1439-0388.2011.00964.x