Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds |
| |
Authors: | Corrado Dimauro Massimo Cellesi Giustino Gaspa Paolo Ajmone-Marsan Roberto Steri Gabriele Marras Nicolò PP Macciotta |
| |
Affiliation: | 1.Dipartimento di Agraria, Sezione Scienze Zootecniche, Università di Sassari, Sassari, 07100, Italy;2.Istituto di Zootecnica, Università Cattolica del Sacro Cuore, Piacenza, 29100, Italy;3.Consiglio per la Ricerca e la Sperimentazione in Agricoltura, via Salaria 31, Monterotondo, 00015, Italy |
| |
Abstract: | BackgroundThe objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used.MethodsData consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content.ResultsIn the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip.ConclusionsResults of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available. |
| |
Keywords: | |
|
|