July 13, 2022
A Bayesian genomic prediction model combining pedigree and incomplete genotyped individuals
Quantitative trait variation may display a complex genetic architecture, which is the underlying basis of a phenotype, with few genes with large effects and several genes with small effects. The genes can show additive, dominance or/and epistatic effects and different types of interaction with the environment. Advances in plant genomics –and, more recently, in DNA sequencing of many species– coupled with the availability of statistical and biometric methods for analyzing genetic and phenotypic data with friendly software, have made it feasible to map and dissect complex quantitative trait variation. Genomic selection (GS) and prediction based on genome-wide single nucleotide polymorphism genotyping, pedigree and phenotypic data are very powerful tools for capturing small genetic effects dispersed over the genome; this allows predicting an individual’s phenotype. New methods and tools are continuously being developed to integrate GS into genetics research. One of the key issues with GS is the fact that there are usually many more individuals that have been phenotyped and have pedigree data than individuals with marker data. When attempting to make predictions, only lines with marker, pedigree and phenotypic data can be used, thus leaving out lines with pedigree and phenotypic data that have missing marker data. This project aims to develop a novel approach that uses Bayesian statistics to combine genotypic data with pedigree and phenotypic data and therefore use all available data. This project will lead to developing new algorithms and software for GS based on advanced statistical methods that will cause a paradigm shift in this field.