TY - THES T1 - Integration of hyperspectral, genomic, and agronomic data for early prediction of biomass yield in hybrid rye (Secale cereale L.) A1 - Galán,Rodrigo José Y1 - 2021/12/15 N2 - Currently, the combination of a growing bioenergy demand and the need to diversify the dominant cultivation of energy maize opens a highly attractive scenario for alternative biomass crops. Rye (Secale cereale L.) stands out for its vigorous growth and increased tolerance to abiotic and biotic stressors. In Germany, less than a quarter of the total harvest is used for food production. Consequently, rye arises as a source of renewables with a reduced bioenergy-food tradeoff, emerging biomass as a new breeding objective. However, rye breeding is mainly driven by grain yield while biomass is destructively evaluated in later selection stages by expensive and time-consuming methods. The overall motivation of this research was to investigate the prospects of combining hyperspectral, genomic, and agronomic data for unlocking the potential of hybrid rye as a dual-purpose crop to meet the increasing demand for renewable sources of energy affordably. A specific aim was to predict the biomass yield as precisely as possible at an early selection stage. For this, a panel of 404 elite rye lines was genotyped and evaluated as testcrosses for grain yield and a subset of 274 genotypes additionally for biomass. Field trials were conducted at four locations in Germany in two years (eight environments). Hyperspectral fingerprints consisted of 400 discrete narrow bands (from 410 to 993 nm) and were collected in two points of time after heading for all hybrids in each site by an uncrewed aerial vehicle. In a first study, population parameters were estimated for different agronomic traits and a total of 23 vegetation indices. Dry matter yield showed significant genetic variation and was stronger correlated with plant height (r_g=0.86) than with grain yield (r_g=0.64) and individual vegetation indices (r_g: =<|0.35|). A multiple linear regression model based on plant height, grain yield, and a subset of vegetation indices surpassed the prediction ability for dry matter yield of models based only on agronomic traits by about 6 %. In a second study, whole-spectrum data was used to indirectly estimate dry matter yield. For this, single-kernel models based on hyperspectral reflectance-derived (HBLUP) and genomic (GBLUP) relationship matrices, a multi-kernel model combining both matrices, and a bivariate model fitted also with plant height as a secondary trait, were considered. HBLUP yielded superior predictive power than the models based on vegetation indices previously tested. The phenotypic correlations between individual wavelengths and dry matter yield were generally significant (p < 0.05) but low (r_p: =< |0.29|). Across environments and training set sizes, the bivariate model yielded the highest prediction abilities (0.56 – 0.75). All models profited from larger training populations. However, if larger training sets cannot be afforded, HBLUP emerged as a promising approach given its higher prediction power on reduced calibration populations compared to the well-established GBLUP. Before its incorporation into prediction models, filtering the hyperspectral data available by the least absolute shrinkage and selection operator (Lasso) was worthwhile to deal with data dimensionally. In a third study, the effects of trait heritability, as well as genetic and environmental relatedness on the prediction ability of GBLUP and HBLUP for biomass-related traits were compared. While the prediction ability of GBLUP (0.14 - 0.28) was largely affected by genetic relatedness and trait heritability, HBLUP was significantly more accurate (0.41 - 0.61) across weakly connected datasets. In this context, dry matter yield could be better predicted (up to 20 %) by a bivariate model. Nevertheless, due to environmental variances, genomic and reflectance-enabled predictions were strongly dependant on a sufficient environmental relationship between data used for model training and validation. In summary, to affordably breed rye as a double-purpose crop to meet the increasing bioenergy demands, the early prediction of biomass across selection cycles is crucial. Hyperspectral imaging has proven to be a suitable tool to select high-yielding biomass genotypes across weakly linked populations. Due to the synergetic effect of combining hyperspectral, genomic, and agronomic traits, higher prediction abilities can be obtained by integrating these data sources into bivariate models. KW - Pflanzenzüchtung KW - Roggen KW - Biomasse KW - Genomik CY - Hohenheim PB - Kommunikations-, Informations- und Medienzentrum der Universität Hohenheim AD - Garbenstr. 15, 70593 Stuttgart UR - http://opus.uni-hohenheim.de/volltexte/2021/1973 ER -