DNA sequence and shape are predictive for meiotic crossovers throughout the plant kingdom |
| |
Authors: | Sevgin Demirci Sander A Peters Dick de Ridder Aalt DJ van Dijk |
| |
Institution: | 1. Business Unit Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Wageningen, the Netherlands;2. Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands;3. Biometris, Wageningen University and Research, Wageningen, the Netherlands |
| |
Abstract: | A better understanding of genomic features influencing the location of meiotic crossovers (CO s) in plant species is both of fundamental importance and of practical relevance for plant breeding. Using CO positions with sufficiently high resolution from four plant species Arabidopsis thaliana , Solanum lycopersicum (tomato), Zea mays (maize) and Oryza sativa (rice)] we have trained machine‐learning models to predict the susceptibility to CO formation. Our results show that CO occurrence within various plant genomes can be predicted by DNA sequence and shape features. Several features related to genome content and to genomic accessibility were consistently either positively or negatively related to CO s in all four species. Other features were found as predictive only in specific species. Gene annotation‐related features were especially predictive for maize, whereas in tomato and Arabidopsis propeller twist and helical twist (DNA shape features) and AT /TA dinucleotides were found to be the most important. In rice, high roll (another DNA shape feature) and low CA dinucleotide frequency in particular were found to be associated with CO occurrence. The accuracy of our models was sufficient for Arabidopsis and rice (area under receiver operating characteristic curve, AUROC > 0.5), and was high for tomato and maize (AUROC ? 0.5), demonstrating that DNA sequence and shape are predictive for meiotic CO s throughout the plant kingdom. |
| |
Keywords: | meiotic recombination crossover machine learning prediction genome accessibility DNA shape tomato
Arabidopsis thaliana
maize rice |
|
|