OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy |
| |
Authors: | David M. Emms Steven Kelly |
| |
Affiliation: | Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX1 3RB UK |
| |
Abstract: | Identifying homology relationships between sequences is fundamental to biological research. Here we provide a novel orthogroup inference algorithm called OrthoFinder that solves a previously undetected gene length bias in orthogroup inference, resulting in significant improvements in accuracy. Using real benchmark datasets we demonstrate that OrthoFinder is more accurate than other orthogroup inference methods by between 8 % and 33 %. Furthermore, we demonstrate the utility of OrthoFinder by providing a complete classification of transcription factor gene families in plants revealing 6.9 million previously unobserved relationships.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-015-0721-2) contains supplementary material, which is available to authorized users. |
| |
Keywords: | |
|
|