Recursive organizer (ROR): an analytic framework for sequence-based association analysis |
| |
Authors: | Lue Ping Zhao Xin Huang |
| |
Affiliation: | 1. Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Mailstop M2-B500, P.O. Box 19024, Seattle, WA, 98109-1024, USA 2. School of Public Health, University of Washington, Seattle, WA, USA 3. AbbVie, Inc., Global Pharmaceutical Research and Development, Dept. R43P, Bldg. AP9A-LL, 1 North Waukegan Road, North Chicago, IL, 60064-6124, USA
|
| |
Abstract: | The advent of next-generation sequencing technologies affords the ability to sequence thousands of subjects cost-effectively, and is revolutionizing the landscape of genetic research. With the evolving genotyping/sequencing technologies, it is not unrealistic to expect that we will soon obtain a pair of diploidic fully phased genome sequences from each subject in the near future. Here, in light of this potential, we propose an analytic framework called, recursive organizer (ROR), which recursively groups sequence variants based upon sequence similarities and their empirical disease associations, into fewer and potentially more interpretable super sequence variants (SSV). As an illustration, we applied ROR to assess an association between HLA-DRB1 and type 1 diabetes (T1D), discovering SSVs of HLA-DRB1 with sequence data from the Wellcome Trust Case Control Consortium. Specifically, ROR reduces 36 observed unique HLA-DRB1 sequences into 8 SSVs that empirically associate with T1D, a fourfold reduction of sequence complexity. Using HLA-DRB1 data from Type 1 Diabetes Genetics Consortium as cases and data from Fred Hutchinson Cancer Research Center as controls, we are able to validate associations of these SSVs with T1D. Further, SSVs consist of nine nucleotides, and each associates with its corresponding amino acids. Detailed examination of these selected amino acids reveals their potential functional roles in protein structures and possible implication to the mechanism of T1D. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|