Spatio-temporal dynamics of intra-host variability in SARS-CoV-2 genomes |
| |
Authors: | Ankit K Pathak,Gyan Prakash Mishra,Bharathram Uppili,Safal Walia,Saman Fatihi,Tahseen Abbas,Sofia Banu,Arup Ghosh,Amol Kanampalliwar,Atimukta Jha,Sana Fatma,Shifu Aggarwal,Mahesh Shanker Dhar,Robin Marwal,Venkatraman Srinivasan Radhakrishnan,Kalaiarasan Ponnusamy,Sandhya Kabra,Partha Rakshit,Rahul C Bhoyar,Abhinav Jain,Mohit Kumar Divakar,Mohamed Imran,Mohammed Faruq,Divya Tej Sowpati,Lipi Thukral,Sunil K Raghav,Mitali Mukerji |
| |
Affiliation: | CSIR - Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India;Institute of Life Sciences (ILS), Bhubaneswar, Odisha, India;Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India;CSIR – Centre for Cellular and Molecular Biology (CSIR-CCMB), Hyderabad, Telangana, India;Biotechnology Division, National Centre for Disease Control (NCDC), New Delhi, India;Indian Institute of Technology (IIT), Jodhpur, India |
| |
Abstract: | During the course of the COVID-19 pandemic, large-scale genome sequencing of SARS-CoV-2 has been useful in tracking its spread and in identifying variants of concern (VOC). Viral and host factors could contribute to variability within a host that can be captured in next-generation sequencing reads as intra-host single nucleotide variations (iSNVs). Analysing 1347 samples collected till June 2020, we recorded 16 410 iSNV sites throughout the SARS-CoV-2 genome. We found ∼42% of the iSNV sites to be reported as SNVs by 30 September 2020 in consensus sequences submitted to GISAID, which increased to ∼80% by 30th June 2021. Following this, analysis of another set of 1774 samples sequenced in India between November 2020 and May 2021 revealed that majority of the Delta (B.1.617.2) and Kappa (B.1.617.1) lineage-defining variations appeared as iSNVs before getting fixed in the population. Besides, mutations in RdRp as well as RNA-editing by APOBEC and ADAR deaminases seem to contribute to the differential prevalence of iSNVs in hosts. We also observe hyper-variability at functionally critical residues in Spike protein that could alter the antigenicity and may contribute to immune escape. Thus, tracking and functional annotation of iSNVs in ongoing genome surveillance programs could be important for early identification of potential variants of concern and actionable interventions. |
| |
Keywords: | |
|
|