首页 | 本学科首页   官方微博 | 高级检索  
     


Intrapatient Variation of the Respiratory Syncytial Virus Attachment Protein Gene
Authors:Charles N. Agoti  Jean L. Mbisa  Ann Bett  Graham F. Medley  D. James Nokes  Patricia A. Cane
Affiliation:Kenya Medical Research Institute, Centre for Geographic Medicine Research-Coast, Kilifi, Kenya,1. Health Protection Agency, Centre for Infections, Colindale, London, United Kingdom,2. Department of Biological Sciences, University of Warwick, Coventry, United Kingdom3.
Abstract:Intrapatient variability of the attachment (G) protein gene of respiratory syncytial virus (RSV) was examined using both population and single-genome sequencing. Samples from three patients infected with a group B virus variant which has a 60-nucleotide duplication in the G protein gene were examined. These samples were chosen because occasional mixed sequence bases were observed. In a minority of RSV genomes from these patients considerable variability was found, including point mutations, insertions, and deletions. Of particular note, the deletion of the exact portion of the gene which had been duplicated in some isolates was observed in viral RNAs from two patients.Human respiratory syncytial virus (RSV) is a major cause of lower respiratory tract infection in infants and vulnerable adults (3, 9) and is unusual in that it can repeatedly reinfect individuals (5, 6). RSV isolates are classified into two groups, A and B, and the attachment (G) protein, a target for neutralizing antibodies, is the most variable of the viral proteins, showing considerable genetic and antigenic variability both within and between the groups (7, 8). The G protein is able to accommodate drastic changes, which have been observed both in culture during the selection of monoclonal antibody escape mutants (4, 12, 13) and in vivo with the emergence of new variants, including a group B strain with a duplication of 60 nucleotides (17). This strain with a 60-nucleotide duplication was first reported from Buenos Aires in 1999 (17) and then was subsequently detected in samples from 1998 in Madrid (16). The strain then became the dominant group B strain worldwide, indicating a selective advantage for this variant (16, 18). Thus, major genetic changes can be introduced into the G gene sequence while the virus replicates in its natural host, which can then be selected under favorable epidemiological conditions.Previous investigations of the genetic diversity of RSV exploited direct sequencing of PCR-amplified products (2), which represent the population average of the in vivo variants. Such sequences are derived from multiple copies of cDNA and represent the dominant sequence, and they thus do not allow detection of minority populations below about 20% prevalence (11). Information on intrapatient viral diversity during infections may therefore be missed, knowledge of which could be important in the overall understanding of the genetic diversity of this virus. We report here the analysis of individual RSV RNA molecules derived by single-genome amplification (SGA) and sequencing from clinical samples using a methodology developed for the analysis of HIV genomes (11, 14).RSV-positive samples were collected from infants admitted to Kilifi District Hospital, Kenya (10). Viral RNA extraction and cDNA synthesis were carried out as previously described (15). For population sequencing, a nested PCR was carried out on the cDNA using primers that amplified the ectodomain-coding part of the G protein gene, with the PCR product being directly sequenced. In the 2007-2008 RSV epidemic in Kilifi, group B viruses were predominant. By population sequencing of ∼100 group B samples, all were found to have the 60-nucleotide duplication observed in the Buenos Aires variant (data not shown). However, in some specimens there were some mixed bases at some positions, so the variability at the level of the single cDNA molecule was further investigated.Three samples that gave occasional mixed signals in the sequence chromatograms were further analyzed by SGA and sequencing. For SGA the cDNAs were serially diluted 3-fold up to 1:6,361. Ten nested PCRs were carried out on each dilution using Platinum high-fidelity PCR Supermix (Invitrogen) (containing Taq polymerase together with the proofreading enzyme Pyrococcus species GB-D polymerase). Based on the Poisson distribution, it has been shown that for a sample dilution yielding approximately 30% positive PCRs there is an 80% likelihood that each PCR is derived from a single cDNA molecule (11). For each of the identified endpoint dilutions, the cDNA was amplified in 80 separate nested PCRs using the high-fidelity enzyme and the positive reaction products sequenced. The nomenclature for the sequences reported in this paper is place of isolation (Kenya [Ken])/year of isolation/strain number. For SGA sequences an additional Roman number is given.The predicted length derived by population sequencing of the G proteins of the three samples examined by SGA was 310 amino acids, showing a 6-nucleotide deletion and a changed stop codon relative to the Buenos Aires strain (Fig. (Fig.1).1). The dominant sequences represented 60 to 88% of the sequences derived by SGA. The differences were due to point mutations, duplications, and deletions, as summarized in Table Table1;1; the consequences of these changes for the predicted length of the G protein are shown in Table Table22.Open in a separate windowFIG. 1.Nucleotide sequence alignment of part of the G protein gene (from nucleotide 400) of the sample 2 population sequence (Ken/08/80900) and a minority sequence (Ken/08/80900/ii), with the sequences of prototype group B strain CH18537 (accession number M17213) and Buenos Aires strain BA/3833/99B (accession number AY333362). This shows the duplication of 60 nucleotides in the Kenyan and Buenos Aires viruses relative to CH18537 and the loss of the same 60 nucleotides in the Kenyan minority sequence. Termination codons are underlined.

TABLE 1.

Diversity in the SGA-derived sequences
StrainTotal no. of SGA-derived sequencesNo. of:
VariantsSubstitutionsInsertionsDeletions
Ken/07/8039517311a0
Ken/08/80900123101b
Ken/08/80767156304c
Open in a separate windowaThe insertion involved an 8-nucleotide duplication.bThe deletion was of 60 nucleotides.cThe deletions ranged from 1 to 104 nucleotides, including one of 60 nucleotides.

TABLE 2.

Summary of nucleotide and predicted amino acid differences observed in single-genome amplification sequences
StrainSequence no.Mutation type (nucleotide position)Effect on predicted polypeptide
Ken/07/80395Ken/07/80395/iiSubstitution (264)None
Ken/07/80395/iiiDuplication (599-606)Frameshift with a subsequent premature stop codon
Ken/08/80900Ken/08/80900/iiDeletion (792-851)Reduction of length by 20 amino acids
Ken/08/80900/iiiSubstitution (539)Isoleucine-to-threonine change
Ken/08/80767Ken/08/80767/iiSubstitution (592)Immediate premature stop codon
Ken/08/80767/iiiSubstitution (233)Threonine-to-serine change
Ken/08/80767/viSubstitution (664)Immediate premature stop codon
Ken/08/80767/vDeletion (599)Frameshift with a subsequent premature stop codon
Ken/08/80767/viDeletion (599)Frameshift with a subsequent premature stop codon
Deletion (792-851)No effect due to earlier frameshift; otherwise this would cause a 20-amino-acid deletion
Ken/08/80767/viiDeletion (836-939)Frameshift, with no subsequent stop codon within the region sequenced
Open in a separate windowFor sample 1 (Ken/07/80395), 17 separate PCR products from SGA were sequenced. Sequence Ken/07/80395/i was the dominant sequence, occurring in 15/17 (88%) of the products. A synonymous mutation (T-C at 264) was seen in Ken/07/80395/ii, while Ken/07/80395/iii had an 8-nucleotide duplication starting at position 599, which resulted in a frameshift with a subsequent premature stop codon.For sample 2 (Ken/08/80900), 12 separate SGA PCR products were sequenced, with the dominant sequence (Ken/08/80900/i) occurring in 9/12 (75%) of the products. One minority sequence (Ken/08/80900/ii) had a 60-nucleotide deletion starting at position 792. This deletion was identical to the duplication that characterizes this variant (Fig. (Fig.1)1) (16). It resulted in a reduction in the length of the predicted polypeptide by the encoded 20 amino acids. Finally, a nonsynonymous mutation was seen at position 539 (T-C), causing amino acid change I175T.For sample 3 (Ken/08/80767), 15 SGA PCR products were sequenced, and the dominant sequence (Ken/08/80767/i) was seen in 9/15 (60%). The minority sequence Ken/08/80767/vi had a single adenosine deletion at nucleotide 599 together with the 60-nucleotide deletion identical to that observed for sample 2. In addition, sequence Ken/08/80767/vii had a 104-base deletion starting at position 836, which resulted in a frameshift with no subsequent stop codon in the region sequenced. Other changes for this sample are shown in Table Table22.The RSV G gene has previously been shown to be accumulating amino acid changes and associated antigenic changes in its variable regions in isolates collected over time (2, 20, 21). Sequence changes have also been observed after monoclonal antibody selection (4, 12, 13, 19), and these included point mutations and frameshifts due to insertions or deletions. In addition, analysis of molecular clones derived from cultured RSV showed that replication of the G gene, including in vitro, is prone to errors, again small insertions or deletions (1). This study reports changes similar to those described above detected directly within patients but also much more drastic changes such as the large deletions.The samples examined by SGA in this study were originally selected because of mixed bases observed by population sequencing, so the results reported here may not be applicable to all RSV infections, and further studies will be required to ascertain how far these observations are generally applicable. The possibility of artifacts should be considered. However, a high-fidelity enzyme mix was used in these experiments, and the error rate in SGA analysis of HIV-1 has been found to be low (0.011%) and without major deletions and insertions(11). The possibility of PCR contamination or mixed infections contributing to the detection of the variant genomes with the exact 60-nucleotide deletion in the same position as the original duplication can be excluded because first, all the samples tested during the period in question had the duplication, and second, the rest of the sequences of the molecules matched most closely the “parent” population sequence (Fig. (Fig.11).The results reported here confirm and extend the observations that the RSV G gene is highly prone to errors during replication, including within the patient, and so provides a pool of variants that can be subject to selection during transmission. The detection of minority genomes that showed a 60-nucleotide deletion at the exact position as the original duplication observed in the Buenos Aires strains was a surprising finding. The mechanisms for such a deletion may lie in the stem-loop structure of the viral RNA sequence that is duplicated, as described by Trento et al. (17). Such deletions provide the potential for emergence of viruses with the “normal” protein length (i.e., without the 20-amino-acid insertion), which, as such strains have been largely replaced in recent years, may be able to overcome the community immunity.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号