首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
以GenBank公开的甲型流感病毒亚型的血凝素(hemagglutinin,HA)核苷酸序列为材料,从简单重复序列(simple se-quence repeat,SSR)分布的分析角度出发,分析了来自于亚洲、非洲、北美洲、南美洲、欧洲、大洋洲的49个地区的76株甲流病毒的HA片段。分析表明:所分析序列的SSRs的分布都很相似,其中单碱基重复的相对丰度值和相对密度值均高于其它五种碱基重复的相对丰度值和相对密度值;甲流病毒HA片段的SSRs与HIV-1[16]基因中的SSRs相比,前者的相对丰度值和相对密度值高于后者。这些结果表明甲流病毒基因中的SSRs可能与甲流病毒的快速变异相关。  相似文献   

2.
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.  相似文献   

3.
Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2–7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.  相似文献   

4.
Cête d׳Ivoire continues to have the highest HIV-1 prevalence rate in West Africa, although the infection number is in constant decline. The external envelope protein of the viruses is a likely site of selection, and responsible for receptor binding and entry into host cells, and therefore constitutes an ideal region with which to investigate the evolutionary processes acting on HIV-1. In this study, we analyse 189 envelope glycoprotein V3 loop region sequences of viruse isolates from 1995 to 2009, from HIV-1 untreated patients living in Cête d׳Ivoire, to decipher the temporal relationship between disease diversity, divergence and selection. Our analyses show that the nonsynonymous and synonymous ratio (dN/dS) was lower than 1 for viral populations analysed within 15 years, which showed the sequences did not undergo adequate immune pressure. The phylogenetic tree of the sequences analysed demonstrated distinctly long internal branches and short external branches, suggesting that only a small number of viruses infected the new host cell at each transmission. In addition to identifying sites under purifying selection, we also identified neutral sites that can cause false positive inference of selection. These sites presented form a resource for future studies of selection pressures acting on HIV-1 enν gene in Cête d׳Ivoire and other West African countries.  相似文献   

5.
Simple sequence repeats (SSRs) or microsatellites are one of the most popular sources of genetic markers and play a significant role in gene function and genome organization. We identified SSRs in the genome of Ganoderma lucidum and analyzed their frequency and distribution in different genomic regions. We also compared the SSRs in G. lucidum with six other Agaricomycetes genomes: Coprinopsis cinerea, Laccaria bicolor, Phanerochaete chrysosporium, Postia placenta, Schizophyllum commune and Serpula lacrymans. Based on our search criteria, the total number of SSRs found ranged from 1206 to 6104 and covered from 0.04% to 0.15% of the fungal genomes. The SSR abundance was not correlated with the genome size, and mono- to tri-nucleotide repeats outnumbered other SSR categories in all of the species examined. In G. lucidum, a repertoire of 2674 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. The highest SSR relative abundance was found in introns (108 SSRs/Mb), followed by intergenic regions (84 SSRs/Mb). A total of 684 SSRs were found in the protein-coding sequences (CDSs) of 588 gene models, with 81.4% of them being tri- or hexa-nucleotides. After scanning for InterPro domains, 280 of these genes were successfully annotated, and 215 of them could be assigned to Gene Ontology (GO) terms. SSRs were also identified in 28 bioactive compound synthesis-related gene models, including one 3-hydroxy-3-methylglutaryl-CoA reductase (HMGR), three polysaccharide biosynthesis genes and 24 cytochrome P450 monooxygenases (CYPs). Primers were designed for the identified SSR loci, providing the basis for the future development of SSR markers of this medicinal fungus.  相似文献   

6.
7.
8.
9.
10.
Recombination is a major force for generating human immunodeficiency virus type 1 (HIV-1) diversity and produces numerous recombinants circulating in the human population. We previously established a cell-based system using green fluorescent protein gene (gfp) as a reporter to study the mechanisms of HIV-1 recombination. We now report an improved system capable of detecting recombination using authentic viral sequences. Frameshift mutations were introduced into the gag gene so that parental viruses do not express full-length Gag; however, recombination can generate a progeny virus that expresses a functional Gag. We demonstrate that this Gag reconstitution assay can be used to detect recombination between two group M HIV-1 variants of the same or of different subtypes. Using both gfp and gag assays, we found that, similar to group M viruses, group O viruses also recombine frequently. When recombination between a group M virus and a group O virus was examined, we found three distinct barriers for intergroup recombination. First, similar to recombination within group M viruses, intergroup recombination is affected by the identity of the dimerization initiation signal (DIS); variants with the same DIS recombined at a higher rate than those with different DIS. Second, using the gfp recombination assay, we showed that intergroup recombination occurs much less frequently than intragroup recombination, even though the gfp target sequence is identical in all viruses. Finally, Gag reconstitution between variants from different groups is further reduced compared with green fluorescent protein, indicating that sequence divergence interferes with recombination efficiency in the gag gene. Compared with identical sequences, we estimate that recombination rates are reduced by 3-fold and by 10- to 13-fold when the target regions in gag contain 91% and 72-73% sequence identities, respectively. These results show that there are at least three distinct mechanisms preventing exchange of genetic information between divergent HIV-1 variants from different groups.  相似文献   

11.
Ouyang Q  Zhao X  Feng H  Tian Y  Li D  Li M  Tan Z 《Gene》2012,499(1):37-40
The presence, locations and composition of simple sequence repeats (SSRs) in Herpes simplex virus type 1 (HSV-1) genome were extracted and analyzed by using the software Imperfect Microsatellite Extractor (IMEx). There were 663 mon-, 502 di-, 184 tri-, 20 tetra-, 4 penta- and 4 hexanucleotide SSRs that were observed in different distribution between coding and noncoding regions in the HSV-1 genome. G/C, GC/CG, and (GGC)(n) were predominant in mononucleotide, dinucletide, trinucleotide repeats respectively. Indeed, the results showed that GC content in simple sequence repeats was notably higher than that in entire HSV-1 genome. Our data might be helpful for studying the pathogenesis, genome structure and evolution of HSV-1.  相似文献   

12.
The abundance and inherent potential for variations in simple sequence repeats (SSRs) or microsatellites resulted in valuable source for genetic markers in eukaryotes. We describe the organization and abundance of SSRs in fungus Fusarium graminearum (causative agent for Fusarium head blight or head scab of wheat). We identified 1705 SSRs of various nucleotide repeat motifs in the sequence database of F. graminearum. It is observed that mononucleotide repeats (62%) were most abundant followed by di- (20%) and trinucleotide repeats (14%). It is noted that tetra-, penta- and hexanucleotide repeats accounted for only 4% of SSRs. The estimated frequency of Class I SSRs (perfect repeats ≥20 nucleotides) was one SSR per 124.5 kb, whereas the frequency of Class II (perfect repeats >10 nucleotides and ≫20 nucleotides) was one SSR per 25.6 kb. The dynamics of SSRs will be a powerful tool for taxonomic, phylogenetic, genome mapping and population genetic studies as SSR based markers show high levels of allelic variation, codominant inheritance and ease of analysis.  相似文献   

13.

Background

The giant panda (Ailuropoda melanoleuca) is a critically endangered species endemic to China. Microsatellites have been preferred as the most popular molecular markers and proven effective in estimating population size, paternity test, genetic diversity for the critically endangered species. The availability of the giant panda complete genome sequences provided the opportunity to carry out genome-wide scans for all types of microsatellites markers, which now opens the way for the analysis and development of microsatellites in giant panda.

Results

By screening the whole genome sequence of giant panda in silico mining, we identified microsatellites in the genome of giant panda and analyzed their frequency and distribution in different genomic regions. Based on our search criteria, a repertoire of 855,058 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. A total of 160 primer pairs were designed to screen for polymorphic microsatellites using the selected tetranucleotide microsatellite sequences. The 51 novel polymorphic tetranucleotide microsatellite loci were discovered based on genotyping blood DNA from 22 captive giant pandas in this study. Finally, a total of 15 markers, which showed good polymorphism, stability, and repetition in faecal samples, were used to establish the novel microsatellite marker system for giant panda. Meanwhile, a genotyping database for Chengdu captive giant pandas (n = 57) were set up using this standardized system. What’s more, a universal individual identification method was established and the genetic diversity were analysed in this study as the applications of this marker system.

Conclusion

The microsatellite abundance and diversity were characterized in giant panda genomes. A total of 154,677 tetranucleotide microsatellites were identified and 15 of them were discovered as the polymorphic and stable loci. The individual identification method and the genetic diversity analysis method in this study provided adequate material for the future study of giant panda.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1268-z) contains supplementary material, which is available to authorized users.  相似文献   

14.
In this study, the chloroplast (cp) genome sequences from three early diverged leptosporangiate ferns were completed and analyzed in order to understand the evolution of the genome of the fern lineages. The complete cp genome sequence of Osmunda cinnamomea (Osmundales) was 142,812 base pairs (bp). The cp genome structure was similar to that of eusporangiate ferns. The gene/intron losses that frequently occurred in the cp genome of leptosporangiate ferns were not found in the cp genome of O. cinnamomea. In addition, putative RNA editing sites in the cp genome were rare in O. cinnamomea, even though the sites were frequently predicted to be present in leptosporangiate ferns. The complete cp genome sequence of Diplopterygium glaucum (Gleicheniales) was 151,007 bp and has a 9.7 kb inversion between the trnL-CAA and trnV-GCA genes when compared to O. cinnamomea. Several repeated sequences were detected around the inversion break points. The complete cp genome sequence of Lygodium japonicum (Schizaeales) was 157,142 bp and a deletion of the rpoC1 intron was detected. This intron loss was shared by all of the studied species of the genus Lygodium. The GC contents and the effective numbers of co-dons (ENCs) in ferns varied significantly when compared to seed plants. The ENC values of the early diverged leptosporangiate ferns showed intermediate levels between eusporangiate and core leptosporangiate ferns. However, our phylogenetic tree based on all of the cp gene sequences clearly indicated that the cp genome similarity between O. cinnamomea (Osmundales) and eusporangiate ferns are symplesiomorphies, rather than synapomorphies. Therefore, our data is in agreement with the view that Osmundales is a distinct early diverged lineage in the leptosporangiate ferns.  相似文献   

15.
The 2 465 177 bp genome of Sulfolobus islandicus LAL14/1, host of the model rudivirus SIRV2, was sequenced. Exhaustive comparative genomic analysis of S. islandicus LAL14/1 and the nine other completely sequenced S. islandicus strains isolated from Iceland, Russia and USA revealed a highly syntenic common core genome of approximately 2 Mb and a long hyperplastic region containing most of the strain-specific genes. In LAL14/1, the latter region is enriched in insertion sequences, CRISPR (clustered regularly interspaced short palindromic repeats), glycosyl transferase genes, toxin–antitoxin genes and MITE (miniature inverted-repeat transposable elements). The tRNA genes of LAL14/1 are preferential targets for the integration of mobile elements but clusters of atypical genes (CAG) are also integrated elsewhere in the genome. LAL14/1 carries five CRISPR loci with 10 per cent of spacers matching perfectly or imperfectly the genomes of archaeal viruses and plasmids found in the Icelandic hot springs. Strikingly, the CRISPR_2 region of LAL14/1 carries an unusually long 1.9 kb spacer interspersed between two repeat regions and displays a high similarity to pING1-like conjugative plasmids. Finally, we have developed a genetic system for S. islandicus LAL14/1 and created ΔpyrEF and ΔCRISPR_1 mutants using double cross-over and pop-in/pop-out approaches, respectively. Thus, LAL14/1 is a promising model to study virus–host interactions and the CRISPR/Cas defence mechanism in Archaea.  相似文献   

16.

Background

Although Mycobacterium tuberculosis isolates are consisted of several different lineages and the epidemiology analyses are usually assessed relative to a particular reference genome, M. tuberculosis H37Rv, which might introduce some biased results. Those analyses are essentially based genome sequence information of M. tuberculosis and could be performed in sillico in theory, with whole genome sequence (WGS) data available in the databases and obtained by next generation sequencers (NGSs). As an approach to establish higher resolution methods for such analyses, whole genome sequences of the M. tuberculosis complexes (MTBCs) strains available on databases were aligned to construct virtual reference genome sequences called the consensus sequence (CS), and evaluated its feasibility in in sillico epidemiological analyses.

Results

The consensus sequence (CS) was successfully constructed and utilized to perform phylogenetic analysis, evaluation of read mapping efficacy, which is crucial for detecting single nucleotide polymorphisms (SNPs), and various MTBC typing methods virtually including spoligotyping, VNTR, Long sequence polymorphism and Beijing typing. SNPs detected based on CS, in comparison with H37Rv, were utilized in concatemer-based phylogenetic analysis to determine their reliability relative to a phylogenetic tree based on whole genome alignment as the gold standard. Statistical comparison of phylogenic trees based on CS with that of H37Rv indicated the former showed always better results that that of later. SNP detection and concatenation with CS was advantageous because the frequency of crucial SNPs distinguishing among strain lineages was higher than those of H37Rv. The number of SNPs detected was lower with the consensus than with the H37Rv sequence, resulting in a significant reduction in computational time. Performance of each virtual typing was satisfactory and accorded with those published when those are available.

Conclusions

These results indicated that virtual CS constructed from genome sequence data is an ideal approach as a reference for MTBC studies.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1368-9) contains supplementary material, which is available to authorized users.  相似文献   

17.
To estimate the phylogeny and molecular evolution of a single-copy gene encoding plastid acetyl-CoA carboxylase (Acc1) within the StH genome species, two Acc1 homoeologous sequences were isolated from nearly all the sampled StH genome species and were analyzed with those from 35 diploid taxa representing 19 basic genomes in Triticeae. Sequence diversity patterns and genealogical analysis suggested that (1) the StH genome species from the same areas or neighboring geographic regions are closely related to each other; (2) the Acc1 gene sequences of the StH genome species from North America and Eurasia are evolutionarily distinct; (3) Dasypyrum has contributed to the nuclear genome of Elymus repens and Elymus mutabilis; (4) the StH genome polyploids have higher levels of sequence diversity in the H genome homoeolog than the St genome homoeolog; and (5) the Acc1 sequence may evolve faster in the polyploid species than in the diploids. Our result provides some insight on evolutionary dynamics of duplicate Acc1 gene, the polyploidy speciation and phylogeny of the StH genome species.  相似文献   

18.
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.  相似文献   

19.
Background and Aims Banana genomes harbour numerous copies of viral sequences derived from banana streak viruses (BSVs) – dsDNA viruses belonging to the family Caulimoviridae. These viral integrants (eBSVs) are mostly defective, probably as a result of ‘pseudogenization’ driven by host genome evolution. However, some can give rise to infection by releasing a functional viral genome following abiotic stresses. These distinct infective eBSVs correspond to the three main widespread BSV species (BSOLV, BSGFV and BSIMV), fully described within the Musa balbisiana B genomes of the seedy diploid ‘Pisang Klutuk Wulung’ (PKW).Methods We characterize eBSV distribution among a Musa sampling including seedy BB diploids and interspecific hybrids with Musa acuminata exhibiting different levels of ploidy for the B genome (ABB, AAB, AB). We used representative samples of the two areas of sympatry between M. acuminata and M. balbisiana species representing the native area of the most widely cultivated AAB cultivars (in India and in East Asia, ranging from the Philippines to New Guinea). Seventy-seven accessions were characterized using eBSV-related PCR markers and Southern hybridization approaches. We coded both sets of results to create a common dissimilarity matrix with which to interpret eBSV distribution.Key Results We propose a Musa phylogeny driven by the M. balbisiana genome based on a dendrogram resulting from a joint neighbour-joining analysis of the three BSV species, showing for the first time lineages between BB and ABB/AAB hybrids. eBSVs appear to be relevant phylogenetic markers that can illustrate the M. balbisiana phylogeography story.Conclusion The theoretical implications of this study for further elucidation of the historical and geographical process of Musa domestication are numerous. Discovery of banana plants with B genome non-infective for eBSV opens the way to the introduction of new genitors in programmes of genetic banana improvement.  相似文献   

20.
Recently, it was found that 80% of sexual HIV-1 transmissions are established by a single virion/viral genome. To investigate whether the transmitted/founder (T/F) viruses have specific biological properties favoring sexual transmission, we inoculated human cervical tissue explants with isogenic HIV-1 viruses encoding Env sequences from T/F and control reference (C/R) HIV-1 variants as well as with full length T/F HIV-1 and compared their replication efficiencies, T cell depletion, and the activation status of infected cells. We found that all the HIV-1 variants were capable of transmitting infection to cervical tissue ex vivo and in this system preferentially replicate in activated CD4 T cells and deplete these cells. There was no difference in the biological properties of T/F and C/R HIV-1 variants as evaluated in ex vivo cervical tissue.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号