首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 468 毫秒
1.
2.

Background

Homoeologous sequences pose a particular challenge if bacterial artificial chromosome (BAC) contigs shall be established for specific regions of an allopolyploid genome. Single nucleotide polymorphisms (SNPs) differentiating between homoeologous genomes (intergenomic SNPs) may represent a suitable screening tool for such purposes, since they do not only identify homoeologous sequences but also differentiate between them.

Results

Sequence alignments between Brassica rapa (AA) and Brassica oleracea (CC) sequences mapping to corresponding regions on chromosomes A1 and C1, respectively were used to identify single nucleotide polymorphisms between the A and C genomes. A large fraction of these polymorphisms was also present in Brassica napus (AACC), an allopolyploid species that originated from hybridisation of A and C genome species. Intergenomic SNPs mapping throughout homoeologous chromosome segments spanning approximately one Mbp each were included in Illumina’s GoldenGate® Genotyping Assay and used to screen multidimensional pools of a Brassica napus bacterial artificial chromosome library with tenfold genome coverage. Based on the results of 50 SNP assays, a BAC contig for the Brassica napus A subgenome was established that spanned the entire region of interest. The C subgenome region was represented in three BAC contigs.

Conclusions

This proof-of-concept study shows that sequence resources of diploid progenitor genomes can be used to deduce intergenomic SNPs suitable for multiplex polymerase chain reaction (PCR)-based screening of multidimensional BAC pools of a polyploid organism. Owing to their high abundance and ease of identification, intergenomic SNPs represent a versatile tool to establish BAC contigs for homoeologous regions of a polyploid genome.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-560) contains supplementary material, which is available to authorized users.  相似文献   

3.

Background

Although Mycobacterium tuberculosis isolates are consisted of several different lineages and the epidemiology analyses are usually assessed relative to a particular reference genome, M. tuberculosis H37Rv, which might introduce some biased results. Those analyses are essentially based genome sequence information of M. tuberculosis and could be performed in sillico in theory, with whole genome sequence (WGS) data available in the databases and obtained by next generation sequencers (NGSs). As an approach to establish higher resolution methods for such analyses, whole genome sequences of the M. tuberculosis complexes (MTBCs) strains available on databases were aligned to construct virtual reference genome sequences called the consensus sequence (CS), and evaluated its feasibility in in sillico epidemiological analyses.

Results

The consensus sequence (CS) was successfully constructed and utilized to perform phylogenetic analysis, evaluation of read mapping efficacy, which is crucial for detecting single nucleotide polymorphisms (SNPs), and various MTBC typing methods virtually including spoligotyping, VNTR, Long sequence polymorphism and Beijing typing. SNPs detected based on CS, in comparison with H37Rv, were utilized in concatemer-based phylogenetic analysis to determine their reliability relative to a phylogenetic tree based on whole genome alignment as the gold standard. Statistical comparison of phylogenic trees based on CS with that of H37Rv indicated the former showed always better results that that of later. SNP detection and concatenation with CS was advantageous because the frequency of crucial SNPs distinguishing among strain lineages was higher than those of H37Rv. The number of SNPs detected was lower with the consensus than with the H37Rv sequence, resulting in a significant reduction in computational time. Performance of each virtual typing was satisfactory and accorded with those published when those are available.

Conclusions

These results indicated that virtual CS constructed from genome sequence data is an ideal approach as a reference for MTBC studies.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1368-9) contains supplementary material, which is available to authorized users.  相似文献   

4.

Background

High-yielding cultivars of rice (Oryza sativa L.) have been developed in Japan from crosses between overseas indica and domestic japonica cultivars. Recently, next-generation sequencing technology and high-throughput genotyping systems have shown many single-nucleotide polymorphisms (SNPs) that are proving useful for detailed analysis of genome composition. These SNPs can be used in genome-wide association studies to detect candidate genome regions associated with economically important traits. In this study, we used a custom SNP set to identify introgressed chromosomal regions in a set of high-yielding Japanese rice cultivars, and we performed an association study to identify genome regions associated with yield.

Results

An informative set of 1152 SNPs was established by screening 14 high-yielding or primary ancestral cultivars for 5760 validated SNPs. Analysis of the population structure of high-yielding cultivars showed three genome types: japonica-type, indica-type and a mixture of the two. SNP allele frequencies showed several regions derived predominantly from one of the two parental genome types. Distinct regions skewed for the presence of parental alleles were observed on chromosomes 1, 2, 7, 8, 11 and 12 (indica) and on chromosomes 1, 2 and 6 (japonica). A possible relationship between these introgressed regions and six yield traits (blast susceptibility, heading date, length of unhusked seeds, number of panicles, surface area of unhusked seeds and 1000-grain weight) was detected in eight genome regions dominated by alleles of one parental origin. Two of these regions were near Ghd7, a heading date locus, and Pi-ta, a blast resistance locus. The allele types (i.e., japonica or indica) of significant SNPs coincided with those previously reported for candidate genes Ghd7 and Pi-ta.

Conclusions

Introgression breeding is an established strategy for the accumulation of QTLs and genes controlling high yield. Our custom SNP set is an effective tool for the identification of introgressed genome regions from a particular genetic background. This study demonstrates that changes in genome structure occurred during artificial selection for high yield, and provides information on several genomic regions associated with yield performance.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-346) contains supplementary material, which is available to authorized users.  相似文献   

5.

Background

Single nucleotide polymorphisms (SNPs) are the most common type of genetic variation. Identification of large numbers of SNPs is helpful for genetic diversity analysis, map-based cloning, genome-wide association analyses and marker-assisted breeding. Recently, identifying genome-wide SNPs in allopolyploid Brassica napus (rapeseed, canola) by resequencing many accessions has become feasible, due to the availability of reference genomes of Brassica rapa (2n = AA) and Brassica oleracea (2n = CC), which are the progenitor species of B. napus (2n = AACC). Although many SNPs in B. napus have been released, the objective in the present study was to produce a larger, more informative set of SNPs for large-scale and efficient genotypic screening. Hence, short-read genome sequencing was conducted on ten elite B. napus accessions for SNP discovery. A subset of these SNPs was randomly selected for sequence validation and for genotyping efficiency testing using the Illumina GoldenGate assay.

Results

A total of 892,536 bi-allelic SNPs were discovered throughout the B. napus genome. A total of 36,458 putative amino acid variants were located in 13,552 protein-coding genes, which were predicted to have enriched binding and catalytic activity as a result. Using the GoldenGate genotyping platform, 94 of 96 SNPs sampled could effectively distinguish genotypes of 130 lines from two mapping populations, with an average call rate of 92%.

Conclusions

Despite the polyploid nature of B. napus, nearly 900,000 simple SNPs were identified by whole genome resequencing. These SNPs were predicted to be effective in high-throughput genotyping assays (51% polymorphic SNPs, 92% average call rate using the GoldenGate assay, leading to an estimated >450 000 useful SNPs). Hence, the development of a much larger genotyping array of informative SNPs is feasible. SNPs identified in this study to cause non-synonymous amino acid substitutions can also be utilized to directly identify causal genes in association studies.  相似文献   

6.

Background and Aims

The Hawaiian silversword alliance (Asteraceae) is one the best examples of a plant adaptive radiation, exhibiting extensive morphological and ecological diversity. No research within this group has addressed the role of geographical isolation, independent of ecological adaptation, in contributing to taxonomic diversity. The aims of this study were to examine genetic differentiation among subspecies of Dubautia laxa (Asteraceae) to determine if allopatric or sympatric populations and subspecies form distinct genetic clusters to understand better the role of geography in diversification within the alliance.

Methods

Dubautia laxa is a widespread member of the Hawaiian silversword alliance, occurring on four of the five major islands of the Hawaiian archipelago, with four subspecies recognized on the basis of morphological, ecological and geographical variation. Nuclear microsatellites and plastid DNA sequence data were examined. Data were analysed using maximum-likelihood and Bayesian phylogenetic methodologies to identify unique evolutionary lineages.

Key Results

Plastid DNA sequence data resolved two highly divergent lineages, recognized as the Laxa and Hirsuta groups, that are more similar to other members of the Hawaiian silversword alliance than they are to each other. The Laxa group is basal to the young island species of Dubautia, whereas the Hirsuta group forms a clade with the old island lineages of Dubautia and with Argyroxiphium. The divergence between the plastid groups is supported by Bayesian microsatellite clustering analyses, but the degree of nuclear differentiation is not as great. Clear genetic differentiation is only observed between allopatric populations, both within and among islands.

Conclusions

These results indicate that geographical separation has aided diversification in D. laxa, whereas ecologically associated morphological differences are not associated with neutral genetic differentiation. This suggests that, despite the stunning ecological adaptation observed, geography has also played an important role in the Hawaiian silversword alliance plant adaptive radiation.  相似文献   

7.

Background

With the price of next generation sequencing steadily decreasing, bacterial genome assembly is now accessible to a wide range of researchers. It is therefore necessary to understand the best methods for generating a genome assembly, specifically, which combination of sequencing and bioinformatics strategies result in the most accurate assemblies. Here, we sequence three E. coli strains on the Illumina MiSeq, Life Technologies Ion Torrent PGM, and Pacific Biosciences RS. We then perform genome assemblies on all three datasets alone or in combination to determine the best methods for the assembly of bacterial genomes.

Results

Three E. coli strains – BL21(DE3), Bal225, and DH5α – were sequenced to a depth of 100× on the MiSeq and Ion Torrent machines and to at least 125× on the PacBio RS. Four assembly methods were examined and compared. The previously published BL21(DE3) genome [GenBank:AM946981.2], allowed us to evaluate the accuracy of each of the BL21(DE3) assemblies. BL21(DE3) PacBio-only assemblies resulted in a 90% reduction in contigs versus short read only assemblies, while N50 numbers increased by over 7-fold. Strikingly, the number of SNPs in PacBio-only assemblies were less than half that seen with short read assemblies (~20 SNPs vs. ~50 SNPs) and indels also saw dramatic reductions (~2 indel >5 bp in PacBio-only assemblies vs. ~12 for short-read only assemblies). Assemblies that used a mixture of PacBio and short read data generally fell in between these two extremes. Use of PacBio sequencing reads also allowed us to call covalent base modifications for the three strains. Each of the strains used here had a known covalent base modification genotype, which was confirmed by PacBio sequencing.

Conclusion

Using data generated solely from the Pacific Biosciences RS, we were able to generate the most complete and accurate de novo assemblies of E. coli strains. We found that the addition of other sequencing technology data offered no improvements over use of PacBio data alone. In addition, the sequencing data from the PacBio RS allowed for sensitive and specific calling of covalent base modifications.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-14-675) contains supplementary material, which is available to authorized users.  相似文献   

8.

Background

The mechanism of high-altitude adaptation has been studied in certain mammals. However, in avian species like the ground tit Pseudopodoces humilis, the adaptation mechanism remains unclear. The phylogeny of the ground tit is also controversial.

Results

Using next generation sequencing technology, we generated and assembled a draft genome sequence of the ground tit. The assembly contained 1.04 Gb of sequence that covered 95.4% of the whole genome and had higher N50 values, at the level of both scaffolds and contigs, than other sequenced avian genomes. About 1.7 million SNPs were detected, 16,998 protein-coding genes were predicted and 7% of the genome was identified as repeat sequences. Comparisons between the ground tit genome and other avian genomes revealed a conserved genome structure and confirmed the phylogeny of ground tit as not belonging to the Corvidae family. Gene family expansion and positively selected gene analysis revealed genes that were related to cardiac function. Our findings contribute to our understanding of the adaptation of this species to extreme environmental living conditions.

Conclusions

Our data and analysis contribute to the study of avian evolutionary history and provide new insights into the adaptation mechanisms to extreme conditions in animals.  相似文献   

9.

Background

A large single nucleotide polymorphism (SNP) dataset was used to analyze genome-wide diversity in a diverse collection of watermelon cultivars representing globally cultivated, watermelon genetic diversity. The marker density required for conducting successful association mapping depends on the extent of linkage disequilibrium (LD) within a population. Use of genotyping by sequencing reveals large numbers of SNPs that in turn generate opportunities in genome-wide association mapping and marker-assisted selection, even in crops such as watermelon for which few genomic resources are available. In this paper, we used genome-wide genetic diversity to study LD, selective sweeps, and pairwise FST distributions among worldwide cultivated watermelons to track signals of domestication.

Results

We examined 183 Citrullus lanatus var. lanatus accessions representing domesticated watermelon and generated a set of 11,485 SNP markers using genotyping by sequencing. With a diverse panel of worldwide cultivated watermelons, we identified a set of 5,254 SNPs with a minor allele frequency of ≥ 0.05, distributed across the genome. All ancestries were traced to Africa and an admixture of various ancestries constituted secondary gene pools across various continents. A sliding window analysis using pairwise FST values was used to resolve selective sweeps. We identified strong selection on chromosomes 3 and 9 that might have contributed to the domestication process. Pairwise analysis of adjacent SNPs within a chromosome as well as within a haplotype allowed us to estimate genome-wide LD decay. LD was also detected within individual genes on various chromosomes. Principal component and ancestry analyses were used to account for population structure in a genome-wide association study. We further mapped important genes for soluble solid content using a mixed linear model.

Conclusions

Information concerning the SNP resources, population structure, and LD developed in this study will help in identifying agronomically important candidate genes from the genomic regions underlying selection and for mapping quantitative trait loci using a genome-wide association study in sweet watermelon.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-767) contains supplementary material, which is available to authorized users.  相似文献   

10.

Background

Retrotransposons have been extensively studied in plants and animals and have been shown to have an impact on human genome dynamics and evolution. Their ability to move within genomes gives retrotransposons to affect genome instability.

Methods

we examined the polymorphic inserted AluYa5, evolutionary young Alu, in the progesterone receptor gene to determine the effects of Alu insertion on molecular environment. We used mono-allelic inserted cell lines which carry both Alu-present and Alu-absent alleles. To determine the epigenetic change and gene expression, we performed restriction enzyme digestion, Pyrosequencing, and Chromatin Immunoprecipitation.

Results

We observed that the polymorphic insertion of evolutionally young Alu causes increasing levels of DNA methylation in the surrounding genomic area and generates inactive histone tail modifications. Consequently the Alu insertion deleteriously inactivates the neighboring gene expression.

Conclusion

The mono-allelic Alu insertion cell line clearly showed that polymorphic inserted repetitive elements cause the inactivation of neighboring gene expression, bringing aberrant epigenetic changes.  相似文献   

11.
12.
13.

Background

Brucellosis is an important zoonotic disease that affects both humans and animals. We sequenced the full genome and characterised the genetic diversity of two Brucella melitensis isolates from Malaysia and the Philippines. In addition, we performed a comparative whole-genome single nucleotide polymorphism (SNP) analysis of B. melitensis strains collected from around the world, to investigate the potential origin and the history of the global spread of B. melitensis.

Results

Single sequencing runs of each genome resulted in draft genome sequences of MY1483/09 and Phil1136/12, which covered 99.85% and 99.92% of the complete genome sequences, respectively. The B. melitensis genome sequences, and two B. abortus strains used as the outgroup strains, yielded a total of 13,728 SNP sites. Phylogenetic analysis using whole-genome SNPs and geographical distribution of the isolates revealed spatial clustering of the B. melitensis isolates into five genotypes, I, II, III, IV and V. The Mediterranean strains, identified as genotype I, occupied the basal node of the phylogenetic tree, suggesting that B. melitensis may have originated from the Mediterranean regions. All of the Asian B. melitensis strains clustered into genotype II with the SEA strains, including the two isolates sequenced in this study, forming a distinct clade denoted here as genotype IId. Genotypes III, IV and V of B. melitensis demonstrated a restricted geographical distribution, with genotype III representing the African lineage, genotype IV representing the European lineage and genotype V representing the American lineage.

Conclusion

We showed that SNPs retrieved from the B. melitensis draft full genomes were sufficient to resolve the interspecies relationships between B. melitensis strains and to discriminate between the vaccine and endemic strains. Phylogeographic reconstruction of the history of B. melitensis global spread at a finer scale by using whole-genome SNP analyses supported the origin of all B. melitensis strains from the Mediterranean region. The possible global distribution of B. melitensis following the ancient trade routes was also consistent with whole-genome SNP phylogeny. The whole genome SNP phylogenetics analysis, hence is a powerful tool for intraspecies discrimination of closely related species.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1294-x) contains supplementary material, which is available to authorized users.  相似文献   

14.
15.

Background and Aims

Genome duplication is a central process in plant evolution and contributes to patterns of variation in genome size within and among lineages. Studies that combine cytogeography with genome size measurements contribute to our basic knowledge of cytotype distributions and their associations with variation in genome size.

Methods

Ploidy and genome size were assessed with direct chromosome counts and flow cytometry for 78 populations within the Claytonia perfoliata complex, comprised of three diploid taxa with numerous polyploids that range to the decaploid level. The relationship between genome size and temperature and precipitation was investigated within and across cytotypes to test for associations between environmental factors and nuclear DNA content.

Key Results

A euploid series (n = 6) of diploids to octoploids was documented through chromosome counts, and decaploids were suggested by flow cytometry. Increased variation in genome size among populations was found at higher ploidy levels, potentially associated with differential contributions of diploid parental genomes, variation in rates of genomic loss or gain, or undetected hybridization. Several accessions were detected with atypical genome sizes, including a diploid population of C. parviflora ssp. grandiflora with an 18 % smaller genome than typical, and hexaploids of C. perfoliata and C. parviflora with genomes 30 % larger than typical. There was a slight but significant association of larger genome sizes with colder winter temperature across the C. perfoliata complex as a whole, and a strong association between lower winter temperatures and large genome size for tetraploid C. parviflora.

Conclusions

The C. perfoliata complex is characterized by polyploids ranging from tetraploid to decaploid, with large magnitude variation in genome size at higher ploidy levels, associated in part with environmental variation in temperature.  相似文献   

16.

Background

Previous genome-wide association analyses identified QTL regions in the X chromosome for percentage of normal sperm and scrotal circumference in Brahman and Tropical Composite cattle. These traits are important to be studied because they are indicators of male fertility and are correlated with female sexual precocity and reproductive longevity. The aim was to investigate candidate genes in these regions and to identify putative causative mutations that influence these traits. In addition, we tested the identified mutations for female fertility and growth traits.

Results

Using a combination of bioinformatics and molecular assay technology, twelve non-synonymous SNPs in eleven genes were genotyped in a cattle population. Three and nine SNPs explained more than 1% of the additive genetic variance for percentage of normal sperm and scrotal circumference, respectively. The SNPs that had a major influence in percentage of normal sperm were mapped to LOC100138021 and TAF7L genes; and in TEX11 and AR genes for scrotal circumference. One SNP in TEX11 was explained ~13% of the additive genetic variance for scrotal circumference at 12 months. The tested SNP were also associated with weight measurements, but not with female fertility traits.

Conclusions

The strong association of SNPs located in X chromosome genes with male fertility traits validates the QTL. The implicated genes became good candidates to be used for genetic evaluation, without detrimentally influencing female fertility traits.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1595-0) contains supplementary material, which is available to authorized users.  相似文献   

17.

Background

Kelp (Saccharina japonica) has been intensively cultured in China for almost a century. Its genetic improvement is comparable with that of rice. However, the development of its molecular tools is extremely limited, thus its genes, genetics and genomics. Kelp performs an alternative life cycle during which sporophyte generation alternates with gametophyte generation. The gametophytes of kelp can be cloned and crossed. Due to these characteristics, kelp may serve as a reference for the biological and genetic studies of Volvox, mosses and ferns.

Results

We constructed a high density single nucleotide polymorphism (SNP) linkage map for kelp by restriction site associated DNA (RAD) sequencing. In total, 4,994 SNP-containing physical (tag-defined) RAD loci were mapped on 31 linkage groups. The map expanded a total genetic distance of 1,782.75 cM, covering 98.66% of the expected (1,806.94 cM). The length of RAD tags (85 bp) was extended to 400–500 bp with Miseq method, offering us an easiness of developing SNP chips and shifting SNP genotyping to a high throughput track. The number of linkage groups was in accordance with the documented with cytological methods. In addition, we identified a set of microsatellites (99 in total) from the extended RAD tags. A gametophyte sex determining locus was mapped on linkage group 2 in a window about 9.0 cM in width, which was 2.66 cM up to marker_40567 and 6.42 cM down to marker_23595.

Conclusions

A high density SNP linkage map was constructed for kelp, an intensively cultured brown alga in China. The RAD tags were also extended so that a SNP chip could be developed. In addition, a set of microsatellites were identified among mapped loci, and a gametophyte sex determining locus was mapped. This map will facilitate the genetic studies of kelp including for example the evaluation of germplasm and the decipherment of the genetic bases of economic traits.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1371-1) contains supplementary material, which is available to authorized users.  相似文献   

18.

Background

Ziziphus Mill. (jujube), the most valued genus of Rhamnaceae, comprises of a number of economically and ecologically important species such as Z. jujuba Mill., Z. acidojujuba Cheng et Liu and Z. mauritiana Lam. Single nucleotide polymorphism (SNP) markers and a high-density genetic map are of great benefit to the improvement of the crop, mapping quantitative trait loci (QTL) and analyzing genome structure. However, such a high-density map is still absent in the genus Ziziphus and even the family Rhamnaceae. The recently developed restriction-site associated DNA (RAD) marker has been proven to be most powerful in genetic map construction. The objective of this study was to construct a high-density linkage map using the RAD tags generated by next generation sequencing.

Results

An interspecific F1 population and their parents (Z. jujuba Mill. ‘JMS2’ × Z. acidojujuba Cheng et Liu ‘Xing 16’) were genotyped using a mapping-by-sequencing approach, to generate RAD-based SNP markers. A total of 42,784 putative high quality SNPs were identified between the parents and 2,872 high-quality RAD markers were grouped in genetic maps. Of the 2,872 RAD markers, 1,307 were linked to the female genetic map, 1,336 to the male map, and 2,748 to the integrated map spanning 913.87 centi-morgans (cM) with an average marker interval of 0.34 cM. The integrated map contained 12 linkage groups (LGs), consistent with the haploid chromosome number of the two parents.

Conclusion

We first generated a high-density genetic linkage map with 2,748 RAD markers for jujube and a large number of SNPs were also developed. It provides a useful tool for both marker-assisted breeding and a variety of genome investigations in jujube, such as sequence assembly, gene localization, QTL detection and genome structure comparison.  相似文献   

19.
《BMC genomics》2014,15(1)

Background

Sugarcane is the source of sugar in all tropical and subtropical countries and is becoming increasingly important for bio-based fuels. However, its large (10 Gb), polyploid, complex genome has hindered genome based breeding efforts. Here we release the largest and most diverse set of sugarcane genome sequences to date, as part of an on-going initiative to provide a sugarcane genomic information resource, with the ultimate goal of producing a gold standard genome.

Results

Three hundred and seventeen chiefly euchromatic BACs were sequenced. A reference set of one thousand four hundred manually-annotated protein-coding genes was generated. A small RNA collection and a RNA-seq library were used to explore expression patterns and the sRNA landscape. In the sucrose and starch metabolism pathway, 16 non-redundant enzyme-encoding genes were identified. One of the sucrose pathway genes, sucrose-6-phosphate phosphohydrolase, is duplicated in sugarcane and sorghum, but not in rice and maize. A diversity analysis of the s6pp duplication region revealed haplotype-structured sequence composition. Examination of hom(e)ologous loci indicate both sequence structural and sRNA landscape variation. A synteny analysis shows that the sugarcane genome has expanded relative to the sorghum genome, largely due to the presence of transposable elements and uncharacterized intergenic and intronic sequences.

Conclusion

This release of sugarcane genomic sequences will advance our understanding of sugarcane genetics and contribute to the development of molecular tools for breeding purposes and gene discovery.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-540) contains supplementary material, which is available to authorized users.  相似文献   

20.

Background

Chlamydia pecorum is an important pathogen of domesticated livestock including sheep, cattle and pigs. This pathogen is also a key factor in the decline of the koala in Australia. We sequenced the genomes of three koala C. pecorum strains, isolated from the urogenital tracts and conjunctiva of diseased koalas. The genome of the C. pecorum VR629 (IPA) strain, isolated from a sheep with polyarthritis, was also sequenced.

Results

Comparisons of the draft C. pecorum genomes against the complete genomes of livestock C. pecorum isolates revealed that these strains have a conserved gene content and order, sharing a nucleotide sequence similarity > 98%. Single nucleotide polymorphisms (SNPs) appear to be key factors in understanding the adaptive process. Two regions of the chromosome were found to be accumulating a large number of SNPs within the koala strains. These regions include the Chlamydia plasticity zone, which contains two cytotoxin genes (toxA and toxB), and a 77 kbp region that codes for putative type III effector proteins. In one koala strain (MC/MarsBar), the toxB gene was truncated by a premature stop codon but is full-length in IPTaLE and DBDeUG. Another five pseudogenes were also identified, two unique to the urogenital strains C. pecorum MC/MarsBar and C. pecorum DBDeUG, respectively, while three were unique to the koala C. pecorum conjunctival isolate IPTaLE. An examination of the distribution of these pseudogenes in C. pecorum strains from a variety of koala populations, alongside a number of sheep and cattle C. pecorum positive samples from Australian livestock, confirmed the presence of four predicted pseudogenes in koala C. pecorum clinical samples. Consistent with our genomics analyses, none of these pseudogenes were observed in the livestock C. pecorum samples examined. Interestingly, three SNPs resulting in pseudogenes identified in the IPTaLE isolate were not found in any other C. pecorum strain analysed, raising questions over the origin of these point mutations.

Conclusions

The genomic data revealed that variation between C. pecorum strains were mainly due to the accumulation of SNPs, some of which cause gene inactivation. The identification of these genetic differences will provide the basis for further studies to understand the biology and evolution of this important animal pathogen.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-667) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号