首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We present herein the first complete genome sequence of a thermophilic Bacillus-related species, Geobacillus kaustophilus HTA426, which is composed of a 3.54 Mb chromosome and a 47.9 kb plasmid, along with a comparative analysis with five other mesophilic bacillar genomes. Upon orthologous grouping of the six bacillar sequenced genomes, it was found that 1257 common orthologous groups composed of 1308 genes (37%) are shared by all the bacilli, whereas 839 genes (24%) in the G.kaustophilus genome were found to be unique to that species. We were able to find the first prokaryotic sperm protamine P1 homolog, polyamine synthase, polyamine ABC transporter and RNA methylase in the 839 unique genes; these may contribute to thermophily by stabilizing the nucleic acids. Contrasting results were obtained from the principal component analysis (PCA) of the amino acid composition and synonymous codon usage for highlighting the thermophilic signature of the G.kaustophilus genome. Only in the PCA of the amino acid composition were the Bacillus-related species located near, but were distinguishable from, the borderline distinguishing thermophiles from mesophiles on the second principal axis. Further analysis revealed some asymmetric amino acid substitutions between the thermophiles and the mesophiles, which are possibly associated with the thermoadaptation of the organism.  相似文献   

2.

Key message

The Co - x anthracnose R gene of common bean was fine-mapped into a 58 kb region at one end of chromosome 1, where no canonical NB-LRR-encoding genes are present in G19833 genome sequence.

Abstract

Anthracnose, caused by the phytopathogenic fungus Colletotrichum lindemuthianum, is one of the most damaging diseases of common bean, Phaseolus vulgaris. Various resistance (R) genes, named Co-, conferring race-specific resistance to different strains of C. lindemuthianum have been identified. The Andean cultivar JaloEEP558 was reported to carry Co-x on chromosome 1, conferring resistance to the highly virulent strain 100. To fine map Co-x, 181 recombinant inbred lines derived from the cross between JaloEEP558 and BAT93 were genotyped with polymerase chain reaction (PCR)-based markers developed using the genome sequence of the Andean genotype G19833. Analysis of RILs carrying key recombination events positioned Co-x at one end of chromosome 1 to a 58 kb region of the G19833 genome sequence. Annotation of this target region revealed eight genes: three phosphoinositide-specific phospholipases C (PI-PLC), one zinc finger protein and four kinases, suggesting that Co-x is not a classical nucleotide-binding leucine-rich encoding gene. In addition, we identified and characterized the seven members of common bean PI-PLC gene family distributed into two clusters located at the ends of chromosomes 1 and 8. Co-x is not a member of Co-1 allelic series since these two genes are separated by at least 190 kb. Comparative analysis between soybean and common bean revealed that the Co-x syntenic region, located at one end of Glycine max chromosome 18, carries Rhg1, a major QTL contributing to soybean cyst nematode resistance. The PCR-based markers generated in this study should be useful in marker-assisted selection for pyramiding Co-x with other R genes.  相似文献   

3.
S100 proteins are calcium-binding proteins, which exist only in vertebrates and which constitute a large protein family. The origin and evolution of the S100 family in vertebrate lineages remain a challenge. Here, we examined the synteny conservation of mammalian S100A genes by analysing the sequence of available vertebrate S100 genes in databases. Five S100A gene members, unknown previously, were identified by chromosome mapping analysis. Mammalian S100A genes are duplicated and clustered on a single chromosome while two S100A gene clusters are found on separate chromosomes in teleost fish, suggesting that S100A genes existed in fish before the fish-specific genome duplication took place. During speciation, tandem gene duplication events within the cluster of S100A genes of a given chromosome have probably led to the multiple members of the S100A gene family. These duplicated genes have been retained in the genome either by neofunctionalisation and/or subfunctionalisation or have evolved into non-coding sequences. However in vertebrate genomes, other S100 genes are also present i.e. S100P, S100B, S100G and S100Z, which exist as single copy genes distributed on different chromosomes, suggesting that they could have evolved from an ancestor different to that of the S100A genes.  相似文献   

4.
A highly lethal hemorrhagic disease associated with infection by elephant endotheliotropic herpesvirus (EEHV) poses a severe threat to Asian elephant husbandry. We have used high-throughput methods to sequence the genomes of the two genotypes that are involved in most fatalities, namely, EEHV1A and EEHV1B (species Elephantid herpesvirus 1, genus Proboscivirus, subfamily Betaherpesvirinae, family Herpesviridae). The sequences were determined from postmortem tissue samples, despite the data containing tiny proportions of viral reads among reads from a host for which the genome sequence was not available. The EEHV1A genome is 180,421 bp in size and consists of a unique sequence (174,601 bp) flanked by a terminal direct repeat (2,910 bp). The genome contains 116 predicted protein-coding genes, of which six are fragmented, and seven paralogous gene families are present. The EEHV1B genome is very similar to that of EEHV1A in structure, size, and gene layout. Half of the EEHV1A genes lack orthologs in other members of subfamily Betaherpesvirinae, such as human cytomegalovirus (genus Cytomegalovirus) and human herpesvirus 6A (genus Roseolovirus). Notable among these are 23 genes encoding type 3 membrane proteins containing seven transmembrane domains (the 7TM family) and seven genes encoding related type 2 membrane proteins (the EE50 family). The EE50 family appears to be under intense evolutionary selection, as it is highly diverged between the two genotypes, exhibits evidence of sequence duplications or deletions, and contains several fragmented genes. The availability of the genome sequences will facilitate future research on the epidemiology, pathogenesis, diagnosis, and treatment of EEHV-associated disease.  相似文献   

5.
Bacterial cold water disease (BCWD) causes significant economic loss in salmonid aquaculture. Using microsatellite markers in a genome scan, we previously detected significant and suggestive QTL affecting phenotypic variation in survival following challenge with Flavobacterium psychrophilum, the causative agent of BCWD in rainbow trout. In this study, we performed selective genotyping of SNPs from restriction-site associated DNA (RAD) sequence data from two pedigreed families (2009070 and 2009196) to validate the major QTL from the previous work and to detect new QTL. The use of RAD SNPs in the genome scans increased the number of mapped markers from ~300 to ~5,000 per family. The significant QTL detected in the microsatellites scan on chromosome Omy8 in family 2009070 was validated explaining up to 58% of the phenotypic variance in that family, and in addition, a second QTL was also detected on Omy8. Two novel QTL on Omy11 and 14 were also detected, and the previously suggestive QTL on Omy1, 7 and 25 were also validated in family 2009070. In family 2009196, the microsatellite significant QTL on Omy6 and 12 were validated and a new QTL on Omy8 was detected, but none of the previously detected suggestive QTL were validated. The two Omy8 QTL from family 2009070 and the Omy12 QTL from family 2009196 were found to be co-localized with handling and confinement stress response QTL that our group has previously identified in a separate pedigreed family. With the currently available data we cannot determine if the co-localized QTL are the result of genes with pleiotropic effects or a mere physical proximity on the same chromosome segment. The genetic markers linked to BCWD resistance QTL were used to query the scaffolds of the rainbow trout reference genome assembly and the QTL-positive scaffold sequences were found to include 100 positional candidate genes. Several of the candidate genes located on or near the two Omy8 QTL detected in family 2009070 suggest potential linkages between stress response and the regulation of immune response in rainbow trout.  相似文献   

6.
The complete genomic sequence of an intracellular bacterial pathogen, Mycoplasma penetrans HF-2 strain, was determined. The HF-2 genome consists of a 1 358 633 bp single circular chromosome containing 1038 predicted coding sequences (CDSs), one set of rRNA genes and 30 tRNA genes. Among the 1038 CDSs, 264 predicted proteins are common to the Mycoplasmataceae sequenced thus far and 463 are M.penetrans specific. The genome contains the two-component system but lacks the essential cellular gene, uridine kinase. The relatively large genome of M.penetrans HF-2 among mycoplasma species may be accounted for by both its rich core proteome and the presence of a number of paralog families corresponding to 25.4% of all CDSs. The largest paralog family is the p35 family, which encodes surface lipoproteins including the major antigen, P35. A total of 44 genes for p35 and p35 homologs were identified and 30 of them form one large cluster in the chromosome. The genetic tree of p35 paralogs suggests the occurrence of dynamic chromosomal rearrangement in paralog formation during evolution. Thus, M.penetrans HF-2 may have acquired diverse repertoires of antigenic variation-related genes to allow its persistent infection in humans.  相似文献   

7.

Key message

qFS07.1 controlling fiber strength was fine-mapped to a 62.6-kb region containing four annotated genes. RT-qPCR and sequence of candidate genes identified an LRR RLK gene as the most likely candidate.

Abstract

Fiber strength is an important component of cotton fiber quality and is associated with other properties, such as fiber maturity, fineness, and length. Stable QTL qFS07.1, controlling fiber strength, had been identified on chromosome 7 in an upland cotton recombinant inbred line (RIL) population from a cross (CCRI35?×?Yumian1) described in our previous studies. To fine-map qFS07.1, an F2 population with 2484 individual plants from a cross between recombinant line RIL014 and CCRI35 was established. A total of 1518 SSR primer pairs, including 1062, designed from chromosome 1 of the Gossypium raimondii genome and 456 from chromosome 1 of the G. arboreum genome (corresponding to the QTL region) were used to fine-map qFS07.1, and qFS07.1 was mapped into a 62.6-kb genome region which contained four annotated genes on chromosome A07 of G. hirsutum. RT-qPCR and comparative analysis of candidate genes revealed a leucine-rich repeat protein kinase (LRR RLK) family protein to be a promising candidate gene for qFS07.1. Fine mapping and identification of the candidate gene for qFS07.1 will play a vital role in marker-assisted selection (MAS) and the study of mechanism of cotton fiber development.
  相似文献   

8.
Gloeostereum incarnatum has edible and medicinal value and was first cultivated and domesticated in China. We sequenced the G. incarnatum monokaryotic strain GiC-126 on an Illumina HiSeq X Ten system and obtained a 34.52-Mb genome assembly sequence that encoded 16,895 predicted genes. We combined the GiC-126 genome with the published genome of G. incarnatum strain CCMJ2665 to construct a genetic linkage map (GiC-126 genome) that had 10 linkage groups (LGs), and the 15 assembly sequences of CCMJ2665 were integrated into 8 LGs. We identified 1912 simple sequence repeat (SSR) loci and detected 700 genes containing 768 SSRs in the genome; 65 and 100 of them were annotated with gene ontology (GO) terms and KEGG pathways, respectively. Carbohydrate-active enzymes (CAZymes) were identified in 20 fungal genomes and annotated; among them, 144 CAZymes were annotated in the GiC-126 genome. The A mating-type locus (MAT-A) of G. incarnatum was located on scaffold885 at 38.9 cM of LG1 and was flanked by two homeodomain (HD1) genes, mip and beta-fg. Fourteen segregation distortion markers were detected in the genetic linkage map, all of which were skewed toward the parent GiC-126. They formed three segregation distortion regions (SDR1–SDR3), and 22 predictive genes were found in scaffold1920 where three segregation distortion markers were located in SDR1. In this study, we corrected and updated the genomic information of G. incarnatum. Our results will provide a theoretical basis for fine gene mapping, functional gene cloning, and genetic breeding the follow-up of G. incarnatum.  相似文献   

9.
10.
Blumea balsamifera (L.) DC., a medicinal plant with high economic value in the Asteraceae family, is widely distributed in China and Southeast Asia. However, studies on the population structure or phylogenetic relationships with other related species are rare owing to the lack of genome information. In this study, through high-throughput sequencing, we found that the chloroplast genome of B. balsamifera was 151,170 bp in length, with a pair of inverted repeat regions (IRa and IRb) comprising 24,982 bp, a large single-copy (LSC) region comprising 82,740 bp, and a small single-copy (SSC) region comprising 18,466 bp. A total of 130 genes were identified in the chloroplast genome of B. balsamifera, including 85 protein-coding, 37 transfer RNA, and 8 ribosomal RNA genes; furthermore, sequence analysis identified 53 simple sequence repeats. Whole chloroplast genome comparison indicated that the inverted regions (IR) were more conserved than large single-copy and SSC regions. Phylogenetic analysis showed that B. balsamifera is closely related to Pluchea indica. Conclusively, the chloroplast genome of B. balsamifera was helpful for species identification and analysis of the genetic diversity and evolution in the genus Blumea and family Asteraceae.  相似文献   

11.
The complete genome sequence of Nocardia nova SH22a was determined in light of the remarkable ability of rubber and gutta-percha (GP) degradation of this strain. The genome consists of a circular chromosome of 8,348,532 bp with a G+C content of 67.77% and 7,583 predicted protein-encoding genes. Functions were assigned to 72.45% of the coding sequences. Among them, a large number of genes probably involved in the metabolism of xenobiotics and hardly degradable compounds, as well as genes that participate in the synthesis of polyketide- and/or nonribosomal peptide-type secondary metabolites, were detected. Based on in silico analyses and experimental studies, such as transposon mutagenesis and directed gene deletion studies, the pathways of rubber and GP degradation were proposed and the relationship between both pathways was unraveled. The genes involved include, inter alia, genes participating in cell envelope synthesis (long-chain-fatty-acid–AMP ligase and arabinofuranosyltransferase), β-oxidation (α-methylacyl-coenzyme A [α-methylacyl-CoA] racemase), propionate catabolism (acyl-CoA carboxylase), gluconeogenesis (phosphoenolpyruvate carboxykinase), and transmembrane substrate uptake (Mce [mammalian cell entry] transporter). This study not only improves our insights into the mechanism of microbial degradation of rubber and GP but also expands our knowledge of the genus Nocardia regarding metabolic diversity.  相似文献   

12.

Background

Rhizobium leguminosarum is an α-proteobacterial N2-fixing symbiont of legumes that has been the subject of more than a thousand publications. Genes for the symbiotic interaction with plants are well studied, but the adaptations that allow survival and growth in the soil environment are poorly understood. We have sequenced the genome of R. leguminosarum biovar viciae strain 3841.

Results

The 7.75 Mb genome comprises a circular chromosome and six circular plasmids, with 61% G+C overall. All three rRNA operons and 52 tRNA genes are on the chromosome; essential protein-encoding genes are largely chromosomal, but most functional classes occur on plasmids as well. Of the 7,263 protein-encoding genes, 2,056 had orthologs in each of three related genomes (Agrobacterium tumefaciens, Sinorhizobium meliloti, and Mesorhizobium loti), and these genes were over-represented in the chromosome and had above average G+C. Most supported the rRNA-based phylogeny, confirming A. tumefaciens to be the closest among these relatives, but 347 genes were incompatible with this phylogeny; these were scattered throughout the genome but were over-represented on the plasmids. An unexpectedly large number of genes were shared by all three rhizobia but were missing from A. tumefaciens.

Conclusion

Overall, the genome can be considered to have two main components: a 'core', which is higher in G+C, is mostly chromosomal, is shared with related organisms, and has a consistent phylogeny; and an 'accessory' component, which is sporadic in distribution, lower in G+C, and located on the plasmids and chromosomal islands. The accessory genome has a different nucleotide composition from the core despite a long history of coexistence.  相似文献   

13.
14.
Glycine latifolia (Benth.) Newell & Hymowitz (2= 40), one of the 27 wild perennial relatives of soybean, possesses genetic diversity and agronomically favorable traits that are lacking in soybean. Here, we report the 939‐Mb draft genome assembly of G. latifolia (PI 559298) using exclusively linked‐reads sequenced from a single Chromium library. We organized scaffolds into 20 chromosome‐scale pseudomolecules utilizing two genetic maps and the Glycine max (L.) Merr. genome sequence. High copy numbers of putative 91‐bp centromere‐specific tandem repeats were observed in consecutive blocks within predicted pericentromeric regions on several pseudomolecules. No 92‐bp putative centromeric repeats, which are abundant in G. max, were detected in G. latifolia or Glycine tomentella. Annotation of the assembled genome and subsequent filtering yielded a high confidence gene set of 54 475 protein‐coding loci. In comparative analysis with five legume species, genes related to defense responses were significantly overrepresented in Glycine‐specific orthologous gene families. A total of 304 putative nucleotide‐binding site (NBS)‐leucine‐rich‐repeat (LRR) genes were identified in this genome assembly. Different from other legume species, we observed a scarcity of TIR‐NBS‐LRR genes in G. latifolia. The G. latifolia genome was also predicted to contain genes encoding 367 LRR‐receptor‐like kinases, a family of proteins involved in basal defense responses and responses to abiotic stress. The genome sequence and annotation of G. latifolia provides a valuable source of alternative alleles and novel genes to facilitate soybean improvement. This study also highlights the efficacy and cost‐effectiveness of the application of Chromium linked‐reads in diploid plant genome de novo assembly.  相似文献   

15.
Sequence Analysis of the Genome of an Oil-Bearing Tree, Jatropha curcas L.   总被引:2,自引:0,他引:2  
《DNA research》2011,18(1):65-76
The whole genome of Jatropha curcas was sequenced, using a combination of the conventional Sanger method and new-generation multiplex sequencing methods. Total length of the non-redundant sequences thus obtained was 285 858 490 bp consisting of 120 586 contigs and 29 831 singlets. They accounted for ∼95% of the gene-containing regions with the average G + C content was 34.3%. A total of 40 929 complete and partial structures of protein encoding genes have been deduced. Comparison with genes of other plant species indicated that 1529 (4%) of the putative protein-encoding genes are specific to the Euphorbiaceae family. A high degree of microsynteny was observed with the genome of castor bean and, to a lesser extent, with those of soybean and Arabidopsis thaliana. In parallel with genome sequencing, cDNAs derived from leaf and callus tissues were subjected to pyrosequencing, and a total of 21 225 unigene data have been generated. Polymorphism analysis using microsatellite markers developed from the genomic sequence data obtained was performed with 12 J. curcas lines collected from various parts of the world to estimate their genetic diversity. The genomic sequence and accompanying information presented here are expected to serve as valuable resources for the acceleration of fundamental and applied research with J. curcas, especially in the fields of environment-related research such as biofuel production. Further information on the genomic sequences and DNA markers is available at http://www.kazusa.or.jp/jatropha/.  相似文献   

16.
17.
《Gene》1996,174(1):43-50
The fungus Phycomyces blakesleeanus has a relatively small genome, 30 megabases (Mb), with a low guanine and cytosine (G+C) content, 35%; the coding sequences cloned to date all have a G+C content of about 50%. In order to investigate the organization of the genome of this fungus, we have cloned and sequenced 251 DNA fragments. One hundred and twenty-six clones were obtained by digestion with MspI (target sequence 5′-CCGG-3′) and 125 random clones were obtained by sonication. The average length of sequence obtained was about 200 base pairs (bp) and the total length was about 50 kilobases (kb). The G + C content is not homogeneous throughout the genome: sequences obtained after digestion with MspI have an average of 5% more G + C content than the random fragments, and are enriched in coding sequences. Fourteen MspI fragments show similarities to known proteins and 21 encode ribosomal RNA (rRNA). By contrast, only three of the random fragments are similar to known proteins and only one to a rRNA. We conclude that the Phycomyces genome is composed of G+C-rich genes surrounded by G+C-poor areas. Two clones have similarities to the transposase of the transposon Tcl from Caenorhabditis elegans. This result suggests the presence of a high copy number of a Tcl-like transposable element in the Phycomyces genome. Another clone was similar to the transposon Txl from Xenopus laevis. A novel repetitive nt sequence has been characterized; about 5% of the total genome is a repetition of any of two consensus sequences of 31 by named PrAI and PrA2.  相似文献   

18.
Scrub typhus (‘Tsutsugamushi’ disease in Japanese) is a mite-borne infectious disease. The causative agent is Orientia tsutsugamushi, an obligate intracellular bacterium belonging to the family Rickettsiaceae of the subdivision alpha-Proteobacteria. In this study, we determined the complete genome sequence of O. tsutsugamushi strain Ikeda, which comprises a single chromosome of 2 008 987 bp and contains 1967 protein coding sequences (CDSs). The chromosome is much larger than those of other members of Rickettsiaceae, and 46.7% of the sequence was occupied by repetitive sequences derived from an integrative and conjugative element, 10 types of transposable elements, and seven types of short repeats of unknown origins. The massive amplification and degradation of these elements have generated a huge number of repeated genes (1196 CDSs, categorized into 85 families), many of which are pseudogenes (766 CDSs), and also induced intensive genome shuffling. By comparing the gene content with those of other family members of Rickettsiacea, we identified the core gene set of the family Rickettsiaceae and found that, while much more extensive gene loss has taken place among the housekeeping genes of Orientia than those of Rickettsia, O. tsutsugamushi has acquired a large number of foreign genes. The O. tsutsugamushi genome sequence is thus a prominent example of the high plasticity of bacterial genomes, and provides the genetic basis for a better understanding of the biology of O. tsutsugamushi and the pathogenesis of ‘Tsutsugamushi’ disease.Key words: Orientia tsutsugamushi, genome sequencing, obligate intracellular bacterium, repetitive sequence, IS element, integrative and conjugative element, gene amplification, genome reduction  相似文献   

19.
20.
This study investigated the potential anti-fungal mechanisms of sodium dehydroacetate (SD) against Geotrichum citri-aurantii. The results showed that the cell wall integrity of G. citri-aurantii was not affected, whereas the membrane permeability of G. citri-aurantii mycelia was visibly altered by SD. Dramatic morphological changes of the mycelia, such as loss of cytoplasm, plasmolysis, and dissolution of intracellular substances, were observed by scanning electron microscopy and transmission electron microscopy analyses, indicating that the mycelium is severely damaged by the SD treatment. Furthermore, SD apparently induced a decrease in the intracellular ATP content before 30 min of exposure. An increase in the activity of the Na+/K+-ATPase was also observed, indicating that Na+ ions might enter the cell and thus disturb the energy supply. Taken together, this study’s findings suggest that the anti-fungal activity of SD against G. citri-aurantii can be attributed to the disruption of cell membrane permeability and energy metabolism.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号