首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
2.
Non-LTR retrotransposons (LINEs) as ubiquitous components of plant genomes   总被引:9,自引:0,他引:9  
During the course of work aimed at isolating a rice gene from Oryza australiensis by PCR, the oligonucleotide primers used were found to generate a fragment that showed sequence homology to the endonuclease (EN) region of the maize non-LTR retrotransposon (LINE) Cin4. We carried out further PCRs using oligonucleotide primers that hybridized to these sequences, and found that they amplified several fragments, each with homology to the EN regions, from Oryza sativa cv. Nipponbare as well as O. australiensis. We mapped the approximate locations of two rice LINE homologues by screening clones in a YAC library made from a rice (O. sativa) genome, and found that each homologue was present in a low copy number apparently at nonspecific regions on rice chromosomes. We then carried out PCR using degenerate oligonucleotide primers which hybridized to the rice LINE homologues and Cin4 to ascertain whether LINE homologues are present in a variety of members of the plant kingdom, including angiosperms, gymnosperms, bracken, horsetail and liverwort. Cloning and nucleotide sequencing revealed that 53 clones obtained from 27 out of 33 plant species contained LINE homologues. In addition to these homologues, we identified four homologues with EN regions in the Arabidopsis thaliana genome by a computer search of databases. The nucleotide sequences of almost all the LINE homologues were greatly diverged, but the derived amino acid sequences were well conserved, and all contained glutamic acid and tyrosine residues at almost the same relative positions as in the the active site regions of AP (apurinic/apyrimidinic)-endonucleases. The EN regions in the LINE homologues from closely related plant species show a closer phylogenetic relationship, indicating that sequence divergence during vertical transmission has been a major influence upon the evolution of plant LINEs. Received: 13 July 1998 / Accepted: 13 October 1998  相似文献   

3.
The CHORI-212 bacterial artificial chromosome (BAC) library was constructed by cloning EcoRI/EcoRI partially digested DNA into the pTARBAC2.1 vector. The library has an average insert size of 161 kb, and provides 10.6-fold coverage of the channel catfish haploid genome. Screening of 32 genes using overgo or cDNA probes indicated that this library had a good representation of the genome as all tested genes existed in the library. We previously reported sequencing of approximately 25,000 BAC ends that generated 20,366 high-quality BAC end sequences (BES) and identified a large number of sequences similar to known genes using BLASTX searches. In this work, particular attention was given to identification of BAC mate pairs with known genes from both ends. When identified, comparative genome analysis was conducted to determine syntenic regions of the catfish genome with the genomes of zebrafish and Tetraodon. Of the 141 mate pairs with known genes from channel catfish, conserved syntenies were identified in 34 (24.1%), with 30 conserved in the zebrafish genome and 14 conserved in the Tetraodon genome. Additional analysis of three of the 34 conserved syntenic groups by direct sequencing indicated conserved gene contents in all three species. This indicates that comparative genome analysis may provide shortcuts to genome analysis in catfish, especially for short genomic regions once the conserved syntenies are identified. Shaolin Wang and Peng Xu contributed equally to the article.  相似文献   

4.
Evolution of NIN-Like Proteins in Arabidopsis, Rice, and Lotus japonicus   总被引:1,自引:0,他引:1  
Genetic studies in Lotus japonicus and pea have identified Nin as a core symbiotic gene required for establishing symbiosis between legumes and nitrogen fixing bacteria collectively called Rhizobium. Sequencing of additional Lotus cDNAs combined with analysis of genome sequences from Arabidopsis and rice reveals that Nin homologues in all three species constitute small gene families. In total, the Arabidopsis and rice genomes encode nine and three NIN-like proteins (NLPs), respectively. We present here a bioinformatics analysis and prediction of NLP evolution. On a genome scale we show that in Arabidopsis, this family has evolved through segmental duplication rather than through tandem amplification. Alignment of all predicted NLP protein sequences shows a composition with six conserved modules. In addition, Lotus and pea NLPs contain segments that might characterize NIN proteins of legumes and be of importance for their function in symbiosis. The most conserved region in NLPs, the RWP-RK domain, has secondary structure predictions consistent with DNA binding properties. This motif is shared by several other small proteins in both Arabidopsis and rice. In rice, the RWP-RK domain sequences have diversified significantly more than in Arabidopsis. Database searches reveal that, apart from its presence in Arabidopsis and rice, the motif is also found in the algae Chlamydomonas and in the slime mold Dictyostelium discoideum. Thus, the origin of this putative DNA binding region seems to predate the fungus–plant divide.Reviewing Editor: Professor David Guttman  相似文献   

5.
The genomes of virtually all free-living archaeons encode one or more deduced protein-serine/threonine/tyrosine kinases belonging to the so-called eukaryotic protein kinase superfamily. However, the distribution of their cognate protein-serine/threonine/phosphatases displays a mosaic pattern. Thermoplasma volcanium is unique among the Archaea inasmuch as it is the sole archaeon whose complement of deduced phosphoprotein phosphatases includes a member of the PPM-family of protein-serine/threonine phosphatases—a family that originated in the Eucarya. A recombinant version of this protein, TvnPPM, exhibited protein-tyrosine phosphatase in addition to its predicted protein-serine/threonine phosphatase activity in vitro. TvnPPM is the fourth member of the PPM-family shown to exhibit such dual-specific capability, suggesting that the ancestral versions of this enzyme exhibited broad substrate specificity. Unlike most other archeaons, the genome of T. volcanium lacks open reading frames encoding stereotypical protein-tyrosine phosphatases. Hence, the dual-specificity of TvnPPM may account for its seemingly aberrant presence in an archaeon.  相似文献   

6.
Cryptomonads, small biflagellate algae, contain four different genomes. In addition to the nucleus, mitochondrion, and chloroplast is a fourth DNA-containing organelle the nucleomorph. Nucleomorphs result from the successive reduction of the nucleus of an engulfed phototrophic eukaryotic endosymbiont by a secondary eukaryotic host cell. By sequencing the chloroplast genome and the nucleomorph chromosomes, we identified a groEL homologue in the genome of the chloroplast and a related cpn60 in one of the nucleomorph chromosomes. The nucleomorph-encoded Cpn60 and the chloroplast-encoded GroEL correspond in each case to one of the two divergent GroEL homologues in the cyanobacterium Synechocystis sp. PCC6803. The coexistence of divergent groEL/cpn60 genes in different genomes in one cell offers insights into gene transfer from evolving chloroplasts to cell nuclei and convergent gene evolution in chlorophyll a/b versus chlorophyll a/c/phycobilin eukaryotic lineages. Received: 24 April 1998 / Accepted: 12 June 1998  相似文献   

7.

Background  

Public databases now contain multitude of complete bacterial genomes, including several genomes of the same species. The available data offers new opportunities to address questions about bacterial genome evolution, a task that requires reliable fine comparison data of closely related genomes. Recent analyses have shown, using pairwise whole genome alignments, that it is possible to segment bacterial genomes into a common conserved backbone and strain-specific sequences called loops.  相似文献   

8.
The heat shock protein 70 kDa sequences (HSP70) are of great importance as molecular chaperones in protein folding and transport. They are abundant under conditions of cellular stress. They are highly conserved in all domains of life: Archaea, eubacteria, eukaryotes, and organelles (mitochondria, chloroplasts). A multiple alignment of a large collection of these sequences was obtained employing our symmetric-iterative ITERALIGN program (Brocchieri and Karlin 1998). Assessments of conservation are interpreted in evolutionary terms and with respect to functional implications. Many archaeal sequences (methanogens and halophiles) tend to align best with the Gram-positive sequences. These two groups also miss a signature segment [about 25 amino acids (aa) long] present in all other HSP70 species (Gupta and Golding 1993). We observed a second signature sequence of about 4 aa absent from all eukaryotic homologues, significantly aligned in all prokaryotic sequences. Consensus sequences were developed for eight groups [Archaea, Gram-positive, proteobacterial Gram-negative, singular bacteria, mitochondria, plastids, eukaryotic endoplasmic reticulum (ER) isoforms, eukaryotic cytoplasmic isoforms]. All group consensus comparisons tend to summarize better the alignments than do the individual sequence comparisons. The global individual consensus ``matches' 87% with the consensus of consensuses sequence. A functional analysis of the global consensus identifies a (new) highly significant mixed charge cluster proximal to the carboxyl terminus of the sequence highlighting the hypercharge run EEDKKRRER (one-letter aa code used). The individual Archaea and Gram-positive sequences contain a corresponding significant mixed charge cluster in the location of the charge cluster of the consensus sequence. In contrast, the four Gram-negative proteobacterial sequences of the alignment do not have a charge cluster (even at the 5% significance level). All eukaryotic HSP70 sequences have the analogous charge cluster. Strikingly, several of the eukaryotic isoforms show multiple mixed charged clusters. These clusters were interpreted with supporting data related to HSP70 activity in facilitating chaperone, transport, and secretion function. We observed that the consensus contains only a single tryptophan residue and a single conserved cysteine. This is interpreted with respect to the target rule for disaggregating misfolded proteins. The mitochondrial HSP70 connections to bacterial HSP70 are analyzed, suggesting a polyphyletic split of Trypanosoma and Leishmania protist mitochondrial (Mt) homologues separated from Mt-animal/fungal/plant homologues. Moreover, the HSP70 sequences from the amitochondrial Entamoeba histolytica and Trichomonas vaginalis species were analyzed. The E. histolytica HSP70 is most similar to the higher eukaryotic cytoplasmic sequences, with significantly weaker alignments to ER sequences and much diminished matching to all eubacterial, mitochondrial, and chloroplast sequences. This appears to be at variance with the hypothesis that E. histolytica rather recently lost its mitochondrial organelle. T. vaginalis contains two HSP70 sequences, one Mt-like and the second similar to eukaryotic cytoplasmic sequences suggesting two diverse origins. Received: 29 January 1998 / Accepted: 14 May 1998  相似文献   

9.
Thirty-two genome sequences of various Vibrionaceae members are compared, with emphasis on what makes V. cholerae unique. As few as 1,000 gene families are conserved across all the Vibrionaceae genomes analysed; this fraction roughly doubles for gene families conserved within the species V. cholerae. Of these, approximately 200 gene families that cluster on various locations of the genome are not found in other sequenced Vibrionaceae; these are possibly unique to the V. cholerae species. By comparing gene family content of the analysed genomes, the relatedness to a particular species is identified for two unspeciated genomes. Conversely, two genomes presumably belonging to the same species have suspiciously dissimilar gene family content. We are able to identify a number of genes that are conserved in, and unique to, V. cholerae. Some of these genes may be crucial to the niche adaptation of this species.  相似文献   

10.
11.
The nucleotide sequences of three independent fragments (designated no. 3, 4, and 9; each 15–20 kb in size) of the genome of alkaliphilic Bacillus sp. C-125 cloned in a λ phage vector have been determined. Thirteen putative open reading frames (ORFs) were identified in sequenced fragment no. 3 and 11 ORFs were identified in no. 4. Twenty ORFs were also identified in fragment no. 9. All putative ORFs were analyzed in comparison with the BSORF database and non-redundant protein databases. The functions of 5 ORFs in fragment no. 3 and 3 ORFs in fragment no. 4 were suggested by their significant similarities to known proteins in the database. Among the 20 ORFs in fragment no. 9, the functions of 11 ORFs were similarly suggested. Most of the annotated ORFs in the DNA fragments of the genome of alkaliphilic Bacillus sp. C-125 were conserved in the Bacillus subtilis genome. The organization of ORFs in the genome of strain C-125 was found to differ from the order of genes in the chromosome of B. subtilis, although some gene clusters (ydh, yqi, yer, and yts) were conserved as operon units the same as in B. subtilis. Received: April 17, 1998 / Accepted: June 23, 1998  相似文献   

12.
Characterization of Repetitive DNA Elements in Arabidopsis   总被引:1,自引:0,他引:1  
We have applied computational methods to the available database and identified several families of repetitive DNA elements in the Arabidopsis thaliana genome. While some of the elements have features expected of either miniature inverted-repeat transposable elements (MITEs) or retrotransposons, the most abundant class of repetitive elements, the AthE1 family, is structurally related to neither. The AthE1 family members are defined by conserved 5′ and 3′ sequences, but these terminal sequences do not represent either inverted or direct repeats. AthE1 family members with greater than 98% identity are easily identified on different Arabidopsis chromosomes. Similar to nonautonomous DNA-based transposon families, the AthE1 family contains members in which the conserved terminal domains flank unrelated sequences. The primary utility of characterizing repetitive sequences is in defining, at least in part, the evolutionary architecture of specific Arabidopsis loci. The repetitive elements described here make up approximately 1% of the available Arabidopsis thaliana genomic sequence. Received: 13 October 1998 / Accepted: 30 December 1998  相似文献   

13.
14.
Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome. Received: 10 March 1996 / Accepted: 20 May 1997  相似文献   

15.
16.
Makarova KS  Mironov AA  Gelfand MS 《Genome biology》2001,2(4):research0013.1-research00138
  相似文献   

17.
Horizontal gene transfer (HGT), a process through which genomes acquire sequences from distantly related organisms, is believed to be a major source of genetic diversity in bacteria. A central question concerning the impact of HGT on bacterial genome evolution is the proportion of horizontally transferred sequences within genomes. This issue, however, remains unresolved because the various methods developed to detect potential HGT events identify different sets of genes. The present-day consensus is that phylogenetic analysis of individual genes is still the most objective and accurate approach for determining the occurrence and directionality of HGT. Here we present a genome-scale phylogenetic analysis of protein-encoding genes from five closely related Chlamydia, identifying a reliable set of sequences that have arisen via HGT since the divergence of the Chlamydia lineage. According to our knowledge, this is the first systematic phylogenetic inference-based attempt to establish a reliable set of acquired genes in a bacterial genome. Although Chlamydia are obligate intracellular parasites of higher eukaryotes, and thus suspected to be isolated from HGT more than the free-living species, our results show that their diversification has involved the introduction of foreign sequences into their genome. Furthermore, we also identified a complete set of genes that have undergone deletion, duplication, or rearrangement during this evolutionary period leading to the radiation of Chlamydia species. Our analysis may provide a deeper insight into how these medically important pathogens emerged and evolved from a common ancestor.  相似文献   

18.
DNA fingerprints and end sequences from bacterial artificial chromosomes (BACs) from two new libraries were generated to improve the first generation integrated physical and genetic map of the rainbow trout (Oncorhynchus mykiss) genome. The current version of the physical map is composed of 167,989 clones of which 158,670 are assembled into contigs and 9,319 are singletons. The number of contigs was reduced from 4,173 to 3,220. End sequencing of clones from the new libraries generated a total of 11,958 high quality sequence reads. The end sequences were used to develop 238 new microsatellites of which 42 were added to the genetic map. Conserved synteny between the rainbow trout genome and model fish genomes was analyzed using 188,443 BAC end sequence (BES) reads. The fractions of BES reads with significant BLASTN hits against the zebrafish, medaka, and stickleback genomes were 8.8%, 9.7%, and 10.5%, respectively, while the fractions of significant BLASTX hits against the zebrafish, medaka, and stickleback protein databases were 6.2%, 5.8%, and 5.5%, respectively. The overall number of unique regions of conserved synteny identified through grouping of the rainbow trout BES into fingerprinting contigs was 2,259, 2,229, and 2,203 for stickleback, medaka, and zebrafish, respectively. These numbers are approximately three to five times greater than those we have previously identified using BAC paired ends. Clustering of the conserved synteny analysis results by linkage groups as derived from the integrated physical and genetic map revealed that despite the low sequence homology, large blocks of macrosynteny are conserved between chromosome arms of rainbow trout and the model fish species.  相似文献   

19.
piggyBac is a short inverted-repeat-type DNA transposable element originally isolated from the genome of the moth Trichoplusia ni. It is currently the gene vector of choice for the transformation of various insect species. A few sequences with similarity to piggyBac have previously been identified from organisms such as humans ( Looper), the pufferfish Takifugu rubripes (Pigibaku), Xenopus (Tx), Daphnia (Pokey), and the Oriental fruit fly Bactrocera dorsalis. We have now identified 50 piggyBac-like sequences from publicly available genome sequences and expressed sequence tags (ESTs). This survey allows the first comparative examination of the distinctive piggyBac transposase, suggesting that it might contain a highly divergent DDD domain, comparable to the widespread DDE domain found in many DNA transposases and retroviral integrases which consists of two absolutely conserved aspartic acids separated by about 70 amino acids with a highly conserved glutamic acid about 35 amino acids further away. Many piggyBac-like sequences were found in the genomes of a phylogenetically diverse range of organisms including fungi, plants, insects, crustaceans, urochordates, amphibians, fishes and mammals. Also, several instances of "domestication" of the piggyBac transposase sequence by the host genome for cellular functions were identified. Novel members of the piggyBac family may be useful in genetic engineering of many organisms.Electronic Supplementary Material Supplementary material is available in the online version of this article at  相似文献   

20.
The ubiquitous glyoxalase system, which is composed of two enzymes, removes cellular cytotoxic methylglyoxal (MG). In an effort to identify critical residues conserved in the evolution of the first enzyme in this system, glyoxalase I (GlxI), as well as the structural implications of sequence alterations in this enzyme, a search of the National Center for Biotechnology Information (NCBI) database of unfinished genomes was undertaken. Eleven putative GlxI sequences from pathogenic organisms were identified and analyses of these sequences in relation to the known and previously identified GlxI enzymes were performed. Several of these sequences show a very high similarity to the Escherichia coli GlxI sequence, most notably the 79% identity of the sequence identified from Yersinia pestis, the causative agent of bubonic plague. In addition to the conservation of residues critical to binding the catalytic metal in all of the proposed GlxI enzymes, four regions in the Homo sapiens GlxI enzyme are absent in all of the bacterial GlxI sequences, with the exception of Pseudomonas putida. Removal of these regions may alter the active-site conformation of the bacterial enzymes in relation to that of the H. sapiens. These differences may be targeted for the development of inhibitors selective to the bacterial enzymes. Received: 13 October 1999 / Accepted: 17 January 2000  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号