首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In hypotrichous ciliates, macronuclear chromosomes are gene‐sized, and micronuclear genes contain short, noncoding internal eliminated segments (IESs) as well as macronuclear‐destined segments (MDSs). In the present study, we characterized the complete macronuclear gene and two to three types of micronuclear actin genes of two urostylid species, i.e. Pseudokeronopsis rubra and Uroleptopsis citrina. Our results show that (1) the gain/loss of IES happens frequently in the subclass Hypotrichia (formerly Stichotrichia), and high fragmentation of germline genes does not imply for gene scrambling; and (2) the micronuclear actin gene is scrambled in the order Sporadotrichida but nonscrambled in the orders Urostylida and Stichotrichida, indicating the independent evolution of MIC‐actin gene patterns in different orders of hypotrichs; (3) locations of MDS–IES junctions of micronuclear actin gene in coding regions are conserved among closely related species.  相似文献   

2.
ABSTRACT Internal eliminated segments (IESs) are sequences that interrupt coding and noncoding regions of germline (micronuclear) genes of ciliated protozoa. IESs are flanked by short, unique repeat sequences, which are presumably required for precise IES excision during macronuclear development. Coding and noncoding segments of genes separated by IESs are called macronuclear-destined segments, or MDSs. We have compiled the characteristics of 89 individual IESs in 12 micronuclear genes in the Oxytricha and Stylonychia genera to define the IES phenomenon precisely, a first step in determining the origin, function and significance of IESs. Although all 89 IESs among the 12 different genes are AT-rich, they show no other similarity in sequence, length, position or number. Two main types of IESs are present. IESs that separate scrambled MDSs are significantly shorter and more frequent and have longer flanking repeat sequences than IESs that intervene between nonscrambled MDSs. A comparison of the nonscrambled gene encoding β-telomere binding protein in three species of hypotrichs shows that even in the same gene IESs are not conserved in sequence, length, position, or number from species to species. A comparison of IESs in the scrambled gene encoding actin I in the three species shows that the evolutionary behavior of IESs in a scrambled gene may be more constrained. However, IESs in the scrambled actin I gene have shifted along the DNA molecule during evolution. In total, the various studies show that IESs are hypermutable in sequence and length. They insert, excise, and shift along DNA molecules more or less randomly during evolution, with no discernible function or consequences.  相似文献   

3.
The micronuclear versions of genes in stichotrichous ciliates are interrupted by multiple, short, non-coding DNA segments called internal eliminated segments, or IESs. IESs divide a gene into macronuclear destined segments, or MDSs. In some micronuclear genes MDSs are in a scrambled disorder. During development of a micronucleus into a macronucleus after cell mating the IESs are excised from micronuclear genes and the MDSs are spliced in the sequentially correct order. Pairs of short repeat sequences in the ends of MDSs undergo homologous recombination to excise IESs and splice MDSs. However, the repeat sequences are too short to guide unambiguously their own alignment in preparation for recombination. Based on experiments by others on the distantly related ciliate, Paramecium, we propose a molecular model of template-guided recombination to explain the excision of the 100,000-150,000 IESs and splicing of MDSs, including unscrambling, in the genome of stichotrichous ciliates. The model solves the problem of correct pairing of pointers, precisely identifies MDS-IES junctions, and provides for irreversible recombination.  相似文献   

4.
The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).  相似文献   

5.
L A Klobutcher  C L Jahn  D M Prescott 《Cell》1984,36(4):1045-1055
During the life cycle of the hypotrichous ciliate Oxytricha nova, a macronucleus containing short, gene-sized DNA molecules is produced from a copy of the chromosomal micronuclear genome. In order to characterize the process of macronuclear development, we have isolated and determined the DNA sequence of a particular macronuclear gene and its micronuclear precursor. The results of this analysis indicate that macronuclear telomeric sequences (5'C4A4(3') repeats) are not present at the ends of the gene in its micronuclear chromosomal location and must be added during development. In addition, the micronuclear copy of the gene contains three short blocks of sequence that must be removed during development, implying the involvement of a nucleic acid-splicing process in generating mature macronuclear genes.  相似文献   

6.
We describe the complete nucleotide sequence of the human ?-globin gene including 387 nucleotides of 5′ flanking sequence and 301 nucleotides of 3′ flanking sequence. The arrangement of coding, noncoding and intervening sequences in this gene is entirely consistent with its identification as the embryonic β-like globin gene.  相似文献   

7.
Genes for tRNALys5 from Drosophila melanogaster.   总被引:2,自引:1,他引:1       下载免费PDF全文
The sequences of two cloned genes from Drosophila which hybridize with tRNALys5 are reported. One gene, in plasmid pDt39, has a sequence which corresponds to the sequence of tRNA. The other gene, in pDt59R, differs in three nucleotides pairs. Both plasmids are transcribed in vitro with extracts of Drosophila Kc cells to give full-sized tRNA precursors with four additional nucleotides at the 5'-end as well as truncated molecules containing 35 nucleotides. This premature termination occurs in a block of four T residues within the mature coding region. Sequences flanking the tRNA genes show little in common except for the blocks of five or more T-residues beyond the 3'-end of the gene. pDt39 hybridizes to 84AB on the polytene chromosomes of Drosophila and pDt59R hybridizes to 29A.  相似文献   

8.
ABSTRACT: BACKGROUND: Gene prediction algorithms (or gene callers) are an essential tool for analyzing shotgun nucleic acid sequence data. Gene prediction is a ubiquitous step in sequence analysis pipelines; it reduces the volume of data by identifying the most likely reading frame for a fragment, permitting the out-of-frame translations to be ignored. In this study we evaluate five widely used ab initio gene-calling algorithms--FragGeneScan, MetaGeneAnnotator, MetaGeneMark, Orphelia, and Prodigal--for accuracy on short (75-1000 bp) fragments containing sequence error from previously published artificial data and "real" metagenomic datasets. RESULTS: While gene prediction tools have similar accuracies predicting genes on error-free fragments, in the presence of sequencing errors considerable differences between tools become evident. For error-containing short reads, FragGeneScan finds more prokaryotic coding regions than does MetaGeneAnnotator, MetaGeneMark, Orphelia, or Prodigal. This improved detection of genes in error-containing fragments, however, comes at the cost of much lower (50%) specificity and overprediction of genes in noncoding regions. CONCLUSIONS: Ab initio gene callers offer a significant reduction in the computational burden of annotating individual nucleic acid reads and are used in many metagenomic annotation systems. For predicting reading frames on raw reads, we find the hidden Markov model approach in FragGeneScan is more sensitive than other gene prediction tools, while Prodigal, MGA, and MGM are better suited for higher-quality sequences such as assembled contigs.  相似文献   

9.
The primary structure of the putative regulatory region of a gene of the Na+,K+-ATPase multigene family in the human genome has been determined. This region includes the first exon with all of the untranslatable sequence of mRNA and a dozen nucleotides, coding for the first four amino acids of the hypothetic precursor of the alpha+-subunit. The entire region comprises over 1400 bp. The possible role of specific nucleotide blocks within this region in comparison with other genes is discussed.  相似文献   

10.
11.
12.
ABSTRACT The micronuclear version of the gene encoding β-telomere binding protein (β-TBP) in Oxytricha nova has been sequenced and compared to the macronuclear β-TBP gene, previously described. The micronuclear gene contains three AT-rich internal eliminated sequences (IES) of 37, 40, and 43 bp and four macronuclear destined sequences (MDS). The IES interrupt the gene once near the 5′ end of the coding region and twice in the 3′ trailer downstream from the TGA stop codon. The sequences of the micronuclear and macronuclear genes are colinear. Thus, the micronuclear β-TBP gene is not scrambled, which contrasts with the highly scrambled state among the 14 MDS in the micronuclear α;-TBP gene.  相似文献   

13.
From the stirodont Arbacia lixula we determined the sequence of 5,127 nucleotides of mitochondrial DNA (mtDNA) encompassing 18 tRNAs, two complete coding genes, parts of three other coding genes, and part of the 12S ribosomal RNA (rRNA). The sequence confirms that the organization of mtDNA is conserved within echinoids. Furthermore, it underlines the following peculiar features of sea urchin mtDNA: the clustering of tRNAs, the short noncoding regulatory sequence, and the separation by the ND1 and ND2 genes of the two rRNA genes. Comparison with the orthologous sequences from the camarodont species Paracentrotus lividus and Strongylocentrotus purpuratus revealed that (1) echinoids have an extra piece on the amino terminus of the ND5 gene that is probably the remnant of an old leucine tRNA gene; (2) third-position codon nucleotide usage has diverged between A. lixula and the camarodont species to a significant extent, implying different directional mutational pressures; and (3) the stirodont-camarodont divergence occurred twice as long ago as did the P. lividus-S. purpuratus divergence.  相似文献   

14.
15.
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.  相似文献   

16.
Purifying and directional selection in overlapping prokaryotic genes   总被引:4,自引:0,他引:4  
In overlapping genes, the same DNA sequence codes for two proteins using different reading frames. Analysis of overlapping genes can help in understanding the mode of evolution of a coding region from noncoding DNA. We identified 71 pairs of convergent genes, with overlapping 3' ends longer than 15 nucleotides, that are conserved in at least two prokaryotic genomes. Among the overlap regions, we observed a statistically significant bias towards the 123:132 phase (i.e. the second codon base in one gene facing the degenerate third position in the second gene). This phase ensures the least mutual constraint on nonconservative amino acid replacements in both overlapping coding sequences. The excess of this phase is compatible with directional (positive) selection acting on the overlapping coding regions. This could be a general evolutionary mode for genes emerging from noncoding sequences, in which the protein sequence has not been subject to selection.  相似文献   

17.
F Baralle 《Cell》1977,12(4):1085-1095
The 5' noncoding regions of human alpha-and beta-globin mRNAs, 37 and 50 nucleotides in length, have been sequenced. A variation of the "plus and minus" gel technique described by Brownlee and Cartwright (1977) was used, and the results were cross-checked by the Maxam and Gilbert (1977) procedure. These studies completed the knowledge of all the noncoding region sequences of both mRNAs, and it was then possible to calculate their exact size. Human alpha-and beta-globin mRNAs are 575 and 626 nucleotides in length, excluding the poly(A). Furthermore, because the coding and 3' noncoding regions of the latter were known from previous studies (Marotta et al., 1977; Proudfoot, 1977), the primary structure of human beta-globin mRNA is now complete except for six ambiguities in the coding region. The human and rabbit 5' noncoding region sequences are about 80% homologous. This suggests that they are under a moderate selective pressure.  相似文献   

18.
The mouse metallothionein II (MT-II) gene is located approximately 6 kilobases upstream of the MT-I gene. A comparison of the sequences of mouse MT-I and MT-II genes (as well as those of other mammals) reveals that the coding regions are highly conserved even at "silent" positions but that the noncoding regions and introns are extremely divergent between primates and rodents. There are four blocks of conserved sequences in the promoters of mouse MT-I, mouse MT-II, and human MT-IIA genes; one includes the TATAAA sequence, and another has been implicated in regulation by heavy metals. Mouse MT-I and MT-II mRNAs are induced to approximately the same extent in vivo in response to cadmium, dexamethasone, or lipopolysaccharide. Mouse MT-I and MT-II genes are regulated by metals but not by glucocorticoids after transfection into HeLa cells.  相似文献   

19.
We have cloned and analyzed a cDNA containing the coding sequence for human hemopexin. We have first identified, by immunological screening of 30.000 colonies of a liver cDNA library in the expression vector pEX1, a clone carrying an insert 1170 base pairs long that shows 100% homology with a known human hemopexin peptide. The complete sequence coding for hemopexin was isolated from a liver cDNA library in the vector pAT218. The DNA insert of 1523 base pairs shows an open reading frame coding for 439 amino acids, a 3' noncoding region of 159 nucleotides long, followed by a poly(A) tail. The insert spans the entire coding region and from which the primary structure of the protein was deduced. By computer assisted analysis of the amino acid sequence, it was possible to recognize a core unit, of about 45 amino acids, which is repeated 8 or possibly even 10 fold along the polypeptide chain. This feature suggests that the gene might have evolved through a series of duplications. This characteristic, together with prediction of secondary structure, suggest a rough model for the tridimensional folding that allows some speculations on the function of hemopexin. Blot hybridization of total RNA from human liver with nick translated hemopexin cDNA detected a message of about 1600 nucleotides. Southern blot experiments to identify the hemopexin gene (s) suggest that it is not a large multi-gene family, but that there is only one or at most a few genes in the human genome.  相似文献   

20.
One of the major goals of comparative genomics is to understand the evolutionary history of each nucleotide in the human genome sequence, and the degree to which it is under selective pressure. Ascertainment of selective constraint at nucleotide resolution is particularly important for predicting the functional significance of human genetic variation and for analyzing the sequence substructure of cis-regulatory sequences and other functional elements. Current methods for analysis of sequence conservation are focused on delineation of conserved regions comprising tens or even hundreds of consecutive nucleotides. We therefore developed a novel computational approach designed specifically for scoring evolutionary conservation at individual base-pair resolution. Our approach estimates the rate at which each nucleotide position is evolving, computes the probability of neutrality given this rate estimate, and summarizes the result in a Sequence CONservation Evaluation (SCONE) score. We computed SCONE scores in a continuous fashion across 1% of the human genome for which high-quality sequence information from up to 23 genomes are available. We show that SCONE scores are clearly correlated with the allele frequency of human polymorphisms in both coding and noncoding regions. We find that the majority of noncoding conserved nucleotides lie outside of longer conserved elements predicted by other conservation analyses, and are experiencing ongoing selection in modern humans as evident from the allele frequency spectrum of human polymorphism. We also applied SCONE to analyze the distribution of conserved nucleotides within functional regions. These regions are markedly enriched in individually conserved positions and short (<15 bp) conserved “chunks.” Our results collectively suggest that the majority of functionally important noncoding conserved positions are highly fragmented and reside outside of canonically defined long conserved noncoding sequences. A small subset of these fragmented positions may be identified with high confidence.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号