首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We sought to evaluate the extent of the contribution of transposable elements (TEs) to human microRNA (miRNA) genes along with the evolutionary dynamics of TE-derived human miRNAs. We found 55 experimentally characterized human miRNA genes that are derived from TEs, and these TE-derived miRNAs have the potential to regulate thousands of human genes. Sequence comparisons revealed that TE-derived human miRNAs are less conserved, on average, than non-TE-derived miRNAs. However, there are 18 TE-derived miRNAs that are relatively conserved, and 14 of these are related to the ancient L2 and MIR families. Comparison of miRNA vs. mRNA expression patterns for TE-derived miRNAs and their putative target genes showed numerous cases of anti-correlated expression that are consistent with regulation via mRNA degradation. In addition to the known human miRNAs that we show to be derived from TE sequences, we predict an additional 85 novel TE-derived miRNA genes. TE sequences are typically disregarded in genomic surveys for miRNA genes and target sites; this is a mistake. Our results indicate that TEs provide a natural mechanism for the origination miRNAs that can contribute to regulatory divergence between species as well as a rich source for the discovery of as yet unknown miRNA genes.  相似文献   

2.
It has been hypothesised that the massive accumulation of L1 transposable elements on the X chromosome is due to their function in X inactivation, and that the accumulation of Alu elements near genes is adaptive. We tested the possible selective advantage of these two transposable element (TE) families with a novel method, interruption analysis. In mammalian genomes, a large number of TEs interrupt other TEs due to the high overall abundance and age of repeats, and these interruptions can be used to test whether TEs are selectively neutral. Interruptions of TEs, which are beneficial for the host, are expected to be deleterious and underrepresented compared with neutral ones. We found that L1 elements in the regions of the X chromosome that contain the majority of the inactivated genes are significantly less frequently interrupted than on the autosomes, while L1s near genes that escape inactivation are interrupted with higher frequency, supporting the hypothesis that L1s on the X chromosome play a role in its inactivation. In addition, we show that TEs are less frequently interrupted in introns than in intergenic regions, probably due to selection against the expansion of introns, but the insertion pattern of Alus is comparable to other repeats.  相似文献   

3.
Long interspersed nuclear elements (LINEs) comprise about 21% of the human genome (of which L1 is most abundant) and are preferentially accumulated in AT-rich regions, as well as the X and Y chromosomes. Most knowledge of L1 distribution in mammals is restricted to human and mouse. Here we report the first investigation of L1 distribution in the genomes of a wide variety of eutherian mammals, including species in the two basal clades, Afrotheria and Xenarthra. Our results show L1 accumulation on the X of all eutherian mammals, an observation consistent with an ancestral involvement of these elements in the X-inactivation process (the Lyon repeat hypothesis). Surprisingly, conspicuous accumulation of L1 in AT-rich regions of the genome was not observed in any species outside of Euarchontoglires (represented by human, mouse and rabbit). Although several features were common to most species investigated, our comprehensive survey shows that the patterns observed in human and mouse are, in many aspects, far from typical for all mammals. We discuss these findings with reference to models that have previously been proposed to explain the AT distribution bias of L1 in human and mouse, and how this relates to the evolution of these elements in other eutherian genomes.Paul D. Waters and Gauthier Dobigny contributed equally to this work  相似文献   

4.
5.
Genomes hold a treasure trove of protein fossils: Fragments of formerly protein-coding DNA, which mainly come from transposable elements (TEs) or host genes. These fossils reveal ancient evolution of TEs and genomes, and many fossils have been exapted to perform diverse functions important for the host’s fitness. However, old and highly degraded fossils are hard to identify, standard methods (e.g. BLAST) are not optimized for this task, and few Paleozoic protein fossils have been found. Here, a recently optimized method is used to find protein fossils in vertebrate genomes. It finds Paleozoic fossils predating the amphibian/amniote divergence from most major TE categories, including virus-related Polinton and Gypsy elements. It finds 10 fossils in the human genome (eight from TEs and two from host genes) that predate the last common ancestor of all jawed vertebrates, probably from the Ordovician period. It also finds types of transposon and retrotransposon not found in human before. These fossils have extreme sequence conservation, indicating exaptation: some have evidence of gene-regulatory function, and they tend to lie nearest to developmental genes. Some ancient fossils suggest “genome tectonics,” where two fragments of one TE have drifted apart by up to megabases, possibly explaining gene deserts and large introns. This paints a picture of great TE diversity in our aquatic ancestors, with patchy TE inheritance by later vertebrates, producing new genes and regulatory elements on the way. Host-gene fossils too have contributed anciently conserved DNA segments. This paves the way to further studies of ancient protein fossils.  相似文献   

6.
Technological advances in the 1970s encouraged the mapping of homologous gene loci in different mammalian species, including mouse and man. One hundred eighty-five homologous loci have now been mapped in these two species. Conservation of linkage is sufficient to identify substantial segments of the two genomes that have been left intact since their divergence from a common ancestor. The recognition of these conserved segments allows experimental manipulation of mouse chromosomes or chromosomal regions to produce models of human chromosomal anomalies of medical importance. Comparative gene mapping has been extended beyond mouse and man and the genomes of some species, including domestic cattle, appear to be more highly conserved relative to humans than the mouse. Such species may be particularly useful in providing models of human chromosomal anomalies that cannot be duplicated in laboratory mice.  相似文献   

7.
Concerted evolution of human amylase genes.   总被引:10,自引:4,他引:10       下载免费PDF全文
Cosmid clones containing 250 kilobases of genomic DNA from the human amylase gene cluster have been isolated. These clones contain seven distinct amylase genes which appear to comprise the complete multigene family. By sequence comparison with the cDNAs, we have identified two pancreatic amylase genes and three salivary amylase genes. Two truncated pseudogenes were also recovered. Intergenic distances of 17 to 22 kilobases separate the amylase gene copies. Within the past 10 million years, duplications, gene conversions, and unequal crossover events have resulted in a very high level of sequence similarity among human amylase gene copies. To identify sequence elements involved in tissue-specific expression and hormonal regulation, the promoter regions of the human amylase genes were sequenced and compared with those of the corresponding mouse genes. The promoters of the human and mouse pancreatic amylase genes are highly homologous between nucleotide -160 and the cap site. Two sequence elements thought to influence pancreas-specific expression of the rodent genes are present in the human genes. In contrast, similarity in the 5' flanking sequences of the salivary amylase genes is limited to several short sequence elements whose positions and orientations differ in the two species. Some of these sequence elements are also associated with other parotid-specific genes and may be involved in their tissue-specific expression. A glucocorticoid response element and a general enhancer element are closely associated in several of the amylase promoters.  相似文献   

8.
Throughout evolution, eukaryotic genomes have been invaded by transposable elements (TEs). Little is known about the factors leading to genomic proliferation of TEs, their preferred integration sites and the molecular mechanisms underlying their insertion. We analyzed hundreds of thousands nested TEs in the human genome, i.e. insertions of TEs into existing ones. We first discovered that most TEs insert within specific ‘hotspots’ along the targeted TE. In particular, retrotransposed Alu elements contain a non-canonical single nucleotide hotspot for insertion of other Alu sequences. We next devised a method for identification of integration sequence motifs of inserted TEs that are conserved within the targeted TEs. This method revealed novel sequences motifs characterizing insertions of various important TE families: Alu, hAT, ERV1 and MaLR. Finally, we performed a global assessment to determine the extent to which young TEs tend to nest within older transposed elements and identified a 4-fold higher tendency of TEs to insert into existing TEs than to insert within non-TE intergenic regions. Our analysis demonstrates that TEs are highly biased to insert within certain TEs, in specific orientations and within specific targeted TE positions. TE nesting events also reveal new characteristics of the molecular mechanisms underlying transposition.  相似文献   

9.
The sequences of a 51-kb region containing the cluster of five rat gamma-crystallin-coding genes (CRYG) and of a 7-kb region surrounding the sixth rat CRYG gene were determined. Approximately 78% of the total sequence represents intergenic DNA. We also sequenced 22 kb of DNA from the human CRYG gene cluster. All CRYG genes are associated with CpG-rich regions. The sequence similarity between the human and rat gene regions drops sharply (to 65%) in intronic and 3'-flanking regions but decreases only gradually in the 5'-flanking region. Highly conserved regions (greater than 80%) are found as far upstream as 1.5 kb. Overall intergenic distances are conserved. The human region contains much more repetitive DNA (24% vs. 10%) but less simple-sequence (sps) DNA (0.7% vs. 4%) than the rat region. Almost all repeats and spsDNA elements are located in the intergenic region. The location of repetitive and spsDNA differs between the orthologous regions and these elements were probably inserted after the evolutionary separation of rat and man. The Alu repeats in man and the B3 repeats in the rat are close copies of their respective consensus sequences and bordered by virtually perfect repeats. In contrast, the B1 and B2 repeats in the rat have diverged considerably from the consensus sequence and the surrounding direct repeats are usually imperfect. Thus the dispersion of the B1 and B2 repeats in the rat probably preceded that of the B3 repeats. Within the rat genomic region the spacing of Z-DNA elements is surprisingly regular, they are located about 12 kb apart. A search for putative matrix-associated regions suggests that the rat CRYG gene cluster is organized into two chromosomal domains.  相似文献   

10.
To study the genome-wide impact of transposable elements (TEs) on the evolution of protein-coding regions, we examined 13 799 human genes and found 533 (approximately 4%) cases of TEs within protein-coding regions. The majority of these TEs (approximately 89.5%) reside within 'introns' and were recruited into coding regions as novel exons. We found that TE integration often has an effect on gene function. In particular, there were two mouse genes whose coding regions consist largely of TEs, suggesting that TE insertion might create new genes. Thus, there is increasing evidence for an important role of TEs in gene evolution. Because many TEs are taxon-specific, their integration into coding regions could accelerate species divergence.  相似文献   

11.
Comparative genomics is a superior way to identify phylogenetically conserved features like genes or regions involved in gene regulation. The comparison of extended orthologous chromosomal regions should also reveal other characteristic traits essential for chromosome or gene function. In the present study we have sequenced and compared a region of conserved synteny from human chromosome 11p15.3 and mouse chromosome 7. In human, this region is known to contain several genes involved in the development of various disorders like Beckwith-Wiedemann overgrowth syndrome and other tumor diseases. Furthermore, in the neighboring chromosome region 11p15.5 extensive imprinting of genes has been reported which might extend to region 11p15.3. The analysis of approximately 730 kb in human and 620 kb in mouse led to the identification of eleven genes. All putative genes found in the mouse DNA were also present in the same order and orientation in the human chromosome. However, in the human DNA one putative gene of unknown function could be identified which is not present in the orthologous position of the mouse chromosome. The sequence similarity between human and mouse is higher in transcribed and exon regions than in non-transcribed segments. Dot plot analysis, however, reveals a surprisingly well-conserved sequence similarity over the entire analyzed region. In particular, the positions of CpG islands, short regions of very high GC content in the 5' region of putative genes, are similar in human and mouse. With respect to base composition, two distinct segments of significantly different GC content exist as well in human as in the mouse. With a GC content of 45% the one segment would correspond to "isochore H1" and the other segment (39% GC in human, 40% GC in mouse) to "isochore L1/L2". The gene density (one gene per 66 kb) is slightly higher than the average calculated for the complete human genome (one gene per 90 kb). The comparison of the number and distribution of repetitive elements shows that the proportion of human DNA made up by interspersed repeats (43.8%) is significantly higher than in the corresponding mouse DNA (30.1%). This partly explains why the human DNA is longer between the landmark genes used to define the orthologous positions in human and mouse.  相似文献   

12.
Patterns of similarity between genomes of related species reflect the distribution of selective constraint within DNA. We analyzed alignments of 142 orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae and found a mosaic pattern with regions of high similarity (phylogenetic footprints) interspersed with non-alignable sequences. Footprints cover ~20% of intergenic regions, often occur in clumps and are rare within 5′ UTRs but common within 3′ UTRs. The footprints have a higher ratio of transitions to transversions than expected at random and a higher GC content than the rest of the intergenic region. The number of footprints and the GC content of footprints within an intergenic region are higher when genes are oriented so that their 5′ ends form the boundaries of the intergenic region. Overall, the patterns and characteristics identified here, along with other comparative and experimental studies, suggest that many footprints have a regulatory function, although other types of function are also possible. These conclusions may be quite general across eukaryotes, and the characteristics of conserved regulatory elements determined from genomic comparisons can be useful in prediction of regulation sites within individual DNA sequences.  相似文献   

13.
14.
Hida A  Koike N  Hirose M  Hattori M  Sakaki Y  Tei H 《Genomics》2000,65(3):224-233
The clock gene, Period1, from human and mouse was sequenced and characterized. Both human PERIOD1 (human PER1) and mouse Period1 (mouse Per1) consisted of 23 exons spanning approximately 16 kb, and their structures showed strong similarity to each other. For example, six highly conserved regions were identified in the 5' upstream sequences. These conserved segments exhibited 77-88% identity and possessed several potential regulatory elements including five E-boxes (the binding site of the CLOCK-BMAL1 complex) and four cyclic AMP response elements. Transient transfection assays using a mPer1-luciferase fusion gene revealed that each of the conserved E-boxes additively functions as an enhancer for the transactivation of mPer1 by mCLOCK and mBMAL1.  相似文献   

15.
16.
17.
18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号