首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
The subtelomeric regions of organisms ranging from protists to fungi undergo a much higher rate of rearrangement than is observed in the rest of the genome. While characterizing these ~40-kb regions of the human fungal pathogen Cryptococcus neoformans, we have identified a recent gene amplification event near the right telomere of chromosome 3 that involves a gene encoding an arsenite efflux transporter (ARR3). The 3,177-bp amplicon exists in a tandem array of 2-15 copies and is present exclusively in strains with the C. neoformans var. grubii subclade VNI A5 MLST profile. Strains bearing the amplification display dramatically enhanced resistance to arsenite that correlates with the copy number of the repeat; the origin of increased resistance was verified as transport-related by functional complementation of an arsenite transporter mutant of Saccharomyces cerevisiae. Subsequent experimental evolution in the presence of increasing concentrations of arsenite yielded highly resistant strains with the ARR3 amplicon further amplified to over 50 copies, accounting for up to ~1% of the whole genome and making the copy number of this repeat as high as that seen for the ribosomal DNA. The example described here therefore represents a rare evolutionary intermediate-an array that is currently in a state of dynamic flux, in dramatic contrast to relatively common, static relics of past tandem duplications that are unable to further amplify due to nucleotide divergence. Beyond identifying and engineering fungal isolates that are highly resistant to arsenite and describing the first reported instance of microevolution via massive gene amplification in C. neoformans, these results suggest that adaptation through gene amplification may be an important mechanism that C. neoformans employs in response to environmental stresses, perhaps including those encountered during infection. More importantly, the ARR3 array will serve as an ideal model for further molecular genetic analyses of how tandem gene duplications arise and expand.  相似文献   

3.
The olfactory receptor (OR) multigene family is widely distributed in the human genome. We characterize here a new cluster of four OR genes (HGMW-approved symbols OR7E20P, OR7E6P, OR7E21P, and OR7E22P) on human chromosome 3p13 that is contained in an approximately 250-kb region. This region has been physically mapped, and a 106-kb portion containing the OR genes has been sequenced. All the OR sequences are disrupted by frameshifts and stop codons and appear to have arisen through local duplications. A myosin light chain kinase pseudogene (HGMW-approved symbol MYLKP) lies at one end of the OR gene cluster. Sequences spanning the entire region are also present at 3q13-q21, the site of the functional MYLK gene. This region duplicated locally before the divergence of primates, and the two paralogous copies were later separated to sites on either side of the centromere. This study increases our understanding of the evolution of the human genome. The 3p13 cluster is the first example of a tandem array of OR pseudogenes, and duplications of such clusters may account for the accumulation of a large number of pseudogenes in the human genome.  相似文献   

4.
In order to identify human lineage specific (HLS) copy number differences (CNDs) compared to other primates, we performed pair wise comparisons (human vs. chimpanzee, gorilla and orangutan) by using cDNA array comparative genomic hybridization (CGH). A set of 23 genes with HLS duplications were identified, as well as other lineage differences in gene copy number specific of chimpanzee, gorilla and orangutan. Each species has gained more copies of specific genes rather than losing gene copies. Eleven of the 23 genes have only been observed to have undergone HLS duplication in Fortna et al. (2004) and in the present study. Then, seven of these 11 genes were analyzed by quantitative PCR in chimpanzee, gorilla and orangutan, as well as in other six primate species (Hylobates lar, Cercopithecus aethiops, Papio hamadryas, Macaca mulatta, Lagothrix lagothricha, and Saimiri sciureus). Six genes confirmed array CGH data, and four of them appeared to have bona fide HLS duplications (ABCB10, E2F6, CDH12, and TDG genes). We propose that these gene duplications have a potential to contribute to specific human phenotypes.  相似文献   

5.
6.
7.
Tandemly arrayed genes (TAGs) account for about one-third of the duplicated genes in eukaryotic genomes. They provide raw genetic material for biological evolution, and play important roles in genome evolution. The 22-kDa prolamin genes in cereal genomes represent typical TAG organization, and provide the good material to investigate gene amplification of TAGs in closely related grass genomes. Here, we isolated and sequenced the Coix 22-kDa prolamin (coixin) gene cluster (283 kb), and carried out a comparative analysis with orthologous 22-kDa prolamin gene clusters from maize and sorghum. The 22-kDa prolamin gene clusters descended from orthologous ancestor genes, but underwent independent gene amplification paths after the separation of these species, therefore varied dramatically in sequence and organization. Our analysis indicated that the gene amplification model of 22-kDa prolamin gene clusters can be divided into three major stages. In the first stage, rare gene duplications occurred from the ancestor gene copy accidentally. In the second stage, rounds of gene amplification occurred by unequal crossing over to form tandem gene array(s). In the third stage, gene array was further diverged by other genomic activities, such as transposon insertions, segmental rearrangements, etc. Unlike their highly conserved sequences, the amplified 22-kDa prolamin genes diverged rapidly at their expression capacities and expression levels. Such processes had no apparent correlation to age or order of amplified genes within TAG cluster, suggesting a fast evolving nature of TAGs after gene amplification. These results provided insights into the amplification and evolution of TAG families in grasses.  相似文献   

8.
Traditional sequence comparison by alignment employs a mutation model comprised of two events, substitutions and indels (insertions or deletions) of single positions. However, modern genetic analysis knows a variety of more complex mutation events (e.g., duplications, excisions, and rearrangements), especially regarding DNA. With ever more DNA sequence data becoming available, the need to accurately compare sequences which have clearly undergone more complicated types of mutational processes is becoming critical. Herein we introduce a new method for pairwise alignment and comparison of sequences with respect to the special evolution of tandem repeats: substitutions and indels of single positions and, additionally, duplications and excisions of variable degree (i.e., of one or more repeat copies simultaneously) are taken into account. To evaluate our method, we apply it to the spa VNTR (variable number of tandem repeats) cluster of Staphylococcus aureus, a bacterium of high medical importance  相似文献   

9.
10.
A mutant strain of Thermus thermophilus which contains deletions in the 3'-terminal region of its leuB gene showed a temperature-sensitive growth phenotype in the absence of leucine. Three phenotypically thermostable mutants were isolated from the temperature-sensitive strain by spontaneous evolution. Each pseudorevertant carried a tandem sequence duplication in the 3' region of its leuB gene. The mutated 3-isopropylmalate dehydrogenases encoded by the leuB genes from the pseudorevertants were more thermostable than the enzyme from the temperature-sensitive strain. Structural analyses suggested that the decreased thermostability of the enzyme from the temperature-sensitive strain was caused by reducing hydrophobic and electrostatic interactions in the carboxyl-terminal region and that the recovered stability of the enzymes from the pseudorevertants was due to the restoration of the hydrophobic interaction. Our results indicate that tandem sequence duplications are the general genetic way to alter protein characteristics in evolution.  相似文献   

11.
High-throughput sequencing technologies have offered in recent years new opportunities to study genome variations. These studies have mostly focused on single nucleotide polymorphisms, small insertions or deletions and on copy number variants. Other structural variants, such as large insertions or deletions, tandem duplications, translocations, and inversions are less well-studied, despite that some have an important impact on phenotypes. In the present study, we performed a large-scale survey of structural variants in cattle. We report the identification of 6,426 putative structural variants in cattle extracted from whole-genome sequence data of 62 bulls representing the three major French dairy breeds. These genomic variants affect DNA segments greater than 50 base pairs and correspond to deletions, inversions and tandem duplications. Out of these, we identified a total of 547 deletions and 410 tandem duplications which could potentially code for CNVs. Experimental validation was carried out on 331 structural variants using a novel high-throughput genotyping method. Out of these, 255 structural variants (77%) generated good quality genotypes and 191 (75%) of them were validated. Gene content analyses in structural variant regions revealed 941 large deletions removing completely one or several genes, including 10 single-copy genes. In addition, some of the structural variants are located within quantitative trait loci for dairy traits. This study is a pan-genome assessment of genomic variations in cattle and may provide a new glimpse into the bovine genome architecture. Our results may also help to study the effects of structural variants on gene expression and consequently their effect on certain phenotypes of interest.  相似文献   

12.
13.
K J Danna  R Workman  V Coryell  P Keim 《Génome》1996,39(2):445-455
The organization of 5S rRNA genes in plants belonging to tribe Phaseoleae was investigated by clamped homogeneous electric field gel electrophoresis and Southern blot hybridization. Representatives of subtribe Glycininae included the diploid species Neonotonia wightii and Teramnus labialis, as well as three soybean accessions: an elite Glycine max (L.) Merr. cultivar (BSR101), an unadapted G. max introduction (PI 437.654), and a wild Glycine soja (PI 468.916). A cultivar of Phaseolus vulgaris (kidney bean), a member of subtribe Phaseolinae, was also examined. We determined the number of 5S rDNA arrays and estimated the size and copy number of the repeat unit for each array. The three soybean accessions all have a single 5S locus, with a repeat unit size of ~345 bp and a copy number ranging from about 600 in 'BSR101' to about 4600 in the unadapted soybean introduction. The size of the 5S gene cluster in 'BSR101' is the same in roots, shoots, and trifoliate leaves. Given that the genus Glycine probably has an allotetraploid origin, our data strongly suggest that one of the two progenitor 5S loci has been lost during diploidization of soybean. Neonotonia wightii, the diploid species most closely related to soybean, also has a single locus but has a repeat unit of 520 bp and a copy number of about 1300. The more distantly related species T. labialis and P. vulgaris exhibited a more complex arrangement of 5S rRNA genes, having at least three arrays, each comprising a few hundred copies of a distinct repeat unit. Although each array in P. vulgaris exhibits a high degree of homogeneity with regard to the sequence of the repeat unit, heterogeneity in array size (copy number) was evident when individual plants were compared. A cis-dependent molecular drive process, such as unequal crossing-over, could account for both the homogenization of repeat units within individual arrays and the observed variation in copy number among individuals. Key words : pulsed-field gel electrophoresis, rRNA genes, soybean, tandem arrays.  相似文献   

14.
Liu Q  Nguyen VQ  Li X  Sommer SS 《BioTechniques》2006,40(5):661-668
Large heterozygous chromosomal deletions and gene duplications are important classes of mutations that are generally missed by standard PCR amplification and sequencing. Multiplex dosage pyrophosphorolysis-activated polymerization (MD-PAP), a derivative of PAP, was utilized to detect these types of mutations. PAP is a method for nucleic acid amplification in which 3' blocked oligonucleotides (P*) are activated by pyrophosphorolysis when annealed to the target template and subsequently extended. A key advantage to this technology is that PAP reactions produce little or no primer-dimer or false priming. As a result of this enhanced specificity, MD-PAP is easy to optimize. Herein, we utilize MD-PAP to determine gene dosage of each exon of the human factor IX gene by comparison with one endogenous internal control from the ATM gene. Estimated dosage is proportional to the actual template copy number over a minimum dynamic range from 1 to 16 copies. A blinded analysis detected 100% of 43 heterozygous deletions of exons in the human factor IX gene.  相似文献   

15.
Jabbari K  Rayko E  Bernardi G 《Gene》2003,317(1-2):203-208
Since many gene duplications in the human genome are ancient duplications going back to the origin of vertebrates, the question may be asked about the fate of such duplicated genes at the compositional genome transitions that occurred between cold- and warm-blooded vertebrates. Indeed, at that transition, about half of the (GC-poor) genes of cold-blooded vertebrates (the genes of the gene-dense "ancestral genome core") underwent a GC enrichment to become the genes of the "genome core" of warm-blooded vertebrates. Since the compositional distribution of the human duplicated genes investigated (1111 pairs) mimics the general distribution of human genes (about 50% GC(3)-poor and 50% GC(3)-rich genes, the border being at 60% GC(3)), we considered two possibilities, namely that the compositional transition affected either (i) about half of the copies on a random basis, or (ii) preferentially only one copy of the duplicated genes. The two possibilities could be distinguished if each copy is put into one of two subsets according to its GC(3) level. Indeed, in the first case, the two distributions would be similar, whereas in the second case, the two distributions would be different, one copy having maintained the ancestral GC-poor composition, and one copy having undergone the compositional change. Using this approach, we could show that, by far and large, one copy of the duplicated genes preferentially underwent the GC enrichment. This result implies that this copy, which had possibly acquired a different function and/or regulation, was preferentially translocated into the gene-dense compartment of the genome, the "ancestral genome core", namely the "gene space" which underwent the compositional transition at the emergence of warm-blooded vertebrates.  相似文献   

16.
Concerted and divergent evolution within the rat gamma-crystallin gene family   总被引:11,自引:0,他引:11  
The nucleotide sequences of six rat gamma-crystallin genes have been determined. All genes have the same mosaic structure: the first exons contain a relatively short (25 to 44 base-pair) 5' non-coding region and the first nine base-pairs of the coding sequence, the second exons encode protein motifs I and II, while protein motifs III and IV are encoded by the third exons. The third exons also contain a 60 to 67-base-pair long 3' non-coding region. In the gamma 1-2 gene, the splice acceptor site of the third exon has been shifted three base-pairs upstream. Hence, the protein product of this gene is one amino acid residue longer. The first introns, though varying in length from 85 to 100 base-pairs, are conserved in sequence. The second introns vary considerably in length (0.9 X 10(3) to 1.9 X 10(3) base-pairs) and sequence. The second exons of the genes show concerted evolution and have undergone multiple gene conversions. In contrast, the third exons show divergent evolution. From the sequences of the third exons, an evolutionary tree of the gene family was constructed. This tree suggests that three of the present genes derive directly from the genes that originated from a tandem duplication of a two-gene cluster. Two duplications of the last gene of the four-gene cluster then yielded the other three genes. Region a' of the third exon, encoding protein motif III, is variable, while the region encoding protein motif IV (b') is constant. We postulate that this variability in region a' is due to a period of radiation after each gene duplication. A comparison of the rat sequences with those of orthologous sequences from other species shows that the variation in region a' is now preserved. Hence, it might specify the specific functional property of each gamma-crystallin protein within the lens.  相似文献   

17.
The rRNA genes in the somatic macronucleus of Tetrahymena thermophila are normally on 21 kb linear palindromic molecules (rDNA). We examined the effect on rRNA gene dosage of transforming T.thermophila macronuclei with plasmid constructs containing a pair of tandemly repeated rDNA replication origin regions unlinked to the rRNA gene. A significant proportion of the plasmid sequences were maintained as high copy circular molecules, eventually consisting solely of tandem arrays of origin regions. As reported previously for cells transformed by a construct in which the same tandem rDNA origins were linked to the rRNA gene [Yu, G.-L. and Blackburn, E. H. (1990) Mol. Cell. Biol., 10, 2070-2080], origin sequences recombined to form linear molecules bearing several tandem repeats of the origin region, as well as rRNA genes. The total number of rDNA origin sequences eventually exceeded rRNA gene copies by approximately 20- to 40-fold and the number of circular replicons carrying only rDNA origin sequences exceeded rRNA gene copies by 2- to 3-fold. However, the rRNA gene dosage was unchanged. Hence, simply monitoring the total number of rDNA origin regions is not sufficient to regulate rRNA gene copy number.  相似文献   

18.
Extensive gene rearrangement is reported in the mitochondrial genomes of lungless salamanders (Plethodontidae). In each genome with a novel gene order, there is evidence that the rearrangement was mediated by duplication of part of the mitochondrial genome, including the presence of both pseudogenes and additional, presumably functional, copies of duplicated genes. All rearrangement-mediating duplications include either the origin of light-strand replication and the nearby tRNA genes or the regions flanking the origin of heavy-strand replication. The latter regions comprise nad6, trnE, cob, trnT, an intergenic spacer between trnT and trnP and, in some genomes, trnP, the control region, trnF, rrnS, trnV, rrnL, trnL1, and nad1. In some cases, two copies of duplicated genes, presumptive regulatory regions, and/or sequences with no assignable function have been retained in the genome following the initial duplication; in other genomes, only one of the duplicated copies has been retained. Both tandem and nontandem duplications are present in these genomes, suggesting different duplication mechanisms. In some of these mitochondrial DNAs, up to 25% of the total length is composed of tandem duplications of noncoding sequence that includes putative regulatory regions and/or pseudogenes of tRNAs and protein-coding genes along with the otherwise unassignable sequences. These data indicate that imprecise initiation and termination of replication, slipped-strand mispairing, and intramolecular recombination may all have played a role in generating repeats during the evolutionary history of plethodontid mitochondrial genomes.  相似文献   

19.
The genes encoding ribosomal RNA are the most abundant in the eukaryotic genome. They reside in tandem repetitive clusters, in some cases totaling hundreds of copies. Due to their repetitive structure, ribosomal RNA genes (rDNA) are easily lost by recombination events within the repeated cluster. We previously identified a unique gene amplification system driven by unequal sister-chromatid recombination during DNA replication. The system compensates for such copy number losses, thus maintaining proper copy number. Here, through a genome-wide screen for genes regulating rDNA copy number, we found that the rtt109 mutant exhibited a hyper-amplification phenotype (∼3 times greater than the wild-type level). RTT109 encodes an acetyl transferase that acetylates lysine 56 of histone H3 and which functions in replication-coupled nucleosome assembly. Relative to unequal sister-chromatid recombination-based amplification (∼1 copy/cell division), the rate of the hyper-amplification in the rtt109 mutant was extremely high (>100 copies/cell division). Cohesin dissociation that promotes unequal sister-chromatid recombination was not observed in this mutant. During hyper-amplification, production level of extra-chromosomal rDNA circles (ERC) by intra-chromosomal recombination in the rDNA was reduced. Interestingly, during amplification, a plasmid containing an rDNA unit integrated into the rDNA as a tandem array. These results support the idea that tandem DNA arrays are produced and incorporated through rolling-circle-type replication. We propose that, in the rtt109 mutant, rDNA hyper-amplification is caused by uncontrolled rolling-circle-type replication.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号