首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 9 毫秒
1.
2.
Evolutionarily conserved non-coding genomic sequences represent a potentially rich source for the discovery of gene regulatory regions. Since these elements are subject to stabilizing selection they evolve much more slowly than adjacent non-functional DNA. These so-called phylogenetic footprints can be detected by comparison of the sequences surrounding orthologous genes in different species. Therefore the loss of phylogenetic footprints as well as the acquisition of conserved non-coding sequences in some lineages, but not in others, can provide evidence for the evolutionary modification of cis-regulatory elements. We introduce here a statistical model of footprint evolution that allows us to estimate the loss of sequence conservation that can be attributed to gene loss and other structural reasons. This approach to studying the pattern of cis-regulatory element evolution, however, requires the comparison of relatively long sequences from many species. We have therefore developed an efficient software tool for the identification of corresponding footprints in long sequences from multiple species. We apply this novel method to the published sequences of HoxA clusters of shark, human, and the duplicated zebrafish and Takifugu clusters as well as the published HoxB cluster sequences. We find that there is a massive loss of sequence conservation in the intergenic region of the HoxA clusters, consistent with the finding in [Chiu et al., PNAS 99 (2002) 5492]. The loss of conservation after cluster duplication is more extensive than expected from structural reasons. This suggests that binding site turnover and/or adaptive modification may also contribute to the loss of sequence conservation.  相似文献   

3.
Horne I  Haritos VS 《Gene》2008,411(1-2):27-37
We have examined a highly dynamic section of the Drosophila melanogaster genome which contains neutral lipase family genes that have undergone multiple tandem duplication events. We have identified the orthologous clusters, encoding between five and eight apparently functional lipases, in other Drosophila genomes: yakuba, ananassae, pseudoobscura, virilis, mojavensis, persimilis, grimshawi and willistoni. We examined their gene structure, duplication and pseudogene formation, and the presence of transposable elements. Based on phylogenetic comparisons, the lipase genes contained in each of the clusters fall into four distinct clades. Clades I and II have distinct evolutionary constraints to clades III and IV. Multiple gene duplications have occurred in different lineages of clades I and II while clades III and IV contain a single lipase gene from each species. Compared with lipases from other clades, clade IV genes contain an additional 3' domain of tandemly repeated sequence of varying length and composition, and a substitution in the residue adjacent to the key catalytic serine in the encoded proteins. A comparison of non-synonymous to synonymous nucleotide substitution (dN/dS) rates within each clade showed the highest rate of divergence was between paralogous lipase gene pairs suggesting selection pressure on duplicated genes. Analysis of the encoded lipase protein sequences within each species using PAML identified positively selected sites; structure homology modeling based on human pancreatic lipase indicated many of these residues formed part of the active site of the enzyme. As some of the cluster lipase genes are known to be expressed in the insect midgut and respond to changes in dietary components, we propose that the lipase cluster has undergone dynamic evolutionary changes to maximize absorption of lipid nutrients from the diet.  相似文献   

4.
In order to understand the coordinate regulation between the alpha-like and beta-like globins during the developmental switches in hemoglobin synthesis, we have studied the rabbit alpha-like globin gene family. A cluster of six linked genes arranged 5'-zeta 1-alpha 1-theta 1-zeta 2-zeta 3-theta 2-3' has been isolated as a set of overlapping clones from a library of rabbit genomic DNA. Blot-hybridization analysis of genomic DNA not only confirms this linkage arrangement but also reveals the presence of additional zeta and theta genes. We propose that this gene cluster was generated by a block duplication of a set of alpha-like genes; the proposed duplication unit is zeta-zeta-alpha-theta. Further duplications of a zeta-zeta-theta set are also proposed to have occurred. As expected for a duplicated locus, the rabbit alpha-like gene cluster contains long blocks of internal homology. The Z homology block is about 7.2 kilobase pairs long and contains the zeta genes; the T homology block is about 4.7 kilobase pairs long and contains a theta gene. Surprisingly, both Z and T homology blocks are flanked by a common junction sequence (J) which contains a region very similar to the 3'-untranslated sequence of an alpha-globin gene. Analysis of the J sequences suggests a recombination mechanism by which the alpha gene could have been deleted from the second set of genes in the cluster (zeta 2-zeta 3-theta 2). The relationships among the genes in characterized alpha-like gene clusters in mammals are summarized. The rabbit gene cluster differs from those of other mammals principally in the loss of a gene orthologous to the human psi alpha 1 and in the block duplication of the zeta-zeta-alpha-theta gene set.  相似文献   

5.
Extensive copy-number variation of the human olfactory receptor gene family   总被引:3,自引:0,他引:3  
As much as a quarter of the human genome has been reported to vary in copy number between individuals, including regions containing about half of the members of the olfactory receptor (OR) gene family. We have undertaken a detailed study of copy-number variation of ORs to elucidate the selective and mechanistic forces acting on this gene family and the true impact of copy-number variation on human OR repertoires. We argue that the properties of copy-number variants (CNVs) and other sets of large genomic regions violate the assumptions of statistical methods that are commonly used in the assessment of gene enrichment. Using more appropriate methods, we provide evidence that OR enrichment in CNVs is not due to positive selection but is because of OR preponderance in segmentally duplicated regions, which are known to be frequently copy-number variable, and because purifying selection against CNVs is lower in OR-containing regions than in regions containing essential genes. We also combine multiplex ligation-dependent probe amplification (MLPA) and PCR to assay the copy numbers of 37 candidate CNV ORs in a panel of ~50 human individuals. We confirm copy-number variation of 18 ORs but find no variation in this human-diversity panel for 16 other ORs, highlighting the caveat that reported intervals often overrepresent true CNVs. The copy-number variation we describe is likely to underpin significant variation in olfactory abilities among human individuals. Finally, we show that both homology-based and homology-independent processes have played a recent role in remodeling the OR family.  相似文献   

6.
Analysis of an inversion within the human beta globin gene cluster.   总被引:14,自引:2,他引:12       下载免费PDF全文
We have cloned and sequenced the DNA from two regions of the defective beta-globin gene cluster from a patient with Indian A gamma delta beta thalassaemia, and confirmed the complex and unusual pattern of rearrangement involving two separate deletions (0.8 kb and 7.5 kb) the inversion of the 15.5 kb segment separating them, as previously proposed from gene mapping studies [1]. All four breakpoints occur within the transcribed region of the globin genes and at one junction are found six nucleotides of unknown origin. This unique rearrangement results in enhanced expression of the upstream fetal gene, and is therefore is pertinent to the localisation of any putative control region involved in the coordinate expression of fetal and adult genes.  相似文献   

7.
The nucleotide sequence of a 9937 base-pair portion of human chromosome 9, which contains two complete leukocyte interferon genes (LeIF-L and J), the complete intergenic region, and part of a third related possible pseudogene (LeIF-M), has been determined. The coding regions of the L and J genes are separated by 4363 nucleotides. The coding regions for the putative L and J interferons are 96% homologous and are each surrounded by about 3500 nucleotides of flanking sequences, which are also highly homologous. The L and J genes and their respective flanking sequences comprise a 4000 nucleotide leukocyte interferon gene repeat unit; the L gene repeat unit contains two major insertions not present in the J gene repeat unit. The J gene repeat unit is flanked by sequence features reminiscent of those found surrounding transposable elements. Both the L and J gene repeat units are embedded within sequences that are highly repeated in the human genome. Structural features identified within this portion of chromosome 9 may have been important for the generation of this interferon gene cluster.  相似文献   

8.
Ancient duplications of the human proglucagon gene   总被引:5,自引:0,他引:5  
Irwin DM 《Genomics》2002,79(5):741-746
The human proglucagon gene (GCG) is encoded within a finished 576-kb DNA sequence generated by the Human Genome Project. GCG is flanked by 18 kb and 65 kb of DNA, 5' and 3', respectively, that do not encode genes. The genomic sequence that includes GCG was found to have a long history of gene duplication events. Some members of the glucagon-like family of genes, GCG on chromosome 2 and GIP on chromosome 17, may be products of ancient genome duplications on the early vertebrate lineage. A large genomic tandem duplication event that included DPP4-like and GCG genes occurred before the amphibian-mammal divergence, but one of the duplicated copies of GCG has been lost on the human lineage. Recently, a processed pseudogene of the X-chromosome-linked gene TIMM8A was inserted downstream of GCG. Some ancient duplicates of GCG may retain physiological functions in other vertebrates.  相似文献   

9.
10.
The olfactory receptor (OR) gene cluster on human chromosome 17p13.3 was subjected to mixed shotgun automated DNA sequencing. The resulting 412 kb of genomic sequence include 17 OR coding regions, 6 of which are pseudogenes. Six of the coding regions were discovered only upon genomic sequencing, while the others were previously reported as partial sequences. A comparison of DNA sequences in the vicinity of the OR coding regions revealed a common gene structure with an intronless coding region and at least one upstream noncoding exon. Potential gene control regions including specific pyrimidine:purine tracts and Olf-1 sites have been identified. One of the pseudogenes apparently has evolved into a CpG island. Four extensive CpG islands can be discerned within the cluster, not coupled to specific OR genes. The cluster is flanked at its telomeric end by an unidentified open reading frame (C17orf2) with no significant similarity to any known protein. A high proportion of the cluster sequence (about 60%) belongs to various families of interspersed repetitive elements, with a clear predominance of LINE repeats. The OR genes in the cluster belong to two families and seven subfamilies, which show a relatively high degree of intermixing along the cluster, in seemingly random orientations. This genomic organization may be best accounted for by a complex series of evolutionary events.  相似文献   

11.
Sharon D  Gilad Y  Glusman G  Khen M  Lancet D  Kalush F 《Gene》2000,260(1-2):87-94
Single-nucleotide polymorphisms (SNPs) were studied in 15 olfactory receptor (OR) coding regions, one control region and two noncoding sequences all residing within a 412 kb OR gene cluster on human chromosome 17p13.3, as well as in other G-protein coupled receptors (GPCRs). A total of 26 SNPs were identified in ORs, 21 of which are coding SNPs (cSNPs). The mean nucleotide diversity of OR coding regions was 0.078% (ranging from 0 to 0.16%), which is about twice higher than that of other GPCRs, and similar to the nucleotide diversity levels of noncoding regions along the human genome. The high polymorphism level in the OR coding regions might be due to a weak positive selection pressure acting on the OR genes. In two cases, OR genes have been found to share the same cSNP. This could be explained by recent gene conversion events, which might be a part of a concerted evolution mechanism acting on the OR superfamily. Using the genotype data of 85 unrelated individuals in 15 SNPs, we found linkage disequilibrium (LD) between pairs of SNPs located on the centromeric part of the cluster. On the other hand, no LD was found between SNPs located on the telomeric part of the cluster, suggesting the presence of several hot-spots for recombination within this cluster. Thus, different regions of this gene cluster may have been subject to different recombination rates.  相似文献   

12.
13.

Background  

One of the many gene families that expanded in early vertebrate evolution is the neuropeptide (NPY) receptor family of G-protein coupled receptors. Earlier work by our lab suggested that several of the NPY receptor genes found in extant vertebrates resulted from two genome duplications before the origin of jawed vertebrates (gnathostomes) and one additional genome duplication in the actinopterygian lineage, based on their location on chromosomes sharing several gene families. In this study we have investigated, in five vertebrate genomes, 45 gene families with members close to the NPY receptor genes in the compact genomes of the teleost fishes Tetraodon nigroviridis and Takifugu rubripes. These correspond to Homo sapiens chromosomes 4, 5, 8 and 10.  相似文献   

14.
A large gene cluster for the Clostridium cellulovorans cellulosome has been cloned and sequenced upstream and downstream of the cbpA and exgS genes (C.-C. Liu and R. H. Doi, Gene 211:39-47, 1998). Gene walking revealed that the engL gene cluster (Y. Tamaru and R. H. Doi, J. Bacteriol. 182:244-247, 2000) was located downstream of the cbpA-exgS genes. Further DNA sequencing revealed that this cluster contains the genes for the scaffolding protein CbpA, the exoglucanase ExgS, several endoglucanases of family 9, the mannanase ManA, and the hydrophobic protein HbpA containing a surface layer homology domain and a hydrophobic (or cohesin) domain. The sequence of the clustered genes is cbpA-exgS-engH-engK-hbpA-engL-man A-engM-engN and is about 22 kb in length. The engN gene did not have a complete catalytic domain, indicating that engN is a truncated gene. This large gene cluster is flanked at the 5' end by a putative noncellulosomal operon consisting of nifV-orf1-sigX-regA and at the 3' end by noncellulosomal genes with homology to transposase (trp) and malate permease (mle). Since gene clusters for the cellulosome are also found in C. cellulolyticum and C. josui, they seem to be typical of mesophilic clostridia, indicating that the large gene clusters may arise from a common ancestor with some evolutionary modifications.  相似文献   

15.
16.
Neocentromeres are fully functional centromeres that have arisen in previously noncentromeric chromosomal locations on rearranged chromosomes. The formation of neocentromeres results in the mitotic stability of chromosomal fragments that do not contain endogenous centromeres and that would normally be lost. Here we describe a unique collection of eight independent patient-derived cell lines, each of which contains a neocentromere on a supernumerary inversion duplication of a portion of human chromosome 13q. Findings in these patients reveal insight into the clinical manifestations associated with polysomy for portions of chromosome 13q. The results of FISH and immunofluorescent analysis of the neocentromeres in these chromosomes confirm the lack of alpha-satellite DNA and the presence of CENtromere proteins (CENP)-C, -E, and hMAD2. The positions of the inversion breakpoints in these chromosomes have been placed onto the physical map of chromosome 13, by means of FISH mapping with cosmid probes. These cell lines define, within chromosome 13q, at least three distinct locations where neocentromeres have formed, with five independent neocentromeres in band 13q32, two in band 13q21, and one in band 13q31. The results of examination of the set of 40 neocentromere-containing chromosomes that have thus far been described, including the 8 neocentromere-containing chromosomes from chromosome 13q that are described in the present study, suggest that chromosome 13q has an increased propensity for neocentromere formation, relative to some other human chromosomes. These neocentromeres will provide the means for testing hypotheses about sequence requirements for human centromere formation.  相似文献   

17.
Several eukaryotic genomes have been completely sequenced and this provides an opportunity to investigate the extent and characteristics (e.g., single gene duplication, block duplication, etc.) of gene duplication in a genome. Detecting duplicate genes in a genome, however, is not a simple problem because of several complications such as domain shuffling, the existence of isoforms derived from alternative splicing, and annotational errors in the databases. We describe a method for overcoming these difficulties and the extents of gene duplication in the genomes of Drosophila melanogaster, Caenorhabditis elegans, and yeast inferred from this method. We also describe a method for detecting block duplications in a genome. Application of this method showed that block duplication is a common phenomenon in both yeast and nematode. The patterns of block duplication in the two species are, however, markedly different. Yeast shows much more extensive block duplication than nematode, with some chromosomes having more than 40% of the duplications derived from block duplications. Moreover, in yeast the majority of block duplications occurred between chromosomes, while in nematode most block duplications occurred within chromosomes.  相似文献   

18.
K Ikuta  T Ogura  A Shimizu    T Honjo 《Nucleic acids research》1986,14(12):4899-4909
We have analysed an inactive allele of the beta-chain gene of the T-cell receptor in a human T-cell line HPB-ALL. Comparison with germline sequences showed that HPB-ALL has a joining (J)-diversity (D)-J complex recombined in head-to-head configuration and a variable (V)-D complex in tail-to-tail configuration. These results demonstrate that the inversion mechanism functions in the beta-chain gene of the T-cell receptor. The presence of the V-D complex suggests that V-D recombination could occur prior to D-J recombination although there is no definite proof that the V-D complex is an intermediate to form the V-D-J complex.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号