首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Homeobox gene clusters and the human paralogy map   总被引:6,自引:0,他引:6  
Homeobox genes encode important developmental control proteins. In vertebrates, those encoding the proteins of the HOX class and their most closely related families, including paraHOX and metaHOX classes, are clustered in paralogous regions (or paralogons). We show that the majority of the other homeobox genes (we called contraHOX) can also be clustered and belong to paralogons in humans. This suggests that they duplicated during vertebrate evolution along the same processes as the HOX genes. We tentatively assembled several paralogons in superparalogons. One of the superparalogons contains the contraHOX genes. These observations were extended to hundreds of genes, and allowed to describe a primary human genome paralogy map.  相似文献   

2.
With about 24,000 extant species, teleosts are the largest group of vertebrates. They constitute more than 99% of the ray-finned fishes (Actinopterygii) that diverged from the lobe-finned fish lineage (Sarcopterygii) about 450 MYA. Although the role of genome duplication in the evolution of vertebrates is now established, its role in structuring the teleost genomes has been controversial. At least two hypotheses have been proposed: a whole-genome duplication in an ancient ray-finned fish and independent gene duplications in different lineages. These hypotheses are, however, based on small data sets and lack adequate statistical and phylogenetic support. In this study, we have made a systematic comparison of the draft genome sequences of Fugu and humans to identify paralogous chromosomal regions ("paralogons") in the Fugu that arose in the ray-finned fish lineage ("fish-specific"). We identified duplicate genes in the Fugu by phylogenetic analyses of the Fugu, human, and invertebrate sequences. Our analyses provide evidence for 425 fish-specific duplicate genes in the Fugu and show that at least 6.6% of the genome is represented by fish-specific paralogons. We estimated the ages of Fugu duplicate genes and paralogons using the molecular clock. Remarkably, the ages of duplicate genes and paralogons are clustered, with a peak around 350 MYA. These data strongly suggest a whole-genome duplication event early during the evolution of ray-finned fishes, probably before the origin of teleosts.  相似文献   

3.
Gene duplication is a crucial mechanism of evolutionary innovation. A substantial fraction of eukaryotic genomes consists of paralogous gene families. We assess the extent of ancestral paralogy, which dates back to the last common ancestor of all eukaryotes, and examine the origins of the ancestral paralogs and their potential roles in the emergence of the eukaryotic cell complexity. A parsimonious reconstruction of ancestral gene repertoires shows that 4137 orthologous gene sets in the last eukaryotic common ancestor (LECA) map back to 2150 orthologous sets in the hypothetical first eukaryotic common ancestor (FECA) [paralogy quotient (PQ) of 1.92]. Analogous reconstructions show significantly lower levels of paralogy in prokaryotes, 1.19 for archaea and 1.25 for bacteria. The only functional class of eukaryotic proteins with a significant excess of paralogous clusters over the mean includes molecular chaperones and proteins with related functions. Almost all genes in this category underwent multiple duplications during early eukaryotic evolution. In structural terms, the most prominent sets of paralogs are superstructure-forming proteins with repetitive domains, such as WD-40 and TPR. In addition to the true ancestral paralogs which evolved via duplication at the onset of eukaryotic evolution, numerous pseudoparalogs were detected, i.e. homologous genes that apparently were acquired by early eukaryotes via different routes, including horizontal gene transfer (HGT) from diverse bacteria. The results of this study demonstrate a major increase in the level of gene paralogy as a hallmark of the early evolution of eukaryotes.  相似文献   

4.
BackgroundSusumu Ohno’s idea that modern vertebrates are degenerate polyploids (concept referred as 2R hypothesis) has been the subject of intense debate for past four decades. It was proposed that intra-genomic synteny regions (paralogons) in human genome are remains of ancient polyploidization events that occurred early in the vertebrate history. The quadruplicated paralogon centered on human HOX clusters is taken as evidence that human HOX-bearing chromosomes were structured by two rounds of whole genome duplication (WGD) events.ResultsEvolutionary history of human HOX-bearing chromosomes (chromosomes 2/7/12/17) was evaluated by the phylogenetic analysis of multigene families with triplicated or quadruplicated distribution on these chromosomes. Topology comparison approach categorized the members of 44 families into four distinct co-duplicated groups. Distinct gene families belonging to a particular co-duplicated group, exhibit similar evolutionary history and hence have duplicated simultaneously, whereas genes of two distinct co-duplicated groups do not share their evolutionary history and have not duplicated in concert with each other.ConclusionThe recovery of co-duplicated groups suggests that “ancient segmental duplications and rearrangements” is the most rational model of evolutionary events that have generated the triplicated and quadruplicated paralogy regions seen on the human HOX-bearing chromosomes.  相似文献   

5.
As a result of two-round whole genome duplications, four or more paralogous Hox clusters exist in vertebrate genomes. The paralogous genes in the Hox clusters show similar expression patterns, implying shared regulatory mechanisms for expression of these genes. Previous studies partly revealed the expression mechanisms of Hox genes. However, cis-regulatory elements that control these paralogous gene expression are still poorly understood. Toward solving this problem, the authors searched conserved non-coding sequences (CNSs), which are candidates of cis-regulatory elements. When comparing orthologous Hox clusters of 19 vertebrate species, 208 intergenic conserved regions were found. The authors then searched for CNSs that were conserved not only between orthologous clusters but also among the four paralogous Hox clusters. The authors found three regions that are conserved among all the four clusters and eight regions that are conserved between intergenic regions of two paralogous Hox clusters. In total, 28 CNSs were identified in the paralogous Hox clusters, and nine of them were newly found in this study. One of these novel regions bears a RARE motif. These CNSs are candidates for gene expression regulatory regions among paralogous Hox clusters. The authors also compared vertebrate CNSs with amphioxus CNSs within the Hox cluster, and found that two CNSs in the HoxA and HoxB clusters retain homology with amphioxus CNSs through the two-round whole genome duplications.  相似文献   

6.
The aims of the study were to outline the sequence of eventsthat gave rise to the vertebrate insulin-relaxin gene familyand the chromosomal regions in which they reside. We analyzedthe gene content surrounding the human insulin/relaxin geneswith respect to what family they belonged to and if the duplicationhistory of investigated families parallels the evolution ofthe insulin-relaxin family members. Markov Clustering and phylogeneticanalysis were used to determine family identity. More than 15%of the genes belonged to families that have paralogs in theregions, defining two sets of quadruplicate paralogy regions.Thereby, the localization of insulin/relaxin genes in humansis in accordance with those regions on human chromosomes 1,11, 12, 19q (insulin/insulin-like growth factors) and 1, 6p/15q,9/5, 19p (insulin-like factors/relaxins) were formed duringtwo genome duplications. We compared the human genome with thatof Ciona intestinalis, a species that split from the vertebratelineage before the two suggested genome duplications. Two insulin-likeorthologs were discovered in addition to the already describedCi-insulin gene. Conserved synteny between the Ciona regionshosting the insulin-like genes and the two sets of human paralogonsimplies their common origin. Linkage of the two human paralogons,as seen in human chromosome 1, as well as the two regions hostingthe Ciona insulin-like genes suggests that a segmental duplicationgave rise to the region prior to the genome doublings. Thus,preserved gene content provides support that genome duplication(s)in addition to segmental and single-gene duplications shapedthe genomes of extant vertebrates.  相似文献   

7.
Aromatic amino acid hydroxylase (AAAH) genes and insulin-like genes form part of an extensive paralogy region shared by human chromosomes 11 and 12, thought to have arisen by tetraploidy in early vertebrate evolution. Cloning of a complementary DNA (cDNA) for an amphioxus (Branchiostoma floridae) hydroxylase gene (AmphiPAH) allowed us to investigate the ancestry of the human chromosome 11/12 paralogy region. Molecular phylogenetic evidence reveals that AmphiPAH is orthologous to vertebrate phenylalanine (PAH) genes; the implication is that all three vertebrate AAAH genes arose early in metazoan evolution, predating vertebrates. In contrast, our phylogenetic analysis of amphioxus and vertebrate insulin-related gene sequences is consistent with duplication of these genes during early chordate ancestry. The conclusion is that two tightly linked gene families on human chromosomes 11 and 12 were not duplicated coincidentally. We rationalize this paradox by invoking gene loss in the AAAH gene family and conclude that paralogous genes shared by paralogous chromosomes need not have identical evolutionary histories.  相似文献   

8.
The appearance of the vertebrates demarcates some of the most far-reaching changes of structure and function seen during the evolution of the metazoans. These drastic changes of body plan and expansion of the central nervous system among other organs coincide with increased gene numbers. The presence of several groups of paralogous chromosomal regions in the human genome is a reflection of this increase. The simplest explanation for the existence of these paralogies would be two genome doublings with subsequent silencing of many genes. It is argued that gene localization data and the delineation of paralogous chromosomal regions give more reliable information about these types of events than dendrograms of gene families as gene relationships are often obscured by uneven replacement rates as well as other factors. Furthermore, the topographical relations of some paralogy groups are discussed.  相似文献   

9.
Analysis of evolution of paralogous genes in a genome is central to our understanding of genome evolution. Comparison of closely related bacterial genomes, which has provided clues as to how genome sequences evolve under natural conditions, would help in such an analysis. With species Staphylococcus aureus, whole-genome sequences have been decoded for seven strains. We compared their DNA sequences to detect large genome polymorphisms and to deduce mechanisms of genome rearrangements that have formed each of them. We first compared strains N315 and Mu50, which make one of the most closely related strain pairs, at the single-nucleotide resolution to catalogue all the middle-sized (more than 10 bp) to large genome polymorphisms such as indels and substitutions. These polymorphisms include two paralogous gene sets, one in a tandem paralogue gene cluster for toxins in a genomic island and the other in a ribosomal RNA operon. We also focused on two other tandem paralogue gene clusters and type I restriction-modification (RM) genes on the genomic islands. Then we reconstructed rearrangement events responsible for these polymorphisms, in the paralogous genes and the others, with reference to the other five genomes. For the tandem paralogue gene clusters, we were able to infer sequences for homologous recombination generating the change in the repeat number. These sequences were conserved among the repeated paralogous units likely because of their functional importance. The sequence specificity (S) subunit of type I RM systems showed recombination, likely at the homology of a conserved region, between the two variable regions for sequence specificity. We also noticed novel alleles in the ribosomal RNA operons and suggested a role for illegitimate recombination in their formation. These results revealed importance of recombination involving long conserved sequence in the evolution of paralogous genes in the genome.  相似文献   

10.
Gene sequence similarity due to shared ancestry after a duplication event, that is paralogy, complicates the assessment of genetic variation, as sequences originating from paralogs can be difficult to distinguish. These confounded sequences are often removed prior to further analyses, leaving the underlying loci uncharacterized. Salmonids have only partially rediploidized subsequent to a whole‐genome duplication; residual tetrasomic inheritance has been observed in males. We present a maximum‐likelihood‐based method to resolve confounded paralogous loci by observing the segregation of alleles in gynogenetic haploid offspring and demonstrate its effectiveness by constructing two linkage maps for chum salmon (Oncorhynchus keta), with and without these newly resolved loci. We find that the resolved paralogous loci are not randomly distributed across the genome. A majority are clustered in expanded subtelomeric regions of 14 linkage groups, suggesting a significant fraction of the chum salmon genome may be missed by the exclusion of paralogous loci. Transposable elements have been proposed as drivers of genome evolution and, in salmonids, may have an important role in the rediploidization process by driving differentiation between homeologous chromosomes. Consistent with that hypothesis, we find a reduced fraction of transposable element annotations among paralogous loci, and these loci predominately occur in the genomic regions that lag in the rediploidization process.  相似文献   

11.
An antecedent of the MHC-linked genomic region in amphioxus   总被引:3,自引:0,他引:3  
The MHC genes on human chromosome 6 are located within one of the best-characterised paralogy regions of the human genome. Numerous genes mapping around this location, 6p21, have paralogues at one, two or three other chromosomal locations on HSA 1, 9 and 19. The similarity between these four chromosomal regions suggests the linkages may have adaptive significance, and/or they may be echoes of segmental or genome duplication in human ancestry. Here, we show that six amphioxus cosmids, containing genes orthologous to those from the human MHC-linked paralogy regions, map to a single amphioxus chromosome. The composition of the MHC-linked genomic region, therefore, pre-dates vertebrate origins.  相似文献   

12.
Bilaterian Hox genes play pivotal roles in the specification of positional identities along the anteroposterior axis. Particularly in vertebrates, their regulation is tightly coordinated by tandem arrays of genes [paralogy groups (PGs)] in four gene clusters (HoxA-D). Traditionally, the uninterrupted Hox cluster (Hox1-14) of the invertebrate chordate amphioxus was regarded as an archetype of the vertebrate Hox clusters. In contrast to Hox1-13 that are globally regulated by the "Hox code" and are often phylogenetically conserved, vertebrate Hox14 members were only recently revealed to be present in an African lungfish, a coelacanth, chondrichthyans and a lamprey, and decoupled from the Hox code. In this study we performed a PCR-based search of Hox14 members from diverse vertebrates, and identified one in the Australian lungfish, Neoceratodus forsteri. Based on a molecular phylogenetic analysis, this gene was designated NfHoxA14. Our real-time RT-PCR suggested its hindgut-associated expression, previously observed also in cloudy catshark HoxD14 and lamprey Hox14α. It is likely that this altered expression scheme was established before the Hox cluster quadruplication, probably at the base of extant vertebrates. To investigate the origin of vertebrate Hox14, by including this sarcopterygian Hox14 member, we performed focused phylogenetic analyses on its relationship with other vertebrate posterior Hox PGs (Hox9-13) as well as amphioxus posterior Hox genes. Our results confirmed the hypotheses previously proposed by other studies that vertebrate Hox14 does not have any amphioxus ortholog, and that none of 1-to-1 pairs of vertebrate and amphioxus posterior Hox genes, based on their relative location in the clusters, is orthologous.  相似文献   

13.
Two rounds of whole genome duplication in the ancestral vertebrate   总被引:5,自引:0,他引:5  
Dehal P  Boore JL 《PLoS biology》2005,3(10):e314
The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish–tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage.  相似文献   

14.
Homeobox genes encode important developmental control proteins. The Drosophila fruit fly HOM complex genes are clustered in region 84-89 of chromosome 3. Probably due to large-scale genome duplication events, their human HOX orthologs belong to four paralogous regions. A series of 13 other homeobox genes are also clustered in region 88-94, on the same chromosome of Drosophila. We suggest that they also duplicated during vertebrate evolution and belong to paralogous regions in humans. These regions are on chromosome arms 4p, 5q, 10q, and 2p or 8p. We coined the term "paralogon" to designate paralogous regions in general. We propose to call these genes "meta Hox" genes. Like Hox genes, metaHox genes are present in one cluster in Drosophila and four clusters (metaHox A-D) in humans on the 4p/5q/10q paralogon.  相似文献   

15.
Whole‐genome duplications are major evolutionary events with a lasting impact on genome structure. Duplication events complicate genetic analyses as paralogous sequences are difficult to distinguish; consequently, paralogs are often excluded from studies. The effects of an ancient whole‐genome duplication (approximately 88 MYA) are still evident in salmonids through the persistence of numerous paralogous gene sequences and partial tetrasomic inheritance. We use restriction site‐associated DNA sequencing on 10 collections of chum salmon from the Salish Sea in the USA and Canada to investigate genetic diversity and population structure in both tetrasomic and rediploidized regions of the genome. We use a pedigree and high‐density linkage map to identify paralogous loci and to investigate genetic variation across the genome. By applying multivariate statistical methods, we show that it is possible to characterize paralogous loci and that they display similar patterns of population structure as the diploidized portion of the genome. We find genetic associations with the adaptively important trait of run‐timing in both sets of loci. By including paralogous loci in genome scans, we can observe evolutionary signals in genomic regions that have routinely been excluded from population genetic studies in other polyploid‐derived species.  相似文献   

16.
From whole genome sequencing of an allotetraploid frog, Xenopus laevis, two homeologous sets (L and S) of four Hox clusters A through D (HoxA.L/S, HoxB.L/S, HoxC.L/S, and HoxD.L/S) and 13 paralogous groups (PGs) with 76 genes were identified, allowing us to carry out the first comprehensive analyses of hox gene expression in vertebrates. Expression of all hox genes during development and in adult tissues was analyzed by RNA‐sequencing. The expression levels of most hox genes were similar between homeologs, but in some pairs, large differences were observed and several of these were confirmed by RT‐PCR and whole mount in situ hybridization experiments. These results indicate that subfunctionalization of hox genes may have occurred since allotetraploidization. Furthermore, comprehensive analysis of hox gene expression during early development did not agree with the hypothesis of temporal collinearity especially in genes belonging to PG2 to PG10 .  相似文献   

17.
18.
The occurrence of a fish-specific genome duplication (FSGD) in the lineage leading to teleost fishes is widely accepted, but the consequences of this event remain elusive. Teleosts, and the cichlid fishes from the species flocks in the East African Great Lakes in particular, evolved a unique complexity and diversity of body coloration and color patterning. Several genes involved in pigment cell development have been retained in duplicate copies in the teleost genome after the FSGD. Here we investigate the evolutionary fate of one of these genes, the type III receptor tyrosine kinase (RTK) colony-stimulating factor 1 receptor (csf1r). We isolated and shotgun sequenced two paralogous csf1r genes from a bacterial artificial chromosome library of the cichlid fish Astatotilapia burtoni that are both linked to paralogs of the pdgfr beta gene, another type III RTK. Two pdgfr beta-csf1r paralogons were also identified in the genomes of pufferfishes and medaka, and our phylogenetic analyses suggest that the pdgfr beta-csf1r locus was duplicated during the course of the FSGD. Comparisons of teleosts and tetrapods suggest asymmetrical divergence at different levels of genomic organization between the teleost-specific pdgfr beta-csf1r paralogons, which seem to have evolved as coevolutionary units. The high-evolutionary rate in the teleost B-paralogon, consisting of csf1rb and pdgfr betab, further suggests neofunctionalization by functional divergence of the extracellular, ligand-binding region of these cell-surface receptors. Finally, we hypothesize that genome duplications and the associated expansion of the RTK family might be causally linked to the evolution of coloration in vertebrates and teleost fishes in particular.  相似文献   

19.
It has been proposed that two events of duplication of the entire genome occurred early in vertebrate history (2R hypothesis). Several phylogenetic studies with a few gene families (mostly Hox genes and proteins from the MHC) have tried to confirm these polyploidization events. However, data from a single locus cannot explain the evolutionary history of a complete genome. To study this 2R hypothesis, we have taken advantage of the phylogenetic position of the lamprey to study the history of gene duplications in vertebrates. We selected most gene families that contain several paralogous genes in vertebrates and for which lamprey genes and an out-group are known in databases. In addition, we isolated members of the nuclear receptor superfamily in lamprey. Hagfish genes were also analyzed and found to confirm the lamprey gene analysis. Consistent with the 2R hypothesis, the phylogenetic analysis of 33 selected gene families, dispersed through the whole genome, revealed that one period of gene duplication arose before the lamprey-gnathostome split and this was followed by a second period of gene duplication after the lamprey-gnathostome split. Nevertheless, our analysis suggests that numerous gene losses and other gene-genome duplications occurred during the evolution of the vertebrate genomes. Thus, the complexity of all the paralogy groups present in vertebrates should be explained by the contribution of genome duplications (2R hypothesis), extra gene duplications, and gene losses.  相似文献   

20.
Polyploidy in vertebrate ancestry: Ohno and beyond   总被引:1,自引:0,他引:1  
Over 30 years ago, Susumu Ohno proposed that two rounds of polyploidy occurred early in vertebrate evolution. We re-examine this proposal using three recent lines of evidence. First, total gene number estimates from completely sequenced genomes suggest an increase in total gene number somewhere along the vertebrate or prevertebrate lineage, compatible with Ohno's model. Second, analyses of homeobox and other genes from amphioxus reveal very extensive gene duplication specifically on the vertebrate lineage. This refines the timing of putative polyploidy to after the divergence of amphioxus and vertebrates. Third, the existence of four-fold paralogy regions in the human genome is suggestive of two rounds of polyploidy, although other explanations are possible. We propose an experimental test, based on chromosomal localization of genes in amphioxus, that should resolve whether paralogy regions are indeed remnants of duplication in vertebrate ancestry.  © 2004 The Linnean Society of London, Biological Journal of the Linnean Society , 2004, 82 , 425–430.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号