共查询到20条相似文献,搜索用时 15 毫秒
1.
The class I region of the major histocompatibility complex contains two subgenomic blocks (250–350 kb each), known as the
alpha and beta blocks. These blocks contain members of multicopy gene families including HLA class I, HERV-16 (previously
called P5 sequences), and PERB11 (MIC). We have previously shown that each block consists of imperfect duplicated segments
(duplicons) containing linked members of different gene families, retroelements and transposons that have coevolved as part
of two separate evolutionary events. Another region provisionally designated here as the kappa block is located between the
alpha and the beta blocks and contains HLA-E, -30, and -92, HERV-16 (P5.3), and PERB11.3 (MICC) within about 250 kb of sequence.
Using Alu elements to trace the evolutionary relationships between different class I duplicons, we have found that (a) the
kappa block contains paralogous (duplicated) Alu J sequences and other retroelement patterns more in common with the beta
than the alpha block; (b) the retroelement pattern associated with the HLA-E duplicon is different from all other HLA class
I duplicons, indicating a more complex evolution; (c) the HLA-92 duplicon, although substantially shorter, is closely related
in sequence to the HLA-B and -C duplicons; (d) two of the six paralogous Alu J elements within the HLA-B and -C duplicons
are associated with the HLA-X duplicon, confirming their evolutionary relationships within the beta block; and (e) the paralogous
Alu J elements within the alpha block are distinctly different from those identified within the beta and kappa blocks. The
sequence conservation and location of duplicated (paralogous) Alu J elements in the MHC class I region show that the beta
and kappa blocks have evolved separately from the alpha block beginning at a time before or during the evolution of Alu J
elements in primates.
Received: 22 September 1999 / Accepted: 24 January 2000 相似文献
2.
Jerzy K. Kulski Silvana Gaudieri Annalise Martin Roger L. Dawkins 《Journal of molecular evolution》1999,49(1):84-97
The recent availability of genomic sequence information for the class I region of the MHC has provided an opportunity to
examine the genomic organization of HLA class I (HLAcI) and PERB11/MIC genes with a view to explaining their evolution from
the perspective of extended genomic duplications rather than by simple gene duplications and/or gene conversion events. Analysis
of genomic sequence from two regions of the MHC (the alpha- and beta-blocks) revealed that at least 6 PERB11 and 14 HLAcI
genes, pseudogenes, and gene fragments are contained within extended duplicated segments. Each segment was searched for the
presence of shared (paralogous) retroelements by RepeatMasker in order to use them as markers of evolution, genetic rearrangements,
and evidence of segmental duplications. Shared Alu elements and other retroelements allowed the duplicated segments to be
classified into five distinct groups (A to E) that could be further distilled down to an ancient preduplication segment containing
a HLA and PERB11 gene, an endogenous retrovirus (HERV-16), and distinctive retroelements. The breakpoints within and between
the different HLAcI segments were found mainly within the PERB11 and HLA genes, HERV-16, and other retroelements, suggesting
that the latter have played a major role in duplication and indel events leading to the present organization of PERB11 and
HLAcI genes. On the basis of the features contained within the segments, a coevolutionary model premised on tandem duplication
of single and multipartite genomic segments is proposed. The model is used to explain the origins and genomic organization
of retroelements, HERV-16, DNA transposons, PERB11, and HLAcI genes as distinct segmental combinations within the alpha- and
beta-blocks of the human MHC.
Received: 5 December 1998 / Accepted: 27 January 1999 相似文献
3.
Jerzy K. Kulski David S. Dunn Silvana Gaudieri Takashi Shiina Hidetoshi Inoko 《Journal of molecular evolution》2001,53(6):642-650
The human CD1 proteins belong to a lipid-glycolipid antigen-presenting gene family and are related in structure and function to the MHC class I molecules. Previous mapping and DNA hybridization studies have shown that five linked genes located within a cluster on human chromosome 1q22-23 encode the CD1 protein family. We have analyzed the complete genomic sequence of the human CD1 gene cluster and found that the five active genes are distributed over 175,600 nucleotides and separated by four expanded intervening genomic regions (IGRs) ranging in length between 20 and 68 kb. The IGRs are composed mostly of retroelements including five full-length L1 PA sequences and various pseudogenes. Some L1 sequences have acted as receptors for other subtypes or families of retroelements. Alu molecular clocks that have evolved during primate history are found distributed within the HLA class I duplicated segments (duplicons) but not within the duplicons of CD1. Phylogeny of the alpha3 domain of the class I-like superfamily of proteins shows that the CD1 cluster is well separated from HLA class I by a number of superfamily members including MIC (PERB11), HFE, Zn-alpha2-GP, FcRn, and MR1. Phylogenetically, the human CD1 sequences are interspersed by CD1 sequences from other mammalian species, whereas the human HLA class I sequences cluster together and are separated from the other mammalian sequences. Genomic and phylogenetic analyses support the view that the human CD1 gene copies were duplicated prior to the evolution of primates and the bulk of the HLA class I genes found in humans. In contrast to the HLA class I genomic structure, the human CD1 duplicons are smaller in size, they lack Alu clocks, and they are interrupted by IGRs at least 4 to 14 times longer than the CD1 genes themselves. The IGRs seem to have been created as "buffer zones" to protect the CD1 genes from disruption by transposable elements. 相似文献
4.
Hughes AL 《Journal of molecular evolution》1999,48(5):565-576
It has been proposed that two rounds of duplication of the entire genome (polyploidization) occurred early in vertebrate
history (the 2R hypothesis); and the observation that certain gene families important in regulating development have four
members in vertebrates, as opposed to one in Drosophila, has been adduced as evidence in support of this hypothesis. However, such a pattern of relationship can be taken as support
of the 2R hypothesis only if (1) the four vertebrate genes can be shown to have diverged after the origin of vertebrates,
and (2) the phylogeny of the four vertebrate genes (A–D) exhibits a topology of the form (AB) (CD), rather than (A) (BCD).
In order to test the 2R hypothesis, I constructed phylogenies for nine protein families important in development. Only one
showed a topology of the form (AB) (CD), and that received weak statistical support. In contrast, four phylogenies showed
topologies of the form (A) (BCD) with statistically significant support. Furthermore, in two cases there was significant support
for duplication of the vertebrate genes prior to the divergence of deuterostomes and protostomes: in one case there was significant
support for duplication of the vertebrate genes at least prior to the divergence of vertebrates and urochordates, and in one
case there was weak support for duplication of the vertebrate genes prior to the divergence of deuterostomes and protostomes.
Taken together with other recently published phylogenies of developmentally important genes, these results provide strong
evidence against the 2R hypothesis.
Received: 22 December 1997 / Accepted: 5 October 1998 相似文献
5.
When divergence between viral species is large, the analysis and comparison of nucleotide or protein sequences are dependent
on mutation biases and multiple substitutions per site leading, among other things, to the underestimation of branch lengths
in phylogenetic trees. To avoid the problem of multiply substituted sites, a method not directly based on the nucleic or protein
sequences has been applied to retroviruses. It consisted of asking questions about genome structure or organization, and gene
function, the series of answers creating coded sequences analyzed by phylogenic software. This method recovered the principal
retroviral groups such as the lentiviruses and spumaviruses and highlighted questions and answers characteristic of each group
of retroviruses. In general, there was reasonable concordance between the coded genome methodology and that based on conventional
phylogeny of the integrase protein sequence, indicating that integrase was fixing mutations slowly enough to marginalize the
problem of multiple substitutions at sites. To a first approximation, this suggests that the acquisition of novel genetic
features generally parallels the fixation of amino acid substitutions.
Received: 18 May 2001 / Accepted: 7 September 2001 相似文献
6.
Glyoxalase I (GlxI) is the first of two enzymes involved in the cellular detoxification of methylglyoxal. A recent search
of the National Center for Biotechnology Information (NCBI) databases with the protein sequence of Salmonella typhimurium GlxI identified two new hypothetical proteins with unassigned function. These two sequences, from Brassica oleracea and Sporobolus stapfianus, have significant sequence similarity to known GlxI sequences, suggesting that these two open reading frames encode for GlxI
in these plants. Interestingly, analysis of these two new sequences indicates that they code for a protein composed of two
fused monomers, a situation previously found solely in the yeast GlxI enzymes.
Received: 10 May 1997 / Accepted: 15 October 1997 相似文献
7.
Takehiro Kusakabe Isato Araki Noriyuki Satoh William R. Jeffery 《Journal of molecular evolution》1997,44(3):289-298
The origin and evolutionary relationship of actin isoforms was investigated in chordates by isolating and characterizing
two new ascidian cytoplasmic and muscle actin genes. The exon–intron organization and sequences of these genes were compared
with those of other invertebrate and vertebrate actin genes. The gene HrCA1 encodes a cytoplasmic (nonmuscle)-type actin, whereas the MocuMA2 gene encodes an adult muscle-type actin. Our analysis of these genes showed that intron positions are conserved among the
deuterostome actin genes. This suggests that actin gene families evolved from a single actin gene in the ancestral deuterostome.
Sequence comparisons and molecular phylogenetic analyses also suggested a close relationship between the ascidian and vertebrate
actin isoforms. It was also found that there are two distinct lineages of muscle actin isoforms in ascidians: the larval muscle
and adult body-wall isoforms. The four muscle isoforms in vertebrates show a closer relationship to each other than to the
ascidian muscle isoforms. Similarly, the two cytoplasmic isoforms in vertebrates show a closer relationship to each other
than to the ascidian and echinoderm cytoplasmic isoforms. In contrast, the two types of ascidian muscle actin diverge from
each other. The close relationship between the ascidian larval muscle actin and the vertebrate muscle isoforms was supported
by both neighbor-joining and maximum parsimony analyses. These results suggest that the chordate ancestor had at least two
muscle actin isoforms and that the vertebrate actin isoforms evolved after the separation of the vertebrates and urochordates.
Received: 20 June 1996 / Accepted: 16 October 1996 相似文献
8.
Retrovirus-like sequences and their solitary (solo) long terminal repeats (LTRs) are common repetitive elements in eukaryotic
genomes. We reported previously that the tandemly arrayed genes encoding U2 snRNA (the RNU2 locus) in humans and apes contain a solo LTR (U2-LTR) which was presumably generated by homologous recombination between
the two LTRs of an ancestral provirus that is retained in the orthologous baboon RNU2 locus. We have now sequenced the orthologous U2-LTRs in human, chimpanzee, gorilla, orangutan, and baboon and examined numerous
homologs of the U2-LTR that are dispersed throughout the human genome. Although these U2-LTR homologs have been collectively
referred to as LTR13 in the literature, they do not display sequence similarity to any known retroviral LTRs; however, the
structure of LTR13 closely resembles that of other retroviral LTRs with a putative promoter, polyadenylation signal, and a
tandemly repeated 53-bp enhancer-like element. Genomic blotting indicates that LTR13 is primate-specific; based on sequence
analysis, we estimate there are about 2,500 LTR13 elements in the human genome. Comparison of the primate U2-LTR sequences
suggests that the homologous recombination event that gave rise to the solo U2-LTR occurred soon after insertion of the ancestral
provirus into the ancestral U2 tandem array. Phylogenetic analysis of the LTR13 family confirms that it is diverse, but the
orthologous U2-LTRs form a coherent group in which chimpanzee is closest to the humans; orangutan is a clear outgroup of human,
chimpanzee, and gorilla; and baboon is a distant relative of human, chimpanzee, gorilla, and orangutan. We compare the LTR13
family with other known LTRs and consider whether these LTRs might play a role in concerted evolution of the primate RNU2 locus.
Received: 29 September 1997 / Accepted: 16 January 1998 相似文献
9.
Hiroshi Suga Daisuke Hoshiyama Shigehiro Kuraku Kazutaka Katoh Kaoru Kubokawa Takashi Miyata 《Journal of molecular evolution》1999,49(5):601-608
Animals evolved a variety of gene families involved in cell–cell communication and developmental control by gene duplication
and domain shuffling. Each family is made up of several subtypes or subfamilies with distinct structures and functions, which
diverged by gene duplications and domain shufflings before the divergence of parazoans and eumetazoans. Since the separation
from protostomes, vertebrates expanded the multiplicity of members (isoforms) in the same subfamily by further gene duplications
in their early evolution before the fish–tetrapod split. To know the dates of isoform duplications more closely, we have conducted
isolation and sequencing cDNAs encoding the fibroblast growth factor receptor, Eph, src, and platelet-derived growth factor receptor subtypes belonging to the protein tyrosine kinase family from Branchiostoma belcheri, an amphioxus, Eptatretus burgeri, a hagfish, and Lampetra reissneri, a lamprey. From a phylogenetic tree of each subfamily inferred from a maximum likelihood (ML) method, together with a bootstrap
analysis based on the ML method, we have shown that the isoform duplications frequently occurred in the early evolution of
vertebrates around or just before the divergence of cyclostomes and gnathostomes by gene duplications and possibly chromosomal
duplications.
Received: 28 April 1998 / Accepted: 30 June 1999 相似文献
10.
M. Mar Albà Mauro F. Santibáñez-Koref John M. Hancock 《Journal of molecular evolution》2001,52(3):249-259
Polyglutamine repeats within proteins are common in eukaryotes and are associated with neurological diseases in humans. Many
are encoded by tandem repeats of the codon CAG that are likely to mutate primarily by replication slippage. However, a recent
study in the yeast Saccharomyces cerevisiae has indicated that many others are encoded by mixtures of CAG and CAA which are less likely to undergo slippage. Here we
attempt to estimate the proportions of polyglutamine repeats encoded by slippage-prone structures in species currently the
subject of genome sequencing projects. We find a general excess over random expectation of polyglutamine repeats encoded by
tandem repeats of codons. We nevertheless find many repeats encoded by nontandem codon structures. Mammals and Drosophila display extreme opposite patterns. Drosophila contains many proteins with polyglutamine tracts but these are generally encoded by interrupted structures. These structures
may have been selected to be resistant to slippage. In contrast, mammals (humans and mice) have a high proportion of proteins
in which repeats are encoded by tandem codon structures. In humans, these include most of the triplet expansion disease genes.
Received: 17 August 2000 / Accepted: 20 November 2000 相似文献
11.
A 2550-bp portion of the mitochondrial genome of a Demosponge, genus Tetilla, was amplified from whole genomic DNA extract and sequenced. The sequence was found to code for the 3′ end of the 16S rRNA
gene, cytochrome c oxidase subunit II, a lysine tRNA, ATPase subunit 8, and a 5′ portion of ATPase subunit 6. The Porifera cluster distinctly
within the eumetazoan radiation, as a sister group to the Cnidaria. Also, the mitochondrial genetic code of this sponge is
likely identical to that found in the Cnidaria. Both the full COII DNA and protein sequences and a portion of the 16S rRNA
gene were found to possess a striking similarity to published Cnidarian mtDNA sequences, allying the Porifera more closely
to the Cnidaria than to any other metazoan phylum. The gene arrangement, COII—tRNALys—ATP8—ATP6, is observed in many Eumetazoan phyla and is apparently ancestral in the metazoa.
Received: 24 November 1997 / Accepted: 14 September 1998 相似文献
12.
Manfred Steinemann Sigrid Steinemann Wilhelm Pinsker 《Journal of molecular evolution》1996,43(4):405-412
The larval cuticle protein genes (Lcps) represent a multigene family located at the right arm of the metacentric autosome 2 (2R) in Drosophila melanogaster. Due to a chromosome fusion the Lcp locus of Drosophila miranda is situated on a pair of secondary sex chromosomes, the X2 and neo-Y chromosome. Comparing the DNA sequences from D. miranda and D. melanogaster organization and the gene arrangement of Lcp1–Lcp4 are similar, although the intergene distances vary considerably. The greatest difference between Lcp1 and Lcp2 is due to the occurrence of a pseudogene in D. melanogaster which is not present in D. miranda. Thus the cluster of the four Lcp genes existed already before the separation of the melanogaster and obscura group. Intraspecific homogenizations of different cluster units must have occurred repeatedly between the Lcp1/Lcp2 and Lcp3/Lcp4 sequence types. The most obvious example is exon 2 of the Lcp3 gene in D. miranda, which has been substituted by the corresponding section of the Lcp4 gene rather recently. The homogenization must have occurred before the translocation which generated the neo-Y chromosome. Lcp3 of D. melanogaster has therefore no orthologous partner in D. miranda. Rearrangements in the promoter regions of the D. miranda Lcp genes have generated new, potentially functional CAAT-box motifs. Since three of the Lcp alleles on the neo-Y are not expressed and Lcp3 is expressed only at a reduced level, it is suggestive to speculate that the rearrangements might be involved as cis-regulatory elements in the up-regulation of the X2-chromosomal Lcp alleles, in Drosophila an essential process for dosage compensation. The Lcp genes on the neo-Y chromosome have accumulated more base substitutions than the corresponding alleles on the X2.
Received: 27 December 1995 / Accepted: 30 April 1996 相似文献
13.
The genomic organization of the hsp83 gene of Drosophila auraria, a far-eastern endemic species belonging to the montium subgroup of the melanogaster species group, is presented here. Based on in situ hybridization on polytene chromosomes, cDNA and genomic clone mapping,
nucleotide sequencing, and genomic Southern analysis, hsp83 is shown to be present as a single-copy gene at locus 64B on the 3L chromosome arm in D. auraria. This gene is organized into two exons separated by a 929-bp intron. The first exon represents the mRNA leader sequence and
is not translated, while the coding region, having a length of 2,151 bp, is solely included in the second exon. Nucleotide
sequence comparisons of D. auraria hsp83 with homologous sequences from other organisms show high conservation of the coding region (88–92% identity) in the genus
Drosophila, in addition to the conserved genomic organization of two-exons–one-intron, of comparable size and arrangement. A phylogenetic
tree based on the protein sequences of homologous genes from representative organisms is in accord with the accredited phylogenetic
position of D. auraria. In the hsp83 gene region, a second case of long antiparallel coupled open reading frames (LAC ORFs) for this species was found. The antiparallel
to the hsp83 gene ORF is 1,554 bases long, while the two ORFs overlap has a size of 1,548 bp. The anti-hsp83 ORF does not show significant homology to any known gene sequences. In addition, no similar LAC ORF structures were found
in homologous gene regions of other organisms.
Received: 18 April 1997 / Accepted: 1 August 1997 相似文献
14.
Margaret J. Beaton Andrew J. Roger Thomas Cavalier-Smith 《Journal of molecular evolution》1998,47(6):697-708
The nucleotide sequence for an 11,715-bp segment of the mitochondrial genome of the octocoral Sarcophyton glaucum is presented, completing the analysis of the entire genome for this anthozoan member of the phylum Cnidaria. The genome contained
the same 13 protein-coding and 2 ribosomal RNA genes as in other animals. However, it also included an unusual mismatch repair
gene homologue reported previously and codes for only a single tRNA gene. Intermediate in length compared to two other cnidarians
(17,443 and 18,911 bp), this organellar genome contained the smallest amount of noncoding DNA (428, compared to 1283 and 781
nt, respectively), making it the most compact one found for the phylum to date. The mitochondrial genes of S. glaucum exhibited an identical arrangement to that found in another octocoral, Renilla kolikeri, with five protein-coding genes in the same order as has been found in insect and vertebrate mitochondrial genomes. Although
gene order appears to be highly conserved among octocorals, compared to the hexacoral, Metridium senile, few similarities were found. Like other metazoan mitochondrial genomes, the A + T composition was elevated and a general
bias against codons ending in G or C was observed. However, an exception to this was the infrequent use of TGA compared to
TGG to code for tryptophan. This divergent codon bias is unusual but appears to be a conserved feature among two rather distantly
related anthozoans.
Received: 27 January 1998 / Accepted: 25 May 1998 相似文献
15.
Calmodulin is a calcium-binding EF-hand protein that is an activator of many enzymes as well as ion pumps and channels. Due
to its multiple targets and its central role in the cell, understanding the evolutionary history of calmodulin genes should
provide insights into the origin of genetic complexity in eukaryotes. We have previously isolated and characterized a calmodulin
gene from the early-diverging chordate Branchiostoma lanceolatum (CaM1). In this paper, we report the existence of a second calmodulin gene (CaM2) as well as two CaM-like genomic fragments (CaML-2, CaML-3) in B. lanceolatum and a CaM2 and three CaM-like genes (CaML-1, CaML-2, CaML-3) in B. floridae. The CaM-like genes were isolated using low-stringency PCR. Surprisingly, the nucleotide sequences of the B. lanceolatum CaM1 and CaM2 cDNAs differ by 19.3%. Moreover, the CaM2 protein differs at two positions from the amino acid sequence of CaM1; the latter
is identical to calmodulins in Drosophila melanogaster, the mollusc Aplysia californica, and the tunicate Halocynthia roretzi. The two B. lanceolatum CaM-like genes are more closely related to the CaM2 than to the CaM1 gene. This relationship is supported by the phylogenetic analyses and the identical exon/intron organization of these three
genes, a relationship unique among animal CaM sequences. These data demonstrate the existence of a CaM multigene family in the cephalochordate Branchiostoma, which may have evolved independently from the multigene family in vertebrates.
Received: 2 November 1999 / Accepted: 25 April 2000 相似文献
16.
In this paper we have analyzed 49 vertebrate gene families that were generated in the early stage of vertebrates and/or shortly
before the origin of vertebrates, each of which consists of three or four member genes. We have dated the first (T1) and second (T2) gene duplications of 26 gene families with 3 member genes. The means of T1 (594 mya) and T2 (488 mya) are largely consistent to a well-cited version of two-round (2R) genome duplication theory. Moreover, in most cases,
the time interval between two successive gene duplications is large enough that the fate of duplicate genes generated by the
first gene duplication was likely to be determined before the second one took place. However, the phylogenetic pattern of
23 gene families with 4 members is complicated; only 5 of them are predicted by 2R model, but 11 families require an additional
gene (or genome) duplication. For the rest (7 families), at least one gene duplication event had occurred before the divergence
between vertebrate and Drosophila, indicating a possible misleading of the 4:1 rule (member gene ratio between vertebrates and invertebrates). Our results show
that Ohno's 2R conjecture is valid as a working hypothesis for providing a most parsimonious explanation. Although for some
gene families, additional gene duplication is needed, the credibility of the third genome duplication (3R) remains to be investigated.
Received: 13 December 1999 / Accepted: 7 April 2000 相似文献
17.
Anton Karabinos Debashish Bhattacharya Hartmut D. Kratzin Norbert Hilschmann 《Journal of molecular evolution》1998,46(3):327-333
The human protein NEFA binds calcium, contains a leucine zipper repeat that does not form a homodimer, and is proposed (along
with the homologous Nuc protein) to have a common evolutionary history with an EF-hand ancestor. We have isolated and characterized
the N-terminal domain of NEFA that contains a signal sequence inferred from both endoproteinase Asp-N (Asp-N) and tryptic
digests. Analysis of this N-terminal sequence shows significant similarity to the conserved multiple domains of the mitochondrial
carrier family (MCF) proteins. The leader sequence of Nuc is, however, most similar to the signal sequences of membrane and/or
secreted proteins (e.g., mouse insulin-like growth factor receptor). We suggest that the divergent NEFA and Nuc N-terminal
sequences may have independent origins and that the common high hydrophobicity governs their targeting to the ER. These results
provide insights into signal sequence evolution and the multiple origins of protein targeting.
Received: 20 February 1997 / Accepted: 28 July 1997 相似文献
18.
In this paper we analyzed 49 lactate dehydrogenase (LDH) sequences, mostly from vertebrates. The amino acid sequence differences
were found to be larger for a human–killifish pair than a human–lamprey pair. This indicates that some protein sequence convergence
may occur and reduce the sequence differences in distantly related species. We also examined transitions and transversions
separately for several species pairs and found that the transitions tend to be saturated in the distantly related species
pair, while transversions are increasing. We conclude that transversions maintain a conservative rate through the evolutionary
time. Kimura's two-parameter model for multiple-hit correction on transversions only was used to derive a distance measure
and then construct a neighbor-joining (NJ) tree. Three findings were revealed from the NJ tree: (i) the branching order of
the tree is consistent with the common branch pattern of major vertebrates; (ii) Ldh-A and Ldh-B genes were duplicated near the origin of vertebrates; and (iii) Ldh-C and Ldh-A in mammals were produced by an independent gene duplication in early mammalian history. Furthermore, a relative rate test
showed that mammalian Ldh-C evolved more rapidly than mammalian Ldh-A. Under a two-rate model, this duplication event was calibrated to be approximately 247 million years ago (mya), dating back
to the Triassic period. Other gene duplication events were also discovered in Xenopus, the first duplication occurring approximately 60–70 mya in both Ldh-A and Ldh-B, followed by another recent gene duplication event, approximately 20 mya, in Ldh-B.
Received: 5 October 2001 / Accepted: 24 October 2001 相似文献
19.
20.
Cristian Cañestro Ricard Albalat Lars Hjelmqvist Laura Godoy Hans Jörnvall Roser Gonzàlez-Duarte 《Journal of molecular evolution》2002,54(1):81-89
The alcohol dehydrogenase (ADH) family has evolved into at least eight ADH classes during vertebrate evolution. We have characterized
three prevertebrate forms of the parent enzyme of this family, including one from an urochordate (Ciona intestinalis) and two from cephalochordates (Branchiostoma floridae and Branchiostoma lanceolatum). An evolutionary analysis of the family was performed gathering data from protein and gene structures, exon–intron distribution,
and functional features through chordate lines. Our data strongly support that the ADH family expansion occurred 500 million
years ago, after the cephalochordate/vertebrate split, probably in the gnathostome subphylum line of the vertebrates. Evolutionary
rates differ between the ancestral, ADH3 (glutathione-dependent formaldehyde dehydrogenase), and the emerging forms, including
the classical alcohol dehydrogenase, ADH1, which has an evolutionary rate 3.6-fold that of the ADH3 form. Phylogenetic analysis
and chromosomal mapping of the vertebrate Adh gene cluster suggest that family expansion took place by tandem duplications, probably concurrent with the extensive isoform
burst observed before the fish/tetrapode split, rather than through the large-scale genome duplications also postulated in
early vertebrate evolution. The absence of multifunctionality in lower chordate ADHs and the structures compared argue in
favor of the acquisition of new functions in vertebrate ADH classes. Finally, comparison between B. floridae and B. lanceolatum Adhs provides the first estimate for a cephalochordate speciation, 190 million years ago, probably concomitant with the beginning
of the drifting of major land masses from the Pangea.
Received: 10 April 2001 / Accepted: 23 May 2001 相似文献