共查询到20条相似文献,搜索用时 46 毫秒
1.
Diversity, Distribution, and Ancient Taxonomic Relationships Within the TIR and Non-TIR NBS-LRR Resistance Gene Subfamilies 总被引:1,自引:0,他引:1
Cannon SB Zhu H Baumgarten AM Spangler R May G Cook DR Young ND 《Journal of molecular evolution》2002,54(4):548-562
Phylogenetic relationships among the NBS-LRR (nucleotide binding site–leucine-rich repeat) resistance gene homologues (RGHs)
from 30 genera and nine families were evaluated relative to phylogenies for these taxa. More than 800 NBS-LRR RGHs were analyzed,
primarily from Fabaceae, Brassicaceae, Poaceae, and Solanaceae species, but also from representatives of other angiosperm
and gymnosperm families. Parsimony, maximum likelihood, and distance methods were used to classify these RGHs relative to
previously observed gene subfamilies as well as within more closely related sequence clades. Grouping sequences using a distance
cutoff of 250 PAM units (point accepted mutations per 100 residues) identified at least five ancient sequence clades with
representatives from several plant families: the previously observed TIR gene subfamily and a minimum of four deep splits
within the non-TIR gene subfamily. The deep splits in the non-TIR subfamily are also reflected in comparisons of amino acid
substitution rates in various species and in ratios of nonsynonymous-to-synonymous nucleotide substitution rates (K
A/K
S values) in Arabidopsis thaliana. Lower K
A/K
S values in the TIR than the non-TIR sequences suggest greater functional constraints in the TIR subfamily. At least three
of the five identified ancient clades appear to predate the angiosperm–gymnosperm radiation. Monocot sequences are absent
from the TIR subfamily, as observed in previous studies. In both subfamilies, clades with sequences separated by approximately
150 PAM units are family but not genus specific, providing a rough measure of minimum dates for the first diversification
event within these clades. Within any one clade, particular taxa may be dramatically over- or underrepresented, suggesting
preferential expansions or losses of certain RGH types within particular taxa and suggesting that no one species will provide
models for all major sequence types in other taxa.
Received: 13 June 2001 / Accepted: 22 October 2001 相似文献
2.
3.
Hiroshi Suga Daisuke Hoshiyama Shigehiro Kuraku Kazutaka Katoh Kaoru Kubokawa Takashi Miyata 《Journal of molecular evolution》1999,49(5):601-608
Animals evolved a variety of gene families involved in cell–cell communication and developmental control by gene duplication
and domain shuffling. Each family is made up of several subtypes or subfamilies with distinct structures and functions, which
diverged by gene duplications and domain shufflings before the divergence of parazoans and eumetazoans. Since the separation
from protostomes, vertebrates expanded the multiplicity of members (isoforms) in the same subfamily by further gene duplications
in their early evolution before the fish–tetrapod split. To know the dates of isoform duplications more closely, we have conducted
isolation and sequencing cDNAs encoding the fibroblast growth factor receptor, Eph, src, and platelet-derived growth factor receptor subtypes belonging to the protein tyrosine kinase family from Branchiostoma belcheri, an amphioxus, Eptatretus burgeri, a hagfish, and Lampetra reissneri, a lamprey. From a phylogenetic tree of each subfamily inferred from a maximum likelihood (ML) method, together with a bootstrap
analysis based on the ML method, we have shown that the isoform duplications frequently occurred in the early evolution of
vertebrates around or just before the divergence of cyclostomes and gnathostomes by gene duplications and possibly chromosomal
duplications.
Received: 28 April 1998 / Accepted: 30 June 1999 相似文献
4.
Dietmar Kültz 《Journal of molecular evolution》1998,46(5):571-588
All currently sequenced stress-activated protein kinases (SAPKs), extracellular signal-regulated kinases (ERKs), and other
mitogen-activated protein kinases (MAPKs) were analyzed by sequence alignment, phylogenetic tree construction, and three-dimensional
structure modeling in order to classify members of the MAPK family. Based on this analysis the MAPK family was divided into
three subgroups (SAPKs, ERKs, and MAPK3) that consist of at least nine subfamilies. Members of a given subfamily were exclusively
from animals, plants, or yeast/fungi. A single signature sequence, [LIVM][TS]XX[LIVM]XT[RK][WY]YRXPX[LIVM] [LIVM], was identified
that is characteristic for all MAPKs and sufficient to distinguish MAPKs from other members of the protein kinase superfamily.
This signature sequence contains the phosphorylation site and is located on loop 12 of the three-dimensional structure of
MAPKs. I also identified signature sequences that are characteristic for each of the nine subfamilies of MAPKs. By modeling
the three-dimensional structure of three proteins for each MAPK subfamily based on the resolved atomic structures of rat ERK2
and murine p38, it is demonstrated that amino acids conserved in all MAPKs are located primarily in the center of the protein
around the catalytic cleft. I conclude that these residues are important for maintaining proper folding into the gross structure
common to all MAPKs. On the other hand, amino acids conserved in a given subfamily are located mainly in the periphery of
MAPKs, indicating their possible importance for defining interactions with substrates, activators, and inhibitors. Within
these subfamily-specific regions, amino acids were identified that represent unique residues occurring in only a single subfamily
and their location was mapped in three-dimensional structure models. These unique residues are likely to be crucial for subfamily-specific
interactions of MAPKs with substrates, inhibitors, or activators and, therefore, represent excellent targets for site-directed
mutagenesis experiments.
Received: 13 August 1997 / Accepted: 21 November 1997 相似文献
5.
6.
The members of the PKA regulatory subunit family (PKA-R family) were analyzed by multiple sequence alignment and clustering
based on phylogenetic tree construction. According to the phylogenetic trees generated from multiple sequence alignment of
the complete sequences, the PKA-R family was divided into four subfamilies (types I to IV). Members of each subfamily were
exclusively from animals (types I and II), fungi (type III), and alveolates (type IV). Application of the same methodology
to the cAMP-binding domains, and subsequently to the region delimited by β-strands 6 and 7 of the crystal structures of bovine
RIα and rat RIIβ (the phosphate-binding cassette; PBC), proved that this highly conserved region was enough to classify unequivocally
the members of the PKA-R family. A single signature sequence, F–G–E–[LIV]–A–L–[LIMV]–x(3)–[PV]–R–[ANQV]–A, corresponding to
the PBC was identified which is characteristic of the PKA-R family and is sufficient to distinguish it from other members
of the cyclic nucleotide-binding protein superfamily. Specific determinants for the A and B domains of each R-subunit type
were also identified. Conserved residues defining the signature motif are important for interaction with cAMP or for positioning
the residues that directly interact with cAMP. Conversely, residues that define subfamilies or domain types are not conserved
and are mostly located on the loop that connects α-helix B′ and β strand 7.
Received: 2 November 2000/Accepted: 14 June 2001 相似文献
7.
Insect vitellogenin and yolk protein receptors (VgR/YPR) are newly discovered members of the low-density lipoprotein receptor
(LDLR) family, which is characterized by a highly conserved arrangement of repetitive modular elements homologous to functionally
unrelated proteins. The insect VgR/YPRs are unique in having two clusters of complement-type cysteine-rich (class A) repeats
or modules, with five modules in the first cluster and seven in the second cluster, unlike classical LDLRs which have a single
seven-module cluster, vertebrate VgRs and very low density lipoprotein receptors (VLDLR) which have a single eight-module
cluster, and LDLR-related proteins (LRPs) and megalins which have four clusters of 2–7, 8, 10, and 11 modules. Alignment of
clusters across subfamilies by conventional alignment programs is problematic because of the repetitive nature of the component
modules which may have undergone rearrangements, duplications, and deletions during evolution. To circumvent this problem,
we ``fingerprinted' each class A module in the different clusters by identifying those amino acids that are both relatively
conserved and relatively unique within the cluster. Intercluster reciprocal comparisons of fingerprints and aligned sequences
allowed us to distinguish four cohorts of modules reflecting shared recent ancestry. All but two of the 57 modules examined
could be assigned to one of these four cohorts designated A, B, C, and D. Alignment of clusters based on modular cohorts revealed
that all clusters are derived from a single primordial cluster of at least seven modules with a consensus arrangement of CDCADBC.
All extant clusters examined are consistent with this consensus, though none matches it perfectly. This analysis also revealed
that the eight-module clusters in vertebrate VgRs, insect VgR/YPRs, and LRP/megalins are not directly homologous with one
another. Assignment of modules to cohorts permitted us to properly align 32 class A clusters from all four LDLR subfamilies
for phylogenetic analysis. The results revealed that smaller one-cluster and two-cluster members of the family did not originate
from the breakup of a large two-cluster or four-cluster receptor. Similarly, the LRP/megalins did not arise from the duplication
of a two-cluster insect VgR/YPR-like progenitor. Rather, it appears that the multicluster receptors were independently constructed
from the same single-cluster ancestor.
Received: 16 January 1997 / Accepted: 21 August 1997 相似文献
8.
Philippe Castagnone-Sereno Hélène Leroy Jean-Philippe Semblat Frédéric Leroy Pierre Abad Carolien Zijlstra 《Journal of molecular evolution》1998,46(2):225-233
An AluI satellite DNA family has been isolated in the genome of the root-knot nematode Meloidogyne chitwoodi. This repeated sequence was shown to be present at approximately 11,400 copies per haploid genome, and represents about 3.5%
of the total genomic DNA. Nineteen monomers were cloned and sequenced. Their length ranged from 142 to 180 bp, and their A
+ T content was high (from 65.7 to 79.1%), with frequent runs of As and Ts. An unexpected heterogeneity in primary structure
was observed between monomers, and multiple alignment analysis showed that the 19 repeats could be unambiguously clustered
in six subfamilies. A consensus sequence has been deduced for each subfamily, within which the number of positions conserved
is very high, ranging from 86.7% to 98.6%. Even though blocks of conserved regions could be observed, multiple alignment of
the six consensus sequences did not enable the establishment of a general unambiguous consensus sequence. Screening of the
six consensus sequences for evidence of internal repeated subunits revealed a 6-bp motif (AAATTT), present in both direct
and inverted orientation. This motif was found up to nine times in the consensus sequences, also with the occurrence of degenerated
subrepeats. Along with the meiotic parthenogenetic mode of reproduction of this nematode, such structural features may argue
for the evolution of this satellite DNA family either (1) from a common ancestral sequence by amplification followed by mechanisms
of sequence divergence, or (2) through independent mutations of the ancestral sequence in isolated amphimictic nematode populations
and subsequent hybridization events. Overall, our results suggest the ancient origin of this satellite DNA family, and may
reflect for M. chitwoodi a phylogenetic position close to the ancestral amphimictic forms of root-knot nematodes.
Received: 23 April 1997 / Accepted: 9 July 1997 相似文献
9.
10.
11.
Phylogenetic relationships among the Japanese papilionid butterflies were analyzed by comparing 783 nucleotide sequences
of the mitochondrial gene encoding NADH dehydrogenase subunit 5 (ND5). Phylogenetic trees of the representative species from each family in the superfamily Papilionoidea revealed that the species
of the family Papilionidae and those of all other families formed distinct clusters, with a few species of the family Hesperiidae
(Hesperioidea) as an outgroup. In the phylogenetic trees of most Japanese species of the family Papilionidae with Nymphalis xanthomelas (Nymphalidae) as an outgroup, the tribe Parnassiini (Parnassiinae) formed a cluster, and the rest formed the other cluster
in which the tribe Zerynthiini (Parnassiinae) and the subfamily Papilioninae formed different subclusters. In the Papilioninae
cluster, the tribes Troidini and Graphiini formed a subcluster, and the tribe Papilionini formed the other subcluster. These
results generally agree with the traditional classification of the papilionid butterflies based on their morphological characteristics
and support the proposed evolutionary genealogy of the butterflies based on their morphology, behavior, and larval host plants,
except that the tribes Parnasiini and Zerynthiini (both Parnassiinae) are not in the same cluster.
Received: 16 March 1998 / Accepted: 28 April 1998 相似文献
12.
Colm O'hUigin Holger Sültmann Herbert Tichy Brent W. Murray 《Journal of molecular evolution》1998,47(5):578-585
We report the cDNA sequences for the DMA and DMB family of Mhc genes of the gray short-tailed opossum. Until now DM sequences were available only in eutherian mammals. The marsupial sequences indicate that both members of the family are
old and probably diverged from other classical class II families about the time of the radiation of jawed vertebrates some
450 million years ago. We examine the evolutionary rates of equivalent sets of classical and nonclassical genes to check for
rate heterogeneity. We find the α-1 domain of the DR genes to be untypically conservative in its evolutionary mode. The DM genes appear to evolve at rates typical of other class II genes, indicating that their placement at the root of class II
gene evolutionary trees may be justified.
Received: 2 March 1998 / Accepted: 2 June 1998 相似文献
13.
Short interspersed DNA elements (SINEs) amplify by retroposition either by (i) successive waves of amplification from one
or a few evolving master genes or by (ii) the generation of new master genes that coexist with their progenitors. Individual,
highly conserved, elements of the B1 SINE family were identified from the GenBank nucleotide database using various B1 subfamily
consensus query sequences to determine their integration times into the mouse genome. A comparison of orthologous loci in
various species of the genus Mus demonstrated that four subfamilies of B1 elements have been amplifying within the last 1–3 million years. Therefore, B1 sequences
are generated by coexisting source genes. Additionally, three B1 subfamilies have been concurrently propagated during subspecies
divergence and strain formation in Mus, indicating very recent activity of this retroposon family. The patterns of intra- and interspecies variations of orthologous
loci demonstrate the usefulness of B1 integrations as a phylogenetic tool. A single inconsistency in the phylogenetic trends
was depicted by the presence of a B1 insert in an orthologous locus exclusively in M. musculus and M. pahari. However, DNA sequence analysis revealed that these were independent integrations at the same genomic site. One highly conserved
B1 element that integrated at least 4–6 million years ago suggests the possibility of occasional function for B1 integrations.
Received: 25 February 2000 / Accepted: 5 June 2000 相似文献
14.
Sox genes encode proteins related to each other, and to the sex determining gene Sry, by the presence of a DNA binding motif known as the HMG domain. Although HMG domains can bind to related DNA sequences, Sox gene products may achieve target gene specificity by binding to preferred target sequences or by interacting with specific partner proteins. To assess their functional similarities, we replaced the HMG box of Sry with the HMG box of Sox3 or Sox9 and tested whether these constructs caused sex reversal in XX mice. Our results indicate that such chimeric transgenes can functionally replace Sry and elicit development of testis cords, male patterns of gene expression, and elaboration of male secondary sexual characteristics. This implies that chimeric SRY proteins with SOX HMG domains can bind to and regulate SRY target genes and that potential SRY partner factor interactions are not disrupted by HMG domain substitutions. genesis 28:111-124, 2000. 相似文献
15.
Two previously undetected domains were identified in a variety of RNA-binding proteins, particularly RNA-modifying enzymes,
using methods for sequence profile analysis. A small domain consisting of 60–65 amino acid residues was detected in the ribosomal
protein S4, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yeast protein containing
a pseudouridine synthetase and a deaminase domain, bacterial tyrosyl-tRNA synthetases, and a number of uncharacterized, small
proteins that may be involved in translation regulation. Another novel domain, designated PUA domain, after PseudoUridine
synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine
synthases, a family of predicted ATPases that may be involved in RNA modification, a family of predicted archaeal and bacterial
rRNA methylases. Additionally, the PUA domain was detected in a family of eukaryotic proteins that also contain a domain homologous
to the translation initiation factor eIF1/SUI1; these proteins may comprise a novel type of translation factors. Unexpectedly,
the PUA domain was detected also in bacterial and yeast glutamate kinases; this is compatible with the demonstrated role of
these enzymes in the regulation of the expression of other genes. We propose that the S4 domain and the PUA domain bind RNA
molecules with complex folded structures, adding to the growing collection of nucleic acid-binding domains associated with
DNA and RNA modification enzymes. The evolution of the translation machinery components containing the S4, PUA, and SUI1 domains
must have included several events of lateral gene transfer and gene loss as well as lineage-specific domain fusions.
Received: 15 May 1998 / Accepted: 20 July 1998 相似文献
16.
Getting In or Out: Early Segregation Between Importers and Exporters in the Evolution of ATP-Binding Cassette (ABC) Transporters 总被引:17,自引:0,他引:17
ATP-binding cassette (ABC) systems, also called traffic ATPases, are found in eukaryotes and prokaryotes and almost all participate
in the transport of a wide variety of molecules. ABC systems are characterized by a highly conserved ATPase module called
here the ABC module, involved in coupling transport to ATP hydrolysis. We have used the sequence of one of the first representatives
of bacterial ABC transporters, the MalK protein, to collect 250 closely related sequences from a nonredundant protein sequence
database. The sequences collected by this objective method are all known or putative ABC transporters. After having eliminated
short protein sequences and duplicates, the 197 remaining sequences were subjected to a phylogenetic analysis based on a mutational
similarity matrix. An unrooted tree for these modules was found to display two major branches, one grouping all collected
uptake systems and the other all collected export systems. This remarkable disposition strongly suggests that the divergence
between these two functionally different types of ABC systems occurred once in the history of these systems and probably before
the differentiation of prokaryotes and eukaryotes. We discuss the implications of this finding and we propose a model accounting
for the generation and the diversification of ABC systems.
Received: 23 February 1997 / Accepted: 7 April 1998 相似文献
17.
Molecular Evolution of the Myeloperoxidase Family 总被引:4,自引:0,他引:4
Animal myeloperoxidase and its relatives constitute a diverse protein family, which includes myeloperoxidase, eosinophil
peroxidase, thyroid peroxidase, salivary peroxidase, lactoperoxidase, ovoperoxidase, peroxidasin, peroxinectin, cyclooxygenase,
and others. The members of this protein family share a catalytic domain of about 500 amino acid residues in length, although
some members have distinctive mosaic structures. To investigate the evolution of the protein family, we performed a comparative
analysis of its members, using the amino acid sequences and the coordinate data available today. The results obtained in this
study are as follows: (1) 60 amino acid sequences belonging to this family were collected by database searching. We found
a new member of the myeloperoxidase family derived from a bacterium. This is the first report of a bacterial member of this
family. (2) An unrooted phylogenetic tree of the family was constructed according to the alignment. Considering the branching
pattern in the obtained phylogenetic tree, together with the mosaic features in the primary structures, 60 members of the
myeloperoxidase family were classified into 16 subfamilies. (3) We found two molecular features that distinguish cyclooxygenase
from the other members of the protein family. (4) Several structurally deviated segments were identified by a structural comparison
between cyclooxygenase and myeloperoxidase. Some of the segments seemed to be associated with the functional and/or structural
differences between the enzymes.
Received: 25 January 2000 / Accepted: 19 July 2000 相似文献
18.
Austin L. Hughes 《Immunogenetics》1999,49(2):106-114
Phylogenetic analysis of the prophenoloxidase/hexamerin family of arthropods revealed four well supported subfamilies: (1)
the arylphorin subfamily, including arylphorins, storage proteins, and other proteins of uncertain function from insects;
(2) the hemocyanins of branchiopod crustaceans, which are copper-binding proteins involved in oxygen transport; (3) the hemocyanins
of chelicerates; and (4) the prophenoloxidases (proPO) of both insects and branchiopods, which are copper-binding molecules
that play a role in sclerotization of cuticle and encapsulation of foreign particles. The phylogeny indicated that insect
and branchiopod proPO constitute a monophyletic group but that branchiopod and chelicerate hemocyanins do not constitute a
monophyletic group. Branchiopod hemocyanin and proPO diverged from each other prior to the divergence of insects from branchiopods
and probably prior to the divergence of chelicerates from the insect-branchiopod lineage. Likewise, the insect arylphorin
subfamily diverged from proPO prior to the divergence of insects from branchiopods and probably prior to the divergence of
chelicerates; thus, the results did not support the hypothesis that insect arylphorins represent hemocyanins freed to assume
a new function because the insect tracheal respiratory system removes the need for an oxygen-transport molecule. Nonetheless,
reconstruction of ancestral sequences by the maximum parsimony method suggested that the ancestors of the arylphorin family
were copper-binding. Regions corresponding to the copper-binding domains were found to have a faster rate of nonsynonymous
evolution in arylphorin subfamily genes than in other hexamerin family genes; this presumably reflects a relaxation of purifying
selection after the loss of copper-binding function.
Received: 25 March 1998 / Revised: 3 July 1998 相似文献
19.
Molecular Phylogeny of Metazoan Intermediate Filament Proteins 总被引:7,自引:0,他引:7
Andreas Erber Dieter Riemer Marc Bovenschulte Klaus Weber 《Journal of molecular evolution》1998,47(6):751-762
We have cloned cytoplasmic intermediate filament (IF) proteins from a large number of invertebrate phyla using cDNA probes,
the monoclonal antibody IFA, peptide sequence information, and various RT-PCR procedures. Novel IF protein sequences reported
here include the urochordata and nine protostomic phyla, i.e., Annelida, Brachiopoda, Chaetognatha, Echiura, Nematomorpha,
Nemertea, Platyhelminthes, Phoronida, and Sipuncula. Taken together with the wealth of data on IF proteins of vertebrates
and the results on IF proteins of Cephalochordata, Mollusca, Annelida, and Nematoda, two IF prototypes emerge. The L-type,
which includes 35 sequences from 11 protostomic phyla, shares with the nuclear lamins the long version of the coil 1b subdomain
and, in most cases, a homology segment of some 120 residues in the carboxyterminal tail domain. The S-type, which includes
all four subfamilies (types I to IV) of vertebrate IF proteins, lacks 42 residues in the coil 1b subdomain and the carboxyterminal
lamin homology segment. Since IF proteins from all three phyla of the chordates have the 42-residue deletion, this deletion
arose in a progenitor prior to the divergence of the chordates into the urochordate, cephalochordate, and vertebrate lineages,
possibly already at the origin of the deuterostomic branch. Four phyla recently placed into the protostomia on grounds of
their 18S rDNA sequences (Brachiopoda, Nemertea, Phoronida, and Platyhelminthes) show IF proteins of the L-type and fit by
sequence identity criteria into the lophotrochozoic branch of the protostomia.
Received: 2 April 1998 / Accepted: 19 June 1998 相似文献
20.
The secondary structure of rRNA internal transcribed spacer 2 is important in the process of ribosomal biogenesis. Trematode
ITS sequences are poorly conserved and difficult to align for phylogenetic comparisons above a family level. If a conserved
secondary structure can be identified, it can be used to guide primary sequence alignments. ITS2 sequences from 39 species
were compared. These species span four orders of trematodes (Echinostomiformes, Plagiorchiformes, Strigeiformes, and Paramphistomiformes)
and one monogenean (Gyrodactyliformes). The sequences vary in length from 251 to 431 bases, with an average GC content of
48%. The monogenean sequence could not be aligned with confidence to the trematodes. Above the family level trematode sequences
were alignable from the 5′ end for 139 bases. Secondary structure foldings predicted a four-domain model. Three folding patterns
were required for the apex of domain B. The folding pattern of domains C and D varies for each family. The structures display
a high GC content within stems. Bases A and U are favored in unpaired regions and variable sites cluster. This produces a
mosaic of conserved and variable regions with a structural conformation resistant to change. Two conserved strings were identified,
one in domain B and the other in domain C. The first site can be aligned to a processing site identified in yeast and rat.
The second site has been found in plants, and structural location appears to be important. A phylogenetic tree of the trematode
sequences, aligned with the aid of secondary structures, distinguishes the four recognized orders.
Received: 21 November 1997 / Accepted: 9 February 1998 相似文献