共查询到20条相似文献,搜索用时 15 毫秒
1.
Costas J 《Journal of molecular evolution》2001,53(3):237-243
Several distinct families of endogenous retroviruses exist in the genomes of primates. Most of them are remnants of ancient
germ-line infections. The human endogenous retrovirus family HERV-K represents the unique known case of endogenous retrovirus
that amplified in the human genome after the divergence of human and chimpanzee lineages. There are two types of HERV-K proviral
genomes differing by the presence or absence of 292 bp in the pol-env boundary. Human-specific insertions exist for both types. The analyses shown in the present work reveal that several lineages
of type 1 and type 2 HERV-K proviruses remained transpositionally active after the human/chimpanzee split. The data also reflect
the important role of mosaic evolution (either by recombination or gene conversion) during the evolutionary history of HERV-K.
Received: 5 February 2001 / Accepted: 22 March 2001 相似文献
2.
Ivan Laprevotte Sophie Brouillet Christophe Terzian Alain Hénaut 《Journal of molecular evolution》1997,44(2):214-225
A computer-assisted analysis was made of 24 complete nucleotide sequences selected from the vertebrate retroviruses to represent
the ten viral groups. The conclusions of this analysis extend and strengthen the previously made hypothesis on the Moloney
murine leukemia virus: The evolution of the nucleotide sequence appears to have occurred mainly through at least three overlapping
levels of duplication: (1) The distributions of overrepresented (3–6)-mers are consistent with the universal rule of a trend
toward TG/CT excess and with the persistence of a certain degree of symmetry between the two strands of DNA. This suggests
one or several original tandemly repeated sequences and some inverted duplications. (2) The existence of two general core
consensuses at the level of these (3–6)-mers supports the hypothesis of a common evolutionary origin of vertebrate retroviruses.
Consensuses more specific to certain sequences are compatible with phylogenetic trees established independently. The consensuses
could correspond to intermediary evolutionary stages. (3) Most of the (3–6)-mers with a significantly higher than average
frequency appear to be internally repeated (with monomeric or oligomeric internal iterations) and seem to be at least partly
the cause of the bias observed by other researchers at the level of retroviral nucleotide composition. They suggest a third
evolutionary stage by slippage-like stepwise local duplications.
Received: 3 January 1996 / Accepted: 27 March 1996 相似文献
3.
Nucleotide Composition Bias Affects Amino Acid Content in Proteins Coded by Animal Mitochondria 总被引:16,自引:0,他引:16
We show that in animal mitochondria homologous genes that differ in guanine plus cytosine (G + C) content code for proteins
differing in amino acid content in a manner that relates to the G + C content of the codons. DNA sequences were analyzed using
square plots, a new method that combines graphical visualization and statistical analysis of compositional differences in
both DNA and protein. Square plots divide codons into four groups based on first and second position A + T (adenine plus thymine)
and G + C content and indicate differences in amino acid content when comparing sequences that differ in G + C content. When
sequences are compared using these plots, the amino acid content is shown to correlate with the nucleotide bias of the genes.
This amino acid effect is shown in all protein-coding genes in the mitochondrial genome, including cox I, cox II, and cyt b, mitochondrial genes which are commonly used for phylogenetic studies. Furthermore, nucleotide content differences are shown
to affect the content of all amino acids with A + T- and G + C-rich codons. We speculate that phylogenetic analysis of genes
so affected may tend erroneously to indicate relatedness (or lack thereof) based only on amino acid content.
Received: 3 July 1996 / Accepted: 6 November 1996 相似文献
4.
When divergence between viral species is large, the analysis and comparison of nucleotide or protein sequences are dependent
on mutation biases and multiple substitutions per site leading, among other things, to the underestimation of branch lengths
in phylogenetic trees. To avoid the problem of multiply substituted sites, a method not directly based on the nucleic or protein
sequences has been applied to retroviruses. It consisted of asking questions about genome structure or organization, and gene
function, the series of answers creating coded sequences analyzed by phylogenic software. This method recovered the principal
retroviral groups such as the lentiviruses and spumaviruses and highlighted questions and answers characteristic of each group
of retroviruses. In general, there was reasonable concordance between the coded genome methodology and that based on conventional
phylogeny of the integrase protein sequence, indicating that integrase was fixing mutations slowly enough to marginalize the
problem of multiple substitutions at sites. To a first approximation, this suggests that the acquisition of novel genetic
features generally parallels the fixation of amino acid substitutions.
Received: 18 May 2001 / Accepted: 7 September 2001 相似文献
5.
Adam Eyre-Walker 《Journal of molecular evolution》1998,47(6):686-690
Parsimony is commonly used to infer the direction of substitution and mutation. However, it is known that parsimony is biased
when the base composition of the DNA sequence is skewed. Here I quantify this effect for several simple cases. The analysis
demonstrates that parsimony can be misleading even when levels of sequence divergence are as low as 10%; parsimony incorrectly
infers an excess of common to rare changes. Caution must therefore be excercised in the use of parsimony.
Received: 13 November 1997 / Accepted: 18 June 1998 相似文献
6.
Mayer J Stuhr T Reus K Maldener E Kitova M Asmus F Meese E 《Journal of molecular evolution》2005,61(5):706-715
We and others recently identified an almost-intact human endogenous retrovirus (HERV), termed HERV-K(HML-2.HOM), that is usually
organized as a tandem provirus. Studies on HERV proviral loci commonly rely on the analysis of single alleles being taken
as representative for a locus. We investigated the frequency of HERV-K(HML-2.HOM) single and tandem alleles in various human
populations. Our analysis revealed that another HERV-K(HML-2) locus, the so-called HERV-K(II) provirus, is also present as
a tandem provirus allele in the human population. Proviral tandem formations were identified in various nonhuman primate species.
We furthermore examined single nucleotide polymorphisms (SNPs) within the HERV-K(HML-2.HOM) proviral gag, prt, and pol genes, which all result in nonsense mutations. We identified four proviral haplotypes displaying different combinations of
gag, prt, and pol SNPs. Haplotypes harboring completely intact proviral genes were not found. For the left provirus of the tandem arrangement
a haplotype displaying intact gag and prt genes and a mutated pol was found in about two-thirds of individuals from different ethnogeographic origins. The same haplotype was always found
in the right provirus. The various haplotypes point toward multiple recombination events between HERV-K(HML-2.HOM) proviruses.
Based on these findings we derive a model for the evolution of the proviral locus since germ line integration.
[Reviewing Editor : Dr. Martin Kreitman] 相似文献
7.
Brian R. Morton Virginia M. Oberholzer Michael T. Clegg 《Journal of molecular evolution》1997,45(3):227-231
Substitutions occurring in noncoding sequences of the plant chloroplast genome violate the independence of sites that is
assumed by substitution models in molecular evolution. The probability that a substitution at a site is a transversion, as
opposed to a transition, increases significantly with increasing A + T content of the two adjacent nucleotides. In the present
study, this dependency of substitutions on local context is examined further in a number of noncoding regions from the chloroplast
genome of members of the grass family (Poaceae). Two features were examined; the influence of specific neighboring bases,
as opposed to the general A + T content, on transversion proportion and an influence on substitutions by nucleotides other
than the two immediately adjacent to the site of substitution. In both cases, a significant effect was found. In the case
of specific nucleotides, transversion proportion is significantly higher at sites with a pyrimidine immediately 5′ on either
strand. Substitutions at sites of the type YNR, where N is the site of substitution, have the highest rate of transversion.
This specific effect is secondary to the A + T content effect such that, in terms of proportion of substitutions that are
transversions, the nucleotides are ranked T > A > C > G as to their effect when they are immediately 5′ to the site of substitution.
In the case of nucleotides other than the immediate neighbors, a significant influence on substitution dynamics is observed
in the case where the two neighboring bases are both A and/or T. Thus, substitutions are primarily, but not exclusively, influenced
by the composition of the two nucleotides that are immediately adjacent. These results indicate that the pattern of molecular
evolution of the plant chloroplast genome is extremely complex as a result of a variety of inter-site dependencies.
Received: 18 October 1996 / Accepted: 12 April 1997 相似文献
8.
Along the gene, nucleotides in various codon positions tend to exert a slight but observable influence on the nucleotide
choice at neighboring positions. Such context biases are different in different organisms and can be used as genomic signatures.
In this paper, we will focus specifically on the dinucleotide composed of a third codon position nucleotide and its succeeding
first position nucleotide. Using the 16 possible dinucleotide combinations, we calculate how well individual genes conform
to the observed mean dinucleotide frequencies of an entire genome, forming a distance measure for each gene. It is found that
genes from different genomes can be separated with a high degree of accuracy, according to these distance values.
In particular, we address the problem of recent horizontal gene transfer, and how imported genes may be evaluated by their
poor assimilation to the host's context biases. By concentrating on the third- and succeeding first position nucleotides,
we eliminate most spurious contributions from codon usage and amino-acid requirements, focusing mainly on mutational effects.
Since imported genes are expected to converge only gradually to genomic signatures, it is possible to question whether a gene
present in only one of two closely related organisms has been imported into one organism or deleted in the other. Striking
correlations between the proposed distance measure and poor homology are observed when Escherichia coli genes are compared to Salmonella typhi, indicating that sets of outlier genes in E. coli may contain a high number of genes that have been imported into E. coli, and not deleted in S. typhi.
Received: 16 January 2001 / Accepted: 30 August 2001 相似文献
9.
The idea that the pattern of point mutation in Drosophila has remained constant during the evolution of the genus has recently been challenged. A study of the nucleotide composition
focused on the Drosophila
saltans group has evidenced unsuspected nucleotide composition differences among lineages. Compositional differences are associated
with an accelerated rate of amino acid replacement in functionally less constrained regions. Here we reassess this issue from
a different perspective. Adopting a maximum-likelihood estimation approach, we focus on the different predictions that mutation
and selection make about the nonsynonymous-to-synonymous rate ratio. We investigate two gene regions, alcohol dehydrogenase
(Adh) and xanthine dehydrogenase (Xdh), using a balanced data set that comprises representatives from the melangaster, obscura, saltans, and willistoni groups. We also consider representatives of the Hawaiian picture-winged group. These Hawaiian species are known to have experienced
repeated bottlenecks and are included as a reference for comparison. Our results confirm patterns previously detected. The
branch ancestral to the fast-evolving willistoni/saltans lineage, where most of the change in GC content has occurred, exhibits an excess of synonymous substitutions. The shift in
mutation bias has affected the extent of the rate variation among sites in Xdh.
Received: 4 May 1999 / Accepted: 26 July 1999 相似文献
10.
Characteristics of Nucleotide Substitution in the Hepatitis C Virus Genome: Constraints on Sequence Change in Coding Regions at Both Ends of the Genome 总被引:19,自引:0,他引:19
Comparison of complete genome sequences for different variants of hepatitis C virus (HCV) reveals several different constraints
on sequence change. Synonymous changes are suppressed in coding regions at both 5′ and 3′ ends of the genome. No evidence
was found for the existence of alternative reading frames or for a lower mutation frequency in these regions. Instead, suppression
may be due to constraints imposed by RNA secondary structures identified within the core and NS5b genes. Nonsynonymous substitutions
are less frequent than synonymous ones except in the hypervariable region of E2 and, to a lesser extent, in E1, NS2, and NS5b.
Transitions are more frequent than transversions, particularly at the third position of codons where the bias is 16:1. In
addition, nucleotide substitutions may not occur symmetrically since there is a bias toward G or C at the third position of
codons, while T ↔ C transitions were twice as frequent as A ↔ G transitions. These different biases do not affect the phylogenetic
analysis of HCV variants but need to be taken into account in interpreting sequence change in longitudinal studies.
Received: 9 September 1996 / Accepted: 20 April 1997 相似文献
11.
Sixteen human endogenous retrovirus (HERV) sequences were detected within 656 kb of genomic sequence obtained from the alpha-
and beta-block of the class I region of the major histocompatibility complex (MHC). The HERVs were identified and characterized as family members of HERV-16
(11 copies), HERV-L (1 copy), HERV-I (2 copies), HERV-K91 (1 copy), and HARLEQUIN (1 copy) by sequence comparison using CENSOR
or Repeat Masker, BLAST searches, and dot plots. The 11 copies of HERV-16 arose as products of duplication of genomic segments
containing HLA class I (HLAcI) and PERB11 (MIC) genes inter alia, whereas the other five HERVs arose after duplication probably as a consequence of single insertion events or translocations.
HERV-L and HERV-I are located between the duplicated genes PERB11.2 (MICB) and PERB11.1 (MICA), and HLA-B and HLA-C, respectively, whereas HERV-K91 and HARLEQUIN are located telomeric of HLA-C. A highly fragmented copy of HERV-I was also found telomeric of PERB11.4. Structural analysis of open reading frames (ORFs) revealed the absence of intact coding sequence within the putative gag, pol, and env gene regions of all the HERVs with the exception of HERV-K91, which had two large ORFs within the region of the putative
protease and pol genes. In addition, the 5′-LTR of HERV-L contained a 2.5-kb element that was AT-rich and large ORFs with putative amino acid
sequences rich in tyrosines and isoleucines. HERV-I, HARLEQUIN, and at least four copies of HERV-16 appear to have been receptors
for the insertion of other retrotransposons including Alu elements and fragments of L1 and THE1. Examination of flanking sequences
suggests that HERV-I and HERV-L had occurred by insertion into ancient L1 fragments. This study has revealed that the alpha-
and beta-block region within the MHC is rich in HERV sequences occurring at a much higher ratio (10 to 1) than normally observed
in the human genome. These HERV sequences will therefore enhance further studies on disease associations and differences between
human haplotypes and primates and their role in the evolution of class I genes in the MHC.
Received: 17 September 1998 / Accepted: 8 January 1999 相似文献
12.
J. Robert Macey Allan Larson Natalia B. Ananjeva Theodore J. Papenfuss 《Journal of molecular evolution》1997,44(6):660-674
A phylogenetic tree for major lineages of iguanian lizards is estimated from 1,488 aligned base positions (858 informative)
of newly reported mitochondrial DNA sequences representing coding regions for eight tRNAs, ND2, and portions of ND1 and COI.
Two well-supported groups are defined, the Acrodonta and the Iguanidae (sensu lato). This phylogenetic hypothesis is used
to investigate evolutionary shifts in mitochondrial gene order, origin for light-strand replication, and secondary structure
of tRNACys. These three characters shift together on the branch leading to acrodont lizards. Plate tectonics and the fossil record indicate
that these characters changed in the Jurassic. We propose that changes to the secondary structure of tRNACys may destroy function of the origin for light-strand replication which, in turn, may facilitate shifts in gene order.
Received: 28 May 1996 / Accepted: 27 December 1996 相似文献
13.
Thomas D. Kim Seong-Eun Cho Chul-Hak Yang Jongsun Kim 《Journal of molecular evolution》2001,53(1):1-9
Fcγ receptor III (FcγRIII), a low-affinity receptor for the Fc portion of immunoglobulin G (IgG Fc), targets antigen-antibody
complexes in a variety of effector cells of the immune system. We have investigated FcγRIII and IgG Fc polymorphism and made
comparative analysis of the functional and evolutionary implications of the interaction between these two molecules. Sequence
analysis and comparison of the three-dimensional structure suggest that the C-terminal Ig domain of FcγRIII is associated
with the binding of IgG. The polymorphic residues of FcγRIII are mainly located in the region of the C-terminal Ig domain
that might be involved in IgG binding. Therefore, polymorphism and functional binding affinity seems to be related to each
other as has been increasingly implicated in clinical observations. IgG Fcs, the natural ligand of FcγRs, also exhibit significant
polymorphism. Three regions have been identified where polymorphism frequently occurs: the putative FcR binding site, the
linker region, and the intermolecular domain-domain interface of the second Ig domain. The putative FcγR binding sites where
polymorphic, and isotype-specific residues cluster are consistent with the regions that have been identified by mutagenesis
and molecular modeling studies. The polymorphic residues of IgG Fc were mainly located in the molecular surface, which could
be used in the recognition of other binding molecules. These observations suggest that polymorphic and isotype-specific residues
in IgG Fc are closely related to their function and protein-protein interaction. Therefore, the colocalization of the polymorphic
residues of FcγRIII and IgG Fcs at their docking sites implies that the polymorphic residues would affect the IgG-FcγRIII
binding interactions to optimize their signaling through evolution.
Received: 9 December 1999 / Accepted: 15 February 2001 相似文献
14.
In this paper we have analyzed 49 vertebrate gene families that were generated in the early stage of vertebrates and/or shortly
before the origin of vertebrates, each of which consists of three or four member genes. We have dated the first (T1) and second (T2) gene duplications of 26 gene families with 3 member genes. The means of T1 (594 mya) and T2 (488 mya) are largely consistent to a well-cited version of two-round (2R) genome duplication theory. Moreover, in most cases,
the time interval between two successive gene duplications is large enough that the fate of duplicate genes generated by the
first gene duplication was likely to be determined before the second one took place. However, the phylogenetic pattern of
23 gene families with 4 members is complicated; only 5 of them are predicted by 2R model, but 11 families require an additional
gene (or genome) duplication. For the rest (7 families), at least one gene duplication event had occurred before the divergence
between vertebrate and Drosophila, indicating a possible misleading of the 4:1 rule (member gene ratio between vertebrates and invertebrates). Our results show
that Ohno's 2R conjecture is valid as a working hypothesis for providing a most parsimonious explanation. Although for some
gene families, additional gene duplication is needed, the credibility of the third genome duplication (3R) remains to be investigated.
Received: 13 December 1999 / Accepted: 7 April 2000 相似文献
15.
Plasma membrane anion channels are thought to play important roles in osmoregulation and signal transduction in higher plant
cells. Knowledge of their pharmacology and regulation is of importance to unravel their physiological functions. In this study,
we explore the pharmacological properties and the nucleotide regulation of the voltage-dependent anion channel of Arabidopsis hypocotyls. The pharmacological profile of this channel is characterized by a low sensitivity to most anion channel blockers.
It is inhibited by niflumic acid with an IC50 of 80 μm, but poorly sensitive to IAA-94 and NPPB and insensitive to 9-AC and DIDS. Nucleotides alter the amplitude, the kinetics
and the voltage-dependence of the channel. The main effect of nucleotides is a shift of the voltage-dependent gate of the
channel toward depolarized potentials leading to a strong reduction of the current amplitude. This regulation does not require
ATP hydrolysis as nonhydrolyzable ATP analogues—AMPPNP and ATPγS—also regulate the anion current. This suggests that a nucleotide
binding site is involved in the regulation. The study of the properties of this putative nucleotide binding site reveals that
(i) ATP regulates the channel with an EC50 of 0.7 mm, (ii) adenyl nucleotides modulate the channel with the following order of effectiveness: ATP > ADP ≫ AMP, and (iii) thiophosphate
nucleotide analogues are the most potent agonists with EC50 in the range of 80 μm. The hypothesis that this regulation may couple the electrical properties of the membrane with the metabolic status of the
cell is discussed.
Received: 26 December 1996/Revised: 21 March 1997 相似文献
16.
17.
We inferred the incidence of nucleotide conversions in the COI and 16S rRNA mitochondrial genes of members of the Symphyta
and basal Apocrita (Hymenoptera). Character-state reconstructions in both genes suggested that conversions between A and T
(AT transversions) occurred much more frequently than any other type of change, although we cannot wholly discount an underlying
transition bias. Parsimony analysis of COI nucleotide characters did not recover phylogeny; e.g., neither the Tenthredinoidea
nor Apocrita were recovered as monophyletic. However, analysis of COI amino acid characters did recover these relationships,
as well as others based on fossil and morphological evidence. Analysis of 16S rRNA characters also recovered these relationships
providing conversions between A and T were down-weighted. Analysis of the combined data sets gave relatively strong support
for various relationships, suggesting that both data sets supported similar topographies. These data sets, both separately
and combined, suggested that the phytophagous Siricidae were more closely related to the predominantly parasitic Apocrita
than were the ectoparasitic Orussoidea. This suggests that the wasp parasitic lifestyle did not have a single origin, unless
the Siricidae have more recently reverted to phytophagy. Alternatively, parasitism evolved twice independently, once in the
Orussoidea and again in the Apocrita. The latter scenario is supported by the observation that the evolution of parasitism
was accompanied by a tendency for the larvae to develop inside plant tissues. Adaptations that accompanied the movement of
wasps into a confined, wood-boring habitat may have preadapted them to becoming ectoparasitic.
Received: 27 March 1996 / Accepted: 2 August 1996 相似文献
18.
A model of nucleotide substitution that allows the transition/transversion rate bias to vary across sites was constructed.
We examined the fit of this model using likelihood-ratio tests by analyzing 13 protein coding genes and 1 pseudogene. Likelihood-ratio
testing indicated that a model that allows variation in the transition/transversion rate bias across sites provided a significant
improvement in fit for most protein coding genes but not for the pseudogene. When the analysis was repeated with parameters
estimated separately for first, second, and third codon positions, strong heterogeneity was uncovered for the first and second
codon positions; the variation in the transition/transversion rate was generally weaker at the third codon position. The transition
rate bias and branch lengths are underestimated when variation in the transition/transversion rate was not accommodated, suggesting
that it may be important to accommodate variation in the pattern of nucleotide substitution for accurate estimation of evolutionary
parameters.
Received: 4 November 1997 / Accepted: 19 May 1998 相似文献
19.
Berg OG 《Journal of molecular evolution》1999,48(4):398-407
The synonymous divergence between Escherichia coli and Salmonella typhimurium is explained in a model where there is a large variation between mutation rates at different nucleotide sites in the genome.
The model is based on the experimental observation that spontaneous mutation rates can vary over several orders of magnitude
at different sites in a gene. Such site-specific variation must be taken into account when studying synonymous divergence
and will result in an apparent saturation below the level expected from an assumption of uniform rates. Recently, it has been
suggested that codon preference in enterobacteria has a very large site-specific variation and that the synonymous divergence
between different species, e.g., E. coli and Salmonella, is saturated. In the present communication it is shown that when site-specific variation in mutation rates is introduced,
there is no need to invoke assumptions of saturation and a large variability in codon preference. The same rate variation
will also bring average mutation rates as estimated from synonymous sequence divergence into numerical agreement with experimental
values.
Received: 10 July 1998 / Accepted: 20 August 1998 相似文献