首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Microstructural changes such as insertions and deletions (=indels) are a major driving force in the evolution of non-coding DNA sequences. To better understand the mechanisms by which indel mutations arise, as well as the molecular evolution of non-coding regions, the number and pattern of indels and nucleotide substitutions were compared in the whole chloroplast genomes. Comparisons were made for a total of over 38 kb non-coding DNA sequences from 126 intergenic regions in two data sets representing species with different divergence times: sugarcane and maize and Oryza sativa var. indica and japonica. The main findings of this study are: (i) Approximately half of all indels are single nucleotide indels. This observation agrees with previous studies in various organisms. (ii) The distribution and number of indels was different between two data sets, and different patterns were observed for tandem repeat and non-repeat indels. (iii) Distribution pattern of tandem repeat indels showed statistically significant bias towards A/T-rich. (iv) The rate of indel mutation was estimated to be approximately 0.8 +/- 0.04 x 10(-9) per site per year, which was similar to previous estimates in other organisms. (v) The frequencies of nucleotide substitutions and indels were significantly lower in inverted repeat (IR).  相似文献   

2.
3.
During somatic hypermutation (SHM), deamination of cytidine by activation-induced cytidine deaminase and subsequent DNA repair generates mutations within immunoglobulin V-regions. Nucleotide insertions and deletions (indels) have recently been shown to be critical for the evolution of antibody binding. Affinity maturation of 53 antibodies using in vitro SHM in a non-B cell context was compared with mutation patterns observed for SHM in vivo. The origin and frequency of indels seen during in vitro maturation were similar to that in vivo. Indels are localized to CDRs, and secondary mutations within insertions further optimize antigen binding. Structural determination of an antibody matured in vitro and comparison with human-derived antibodies containing insertions reveal conserved patterns of antibody maturation. These findings indicate that activation-induced cytidine deaminase acting on V-region sequences is sufficient to initiate authentic formation of indels in vitro and in vivo and that point mutations, indel formation, and clonal selection form a robust tripartite system for antibody evolution.  相似文献   

4.
Molecular weight of heavy chains of immunoglobulin G typical of cancer is studied immunoglobulin and may be responsible for manifestation of certain anomalous acid and peptide composition of this protein heavy chains as compared with immunoglobulin G in blood serum of healthy people. Immunochemical methods helped detecting an antigenic determinant (or determinants) which is arranged in the heavy chains of the studied immunoglobulin and may be responsible for manifestation of certain anomalous properties of cancer-typical immunoglobulin G molecules. A set of bromo-cyanogenic fragments differing from the spectrum of these fragments in the heavy chains of normal immunoglobulin G is formed following a specific chemical effect of bromo-cyanogen on the heavy chains of immunoglobulin G typical of cancer. Essential differences are found in dancyl-fingerprints of the heavy chains of the compared proteins. Everything mentioned is a result of changes in the primary structure of the heavy chains of immunoglobulin G typical of cancer.  相似文献   

5.
Little is known about variation of nucleotide insertion/deletions (indels) within species. In Arabidopsis thaliana, we investigated indel polymorphism patterns between two genome sequences and among 96 accessions at 1215 loci. Our study identified patterns in the variation of indel density, size, GC content and distribution, and a correlation between indels and substitutions. We found that the GC content in indel sequences was lower than that in non-indel sequences and that indels typically occur in regions with lower GC content. Patterns of indel frequency distribution among populations were more consistent with neutral expectation than substitution patterns. We also found that the local level of substitutions is positively correlated with indel density and negatively correlated with their distance to the closed indel, suggesting that indels play an important role in nucleotide variation.  相似文献   

6.
It has become clear that a large proportion of functional DNA in the human genome does not code for protein. Identification of this non-coding functional sequence using comparative approaches is proving difficult and has previously been thought to require deep sequencing of multiple vertebrates. Here we introduce a new model and comparative method that, instead of nucleotide substitutions, uses the evolutionary imprint of insertions and deletions (indels) to infer the past consequences of selection. The model predicts the distribution of indels under neutrality, and shows an excellent fit to human–mouse ancestral repeat data. Across the genome, many unusually long ungapped regions are detected that are unaccounted for by the neutral model, and which we predict to be highly enriched in functional DNA that has been subject to purifying selection with respect to indels. We use the model to determine the proportion under indel-purifying selection to be between 2.56% and 3.25% of human euchromatin. Since annotated protein-coding genes comprise only 1.2% of euchromatin, these results lend further weight to the proposition that more than half the functional complement of the human genome is non-protein-coding. The method is surprisingly powerful at identifying selected sequence using only two or three mammalian genomes. Applying the method to the human, mouse, and dog genomes, we identify 90 Mb of human sequence under indel-purifying selection, at a predicted 10% false-discovery rate and 75% sensitivity. As expected, most of the identified sequence represents unannotated material, while the recovered proportions of known protein-coding and microRNA genes closely match the predicted sensitivity of the method. The method's high sensitivity to functional sequence such as microRNAs suggest that as yet unannotated microRNA genes are enriched among the sequences identified. Futhermore, its independence of substitutions allowed us to identify sequence that has been subject to heterogeneous selection, that is, sequence subject to both positive selection with respect to substitutions and purifying selection with respect to indels. The ability to identify elements under heterogeneous selection enables, for the first time, the genome-wide investigation of positive selection on functional elements other than protein-coding genes.  相似文献   

7.
The principal sources of genetic variation that can be assayed with restriction enzymes are base substitutions and insertions/deletions (indels). The likelihood of detecting indels as restriction fragment length polymorphisms (RFLPs) is determined by the size and frequency of the indels, and the ability to resolve small indels as RFLPs is limited by the distribution of restriction fragment sizes. In this study, we use aligned sequences from the indica and japonica subspecies of rice ( Oryza sativa L.) to quantify and compare the ability of restriction enzymes to detect indels. We look specifically at two abundant transposable element-derived indel sources: miniature inverted repeat transposable elements (MITEs) and long terminal repeat (LTR) retroelements. From this analysis we conclude that indels rather than base substitutions are the prevailing source of the polymorphism detected in rice. We show that, although MITE derived indels are more abundant than LTR-retroelement derived indels, LTR-retroelements have a greater capacity to generate visible restriction fragment length polymorphism because of their larger size. We find that the variation in the detectability of indels among restriction enzymes can be explained by differences in the frequency and dispersion of their restriction sites in the genome. The parameters that describe the fragment size distributions obtained with the restriction enzymes are highly correlated across the sequenced genomes of rice, Arabidopsis and human, with the exception of some extreme deviations in frequency for particular recognition sequences corresponding to variations in the levels and modes of DNA methylation in the three disparate organisms. Thus, we can predict the relative ability of a restriction enzyme to detect indels derived from a specific source based on the distribution of restriction fragment sizes, even when this is estimated for a distantly related genome.Electronic Supplementary Material Supplementary Material is available in the online version of this article at Communicated by M.-A. Grandbastien  相似文献   

8.
Traditional sequence comparison by alignment employs a mutation model comprised of two events, substitutions and indels (insertions or deletions) of single positions. However, modern genetic analysis knows a variety of more complex mutation events (e.g., duplications, excisions, and rearrangements), especially regarding DNA. With ever more DNA sequence data becoming available, the need to accurately compare sequences which have clearly undergone more complicated types of mutational processes is becoming critical. Herein we introduce a new method for pairwise alignment and comparison of sequences with respect to the special evolution of tandem repeats: substitutions and indels of single positions and, additionally, duplications and excisions of variable degree (i.e., of one or more repeat copies simultaneously) are taken into account. To evaluate our method, we apply it to the spa VNTR (variable number of tandem repeats) cluster of Staphylococcus aureus, a bacterium of high medical importance  相似文献   

9.
Opinions split when it comes to the significance and thus the weighting of indel characters as phylogenetic markers. This paper attempts to test the phylogenetic information content of indels and nucleotide substitutions by proposing an a priori weighting system of non-protein-coding genes. Theoretically, the system rests on a weighting scheme which is based on a falsificationist approach to cladistic inference. It provides insertions, deletions and nucleotide substitutions weights according to their specific number of identical classes of potential falsifiers, resulting in the following system: nucleotide substitutions weight = 3, deletions of n nucleotides weight = (2n–1), and insertions of n nucleotides weight = (5n–1). This weighting system and the utility of indels as phylogenetic markers are tested against a suitable data set of 18S rDNA sequences of Diptera and Strepsiptera taxa together with other Metazoa species. The indels support the same clades as the nucleotide substitution data, and the application of the weighting system increases the corresponding consistency indices of the differentially weighted character types. As a consequence, applying the weighting system seems to be reasonable, and indels appear to be good phylogenetic markers.  相似文献   

10.
Previous crystallographic studies in this laboratory demonstrated that immunoglobulin light chains with the same amino acid sequence can have at least two and probably three or more conformations, depending on whether the second member of an interacting pair is a light or heavy chain. If a heavy chain is not available in the assembly medium, a second light chain plays the structural role of the heavy chain in the formation of a dimer. In the present work, the lambda-type light chains were dissociated from the heavy chains of a serum IgG1 immunoglobulin from the patient Mcg and reassembled noncovalently into a dimer. The reassembly process was completed by allowing the penultimate half-cystine residues to form an interchain disulfide bond. The covalently linked dimer was compared with the Mcg urinary Bence-Jones dimer, for which an atomic model has been fitted to a 2.3-A electron density map. The assembled dimer and the native Bence-Jones protein were indistinguishable in their chromatographic and electrophoretic properties, as well as in their activity in the binding of bis(dinitrophenyl)lysine. These results indicate that the light chains can be converted into the two types of Bence-Jones conformational isomers. The procedure was also reversed: the two Bence-Jones isomers were dissociated and reassembled as the single type of isomer associating with each of two heavy chains in the IgG1 protein. The change in activity occurring when a light chain associates with a heavy chain instead of a second light chain is illustrated by the fact that the Mcg IgG1 immunoglobulin does not bind dis(dinitrophenyl)lysine in measurable amounts.  相似文献   

11.
Zhang Z  Wang Y  Wang L  Gao P 《PloS one》2010,5(12):e14316

Background

In the process of protein evolution, sequence variations within protein families can cause changes in protein structures and functions. However, structures tend to be more conserved than sequences and functions. This leads to an intriguing question: what is the evolutionary mechanism by which sequence variations produce structural changes? To investigate this question, we focused on the most common types of sequence variations: amino acid substitutions and insertions/deletions (indels). Here their combined effects on protein structure evolution within protein families are studied.

Results

Sequence-structure correlation analysis on 75 homologous structure families (from SCOP) that contain 20 or more non-redundant structures shows that in most of these families there is, statistically, a bilinear correlation between the amount of substitutions and indels versus the degree of structure variations. Bilinear regression of percent sequence non-identity (PNI) and standardized number of gaps (SNG) versus RMSD was performed. The coefficients from the regression analysis could be used to estimate the structure changes caused by each unit of substitution (structural substitution sensitivity, SSS) and by each unit of indel (structural indel sensitivity, SIDS). An analysis on 52 families with high bilinear fitting multiple correlation coefficients and statistically significant regression coefficients showed that SSS is mainly constrained by disulfide bonds, which almost have no effects on SIDS.

Conclusions

Structural changes in homologous protein families could be rationally explained by a bilinear model combining amino acid substitutions and indels. These results may further improve our understanding of the evolutionary mechanisms of protein structures.  相似文献   

12.
The genome-sequencing gold rush has facilitated the use of comparative genomics to uncover patterns of genome evolution, although their causal mechanisms remain elusive. One such trend, ubiquitous to prokarya and eukarya, is the association of insertion/deletion mutations (indels) with increases in the nucleotide substitution rate extending over hundreds of base pairs. The prevailing hypothesis is that indels are themselves mutagenic agents. Here, we employ population genomics data from Escherichia coli, Saccharomyces paradoxus, and Drosophila to provide evidence suggesting that it is not the indels per se but the sequence in which indels occur that causes the accumulation of nucleotide substitutions. We found that about two-thirds of indels are closely associated with repeat sequences and that repeat sequence abundance could be used to identify regions of elevated sequence diversity, independently of indels. Moreover, the mutational signature of indel-proximal nucleotide substitutions matches that of error-prone DNA polymerases. We propose that repeat sequences promote an increased probability of replication fork arrest, causing the persistent recruitment of error-prone DNA polymerases to specific sequence regions over evolutionary time scales. Experimental measures of the mutation rates of engineered DNA sequences and analyses of experimentally obtained collections of spontaneous mutations provide molecular evidence supporting our hypothesis. This study uncovers a new role for repeat sequences in genome evolution and provides an explanation of how fine-scale sequence contextual effects influence mutation rates and thereby evolution.  相似文献   

13.
Nucleotide substitutions, insertions, and deletions constitute the principal molecular mechanisms generating genetic variation on small length scales. In contrast to substitutions, the nature of short DNA insertions and deletions (indels) is far less understood. With the recent availability of whole-genome multiple alignments between human and other primates, detailed investigations on indel characteristics and origin have come within reach. Here, we show that the majority of short (1-100 bp) DNA insertions in the human lineage are tandem duplications of directly adjacent sequence segments with conserved polarity. Indels in microsatellites comprise only a small fraction. The underlying molecular processes generating indels do not necessarily rely on the presence of preexisting duplicates, as would be expected for unequal crossing over, as well as replication slippage. Instead, our findings point toward a mechanism that preferentially occurs in the male germline and is not recombination-mediated. Surprisingly, nonframeshifting tandem duplications and deletions in coding regions still occur at approximately 50% of their genomic background rates. As is already well established in the context of gene and segmental duplications, our results demonstrate that duplications are also likely to constitute the predominant process for rapid generation of new genetic material and function on smaller scales.  相似文献   

14.
To assess the impact of various heavy and light chain mutations on p-azophenylarsonate binding, murine antibodies have been produced in insect cells (SF9) utilizing a baculovirus expression system. When expressed in this system, an antibody composed of a canonical CRIA+ heavy and light chain can bind antigen and express idiotype indistinguishably from analogous hybridoma-derived antibodies. Antibodies comprised of either light chains mutant at the V-J junction or heavy chains mutant at the V-D junction were found to be incapable of binding arsonate. In addition, substitutions in the first and second complementarity determining regions of the heavy chain were shown to play a role in arsonate binding, most likely related to affinity maturation targeted at the carrier protein. These results confirm the obligatory role that junctional diversity plays in the generation of arsonate-specific antibodies, as well as extend our understanding of the role of other variable region amino acids in arsonate binding.  相似文献   

15.
16.
Summary Almost all of the body's extracellular immunoglobulin (Ig) is derived from Ig-secreting plasma cells of lymphoid tissues. The secreted material is a heterogeneous mixture of different classes and specificities. Lymphoid tissues also contain a large number of essentially non-secretory cells — B lymphocytes — which bear Ig firmly associated with their plasma membranes. Ig molecules thus exist in two functionally different forms, as membrane-bound antigen receptors on the surface of B lymphocytes on the one hand, and as humoral secreted Ig antibodies on the other. On B cells, membrane-bound heavy chains have an apparent mol. wt. slightly larger than that of secreted heavy chains from plasma cells. Membrane-bound but not secreted heavy chains bind detergents, thus suggesting the presence of a hydrophobic region in membrane-bound heavy chains, which is absent in secreted heavy chains. Most investigations have dealt with immunoglobulin M. The two types of IgM heavy chains differ at their carboxy termini. Recent investigations at the nucleic acid level demonstrate that membrane-associated µ chains contain a 41-residue hydrophobic tail adjacent to the last constant domain, whereas secretory µ chains contain a 20-residue hydrophilic tail. At the present time, evidence is accumulating that all membrane-bound Ig heavy chain classes may contain similar hydrophobic structures necessary for anchorage of the molecules into the lipid bilayer.  相似文献   

17.
;Fingerprints' of bovine colostrum and serum immunoglobulin G1 heavy chains were extremely similar, but different from serum immunoglobin G2 heavy chains. Serum immunoglobulin G1 and immunoglobulin G2 heavy chains were treated with cyanogen bromide. The fractions from the C-terminal end of the heavy chains were isolated and the amino acid sequence of this fraction from immunoglobulin G2 was:His-Glx-Ala-Leu-His-Asx-His-Tyr-Met-Gln-Lys-Ser-Thr-Ser-Lys-Ser-Ala-GlyThe amino acid composition of this fraction from immunoglobulin G1 was the same except for the methionine, which in immunoglobulin G1 was replaced by threonine.  相似文献   

18.
Insertions, deletions, and inversions in the chloroplast genome of higher plants have been shown to be extremely useful for resolving phylogenetic relationships both between closely related taxa and among more basal lineages. Introns and intergenic spacers from the chloroplast genome are now increasingly used for phylogenetic and population genetic studies of populations from a single species, and it is therefore interesting to know whether indels can provide useful data and hence increase the power of intraspecific studies. Here, we show that indels in three cpDNA intergenic spacers and one cpDNA intron for two species of Silene evolve at slightly higher rates than base pair substitutions. Repeat indels appear to have the highest rate of evolution and are thus more prone to homoplasy. We show that coded indel data have high information content for phylogenetic analysis, and indels thus provide useful information to infer phylogenetic relationships at the intraspecific level.  相似文献   

19.
Russell bodies are intracellular aggregates of immunoglobulins. Although the mechanism of Russell body biogenesis has been extensively studied by using truncated mutant heavy chains, the importance of the variable domain sequences in this process and in immunoglobulin biosynthesis remains largely unknown. Using a panel of structurally and functionally normal human immunoglobulin Gs, we show that individual immunoglobulin G clones possess distinctive Russell body inducing propensities that can surface differently under normal and abnormal cellular conditions. Russell body inducing predisposition unique to each immunoglobulin G clone was corroborated by the intrinsic physicochemical properties encoded in the heavy chain variable domain/light chain variable domain sequence combinations that define each immunoglobulin G clone. While the sequence based intrinsic factors predispose certain immunoglobulin G clones to be more prone to induce Russell bodies, extrinsic factors such as stressful cell culture conditions also play roles in unmasking Russell body propensity from immunoglobulin G clones that are normally refractory to developing Russell bodies. By taking advantage of heterologous expression systems, we dissected the roles of individual subunit chains in Russell body formation and examined the effect of non-cognate subunit chain pair co-expression on Russell body forming propensity. The results suggest that the properties embedded in the variable domain of individual light chain clones and their compatibility with the partnering heavy chain variable domain sequences underscore the efficiency of immunoglobulin G biosynthesis, the threshold for Russell body induction, and the level of immunoglobulin G secretion. We propose that an interplay between the unique properties encoded in variable domain sequences and the state of protein homeostasis determines whether an immunoglobulin G expressing cell will develop the Russell body phenotype in a dynamic cellular setting.  相似文献   

20.
The immunoglobulin (Ig) molecule is composed of two identical heavy chains and two identical light chains (H2L2). Transport of this heteromeric complex is dependent on the correct assembly of the component parts, which is controlled, in part, by the association of incompletely assembled Ig heavy chains with the endoplasmic reticulum (ER) chaperone, BiP. Although other heavy chain-constant domains interact transiently with BiP, in the absence of light chain synthesis, BiP binds stably to the first constant domain (CH1) of the heavy chain, causing it to be retained in the ER. Using a simplified two-domain Ig heavy chain (VH-CH1), we have determined why BiP remains bound to free heavy chains and how light chains facilitate their transport. We found that in the absence of light chain expression, the CH1 domain neither folds nor forms its intradomain disulfide bond and therefore remains a substrate for BiP. In vivo, light chains are required to facilitate both the folding of the CH1 domain and the release of BiP. In contrast, the addition of ATP to isolated BiP-heavy chain complexes in vitro causes the release of BiP and allows the CH1 domain to fold in the absence of light chains. Therefore, light chains are not intrinsically essential for CH1 domain folding, but play a critical role in removing BiP from the CH1 domain, thereby allowing it to fold and Ig assembly to proceed. These data suggest that the assembly of multimeric protein complexes in the ER is not strictly dependent on the proper folding of individual subunits; rather, assembly can drive the complete folding of protein subunits.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号