首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Maize (Zea mays) seeds are a good source of protein, despite being deficient in several essential amino acids. However, eliminating the highly abundant but poorly balanced seed storage proteins has revealed that the regulation of seed amino acids is complex and does not rely on only a handful of proteins. In this study, we used two complementary omics-based approaches to shed light on the genes and biological processes that underlie the regulation of seed amino acid composition. We first conducted a genome-wide association study to identify candidate genes involved in the natural variation of seed protein-bound amino acids. We then used weighted gene correlation network analysis to associate protein expression with seed amino acid composition dynamics during kernel development and maturation. We found that almost half of the proteome was significantly reduced during kernel development and maturation, including several translational machinery components such as ribosomal proteins, which strongly suggests translational reprogramming. The reduction was significantly associated with a decrease in several amino acids, including lysine and methionine, pointing to their role in shaping the seed amino acid composition. When we compared the candidate gene lists generated from both approaches, we found a nonrandom overlap of 80 genes. A functional analysis of these genes showed a tight interconnected cluster dominated by translational machinery genes, especially ribosomal proteins, further supporting the role of translation dynamics in shaping seed amino acid composition. These findings strongly suggest that seed biofortification strategies that target the translation machinery dynamics should be considered and explored further.

An integrated approach reveals the key role of translational machinery in maize kernel amino acid natural variation and homeostasis, highlighting targets for seed amino acid biofortification.  相似文献   

2.
A polypeptide (polypeptide P39), which is presumed to involved in the photosynthetic circadian rhythm in the green alga Acetabularia, was purified from the EDTA-insoluble chloroplast membrane fraction by means of preparative dodecylsulfate gel electrophoresis and then partially characterized. The purity of the isolated polypeptide P39 was confirmed by a further electrophoresis on an analytical dodecylsulfate gel and further elucidated by amino-terminal analysis which shows that glycine is the only amino-terminal amino acid of the purified polypeptide material. The molecular weight of the polypeptide P39 was found to be about 39,000 on analytical gel electrophoresis and the value was further supported by those obtained from amino acid composition and peptide mapping. The amino acid composition of polypeptide P39 showed that the proportion of intermediate amino acid groups is high while the proportion of hydrophilic amino acid groups is well balanced by that of hydrophobic amino acid groups, a property characteristic of membrane proteins.  相似文献   

3.
One of the well-known observations of proteins from thermophilic bacteria is the bias of the amino acid composition in which charged residues are present in large numbers, and polar residues are scarce. On the other hand, it has been reported that the molecular surfaces of proteins are adapted to their subcellular locations, in terms of the amino acid composition. Thus, it would be reasonable to expect that the differences in the amino acid compositions between proteins of thermophilic and mesophilic bacteria would be much greater on the protein surface than in the interior. We performed systematic comparisons between proteins from thermophilic bacteria and mesophilic bacteria, in terms of the amino acid composition of the protein surface and the interior, as well as the entire amino acid chains, by using sequence information from the genome projects. The biased amino acid composition of thermophilic proteins was confirmed, and the differences from those of mesophilic proteins were most obvious in the compositions of the protein surface. In contrast to the surface composition, the interior composition was not distinctive between the thermophilic and mesophilic proteins. The frequency of the amino acid pairs that are closely located in the space was also analyzed to show the same trend of the single amino acid compositions. Interestingly, extracellular proteins from mesophilic bacteria showed an inverse trend against thermophilic proteins (i.e. a reduced number of charged residues and rich in polar residues). Nuclear proteins from eukaryotes, which are known to be abundant in positive charges, showed different compositions as a whole from the thermophiles. These results suggest that the bias of the amino acid composition of thermophilic proteins is due to the residues on the protein surfaces, which may be constrained by the extreme environment.  相似文献   

4.
Sau K  Gupta SK  Sau S  Mandal SC  Ghosh TC 《Bio Systems》2006,85(2):107-113
Synonymous codon and amino acid usage biases have been investigated in 903 Mimivirus protein-coding genes in order to understand the architecture and evolution of Mimivirus genome. As expected for an AT-rich genome, third codon positions of the synonymous codons of Mimivirus carry mostly A or T bases. It was found that codon usage bias in Mimivirus genes is dictated both by mutational pressure and translational selection. Evidences show that four factors such as mean molecular weight (MMW), hydropathy, aromaticity and cysteine content are mostly responsible for the variation of amino acid usage in Mimivirus proteins. Based on our observation, we suggest that genes involved in translation, DNA repair, protein folding, etc., have been laterally transferred to Mimivirus a long ago from living organism and with time these genes acquire the codon usage pattern of other Mimivirus genes under selection pressure.  相似文献   

5.
In this study, an attempt has been made to predict the major functions of gramnegative bacterial proteins from their amino acid sequences. The dataset used for training and testing consists of 670 non-redundant gram-negative bacterial proteins (255 of cellular process, 60 of information molecules, 285 of metabolism, and 70 of virulence factors). First we developed an SVM-based method using amino acid and dipeptide composition and achieved the overall accuracy of 52.39% and 47.01%, respectively. We introduced a new concept for the classification of proteins based on tetrapeptides, in which we identified the unique tetrapeptides significantly found in a class of proteins. These tetrapeptides were used as the input feature for predicting the function of a protein and achieved the overall accuracy of 68.66%. We also developed a hybrid method in which the tetrapeptide information was used with amino acid composition and achieved the overall accuracy of 70.75%. A five-fold cross validation was used to evaluate the performance of these methods. The web server VICMpred has been developed for predicting the function of gram-negative bacterial proteins (http://www.imtech.res.in/raghava/vicmpred/).  相似文献   

6.
H Nakashima  K Nishikawa  T Ooi 《Proteins》1990,8(2):173-178
A compact mitochondrial gene contains all essential information about the synthesis of mitochondrial proteins which play their roles in a small compartment of the mitochondrium. Almost no noncoding regions have been found through the gene, but a necessary set of tRNAs for the 20 amino acids is provided for biosynthesis, some of them coding different amino acids from those in a usual cell. Since the gene is so compact that the produced proteins would have some characteristic aspects for the mitochondrium, amino acid compositions of mitochondrial proteins (mt-proteins) were examined in the 20-dimensional composition space. The results show that compositions of proteins translated from the mitochondrial genes have a distinct character having more hydrophobic content than others, which is illustrated by a clustered distribution in the multidimensional composition space. The cluster is located at the tail edge of the global distribution pattern of a Gaussian shape for other various kinds of proteins in the space. The mt-proteins are rich in hydrophobic amino acids as is a membrane protein, but are different from other membrane proteins in a lesser content of Val. A good correlation found between the base and amino acid compositions for the mitochondria was examined in comparison to those of organisms such as thermophilic bacterium having an extreme G-C-rich base composition.  相似文献   

7.
8.
Goliaei B  Minuchehr Z 《FEBS letters》2003,537(1-3):121-127
Amino acids seem to have specific preferences for various locations in alpha-helices. These specific preferences, called singlet local propensity (SLP), have been determined by calculating the preference of occurrence of each amino acid in different positions of the alpha-helix. We have studied the occurrence of amino acids, single or pairs, in different positions, singlet or doublet, of alpha-helices in a database of 343 non-homologous proteins representing a unique superfamily from the SCOP database with a resolution better than 2.5 A from the Protein Data Bank. The preference of single amino acids for various locations of the helix was shown by the relative entropy of each amino acid with respect to the background. Based on the total relative entropy of all amino acids occurring in a single position, the N(cap) position was found to be the most selective position in the alpha-helix. A rigorous statistical analysis of amino acid pair occurrences showed that there are exceptional pairs for which, the observed frequency of occurrence in various doublet positions of the alpha-helix is significantly different from the expected frequency of occurrence in that position. The doublet local propensity (DLP) was defined as the preference of occurrences of amino acid pairs in different doublet positions of the alpha-helix. For most amino acid pairs, the observed DLP (DLP(O)) was nearly equal to the expected DLP (DLP(E)), which is the product of the related SLPs. However, for exceptional pairs of amino acids identified above, the DLP(O) and DLP(E) values were significantly different. Based on the relative values of DLP(O) and DLP(E), exceptional amino acid pairs were divided into two categories. Those, for which the DLP(O) values are higher than DLP(E), should have a strong tendency to pair together in the specified position. For those pairs which the DLP(O) values are less than DLP(E), there exists a hindrance in neighboring of the two amino acids in that specific position of the alpha-helix. These cases have been identified and listed in various tables in this paper. The amount of mutual information carried by the exceptional pairs of amino acids was significantly higher than the average mutual information carried by other amino acid pairs. The average mutual information conveyed by amino acid pairs in each doublet position was found to be very small but non-zero.  相似文献   

9.
The amino acid sequences of proteins determine their three-dimensional structures and functions. However, how sequence information is related to structures and functions is still enigmatic. In this study, we show that at least a part of the sequence information can be extracted by treating amino acid sequences of proteins as a collection of English words, based on a working hypothesis that amino acid sequences of proteins are composed of short constituent amino acid sequences (SCSs) or “words”. We first confirmed that the English language highly likely follows Zipf''s law, a special case of power law. We found that the rank-frequency plot of SCSs in proteins exhibits a similar distribution when low-rank tails are excluded. In comparison with natural English and “compressed” English without spaces between words, amino acid sequences of proteins show larger linear ranges and smaller exponents with heavier low-rank tails, demonstrating that the SCS distribution in proteins is largely scale-free. A distribution pattern of SCSs in proteins is similar among species, but species-specific features are also present. Based on the availability scores of SCSs, we found that sequence motifs are enriched in high-availability sites (i.e., “key words”) and vice versa. In fact, the highest availability peak within a given protein sequence often directly corresponds to a sequence motif. The amino acid composition of high-availability sites within motifs is different from that of entire motifs and all protein sequences, suggesting the possible functional importance of specific SCSs and their compositional amino acids within motifs. We anticipate that our availability-based word decoding approach is complementary to sequence alignment approaches in predicting functionally important sites of unknown proteins from their amino acid sequences.  相似文献   

10.
Base composition, codon usages and amino acid usages have been analyzed by taking 529 orthologous sequences of Aquifex aeolicus and Bacillus subtilis, having different optimal growth temperatures. These two bacteria do not have significant difference in overall GC composition, but GC(1+2) and GC3 levels were found to vary significantly. Significant increments in purine content and GC3 composition have been observed in the coding sequences of Aquifex aeolicus than its Bacillus subtilis counterparts. Correspondence analyses on codon and amino acid usages reveal that variation in base composition actually influences their codon and amino acid usages. Two selection pressures acting on the nucleotide level (GC3 and purine enrichment), causes variation in the amino acid usage differently in different protein secondary structures. Our results suggest that adaptation of amino acid usages in coil structure of Aquifex aeolicus proteins is under the control of both purine increment and GC3 composition, whereas the adaptation of the amino acids in the helical region of thermophilic bacteria is strongly influenced by the purine content. Evolutionary perspectives concerning the temperature adaptation of DNA and protein molecules of these two bacteria have been discussed on the basis of these results.  相似文献   

11.
1. The mitochondria isolated from human or rat liver were fractionated into submitochondrial particles and purified inner and outer membrane. According to different marker enzymes the inner membranes were enriched about 5-6-fold and the outer membranes about 12-14-fold. The electron microscopical appearance of the membranes was that expected on the basis of enzymic characterization. 2. A comparison of the average amino acid composition of the membrane proteins from the two types of mitochondria has been made. In the case of submitochondrial particles there were statistically significant differences between the human and rat hydrolysates for only five amino acids. Analysing the purified mitochondrial membranes there were significant differences between the two species for nine amino acids in the case of outer membranes and for 12 amino acids in the case of inner membranes. 3. With one exception all amino acids that were increased or decreased in the outer membrane exhibited a similar trend in the inner membrane of human compared with rat liver mitochondria. It appears that liver mitochondrial membranes have a species-dependent pattern of amino acid composition of their proteins.  相似文献   

12.
Human and mouse granulocyte-macrophage-colony-stimulating factors (hGM-CSF and mGM-CSF, respectively), isolated from Escherichia coli cells expressing the corresponding human and mouse genes, have been characterized. The observed properties of the proteins have been compared with those properties which can be deduced from the DNA sequence alone and the published properties of natural GM-CSFs. The purified E. coli-derived proteins were found to have the expected molecular masses, amino acid compositions and N- and C-terminal amino acid sequences. The finding of 70-90% unprocessed N-terminal methionine for both proteins is discussed. The four Cys residues were found to be involved in two intramolecular disulphide bonds, linking the first and third, and second and fourth Cys residues. This disulphide bond arrangement is probably the one existing in natural material, since, although not glycosylated, both E. coli-derived proteins showed biological activity (colony stimulating assay for hGM-CSF, and cell proliferation assay for mGM-CSF) comparable with that reported for the respective proteins purified from animal cells.  相似文献   

13.
Ofran Y  Margalit H 《Proteins》2006,64(1):275-279
It is well established that there is a relationship between the amino acid composition of a protein and its structural class (i.e., alpha, beta, alpha + beta, or alpha/beta). Several studies have even shown the power of amino acid composition in predicting the secondary structure class of a protein. Herein, we show that significant similarity in amino acid composition exists not only between proteins of the same class, but even between proteins of the same fold. To test conjectural explanations for this phenomenon, we analyzed a set of structurally similar proteins that are dissimilar in sequence. Based on this analysis, we suggest that specific residues that are involved in intramolecular interactions may account for this surprising relationship between composition and structure.  相似文献   

14.
Two glycoproteins bands isolated from the cyst wall protein pattern of two colpodid ciliates, Colpoda inflata (gp46CI) and Colpoda cucullus (gp46CC) were analysed for their amino acid composition. Both glycoproteins are very rich in glycine and have a relatively high hydrophobicity, containing additionally many leucine and alanine residues. Their high degree of similarity is both quantitative and qualitative. Compared with just two previously published reports, their amino acid compositions are similar to those found in the hydrolysed cyst wall total proteins from the ciliates C. steinii and Paraurostyla spp. The amino acid composition corroborates that they are indeed glycoproteins, because asparagine, an amino acid residue suitable for the attachment to N-acetylglucosamine by its amide group (N-glycan), is abundant in both proteins. We discuss our data in relation to other glycine-rich proteins and a comparison with amino acid composition protein databases is carried out.  相似文献   

15.
Structural genomics projects are producing many three-dimensional structures of proteins that have been identified only from their gene sequences. It is therefore important to develop computational methods that will predict sites involved in productive intermolecular interactions that might give clues about functions. Techniques based on evolutionary conservation of amino acids have the advantage over physiochemical methods in that they are more general. However, the majority of techniques neither use all available structural and sequence information, nor are able to distinguish between evolutionary restraints that arise from the need to maintain structure and those that arise from function. Three methods to identify evolutionary restraints on protein sequence and structure are described here. The first identifies those residues that have a higher degree of conservation than expected: this is achieved by comparing for each amino acid position the sequence conservation observed in the homologous family of proteins with the degree of conservation predicted on the basis of amino acid type and local environment. The second uses information theory to identify those positions where environment-specific substitution tables make poor predictions of the overall amino acid substitution pattern. The third method identifies those residues that have highly conserved positions when three-dimensional structures of proteins in a homologous family are superposed. The scores derived from these methods are mapped onto the protein three-dimensional structures and contoured, allowing identification clusters of residues with strong evolutionary restraints that are sites of interaction in proteins involved in a variety of functions. Our method differs from other published techniques by making use of structural information to identify restraints that arise from the structure of the protein and differentiating these restraints from others that derive from intermolecular interactions that mediate functions in the whole organism.  相似文献   

16.
Genomic Regions Associated with Amino Acid Composition in Soybean   总被引:3,自引:0,他引:3  
Soybean [Glycine max (L.) Merr.] is the single largest source of protein in animal feed. However, few studies have been conducted to evaluate genomic regions controlling amino acid composition in soybean. It is important to study the genetics of amino acid composition to achieve improvements through breeding. The objectives of this study were to determine the ratios between essential to non-essential (E:NE) and essential to total (E:T) amino acids, and to identify genomic regions controlling essential and non-essential amino acid composition in soybean seed. To achieve these objectives, 101 F6-derived recombinant inbred lines (RIL) developed from a cross of N87-984-16 × TN93-99 were used. Ground soybean seed samples were analyzed for amino acids using a near infrared spectroscopy (NIRS) instrument. A significant (p < 0.01) difference among the RIL was found for amino acid composition. Heritability estimates on an entry mean basis ranged from 0.13 for His to 0.67 for Tyr. A total of 94 polymorphic simple sequence repeat (SSR) molecular genetic markers were screened in DNA from progenies. Single factor ANOVA was used to identify candidate quantitative trait loci (QTL), which were then confirmed by QTL Cartographer. At least one QTL for each amino acid was detected in this population. QTL linked to molecular markers Satt143, Satt168, Satt203, Satt274 and Satt495 were associated with most of the amino acids. Phenotypic variation explained by an individual QTL ranged from 9.4 to 45.3%. QTL detected for amino acids in soybean in this experiment are expected to be useful for future breeding programs targeting development of improved soybean amino acid composition for human and animal nutrition.  相似文献   

17.
18.
Peptide maps of tryptic digests of the structural proteins from inner shells of intracisternal A-particles have shown common peptides for all the proteins. The terminal amino group of the three different structural proteins was identified as arginine. The major protein revealed approximately half the number of peptides expected from the amino acid composition. Since evidence for a cross-link bond has not been found, the main structural protein may be a single polypeptide chain containing a total or partial duplication of sequence.  相似文献   

19.
The parasite Plasmodium falciparum, responsible for the most deadly form of human malaria, is one of the extremely AT-rich genomes sequenced so far and known to possess many atypical characteristics. Using multivariate statistical approaches, the present study analyzes the amino acid usage pattern in 5038 annotated protein-coding sequences in P. falciparum clone 3D7. The amino acid composition of individual proteins, though dominated by the directional mutational pressure, exhibits wide variation across the proteome. The Asn content, expression level, mean molecular weight, hydropathy, and aromaticity are found to be the major sources of variation in amino acid usage. At all stages of development, frequencies of residues encoded by GC-rich codons such as Gly, Ala, Arg, and Pro increase significantly in the products of the highly expressed genes. Investigation of nucleotide substitution patterns in P. falciparum and other Plasmodium species reveals that the nonsynonymous sites of highly expressed genes are more conserved than those of the lowly expressed ones, though for synonymous sites, the reverse is true. The highly expressed genes are, therefore, expected to be closer to their putative ancestral state in amino acid composition, and a plausible reason for their sequences being GC-rich at nonsynonymous codon positions could be that their ancestral state was less AT-biased. Negative correlation of the expression level of proteins with respective molecular weights supports the notion that P. falciparum, in spite of its intracellular parasitic lifestyle, follows the principle of cost minimization. [Reviewing Editor : Dr. Richard Kliman]  相似文献   

20.
Liang HK  Huang CM  Ko MT  Hwang JK 《Proteins》2005,59(1):58-63
Structural analysis is useful in elucidating structural features responsible for enhanced thermal stability of proteins. However, due to the rapid increase of sequenced genomic data, there are far more protein sequences than the corresponding three-dimensional (3D) structures. The usual sequence-based amino acid composition analysis provides useful but simplified clues about the amino acid types related to thermal stability of proteins. In this work, we developed a statistical approach to identify the significant amino acid coupling sequence patterns in thermophilic proteins. The amino acid coupling sequence pattern is defined as any 2 types of amino acids separated by 1 or more amino acids. Using this approach, we construct the rho profiles for the coupling patterns. The rho value gives a measure of the relative occurrence of a coupling pattern in thermophiles compared with mesophiles. We found that thermophiles and mesophiles exhibit significant bias in their amino acid coupling patterns. We showed that such bias is mainly due to temperature adaptation instead of species or GC content variations. Though no single outstanding coupling pattern can adequately account for protein thermostability, we can use a group of amino acid coupling patterns having strong statistical significance (p values < 10(-7)) to distinguish between thermophilic and mesophilic proteins. We found a good correlation between the optimal growth temperatures of the genomes and the occurrences of the coupling patterns (the correlation coefficient is 0.89). Furthermore, we can separate the thermophilic proteins from their mesophilic orthologs using the amino acid coupling patterns. These results may be useful in the study of the enhanced stability of proteins from thermophiles-especially when structural information is scarce. Proteins 2005. (c) 2005 Wiley-Liss, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号