首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 27 毫秒
1.
Evolution of protein superfamilies and bacterial genome size   总被引:1,自引:0,他引:1  
We present the structural annotation of 56 different bacterial species based on the assignment of genes to 816 evolutionary superfamilies in the CATH domain structure database. These assignments have enabled us to analyse the recurrence of specific superfamilies within and across the genomes. We have selected the superfamilies that have a very broad representation and therefore appear to be universally distributed in a significant number of bacterial lineages. Occurrence profiles of these universally distributed superfamilies are compared with genome size in order to estimate the correlation between superfamily duplication and the increase in proteome size. This distinguishes between those size-dependent superfamilies where frequency of occurrence is highly correlated with increase in genome size, and size-independent superfamilies where no correlation is observed. Consideration of the size correlation and the ratio between the mean and the standard deviations for all the superfamily profiles allows more detailed subdivisions and classification of superfamilies. For example, within the size-independent superfamilies, we distinguished a group that are distributed evenly amongst all the genomes. Within the size-dependent superfamilies we differentiated two groups: linearly distributed and non-linearly distributed. Functional annotation using the COG database was performed for all superfamilies in each of these groups, and this revealed significant differences amongst the three sets of superfamilies. Evenly distributed, size-independent domains are shown to be involved primarily in protein translation and biosynthesis. For the size-dependent superfamilies, linearly distributed superfamilies are involved mainly in metabolism, and non-linearly distributed superfamily domains are involved principally in gene regulation.  相似文献   

2.
The organization of ribosomal proteins in 16 prokaryotic genomes was studied as an example of comparative genome analyses of gene systems. Hypothetical ribosomal protein-containing operons were constructed. These operons also contained putative genes and other non-ribosomal genes. The correspondences among these genes across different organisms were clarified by sequence homology computations. In this way a cross tabulation of 70 ribosomal proteins genes was constructed. On average, these were organized into 9-14 operons in each genome. There were also 25 non-ribosomal or putative genes in these mainly ribosomal protein operons. Hence the table contains 95 genes in total. It was found that: (i) the conservation of the block of about 20 r-proteins in the L3 and L4 operons across almost the entire eubacteria and archaebacteria is remarkable; (ii) some operons only belong to eubacteria or archaebacteria; (iii) although the ribosomal protein operons are highly conserved within domain, there are fine variations in some operons across different organisms within each domain, and these variations are informative on the evolutionary relations among the organisms. This method provides a new potential for studying the origin and evolution of old species.  相似文献   

3.
The organization of ribosomal proteins in 16 prokaryotic genomes was studied as an example of comparative genome analyses of gene systems. Hypothetical ribosomal protein-containing operons were constructed. These operons also contained putative genes and other non-ribosomal genes. The correspondences among these genes across different organisms were clarified by sequence homology computations. In this way a cross tabulation of 70 ribosomal proteins genes was constructed. On average, these were organized into 9-14 operons in each genome. There were also 25 non-ribosomal or putative genes in these mainly ribosomal protein operons. Hence the table contains 95 genes in total. It was found that: (i) the conservation of the block of about 20 r-proteins in the L3 and L4 operons across almost the entire eubacteria and ar-chaebacteria is remarkable; (ii) some operons only belong to eubacteria or archaebacte-ria; (iii) although the ribosomal protein operons are highly conserved within domain, there are fine variat  相似文献   

4.
5.
The CATH database of domain structures has been used to explore the structural variation of homologous domains in 294 well populated domain structure superfamilies, each containing at least three sequence diverse relatives. Our analyses confirm some previously detected trends relating sequence divergence to structural variation but for a much larger dataset and in some superfamilies the new data reveal exceptional structural variation. Use of a new algorithm (2DSEC) to analyse variability in secondary structure compositions across a superfamily sheds new light on how structures evolve. 2DSEC detects inserted secondary structures that embellish the core of conserved secondary structures found throughout the superfamily. Analysis showed that for 56% of highly populated superfamilies (>9 sequence diverse relatives), there are twofold or more increases in the numbers of secondary structures in some relatives. In some families fivefold increases occur, sometimes modifying the fold of the domain. Manual inspection of secondary structure insertions or embellishments in 48 particularly variable superfamilies revealed that although these insertions were usually discontiguous in the sequence they were often co-located in 3D resulting in a larger structural motif that often modified the geometry of the active site or the surface conformation promoting diverse domain partnerships and protein interactions. These observations, supported by automatic analysis of all well populated CATH families, suggest that accretion of small secondary structure insertions may provide a simple mechanism for evolving new functions in diverse relatives. Some layered domain architectures (e.g. mainly-beta and alpha-beta sandwiches) that recur highly in the genomes more frequently exploit these types of embellishments to modify function. In these architectures, aggregation occurs most often at the edges, top or bottom of the beta-sheets. Information on structural variability across domain superfamilies has been made available through the CATH Dictionary of Homologous Structures (DHS).  相似文献   

6.
The recent advent of conformation capture techniques has provided unprecedented insights into the spatial organization of chromatin. We present a large-scale investigation of the inter-chromosomal segment and gene contact networks in embryonic stem cells of two mammalian organisms: humans and mice. Both interaction networks are characterized by a high degree of clustering of genome regions and the existence of hubs. Both genomes exhibit similar structural characteristics such as increased flexibility of certain Y chromosome regions and co-localization of centromere-proximal regions. Spatial proximity is correlated with the functional similarity of genes in both species. We also found a significant association between spatial proximity and the co-expression of genes in the human genome. The structural properties of chromatin are also species specific, including the presence of two highly interactive regions in mouse chromatin and an increased contact density on short, gene-rich human chromosomes, thereby indicating their central nuclear position. Trans-interacting segments are enriched in active marks in human and had no distinct feature profile in mouse. Thus, in contrast to interactions within individual chromosomes, the inter-chromosomal interactions in human and mouse embryonic stem cells do not appear to be conserved.  相似文献   

7.
The complete complement of C1q-domain-containing proteins in Homo sapiens   总被引:2,自引:0,他引:2  
The C-terminal domains of the A, B, C chains of C1q subcomponent of C1 complex represent a common structural motif, the C1q domain, that is found in a diverse range of proteins. We analyzed the human genome for the complete complement of this family and have identified a total of 31 independent gene sequences. The predominant organization of C1q-domain-containing (C1qDC) proteins includes a leading signal peptide, a collagen-like region of variable length, and a C-terminal C1q domain. There are 15 highly conserved residues within the C1q domain, among which 8 are invariant within the human gene set and these are predicted to cluster within the hydrophobic core of the protein. We suggest a 3-subfamily classification based on sequence homology. For some C1qDC-encoding genes, strict orthology has been retained throughout vertebrate evolution and these examples suggest a highly specific functional role for C1qDC proteins that has been under significant selective pressure. Alternatively, individual species have co-opted C1qDC proteins for roles that are highly specific to their biology, suggesting an evolutionary strategy of gene duplication and functional diversification. A more extensive analysis of the evolutionary relationship of C1qDC proteins reveals an ancient rooting, with clear members found in eubacterial species. Curiously, we have been unable to identify C1qDC-encoding genes in many eukaryotic genomcs, such as Sacchromyces cerivisae and C. elegans, suggesting that the retention or loss of this gene family throughout evolution has been sporadic.  相似文献   

8.
9.
We studied the 3D structural organization of the fission yeast genome, which emerges from the tethering of heterochromatic regions in otherwise randomly configured chromosomes represented as flexible polymer chains in an nuclear environment. This model is sufficient to explain in a statistical manner many experimentally determined distinctive features of the fission yeast genome, including chromatin interaction patterns from Hi-C experiments and the co-locations of functionally related and co-expressed genes, such as genes expressed by Pol-III. Our findings demonstrate that some previously described structure-function correlations can be explained as a consequence of random chromatin collisions driven by a few geometric constraints (mainly due to centromere-SPB and telomere-NE tethering) combined with the specific gene locations in the chromosome sequence. We also performed a comparative analysis between the fission and budding yeast genome structures, for which we previously detected a similar organizing principle. However, due to the different chromosome sizes and numbers, substantial differences are observed in the 3D structural genome organization between the two species, most notably in the nuclear locations of orthologous genes, and the extent of nuclear territories for genes and chromosomes. However, despite those differences, remarkably, functional similarities are maintained, which is evident when comparing spatial clustering of functionally related genes in both yeasts. Functionally related genes show a similar spatial clustering behavior in both yeasts, even though their nuclear locations are largely different between the yeast species.  相似文献   

10.
Genomes undergo changes in organization as a result of gene duplications, chromosomal rearrangements and local mutations, among other mechanisms. In contrast to prokaryotes, in which genes of a common function are often organized in operons and reside contiguously along the genome, most eukaryotes show much weaker clustering of genes by function, except for few concrete functional groups. We set out to check systematically if there is a relation between gene function and gene organization in the human genome. We test this question for three types of functional groups: pairs of interacting proteins, complexes and pathways. We find a significant concentration of functional groups both in terms of their distance within the same chromosome and in terms of their dispersal over several chromosomes. Moreover, using Hi-C contact map of the tendency of chromosomal segments to appear close in the 3D space of the nucleus, we show that members of the same functional group that reside on distinct chromosomes tend to co-localize in space. The result holds for all three types of functional groups that we tested. Hence, the human genome shows substantial concentration of functional groups within chromosomes and across chromosomes in space.  相似文献   

11.
12.
An in silico comparative genomics approach was used to identify putative orthologs to genetically mapped genes from the mosquito, Aedes aegypti, in the Drosophila melanogaster and Anopheles gambiae genome databases. Comparative chromosome positions of 73 D. melanogaster orthologs indicated significant deviations from a random distribution across each of the five A. aegypti chromosomal regions, suggesting that some ancestral chromosome elements have been conserved. However, the two genomes also reflect extensive reshuffling within and between chromosomal regions. Comparative chromosome positions of A. gambiae orthologs indicate unequivocally that A. aegypti chromosome regions share extensive homology to the five A. gambiae chromosome arms. Whole-arm or near-whole-arm homology was contradicted with only two genes among the 75 A. aegypti genes for which orthologs to A. gambiae were identified. The two genomes contain large conserved chromosome segments that generally correspond to break/fusion events and a reciprocal translocation with extensive paracentric inversions evident within. Only very tightly linked genes are likely to retain conserved linear orders within chromosome segments. The D. melanogaster and A. gambiae genome databases therefore offer limited potential for comparative positional gene determinations among even closely related dipterans, indicating the necessity for additional genome sequencing projects with other dipteran species.  相似文献   

13.
14.
Sex chromosomes differ from other chromosomes in the striking divergence they often show in size, structure, and gene content. Not only do they possess genes controlling sex determination that are restricted to either the X or Y (or Z or W) chromosomes, but in many taxa they also include recombining regions. In these 'pseudoautosomal regions' (PARs), sequence homology is maintained by meiotic pairing and exchange in the heterogametic sex. PARs are unique genomic regions, exhibiting some features of autosomes, but they are also influenced by their partial sex linkage. Here we review the distribution and structure of PARs among animals and plants, the theoretical predictions concerning their evolutionary dynamics, the reasons for their persistence, and the diversity and content of genes that reside within them. It is now clear that the evolution of the PAR differs in important ways from that of genes in either the non-recombining regions of sex chromosomes or the autosomes.  相似文献   

15.
Cloned genes were used to probe a highly redundant library of large cloned fragments of the Dictyostelium discoideum genome carried in yeast artificial chromosomes (YACs). Each gene recognized several independent YAC clones, thereby grouping them into a contig. Individual YACs were arranged within the contig by positioning genes relative to rare restriction sites and the YAC ends. Genes that had been previously assigned to one of the six linkage groups by parasexual genetics were used to establish physically mapped regions on specific chromosomes. Previously unmapped genes were assigned to specific chromosomes when they recognized members of a mapped contig. Linkage was confirmed by congruence of large-scale restriction maps centered on either the previously mapped or the newly mapped genes. At present, the chromosome-assigned map segments comprise approximately 50% of the genome. About half of each map segment is covered by overlapping YACs.  相似文献   

16.
Two v-erbA-related genes, named ear-2 and ear-3, have been identified in the human genome and characterized by cDNA cloning. These genes are predicted to encode proteins that are very similar in primary structure to receptors for steroid hormones or thyroid hormone (T3). In addition, amino acid sequences of the ear-2 and ear-3 gene products are very similar each other especially at the DNA binding domain (86% homology) and at the putative ligand binding domain (76% homology). Northern hybridization with ear DNA probes of RNAs from various tissues of a human fetus reveals that the expression of ear-2 is high in the liver whereas the expression of ear-3 is relatively ubiquitous. Hybridization analysis of DNAs from sorted chromosomes shows that the ear-2 gene is located on chromosome 19 and ear-3 on chromosome 5, indicating that the two genes are clearly different from each other.  相似文献   

17.
Genetic diversity among the T-protein genes of group A streptococci   总被引:2,自引:0,他引:2  
T protein is a trypsin- and pepsin-resistant molecule on the surface of group A streptococci used as a serological tool to differentiate streptococci of this group. The purpose of this study was to determine the relatedness among the T protein genes of the 25 known T serotypes. DNA probes were constructed which represented various regions of the structural gene for the T6 protein, tee6. The probes were assayed for their ability to hybridize HindIII digests of chromosomal DNA from the 25 different T serotypes. Probe pTEE6.3, coding for the entire T6 protein, and pTEE6(1-299), coding for the amino-terminal half of T6, displayed the highest amount of homology, each binding to 10 of 25 T serotypes. Probes coding for sequences in the carboxy-terminal half of T6 showed considerably less homology among T serotypes with one probe hybridizing with only three out of 25. A synthetic oligonucleotide coding for the carboxy-terminal hydrophobic domain of T6, an area conserved to some degree among several bacterial surface proteins, showed homology with only seven out of 25 T serotypes. Hybridization with sequences outside the tee6 coding area provided additional information on the relatedness of certain sets of T serotypes according to restriction-fragment size heterogeneity. Clearly, there is considerable diversity among T-serotype genes. The data suggest that two or more families of structurally variant T proteins exist, which share only the property of proteolytic resistance and/or, perhaps, some biological function.  相似文献   

18.

Background

Throughout evolution, mutations in particular regions of some protein structures have resulted in extra covalent bonds that increase the overall robustness of the fold: disulfide bonds. The two strategically placed cysteines can also have a more direct role in protein function, either by assisting thiol or disulfide exchange, or through allosteric effects. In this work, we verified how the structural similarities between disulfides can reflect functional and evolutionary relationships between different proteins. We analyzed the conformational patterns of the disulfide bonds in a set of disulfide-rich proteins that included twelve SCOP superfamilies: thioredoxin-like and eleven superfamilies containing small disulfide-rich proteins (SDP).

Results

The twenty conformations considered in the present study were characterized by both structural and energetic parameters. The corresponding frequencies present diverse patterns for the different superfamilies. The least-strained conformations are more abundant for the SDP superfamilies, while the “catalytic” +/−RHook is dominant for the thioredoxin-like superfamily. The “allosteric” -RHSaple is moderately abundant for BBI, Crisp and Thioredoxin-like superfamilies and less frequent for the remaining superfamilies. Using a hierarchical clustering analysis we found that the twelve superfamilies were grouped in biologically significant clusters.

Conclusions

In this work, we carried out an extensive statistical analysis of the conformational motifs for the disulfide bonds present in a set of disulfide-rich proteins. We show that the conformational patterns observed in disulfide bonds are sufficient to group proteins that share both functional and structural patterns and can therefore be used as a criterion for protein classification.  相似文献   

19.
FtsH(Filamentation temperature-sensitive H)是一种广泛存在于原核生物和真核生物中的ATP依赖型金属蛋白酶。同源性分析表明,在拟南芥和水稻基因组中分别有12个和9个ftsH基因。ftsH基因在染色体上的分布有明显的偏爱性,如拟南芥的1、2、5号染色体和水稻的1、5号染色体。亚细胞定位分析表明,所有FtsH蛋白均定位于叶绿体或线粒体中。系统进化分析表明,21个FtsH蛋白成员可分为8个类群,其中AtFtsH12在水稻中没有发现种间同源物。每个类群成员的蛋白序列高度保守,种内同源物显示出大于80%的相似性,而种间同源物的相似性也大于70%。类群内的同源基因并非平行进化产生的,拟南芥基因组中进化出AtftsH1/5、AtftsH2/8、AtftsH3/10和AtftsH7/9共4个同源基因对,而水稻基因组中只有OsftsH3/8和OsftsH4/5两个同源基因对。每一类群中的成员在基因外显子-内含子边界分布上表现出高度保守性,在蛋白功能结构域的可变残基上具有偏爱性,而内含子在碱基组成和序列长度上表现出广泛的变异。拟南芥和水稻ftsH基因家族的比较分析为其他物种ftsH基因的特...  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号