首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The present study examines 783 human-mouse orthologous gene pairs for their pattern of sequence evolution, contrasting mammalia, eukaryota, coelomata, and bilateria specific human intronless genes. Such comparisons may be of use in understanding the general evolution of human genome. Evolutionary rate analyses indicate that mammalia specific human intronless genes are evolving faster as compared to other intronless genes specific to eukaryotic lineage, indicating towards their rapid evolution. The observations indicates that the genes conserved in eukaryota, coelomata, and bilateria, that is, proteins that arose earlier in evolution as compared to mammalia specific genes evolve slowly and are subjected to negative selection. The cause underlying rate variations was also explored. Although mutational bias might slightly fasten the nonsynonymous rates in mammalia specific genes, it is unlikely to be major cause of rate difference between the various categories. Furthermore, rate of divergence of mammalia specific intronless genes has been related to functional classification using the protein family annotation. Protein function was found in some cases to have larger impact on the rate of evolution of genes. Also, the codon usage pattern of mammalia specific intronless genes do not seem to differ much from those of other intronless genes conserved solely in eukaryotic lineage.  相似文献   

2.
Using computational approaches we have identified 2017 expressed intronless genes in the mouse genome. Evolutionary analysis reveals that 56 intronless genes are conserved among the three domains of life--bacteria, archea and eukaryotes. These highly conserved intronless genes were found to be involved in essential housekeeping functions. About 80% of expressed mouse intronless genes have orthologs in eukaryotic genomes only, and thus are specific to eukaryotic organisms. 608 of these genes have intronless human orthologs and 302 of these orthologs have a match in OMIM database. Investigation into these mouse genes will be important in generating mouse models for understanding human diseases.  相似文献   

3.
4.
This study assessed the relationship between the occurrence and function of intronless or single exon genes (SEG) in the genome of five teleost species and their phylogenetic distance. The results revealed that Takifugu rubripes, Tetraodon nigroviridis, Oryzias latipes, Gasterosteus aculeatus and Danio rerio genomes are respectively comprised of 2.83%, 3.42%, 4.49%, 4.35% and 4.02% SEGs. These SEGs encode for a variety of family proteins including claudins, olfactory receptors and histones that are essential for various biological functions. Subsequently, we predicted and annotated SEGs in three European sea bass, Dicentrarchus labrax chromosomes that we have sequenced, and compared results with those of stickleback (G. aculeatus) homologous chromosomes. While the annotation features of three D. labrax chromosomes revealed 78 (5.30%) intronless genes, comparisons with G. aculeatus showed that SEG composition and their order varied significantly among corresponding chromosomes, even for those with nearly complete synteny. More than half of SEGs identified in most of the species have at least one ortholog multiple exon gene in the same genome, which provides insight to their possible origin by retrotransposition. In spite of the fact that they belong to the same lineage, the fraction of predicted SEGs varied significantly between the genomes analyzed, and only a low fraction of proteins (4.1%) is conserved between all five species. Furthermore, the inter-specific distribution of SEGs as well as the functional categories shared by species did not reflect their phylogenetic relationships. These results indicate that new SEGs are continuously and independently generated after species divergence over evolutionary time as evidenced by the phylogenetic results of single exon claudins genes. Although the origin of SEGs cannot be inferred directly from the phylogeny, our results provide strong support for the idea that retrotransposition followed by tandem duplications is the most probable event that can explain the expansion of SEGs in eukaryotic organisms.  相似文献   

5.
Siew N  Saini HK  Fischer D 《FEBS letters》2005,579(14):3175-3182
A large number of sequences in each newly sequenced genome correspond to lineage and species-specific proteins, also known as ORFans. Amongst these ORFans, a large number are sequences with unknown structures and functions. We have identified a family of sequences, annotated as hypothetical proteins, which are specific to Bacillus and have carried out a computational study aimed at characterizing this family. Fold-recognition methods predict that these sequences belong to the alpha/beta hydrolase fold. We suggest possible catalytic triads for the ORFans and propose a hypothesis regarding the possible families within the alpha/beta hydrolase superfamily to which they may belong.  相似文献   

6.
We have used the (nearly) completed eukaryotic genome sequences to trace the evolution of thirteen families of established vertebrate regulators of type-1 protein phosphatases (PP1). Two of these families are present in all lineages of the eukaryotic crown and therefore qualify as candidate primordial regulators that determined the surface of PP1. The set of regulators of PP1 has continued to expand ever since, often in response to functional innovations in different eukaryotic lineages. In particular, the development of metazoan multicellularity was accompanied by an explosive increase in the number of regulators of PP1. The further increase in the functional diversity of PP1 in the vertebrate lineage was mainly achieved by the duplication of genes for regulatory subunits and by the conversion of already existing proteins into regulators of PP1. Unexpectedly, our analysis has also enabled us to classify nine poorly characterized proteins as likely regulators of PP1.  相似文献   

7.
Stevens TJ  Arkin IT 《Proteins》2000,39(4):417-420
One may speculate that higher organisms require a proportionately greater abundance of membrane proteins within their genomes in order to furnish the requirements of differentiated cell types, compartmentalization, and intercellular signalling. With the recent availability of several complete prokaryotic genome sequences and sufficient progress in many eukaryotic genome sequencing projects, we seek to test this hypothesis. Using optimized hydropathy analysis of proteins in several, diverse proteomes, we show that organisms of the three domains of life-Eukarya, Eubacteria, and Archaea-have similar proportions of alpha-helical membrane proteins within their genomes and that these are matched by the complexity of the aqueous components.  相似文献   

8.
Members of the RNA-helicase family are defined by several evolutionary conserved motifs. They are found in all organisms - from bacteria to humans - and many viruses. The minimum number of RNA helicases present within a eukaryotic cell can be predicted from the complete sequence of the Saccharomyces cerevisiae genome. Recent progress in the functional analysis of various family members has given new insights into, and confirmed the significance of these proteins for, most cellular RNA metabolic processes.  相似文献   

9.
Intronless genes, a characteristic feature of prokaryotes, constitute a significant portion of the eukaryotic genomes. Our analysis revealed the presence of 11,109 (19.9%) and 5,846 (21.7%) intronless genes in rice and Arabidopsis genomes, respectively, belonging to different cellular role and gene ontology categories. The distribution and conservation of rice and Arabidopsis intronless genes among different taxonomic groups have been analyzed. A total of 301 and 296 intronless genes from rice and Arabidopsis, respectively, are conserved among organisms representing the three major domains of life, i.e., archaea, bacteria, and eukaryotes. These evolutionarily conserved proteins are predicted to be involved in housekeeping cellular functions. Interestingly, among the 68% of rice and 77% of Arabidopsis intronless genes present only in eukaryotic genomes, approximately 51% and 57% genes have orthologs only in plants, and thus may represent the plant-specific genes. Furthermore, 831 and 144 intronless genes of rice and Arabidopsis, respectively, referred to as ORFans, do not exhibit homology to any of the genes in the database and may perform species-specific functions. These data can serve as a resource for further comparative, evolutionary, and functional analysis of intronless genes in plants and other organisms. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

10.
Several thousand genes in the human genome have been linked to a heritable genetic disease. The majority of these appear to be nonessential genes (i.e., are not embryonically lethal when inactivated), and one could therefore speculate that they are late additions in the evolutionary lineage toward humans. Contrary to this expectation, we find that they are in fact significantly overrepresented among the genes that have emerged during the early evolution of the metazoa. Using a phylostratigraphic approach, we have studied the evolutionary emergence of such genes at 19 phylogenetic levels. The majority of disease genes was already present in the eukaryotic ancestor, and the second largest number has arisen around the time of evolution of multicellularity. Conversely, genes specific to the mammalian lineage are highly underrepresented. Hence, genes involved in genetic diseases are not simply a random subset of all genes in the genome but are biased toward ancient genes.  相似文献   

11.
ATP-binding cassette (ABC) proteins constitute a large superfamily found in all kingdoms of living organisms. The recent completion of two draft sequences of the rice (Oryza sativa) genome allowed us to analyze and classify its ABC proteins and to compare to those in Arabidopsis thaliana. We identified a similar number of ABC proteins in rice and Arabidopsis (121 versus 120), despite the rice genome being more than three times the size of Arabidopsis. Both Arabidopsis and rice have representative members in all seven major subfamilies of ABC ATPases (A to G) commonly found in eukaryotes. This comparative analysis allowed the detection of 29 potential orthologous sequences in Arabidopsis and rice. However, plant share with prokaryotes a specific set of ABC systems that is not detected in animals. These ABC systems might be inherited from the cyanobacterial ancestor of chloroplasts. The present work provides the first complete inventory of rice ABC proteins and an updated inventory of those proteins in Arabidopsis.  相似文献   

12.
Liu H  Fu Y  Xie J  Cheng J  Ghabrial SA  Li G  Peng Y  Yi X  Jiang D 《Journal of virology》2011,85(19):9863-9876
Parvoviruses infect humans and a broad range of animals, from mammals to crustaceans, and generally are associated with a variety of acute and chronic diseases. However, many others cause persistent infections and are not known to be associated with any disease. Viral persistence is likely related to the ability to integrate into the chromosomal DNA and to establish a latent infection. However, there is little evidence for genome integration of parvoviral DNA except for Adeno-associated virus (AAV). Here we performed a systematic search for homologs of parvoviral proteins in publicly available eukaryotic genome databases followed by experimental verification and phylogenetic analysis. We conclude that parvoviruses have frequently invaded the germ lines of diverse animal species, including mammals, fishes, birds, tunicates, arthropods, and flatworms. The identification of orthologous endogenous parvovirus sequences in the genomes of humans and other mammals suggests that parvoviruses have coexisted with mammals for at least 98 million years. Furthermore, some of the endogenized parvoviral genes were expressed in eukaryotic organisms, suggesting that these viral genes are also functional in the host genomes. Our findings may provide novel insights into parvovirus biology, host interactions, and evolution.  相似文献   

13.
We have developed a method to reliably identify partial membrane protein topologies using the consensus of five topology prediction methods. When evaluated on a test set of experimentally characterized proteins, we find that approximately 90% of the partial consensus topologies are correctly predicted in membrane proteins from prokaryotic as well as eukaryotic organisms. Whole-genome analysis reveals that a reliable partial consensus topology can be predicted for approximately 70% of all membrane proteins in a typical bacterial genome and for approximately 55% of all membrane proteins in a typical eukaryotic genome. The average fraction of sequence length covered by a partial consensus topology is 44% for the prokaryotic proteins and 17% for the eukaryotic proteins in our test set, and similar numbers are found when the algorithm is applied to whole genomes. Reliably predicted partial topologies may simplify experimental determinations of membrane protein topology.  相似文献   

14.
15.
16.
17.
BACKGROUND: Endogenous retroviruses contribute to the evolution of the host genome and can be associated with disease. Human endogenous retrovirus K (HERV-K) is related to the mouse mammary tumor virus and is present in the genomes of humans, apes and cercopithecoids (Old World monkeys). It is unknown how long ago in primate evolution the full-length HERV-K proviruses that are in the human genome today were formed. RESULTS: Ten full-length HERV-K proviruses were cloned from the human genome. Using provirus-specific probes, eight of the ten were found to be present in a genetically diverse set of humans but not in other extant hominoids. Intact preintegration sites for each of these eight proviruses were present in the apes. A ninth provirus was detected in the human, chimpanzee, bonobo and gorilla genomes, but not in the orang-utan genome. The tenth was found only in humans, chimpanzees and bonobos. Complete sequencing of six of the human-specific proviruses showed that full-length open reading frames for the retroviral protein precursors Gag-Pro-Pol or Env were each present in multiple proviruses. CONCLUSIONS: At least eight full-length HERV-K genomes that are in the human germline today integrated after humans diverged from chimpanzees. All of the viral open reading frames and cis-acting sequences necessary for HERV-K replication must have been intact during the recent time when these proviruses formed. Multiple full-length open reading frames for all HERV-K proteins are present in the human genome today.  相似文献   

18.
Novel H3N2 influenza viruses (H3N2v) containing seven genome segments from swine lineage triple-reassortant H3N2 viruses and a 2009 pandemic H1N1 (H1N1pdm09) matrix protein segment (pM) were isolated from 12 humans in the United States between August and December 2011. To understand the evolution of these novel H3N2 viruses in swine and humans, we undertook a phylogenetic analysis of 674 M sequences and 388 HA and NA sequences from influenza viruses isolated from North American swine during 2009-2011, as well as HA, NA, and M sequences from eight H3N2v viruses isolated from humans. We identified 34 swine influenza viruses (termed rH3N2p) with the same combination of H3, N2, and pM segments as the H3N2v viruses isolated from humans. Notably, these rH3N2p viruses were generated in swine via reassortment events between H3N2 viruses and the pM segment approximately 4 to 10 times since 2009. The pM segment has also reassorted with multiple distinct lineages of H1 virus, especially H1δ viruses. Importantly, the N2 segment of all H3N2v viruses isolated from humans is derived from a genetically distinct N2 lineage that has circulated in swine since being acquired by reassortment with seasonal human H3N2 viruses in 2001-2002, rather than from the N2 that is associated with the 1998 H3N2 swine lineage. The identification of this N2 variant may have implications for influenza vaccine design and the potential pandemic threat of H3N2v to human age groups with differing levels of prior exposure and immunity.  相似文献   

19.
20.
The SNARE proteins are required for membrane fusion during intracellular vesicular transport and for its specificity. Only the unique combination of SNARE proteins (cognates) can be bound and can lead to membrane fusion, although the characteristics of the possible specificity of the binding combinations encoded in the SNARE sequences have not yet been determined. We discovered by whole genome sequence analysis that sequence motifs (conserved sequences) in the SNARE motif domains for each protein group correspond to localization sites or transport pathways. We claim that these motifs reflect the specificity of the binding combinations of SNARE motif domains. Using these motifs, we could classify SNARE proteins from 48 organisms into their localization sites or transport pathways. The classification result shows that more than 10 SNARE subgroups are kingdom specific and that the SNARE paralogs involved in the plasma membrane-related transport pathways have developed greater variations in higher animals and higher plants than those involved in the endoplasmic reticulum-related transport pathways throughout eukaryotic evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号