首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Although baker's yeast is a primary model organism for research on eukaryotic ribosome assembly and nucleoli, the list of its proteins that are functionally associated with nucleoli or ribosomes is still incomplete. We trained a naïve Bayesian classifier to predict novel proteins that are associated with yeast nucleoli or ribosomes based on parts lists of nucleoli in model organisms and large-scale protein interaction data sets. Phylogenetic profiling and gene expression analysis were carried out to shed light on evolutionary and regulatory aspects of nucleoli and ribosome assembly.

Results

We predict that, in addition to 439 known proteins, a further 62 yeast proteins are associated with components of the nucleolus or the ribosome. The complete set comprises a large core of archaeal-type proteins, several bacterial-type proteins, but mostly eukaryote-specific inventions. Expression of nucleolar and ribosomal genes tends to be strongly co-regulated compared to other yeast genes.

Conclusion

The number of proteins associated with nucleolar or ribosomal components in yeast is at least 14% higher than known before. The nucleolus probably evolved from an archaeal-type ribosome maturation machinery by recruitment of several bacterial-type and mostly eukaryote-specific factors. Not only expression of ribosomal protein genes, but also expression of genes encoding the 90S processosome, are strongly co-regulated and both regulatory programs are distinct from each other.  相似文献   

2.
3.
4.
5.
6.
7.

Key message

The core promoter of the antiquitin ALDH7B4 gene was compared between selected Brassicaceae. Conserved cis elements controlling osmotic stress and wound-induced expression were identified and analysed in Arabidopsis thaliana leaves and seeds.

Abstract

Aldehyde dehydrogenases metabolise a wide range of aliphatic and aromatic aldehydes, which become cytotoxic at high levels. Family 7 aldehyde dehydrogenase genes, often described as antiquitins or turgor-responsive genes in plants, are broadly conserved across all domains. Despite the high conservation of the plant ALDH7 proteins and their importance in stress responses, their regulation has not been investigated. Here, we compared ALDH7 genes of different Brassicaceae and found that, in contrast to the gene organisation and protein coding sequences, similarities in the promoter sequences were limited to the first few hundred nucleotides upstream of the translation start codon. The function of this region was studied by isolating the core promoter of the Arabidopsis thaliana ALDH7B4 gene, taken as model. The promoter was found to be responsive to wounding in addition to salt and dehydration stress. Cis-acting elements involved in stress responsiveness were analysed and two conserved ACGT-containing motifs proximal to the translation start codon were found to be essential for the responsiveness to osmotic stress in leaves and in seeds. The integrity of an upstream ACGT motif and a dehydration-responsive element/C-repeat—low temperature-responsive element was found to be necessary for ALDH7B4 expression in seeds and induction by salt, dehydration and ABA in leaves. The comparison of the gene expression in selected Arabidopsis mutants demonstrated that osmotic stress-induced ALDH7B4 expression in leaves and seeds involves both ABA- and lipid-signalling components.  相似文献   

8.
9.

Background

Understanding how DNA sequence polymorphism relates to variation in gene expression is essential to connecting genotypic differences with phenotypic differences among individuals. Addressing this question requires linking population genomic data with gene expression variation.

Results

Using whole genome expression data and recent light shotgun genome sequencing of six Drosophila simulans genotypes, we assessed the relationship between expression variation in males and females and nucleotide polymorphism across thousands of loci. By examining sequence polymorphism in gene features, such as untranslated regions and introns, we find that genes showing greater variation in gene expression between genotypes also have higher levels of sequence polymorphism in many gene features. Accordingly, X-linked genes, which have lower sequence polymorphism levels than autosomal genes, also show less expression variation than autosomal genes. We also find that sex-specifically expressed genes show higher local levels of polymorphism and divergence than both sex-biased and unbiased genes, and that they appear to have simpler regulatory regions.

Conclusion

The gene-feature-based analyses and the X-to-autosome comparisons suggest that sequence polymorphism in cis-acting elements is an important determinant of expression variation. However, this relationship varies among the different categories of sex-biased expression, and trans factors might contribute more to male-specific gene expression than cis effects. Our analysis of sex-specific gene expression also shows that female-specific genes have been overlooked in analyses that only point to male-biased genes as having unusual patterns of evolution and that studies of sexually dimorphic traits need to recognize that the relationship between genetic and expression variation at these traits is different from the genome as a whole.  相似文献   

10.
11.
12.
13.

Background

Existing clustering approaches for microarray data do not adequately differentiate between subsets of co-expressed genes. We devised a novel approach that integrates expression and sequence data in order to generate functionally coherent and biologically meaningful subclusters of genes. Specifically, the approach clusters co-expressed genes on the basis of similar content and distributions of predicted statistically significant sequence motifs in their upstream regions.

Results

We applied our method to several sets of co-expressed genes and were able to define subsets with enrichment in particular biological processes and specific upstream regulatory motifs.

Conclusions

These results show the potential of our technique for functional prediction and regulatory motif identification from microarray data.
  相似文献   

14.
Gasch AP  Eisen MB 《Genome biology》2002,3(11):research0059.1-research005922
  相似文献   

15.
16.

Background

The massive scale of microarray derived gene expression data allows for a global view of cellular function. Thus far, comparative studies of gene expression between species have been based on the level of expression of the gene across corresponding tissues, or on the co-expression of the gene with another gene.

Results

To compare gene expression between distant species on a global scale, we introduce the "expression context". The expression context of a gene is based on the co-expression with all other genes that have unambiguous counterparts in both genomes. Employing this new measure, we show 1) that the expression context is largely conserved between orthologs, and 2) that sequence identity shows little correlation with expression context conservation after gene duplication and speciation.

Conclusion

This means that the degree of sequence identity has a limited predictive quality for differential expression context conservation between orthologs, and thus presumably also for other facets of gene function.  相似文献   

17.

Background

Mimivirus isolated from A. polyphaga is the largest virus discovered so far. It is unique among all the viruses in having genes related to translation, DNA repair and replication which bear close homology to eukaryotic genes. Nevertheless, only a small fraction of the proteins (33%) encoded in this genome has been assigned a function. Furthermore, a large fraction of the unassigned protein sequences bear no sequence similarity to proteins from other genomes. These sequences are referred to as ORFans. Because of their lack of sequence similarity to other proteins, they can not be assigned putative functions using standard sequence comparison methods. As part of our genome-wide computational efforts aimed at characterizing Mimivirus ORFans, we have applied fold-recognition methods to predict the structure of these ORFans and further functions were derived based on conservation of functionally important residues in sequence-template alignments.

Results

Using fold recognition, we have identified highly confident computational 3D structural assignments for 21 Mimivirus ORFans. In addition, highly confident functional predictions for 6 of these ORFans were derived by analyzing the conservation of functional motifs between the predicted structures and proteins of known function. This analysis allowed us to classify these 6 previously unannotated ORFans into their specific protein families: carboxylesterase/thioesterase, metal-dependent deacetylase, P-loop kinases, 3-methyladenine DNA glycosylase, BTB domain and eukaryotic translation initiation factor eIF4E.

Conclusion

Using stringent fold recognition criteria we have assigned three-dimensional structures for 21 of the ORFans encoded in the Mimivirus genome. Further, based on the 3D models and an analysis of the conservation of functionally important residues and motifs, we were able to derive functional attributes for 6 of the ORFans. Our computational identification of important functional sites in these ORFans can be the basis for a subsequent experimental verification of our predictions. Further computational and experimental studies are required to elucidate the 3D structures and functions of the remaining Mimivirus ORFans.  相似文献   

18.

Background

Mitochondria mediate most of the energy production that occurs in the majority of eukaryotic organisms. These subcellular organelles contain a genome that differs from the nuclear genome and is referred to as mitochondrial DNA (mtDNA). Despite a disparity in gene content, all mtDNAs encode at least two components of the mitochondrial electron transport chain, including cytochrome c oxidase I (Cox1).

Presentation of the hypothesis

A positionally conserved ORF has been found on the complementary strand of the cox1 genes of both eukaryotic mitochondria (protist, plant, fungal and animal) and alpha-proteobacteria. This putative gene has been named gau for gene antisense ubiquitous in mtDNAs. The length of the deduced protein is approximately 100 amino acids. In vertebrates, several stop codons have been found in the mt gau region, and potentially functional gau regions have been found in nuclear genomes. However, a recent bioinformatics study showed that several hypothetical overlapping mt genes could be predicted, including gau; this involves the possible import of the cytosolic AGR tRNA into the mitochondria and/or the expression of mt antisense tRNAs with anticodons recognizing AGR codons according to an alternative genetic code that is induced by the presence of suppressor tRNAs. Despite an evolutionary distance of at least 1.5 to 2.0 billion years, the deduced Gau proteins share some conserved amino acid signatures and structure, which suggests a possible conserved function. Moreover, BLAST analysis identified rare, sense-oriented ESTs with poly(A) tails that include the entire gau region. Immunohistochemical analyses using an anti-Gau monoclonal antibody revealed strict co-localization of Gau proteins and a mitochondrial marker.

Testing the hypothesis

This hypothesis could be tested by purifying the gau gene product and determining its sequence. Cell biological experiments are needed to determine the physiological role of this protein.

Implications of the hypothesis

Studies of the gau ORF will shed light on the origin of novel genes and their functions in organelles and could also have medical implications for human diseases that are caused by mitochondrial dysfunction. Moreover, this strengthens evidence for mitochondrial genes coded according to an overlapping genetic code.  相似文献   

19.
20.

Background

High throughput techniques have generated a huge set of biological data, which are deposited in various databases. Efficient exploitation of these databases is often hampered by a lack of appropriate tools, which allow easy and reliable identification of genes that miss functional characterization but are correlated with specific biological conditions (e.g. organotypic expression).

Results

We have developed a simple algorithm (DGSA = Database-dependent Gene Selection and Analysis) to identify genes with unknown functions involved in organ development concentrating on the heart. Using our approach, we identified a large number of yet uncharacterized genes, which are expressed during heart development. An initial functional characterization of genes by loss-of-function analysis employing morpholino injections into zebrafish embryos disclosed severe developmental defects indicating a decisive function of selected genes for developmental processes.

Conclusion

We conclude that DGSA is a versatile tool for database mining allowing efficient selection of uncharacterized genes for functional analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号