共查询到20条相似文献,搜索用时 30 毫秒
1.
Background
Identification of phosphorylation sites by computational methods is becoming increasingly important because it reduces labor-intensive and costly experiments and can improve our understanding of the common properties and underlying mechanisms of protein phosphorylation.Methods
A multitask learning framework for learning four kinase families simultaneously, instead of studying each kinase family of phosphorylation sites separately, is presented in the study. The framework includes two multitask classification methods: the Multi-Task Least Squares Support Vector Machines (MTLS-SVMs) and the Multi-Task Feature Selection (MT-Feat3).Results
Using the multitask learning framework, we successfully identify 18 common features shared by four kinase families of phosphorylation sites. The reliability of selected features is demonstrated by the consistent performance in two multi-task learning methods.Conclusions
The selected features can be used to build efficient multitask classifiers with good performance, suggesting they are important to protein phosphorylation across 4 kinase families.2.
Background
Fertilization in Caenorhabditis elegans requires functional SPE-9 protein in sperm. SPE-9 is a transmembrane protein with a predicted extracellular domain that contains ten epidermal growth factor (EGF)-like motifs. The presence of these EGF-like motifs suggests that SPE-9 is likely to function in gamete adhesive and/or ligand-receptor interactions.Results
We obtained specific antisera directed against different regions of SPE-9 in order to determine its subcellular localization. SPE-9 is segregated to spermatids with a pattern that is consistent with localization to the plasma membrane. During spermiogenesis, SPE-9 becomes localized to spiky projections that coalesce to form a pseudopod. This leads to an accumulation of SPE-9 on the pseudopod of mature sperm.Conclusions
The wild type localization patterns of SPE-9 provide further evidence that like the sperm of other species, C. elegans sperm have molecularly mosaic and dynamic regions. SPE-9 is redistributed by what is likely to be a novel mechanism that is very fast (~5 minutes) and is coincident with dramatic rearrangements in the major sperm protein cytoskeleton. We conclude that SPE-9 ends up in a location on mature sperm where it can function during fertilization and this localization defines the sperm region required for these interactions.3.
Background
The reconstruction of ancestral genomes must deal with the problem of resolution, necessarily involving a trade-off between trying to identify genomic details and being overwhelmed by noise at higher resolutions.Results
We use the median reconstruction at the synteny block level, of the ancestral genome of the order Gentianales, based on coffee, Rhazya stricta and grape, to exemplify the effects of resolution (granularity) on comparative genomic analyses.Conclusions
We show how decreased resolution blurs the differences between evolving genomes, with respect to rate, mutational process and other characteristics.4.
Background
Bacterial genomes develop new mechanisms to tide them over the imposing conditions they encounter during the course of their evolution. Acquisition of new genes by lateral gene transfer may be one of the dominant ways of adaptation in bacterial genome evolution. Lateral gene transfer provides the bacterial genome with a new set of genes that help it to explore and adapt to new ecological niches.Methods
A maximum likelihood analysis was done on the five sequenced corynebacterial genomes to model the rates of gene insertions/deletions at various depths of the phylogeny.Results
The study shows that most of the laterally acquired genes are transient and the inferred rates of gene movement are higher on the external branches of the phylogeny and decrease as the phylogenetic depth increases. The newly acquired genes are under relaxed selection and evolve faster than their older counterparts. Analysis of some of the functionally characterised LGTs in each species has indicated that they may have a possible adaptive role.Conclusion
The five Corynebacterial genomes sequenced to date have evolved by acquiring between 8 – 14% of their genomes by LGT and some of these genes may have a role in adaptation.5.
Morten Muhlig Nielsen Paula Tataru Tobias Madsen Asger Hobolth Jakob Skou Pedersen 《Algorithms for molecular biology : AMB》2018,13(1):17
Background
Motif analysis methods have long been central for studying biological function of nucleotide sequences. Functional genomics experiments extend their potential. They typically generate sequence lists ranked by an experimentally acquired functional property such as gene expression or protein binding affinity. Current motif discovery tools suffer from limitations in searching large motif spaces, and thus more complex motifs may not be included. There is thus a need for motif analysis methods that are tailored for analyzing specific complex motifs motivated by biological questions and hypotheses rather than acting as a screen based motif finding tool.Methods
We present Regmex (REGular expression Motif EXplorer), which offers several methods to identify overrepresented motifs in ranked lists of sequences. Regmex uses regular expressions to define motifs or families of motifs and embedded Markov models to calculate exact p-values for motif observations in sequences. Biases in motif distributions across ranked sequence lists are evaluated using random walks, Brownian bridges, or modified rank based statistics. A modular setup and fast analytic p value evaluations make Regmex applicable to diverse and potentially large-scale motif analysis problems.Results
We demonstrate use cases of combined motifs on simulated data and on expression data from micro RNA transfection experiments. We confirm previously obtained results and demonstrate the usability of Regmex to test a specific hypothesis about the relative location of microRNA seed sites and U-rich motifs. We further compare the tool with an existing motif discovery tool and show increased sensitivity.Conclusions
Regmex is a useful and flexible tool to analyze motif hypotheses that relates to large data sets in functional genomics. The method is available as an R package (https://github.com/muhligs/regmex).6.
Background
Existing clustering approaches for microarray data do not adequately differentiate between subsets of co-expressed genes. We devised a novel approach that integrates expression and sequence data in order to generate functionally coherent and biologically meaningful subclusters of genes. Specifically, the approach clusters co-expressed genes on the basis of similar content and distributions of predicted statistically significant sequence motifs in their upstream regions.Results
We applied our method to several sets of co-expressed genes and were able to define subsets with enrichment in particular biological processes and specific upstream regulatory motifs.Conclusions
These results show the potential of our technique for functional prediction and regulatory motif identification from microarray data.7.
Background
Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topologies. Furthermore, because of poor taxon sampling four-taxon analyses suffer from sensitivity to the long branch attraction artifact. Here we extend the probability mapping approach by improving taxon sampling of the analyzed datasets, and by using bootstrap support values, a more conservative tool to assess reliability.Results
Quartets of orthologous proteins were complemented with homologs from selected reference genomes. The mapping of bootstrap support values from these extended datasets gives results similar to the original maximum likelihood and posterior probability mapping. The more conservative nature of the plotted support values allows to focus further analyses on those protein families that strongly disagree with the majority or plurality of genes present in the analyzed genomes.Conclusion
Posterior probability is a non-conservative measure for support, and posterior probability mapping only provides a quick estimation of phylogenetic information content of four genomes. This approach can be utilized as a pre-screen to select genes that might have been horizontally transferred. Better taxon sampling combined with subtree analyses prevents the inconsistencies associated with four-taxon analyses, but retains the power of visual representation. Nevertheless, a case-by-case inspection of individual multi-taxon phylogenies remains necessary to differentiate unrecognized paralogy and shared phylogenetic reconstruction artifacts from horizontal gene transfer events.8.
Background
The protein encoded by the gene ybgI was chosen as a target for a structural genomics project emphasizing the relation of protein structure to function.Results
The structure of the ybgI protein is a toroid composed of six polypeptide chains forming a trimer of dimers. Each polypeptide chain binds two metal ions on the inside of the toroid.Conclusion
The toroidal structure is comparable to that of some proteins that are involved in DNA metabolism. The di-nuclear metal site could imply that the specific function of this protein is as a hydrolase-oxidase enzyme.9.
Jensen LJ Skovgaard M Sicheritz-Pontén T Jørgensen MK Lundegaard C Pedersen CC Petersen N Ussery D 《BMC genomics》2003,4(1):12
Background
For most sequenced prokaryotic genomes, about a third of the protein coding genes annotated are "orphan proteins", that is, they lack homology to known proteins. These hypothetical genes are typically short and randomly scattered throughout the genome. This trend is seen for most of the bacterial and archaeal genomes published to date.Results
In contrast we have found that a large fraction of the genes coding for such orphan proteins in the Methanopyrus kandleri AV19 genome occur within two large regions. These genes have no known homologs except from other M. kandleri genes. However, analysis of their lengths, codon usage, and Ribosomal Binding Site (RBS) sequences shows that they are most likely true protein coding genes and not random open reading frames.Conclusions
Although these regions can be considered as candidates for massive lateral gene transfer, our bioinformatics analysis suggests that this is not the case. We predict many of the organism specific proteins to be transmembrane and belong to protein families that are non-randomly distributed between the regions. Consistent with this, we suggest that the two regions are most likely unrelated, and that they may be integrated plasmids.10.
N. Cesbron A.-L. Royer Y. Guitton A. Sydor B. Le Bizec G. Dervilly-Pinel 《Metabolomics : Official journal of the Metabolomic Society》2017,13(8):99
Introduction
Collecting feces is easy. It offers direct outcome to endogenous and microbial metabolites.Objectives
In a context of lack of consensus about fecal sample preparation, especially in animal species, we developed a robust protocol allowing untargeted LC-HRMS fingerprinting.Methods
The conditions of extraction (quantity, preparation, solvents, dilutions) were investigated in bovine feces.Results
A rapid and simple protocol involving feces extraction with methanol (1/3, M/V) followed by centrifugation and a step filtration (10 kDa) was developed.Conclusion
The workflow generated repeatable and informative fingerprints for robust metabolome characterization.11.
13.
Background
In many microbial genomes, a strong preference for a small number of codons can be observed in genes whose products are needed by the cell in large quantities. This codon usage bias (CUB) improves translational accuracy and speed and is one of several factors optimizing cell growth. Whereas CUB and the overrepresentation of individual proteins have been studied in detail, it is still unclear which high-level metabolic categories are subject to translational optimization in different habitats.Results
In a systematic study of 388 microbial species, we have identified for each genome a specific subset of genes characterized by a marked CUB, which we named the effectome. As expected, gene products related to protein synthesis are abundant in both archaeal and bacterial effectomes. In addition, enzymes contributing to energy production and gene products involved in protein folding and stabilization are overrepresented. The comparison of genomes from eleven habitats shows that the environment has only a minor effect on the composition of the effectomes. As a paradigmatic example, we detailed the effectome content of 37 bacterial genomes that are most likely exposed to strongest selective pressure towards translational optimization. These effectomes accommodate a broad range of protein functions like enzymes related to glycolysis/gluconeogenesis and the TCA cycle, ATP synthases, aminoacyl-tRNA synthetases, chaperones, proteases that degrade misfolded proteins, protectants against oxidative damage, as well as cold shock and outer membrane proteins.Conclusions
We made clear that effectomes consist of specific subsets of the proteome being involved in several cellular functions. As expected, some functions are related to cell growth and affect speed and quality of protein synthesis. Additionally, the effectomes contain enzymes of central metabolic pathways and cellular functions sustaining microbial life under stress situations. These findings indicate that cell growth is an important but not the only factor modulating translational accuracy and speed by means of CUB.14.
Allegra Via Pier Federico Gherardini Enrico Ferraro Gabriele Ausiello Gianpaolo Scalia Tomba Manuela Helmer-Citterich 《BMC bioinformatics》2007,8(1):68
Background
False occurrences of functional motifs in protein sequences can be considered as random events due solely to the sequence composition of a proteome. Here we use a numerical approach to investigate the random appearance of functional motifs with the aim of addressing biological questions such as: How are organisms protected from undesirable occurrences of motifs otherwise selected for their functionality? Has the random appearance of functional motifs in protein sequences been affected during evolution?Results
Here we analyse the occurrence of functional motifs in random sequences and compare it to that observed in biological proteomes; the behaviour of random motifs is also studied. Most motifs exhibit a number of false positives significantly similar to the number of times they appear in randomized proteomes (=expected number of false positives). Interestingly, about 3% of the analysed motifs show a different kind of behaviour and appear in biological proteomes less than they do in random sequences. In some of these cases, a mechanism of evolutionary negative selection is apparent; this helps to prevent unwanted functionalities which could interfere with cellular mechanisms.Conclusion
Our thorough statistical and biological analysis showed that there are several mechanisms and evolutionary constraints both of which affect the appearance of functional motifs in protein sequences.15.
Ling Bai Wei He Tianpeng Li Cuiting Yang Yingping Zhuang Shu Quan 《Biotechnology letters》2017,39(8):1191-1199
Objective
To investigate the application of the TEM-1 β-lactamase protein fragment complementation assay (PCA) in detecting weak and unstable protein–protein interactions as typically observed during chaperone-assisted protein folding in the periplasm of Escherichia coli.Results
The TEM-1 β-lactamase PCA system effectively captured the interactions of three pairs of chaperones and substrates. Moreover, the strength of the interactions can be quantitatively analyzed by comparing different levels of penicillin resistance, and the assay can be performed under 0.5% butanol, a stress condition thought to be physiologically relevant.Conclusions
The β-lactamase PCA system faithfully reports chaperone-substrate interactions in the bacterial cell envelope, and therefore this system has the potential to map the complex protein homeostasis network under a fluctuating environment.16.
Anna Skorczyk-Werner Anna Wawrocka Natalia Kochalska Maciej Robert Krawczynski 《Orphanet journal of rare diseases》2018,13(1):221
Background
Choroideremia (CHM) is a rare X-linked recessive retinal dystrophy characterized by progressive chorioretinal degeneration in the males affected. The symptoms include night blindness in childhood, progressive peripheral vision loss and total blindness in the late stages. The disease is caused by mutations in the CHM gene encoding Rab Escort Protein 1 (REP-1). The aim of the study was to identify the molecular basis of choroideremia in five families of Polish origin.Methods
Six male patients from five unrelated families of Polish ethnicity, who were clinically diagnosed with choroideremia, were examined in this study. An ophthalmologic examination performed in all the probands included: best-corrected visual acuity, slit-lamp examination, funduscopy, fluorescein angiography and perimetry. The entire coding region encompassing 15 exons and the flanking intronic sequences of the CHM gene were amplified with PCR and directly sequenced in all the patients.Results
Five variants in the CHM gene were identified in the five families examined. Two of the variants were new: c.1175dupT and c.83C?>?G, while three had been previously reported.Conclusions
This study provides the first molecular genetic characteristics of patients with choroideremia from the previously unexplored Polish population.17.
Olesya I. Klimchuk Kirill A. Konovalov Vadim V. Perekhvatov Konstantin V. Skulachev Daria V. Dibrova Armen Y. Mulkidjanian 《Biology direct》2017,12(1):26
Background
In prokaryotic genomes, functionally coupled genes can be organized in conserved gene clusters enabling their coordinated regulation. Such clusters could contain one or several operons, which are groups of co-transcribed genes. Those genes that evolved from a common ancestral gene by speciation (i.e. orthologs) are expected to have similar genomic neighborhoods in different organisms, whereas those copies of the gene that are responsible for dissimilar functions (i.e. paralogs) could be found in dissimilar genomic contexts. Comparative analysis of genomic neighborhoods facilitates the prediction of co-regulated genes and helps to discern different functions in large protein families.Aim
We intended, building on the attribution of gene sequences to the clusters of orthologous groups of proteins (COGs), to provide a method for visualization and comparative analysis of genomic neighborhoods of evolutionary related genes, as well as a respective web server.Results
Here we introduce the COmparative Gene Neighborhoods Analysis Tool (COGNAT), a web server for comparative analysis of genomic neighborhoods. The tool is based on the COG database, as well as the Pfam protein families database. As an example, we show the utility of COGNAT in identifying a new type of membrane protein complex that is formed by paralog(s) of one of the membrane subunits of the NADH:quinone oxidoreductase of type 1 (COG1009) and a cytoplasmic protein of unknown function (COG3002).Reviewers
This article was reviewed by Drs. Igor Zhulin, Uri Gophna and Igor Rogozin.18.
Andrei Prodan Sultan Imangaliyev Henk S. Brand Martijn N. A. Rosema Evgeni Levin Wim Crielaard Bart J. F. Keijser Enno C. I. Veerman 《Metabolomics : Official journal of the Metabolomic Society》2016,12(9):147
Introduction
Understanding the changes occurring in the oral ecosystem during development of gingivitis could help improve prevention and treatment strategies for oral health. Erythritol is a non-caloric polyol proposed to have beneficial effects on oral health.Objectives
To examine the effect of experimental gingivitis and the effect of erythritol on the salivary metabolome and salivary functional biochemistry.Methods
In a two-week experimental gingivitis challenge intervention study, non-targeted, mass spectrometry-based metabolomic profiling was performed on saliva samples from 61 healthy adults, collected at five time-points. The effect of erythritol was studied in a randomized, controlled trial setting. Fourteen salivary biochemistry variables were measured with antibody- or enzymatic activity-based assays.Results
Bacterial amino acid catabolites (cadaverine, N-acetylcadaverine, and α-hydroxyisovalerate) and end-products of bacterial alkali-producing pathways (N-α-acetylornithine and γ-aminobutyrate) increased significantly during the experimental gingivitis. Significant changes were found in a set of 13 salivary metabolite ratios composed of host cell membrane lipids involved in cell signaling, host responses to bacteria, and defense against free radicals. An increase in mevalonate was also observed. There were no significant effects of erythritol. No significant changes were found in functional salivary biochemistry.Conclusions
The findings underline a dynamic interaction between the host and the oral microbial biofilm during an experimental induction of gingivitis.19.
MicroRNA-190b confers radio-sensitivity through negative regulation of Bcl-2 in gastric cancer cells
Objectives
To determine the role of miR-190b in radio-sensitivity of gastric cancer (GC).Results
In radio-resistant GC cells, down-regulation of miR-190b and up-regulation of Bcl-2 were observed. The protein expression of Bcl-2 was negatively regulated by miR-190b. Overexpression of miR-190b significantly decreased cell viability and enhanced radio-sensitivity of GC cells. Of note, these effects of miR-190b on GC cells radio-sensitivity were abolished by Bcl-2.Conclusion
miR-190b confers radio-sensitivity of GC cells, possibly via negative regulation of Bcl-2.20.