首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Numerous high-throughput sequencing studies have focused on detecting conventionally spliced mRNAs in RNA-seq data. However, non-standard RNAs arising through gene fusion, circularization or trans-splicing are often neglected. We introduce a novel, unbiased algorithm to detect splice junctions from single-end cDNA sequences. In contrast to other methods, our approach accommodates multi-junction structures. Our method compares favorably with competing tools for conventionally spliced mRNAs and, with a gain of up to 40% of recall, systematically outperforms them on reads with multiple splits, trans-splicing and circular products. The algorithm is integrated into our mapping tool segemehl (http://www.bioinf.uni-leipzig.de/Software/segemehl/).  相似文献   

2.
3.

Background

Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era, when large volumes of protein sequences are accumulating quickly without being functionally annotated.

Results

In this paper, a new predictor called TargetVita is designed and implemented for predicting protein-vitamin binding residues using protein sequences. In TargetVita, features derived from the position-specific scoring matrix (PSSM), predicted protein secondary structure, and vitamin binding propensity are combined to form the original feature space; then, several feature subspaces are selected by performing different feature selection methods. Finally, based on the selected feature subspaces, heterogeneous SVMs are trained and then ensembled for performing prediction.

Conclusions

The experimental results obtained with four separate vitamin-binding benchmark datasets demonstrate that the proposed TargetVita is superior to the state-of-the-art vitamin-specific predictor, and an average improvement of 10% in terms of the Matthews correlation coefficient (MCC) was achieved over independent validation tests. The TargetVita web server and the datasets used are freely available for academic use at http://csbio.njust.edu.cn/bioinf/TargetVita or http://www.csbio.sjtu.edu.cn/bioinf/TargetVita.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-297) contains supplementary material, which is available to authorized users.  相似文献   

4.
Protein-nucleotide interactions are ubiquitous in a wide variety of biological processes. Accurately identifying interaction residues solely from protein sequences is useful for both protein function annotation and drug design, especially in the post-genomic era, as large volumes of protein data have not been functionally annotated. Protein-nucleotide binding residue prediction is a typical imbalanced learning problem, where binding residues are extremely fewer in number than non-binding residues. Alleviating the severity of class imbalance has been demonstrated to be a promising means of improving the prediction performance of a machine-learning-based predictor for class imbalance problems. However, little attention has been paid to the negative impact of class imbalance on protein-nucleotide binding residue prediction. In this study, we propose a new supervised over-sampling algorithm that synthesizes additional minority class samples to address class imbalance. The experimental results from protein-nucleotide interaction datasets demonstrate that the proposed supervised over-sampling algorithm can relieve the severity of class imbalance and help to improve prediction performance. Based on the proposed over-sampling algorithm, a predictor, called TargetSOS, is implemented for protein-nucleotide binding residue prediction. Cross-validation tests and independent validation tests demonstrate the effectiveness of TargetSOS. The web-server and datasets used in this study are freely available at http://www.csbio.sjtu.edu.cn/bioinf/TargetSOS/.  相似文献   

5.
Central to Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-Cas systems are repeated RNA sequences that serve as Cas-protein–binding templates. Classification is based on the architectural composition of associated Cas proteins, considering repeat evolution is essential to complete the picture. We compiled the largest data set of CRISPRs to date, performed comprehensive, independent clustering analyses and identified a novel set of 40 conserved sequence families and 33 potential structure motifs for Cas-endoribonucleases with some distinct conservation patterns. Evolutionary relationships are presented as a hierarchical map of sequence and structure similarities for both a quick and detailed insight into the diversity of CRISPR-Cas systems. In a comparison with Cas-subtypes, I-C, I-E, I-F and type II were strongly coupled and the remaining type I and type III subtypes were loosely coupled to repeat and Cas1 evolution, respectively. Subtypes with a strong link to CRISPR evolution were almost exclusive to bacteria; nevertheless, we identified rare examples of potential horizontal transfer of I-C and I-E systems into archaeal organisms. Our easy-to-use web server provides an automated assignment of newly sequenced CRISPRs to our classification system and enables more informed choices on future hypotheses in CRISPR-Cas research: http://rna.informatik.uni-freiburg.de/CRISPRmap.  相似文献   

6.
7.
Three-dimensional (3D) culture models are critical tools for understanding tissue morphogenesis. A key requirement for their analysis is the ability to reconstruct the tissue into computational models that allow quantitative evaluation of the formed structures. Here, we present Software for Automated Morphological Analysis (SAMA), a method by which epithelial structures grown in 3D cultures can be imaged, reconstructed and analyzed with minimum human intervention. SAMA allows quantitative analysis of key features of epithelial morphogenesis such as ductal elongation, branching and lumen formation that distinguish different hormonal treatments. SAMA is a user-friendly set of customized macros operated via FIJI (http://fiji.sc/Fiji), an open-source image analysis platform in combination with a set of functions in R (http://www.r-project.org/), an open-source program for statistical analysis. SAMA enables a rapid, exhaustive and quantitative 3D analysis of the shape of a population of structures in a 3D image. SAMA is cross-platform, licensed under the GPLv3 and available at http://montevil.theobio.org/content/sama.  相似文献   

8.

Motivation

Genome-wide screens for structured ncRNA genes in mammals, urochordates, and nematodes have predicted thousands of putative ncRNA genes and other structured RNA motifs. A prerequisite for their functional annotation is to determine the reading direction with high precision.

Results

While folding energies of an RNA and its reverse complement are similar, the differences are sufficient at least in conjunction with substitution patterns to discriminate between structured RNAs and their complements. We present here a support vector machine that reliably classifies the reading direction of a structured RNA from a multiple sequence alignment and provides a considerable improvement in classification accuracy over previous approaches.

Software

RNAstrand is freely available as a stand-alone tool from http://www.bioinf.uni-leipzig.de/Software/RNAstrand and is also included in the latest release of RNAz, a part of the Vienna RNA Package.  相似文献   

9.
PARma is a complete data analysis software for AGO-PAR-CLIP experiments to identify target sites of microRNAs as well as the microRNA binding to these sites. It integrates specific characteristics of the experiments into a generative model. The model and a novel pattern discovery tool are iteratively applied to data to estimate seed activity probabilities, cluster confidence scores and to assign the most probable microRNA. Based on differential PAR-CLIP analysis and comparison to RIP-Chip data, we show that PARma is more accurate than existing approaches. PARma is available from http://www.bio.ifi.lmu.de/PARma  相似文献   

10.
11.
12.
Saccharomyces cerevisiae Spt6 protein is a conserved chromatin factor with several distinct functional domains, including a natively unstructured 30-residue N-terminal region that binds competitively with Spn1 or nucleosomes. To uncover physiological roles of these interactions, we isolated histone mutations that suppress defects caused by weakening Spt6:Spn1 binding with the spt6-F249K mutation. The strongest suppressor was H2A-N39K, which perturbs the point of contact between the two H2A-H2B dimers in an assembled nucleosome. Substantial suppression also was observed when the H2A-H2B interface with H3-H4 was altered, and many members of this class of mutations also suppressed a defect in another essential histone chaperone, FACT. Spt6 is best known as an H3-H4 chaperone, but we found that it binds with similar affinity to H2A-H2B or H3-H4. Like FACT, Spt6 is therefore capable of binding each of the individual components of a nucleosome, but unlike FACT, Spt6 did not produce endonuclease-sensitive reorganized nucleosomes and did not displace H2A-H2B dimers from nucleosomes. Spt6 and FACT therefore have distinct activities, but defects can be suppressed by overlapping histone mutations. We also found that Spt6 and FACT together are nearly as abundant as nucleosomes, with ∼24,000 Spt6 molecules, ∼42,000 FACT molecules, and ∼75,000 nucleosomes per cell. Histone mutations that destabilize interfaces within nucleosomes therefore reveal multiple spatial regions that have both common and distinct roles in the functions of these two essential and abundant histone chaperones. We discuss these observations in terms of different potential roles for chaperones in both promoting the assembly of nucleosomes and monitoring their quality.  相似文献   

13.
Telomere length is tightly regulated in cells that express telomerase. The Saccharomyces cerevisiae Ku heterodimer, a DNA end-binding complex, positively regulates telomere length in a telomerase-dependent manner. Ku associates with the telomerase RNA subunit TLC1, and this association is required for TLC1 nuclear retention. Ku–TLC1 interaction also impacts the cell-cycle-regulated association of the telomerase catalytic subunit Est2 to telomeres. The promotion of TLC1 nuclear localization and Est2 recruitment have been proposed to be the principal role of Ku in telomere length maintenance, but neither model has been directly tested. Here we study the impact of forced recruitment of Est2 to telomeres on telomere length in the absence of Ku’s ability to bind TLC1 or DNA ends. We show that tethering Est2 to telomeres does not promote efficient telomere elongation in the absence of Ku–TLC1 interaction or DNA end binding. Moreover, restoration of TLC1 nuclear localization, even when combined with Est2 recruitment, does not bypass the role of Ku. In contrast, forced recruitment of Est1, which has roles in telomerase recruitment and activation, to telomeres promotes efficient and progressive telomere elongation in the absence of Ku–TLC1 interaction, Ku DNA end binding, or Ku altogether. Ku associates with Est1 and Est2 in a TLC1-dependent manner and enhances Est1 recruitment to telomeres independently of Est2. Together, our results unexpectedly demonstrate that the principal role of Ku in telomere length maintenance is to promote the association of Est1 with telomeres, which may in turn allow for efficient recruitment and activation of the telomerase holoenzyme.  相似文献   

14.
The oocytes of most sexually reproducing animals arrest in meiotic prophase I. Oocyte growth, which occurs during this period of arrest, enables oocytes to acquire the cytoplasmic components needed to produce healthy progeny and to gain competence to complete meiosis. In the nematode Caenorhabditis elegans, the major sperm protein hormone promotes meiotic resumption (also called meiotic maturation) and the cytoplasmic flows that drive oocyte growth. Prior work established that two related TIS11 zinc-finger RNA-binding proteins, OMA-1 and OMA-2, are redundantly required for normal oocyte growth and meiotic maturation. We affinity purified OMA-1 and identified associated mRNAs and proteins using genome-wide expression data and mass spectrometry, respectively. As a class, mRNAs enriched in OMA-1 ribonucleoprotein particles (OMA RNPs) have reproductive functions. Several of these mRNAs were tested and found to be targets of OMA-1/2-mediated translational repression, dependent on sequences in their 3′-untranslated regions (3′-UTRs). Consistent with a major role for OMA-1 and OMA-2 in regulating translation, OMA-1-associated proteins include translational repressors and activators, and some of these proteins bind directly to OMA-1 in yeast two-hybrid assays, including OMA-2. We show that the highly conserved TRIM-NHL protein LIN-41 is an OMA-1-associated protein, which also represses the translation of several OMA-1/2 target mRNAs. In the accompanying article in this issue, we show that LIN-41 prevents meiotic maturation and promotes oocyte growth in opposition to OMA-1/2. Taken together, these data support a model in which the conserved regulators of mRNA translation LIN-41 and OMA-1/2 coordinately control oocyte growth and the proper spatial and temporal execution of the meiotic maturation decision.  相似文献   

15.
The nuclear envelope in Saccharomyces cerevisiae harbors two essential macromolecular protein assemblies: the nuclear pore complexes (NPCs) that enable nucleocytoplasmic transport, and the spindle pole bodies (SPBs) that mediate chromosome segregation. Previously, based on metazoan and budding yeast studies, we reported that reticulons and Yop1/DP1 play a role in the early steps of de novo NPC assembly. Here, we examined if Rtn1 and Yop1 are required for SPB function in S. cerevisiae. Electron microscopy of rtn1Δ yop1Δ cells revealed lobular abnormalities in SPB structure. Using an assay that monitors lateral expansion of the SPB central layer, we found that rtn1Δ yop1Δ SPBs had decreased connections to the NE compared to wild type, suggesting that SPBs are less stable in the NE. Furthermore, large budded rtn1Δ yop1Δ cells exhibited a high incidence of short mitotic spindles, which were frequently misoriented with respect to the mother–daughter axis. This correlated with cytoplasmic microtubule defects. We found that overexpression of the SPB insertion factors NDC1, MPS2, or BBP1 rescued the SPB defects observed in rtn1Δ yop1Δ cells. However, only overexpression of NDC1, which is also required for NPC biogenesis, rescued both the SPB and NPC associated defects. Rtn1 and Yop1 also physically interacted with Ndc1 and other NPC membrane proteins. We propose that NPC and SPB biogenesis are altered in cells lacking Rtn1 and Yop1 due to competition between these complexes for Ndc1, an essential common component of both NPCs and SPBs.  相似文献   

16.
17.
The Saccharomyces cerevisiae type 2C protein phosphatase Ptc1 is required for a wide variety of cellular functions, although only a few cellular targets have been identified. A genetic screen in search of mutations in protein kinase–encoding genes able to suppress multiple phenotypic traits caused by the ptc1 deletion yielded a single gene, MKK1, coding for a MAPK kinase (MAPKK) known to activate the cell-wall integrity (CWI) Slt2 MAPK. In contrast, mutation of the MKK1 paralog, MKK2, had a less significant effect. Deletion of MKK1 abolished the increased phosphorylation of Slt2 induced by the absence of Ptc1 both under basal and CWI pathway stimulatory conditions. We demonstrate that Ptc1 acts at the level of the MAPKKs of the CWI pathway, but only the Mkk1 kinase activity is essential for ptc1 mutants to display high Slt2 activation. We also show that Ptc1 is able to dephosphorylate Mkk1 in vitro. Our results reveal the preeminent role of Mkk1 in signaling through the CWI pathway and strongly suggest that hyperactivation of Slt2 caused by upregulation of Mkk1 is at the basis of most of the phenotypic defects associated with lack of Ptc1 function.  相似文献   

18.
Kinetochores are conserved protein complexes that bind the replicated chromosomes to the mitotic spindle and then direct their segregation. To better comprehend Saccharomyces cerevisiae kinetochore function, we dissected the phospho-regulated dynamic interaction between conserved kinetochore protein Cnn1CENP-T, the centromere region, and the Ndc80 complex through the cell cycle. Cnn1 localizes to kinetochores at basal levels from G1 through metaphase but accumulates abruptly at anaphase onset. How Cnn1 is recruited and which activities regulate its dynamic localization are unclear. We show that Cnn1 harbors two kinetochore-localization activities: a C-terminal histone-fold domain (HFD) that associates with the centromere region and a N-terminal Spc24/Spc25 interaction sequence that mediates linkage to the microtubule-binding Ndc80 complex. We demonstrate that the established Ndc80 binding site in the N terminus of Cnn1, Cnn160–84, should be extended with flanking residues, Cnn125–91, to allow near maximal binding affinity to Ndc80. Cnn1 localization was proposed to depend on Mps1 kinase activity at Cnn1–S74, based on in vitro experiments demonstrating the Cnn1Ndc80 complex interaction. We demonstrate that from G1 through metaphase, Cnn1 localizes via both its HFD and N-terminal Spc24/Spc25 interaction sequence, and deletion or mutation of either region results in anomalous Cnn1 kinetochore levels. At anaphase onset (when Mps1 activity decreases) Cnn1 becomes enriched mainly via the N-terminal Spc24/Spc25 interaction sequence. In sum, we provide the first in vivo evidence of Cnn1 preanaphase linkages with the kinetochore and enrichment of the linkages during anaphase.  相似文献   

19.
eIF5A is an essential and evolutionary conserved translation elongation factor, which has recently been proposed to be required for the translation of proteins with consecutive prolines. The binding of eIF5A to ribosomes occurs upon its activation by hypusination, a modification that requires spermidine, an essential factor for mammalian fertility that also promotes yeast mating. We show that in response to pheromone, hypusinated eIF5A is required for shmoo formation, localization of polarisome components, induction of cell fusion proteins, and actin assembly in yeast. We also show that eIF5A is required for the translation of Bni1, a proline-rich formin involved in polarized growth during shmoo formation. Our data indicate that translation of the polyproline motifs in Bni1 is eIF5A dependent and this translation dependency is lost upon deletion of the polyprolines. Moreover, an exogenous increase in Bni1 protein levels partially restores the defect in shmoo formation seen in eIF5A mutants. Overall, our results identify eIF5A as a novel and essential regulator of yeast mating through formin translation. Since eIF5A and polyproline formins are conserved across species, our results also suggest that eIF5A-dependent translation of formins could regulate polarized growth in such processes as fertility and cancer in higher eukaryotes.  相似文献   

20.
Protein designers use a wide variety of software tools for de novo design, yet their repertoire still lacks a fast and interactive all-atom search engine. To solve this, we have built the Suns program: a real-time, atomic search engine integrated into the PyMOL molecular visualization system. Users build atomic-level structural search queries within PyMOL and receive a stream of search results aligned to their query within a few seconds. This instant feedback cycle enables a new “designability”-inspired approach to protein design where the designer searches for and interactively incorporates native-like fragments from proven protein structures. We demonstrate the use of Suns to interactively build protein motifs, tertiary interactions, and to identify scaffolds compatible with hot-spot residues. The official web site and installer are located at http://www.degradolab.org/suns/ and the source code is hosted at https://github.com/godotgildor/Suns (PyMOL plugin, BSD license), https://github.com/Gabriel439/suns-cmd (command line client, BSD license), and https://github.com/Gabriel439/suns-search (search engine server, GPLv2 license).
This is a PLOS Computational Biology Software Article
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号