首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background  

Pathogenicity islands (PAIs), distinct genomic segments of pathogens encoding virulence factors, represent a subgroup of genomic islands (GIs) that have been acquired by horizontal gene transfer event. Up to now, computational approaches for identifying PAIs have been focused on the detection of genomic regions which only differ from the rest of the genome in their base composition and codon usage. These approaches often lead to the identification of genomic islands, rather than PAIs.  相似文献   

2.
3.
LL Zheng  YX Li  J Ding  XK Guo  KY Feng  YJ Wang  LL Hu  YD Cai  P Hao  KC Chou 《PloS one》2012,7(8):e42517
Bacterial pathogens continue to threaten public health worldwide today. Identification of bacterial virulence factors can help to find novel drug/vaccine targets against pathogenicity. It can also help to reveal the mechanisms of the related diseases at the molecular level. With the explosive growth in protein sequences generated in the postgenomic age, it is highly desired to develop computational methods for rapidly and effectively identifying virulence factors according to their sequence information alone. In this study, based on the protein-protein interaction networks from the STRING database, a novel network-based method was proposed for identifying the virulence factors in the proteomes of UPEC 536, UPEC CFT073, P. aeruginosa PAO1, L. pneumophila Philadelphia 1, C. jejuni NCTC 11168 and M. tuberculosis H37Rv. Evaluated on the same benchmark datasets derived from the aforementioned species, the identification accuracies achieved by the network-based method were around 0.9, significantly higher than those by the sequence-based methods such as BLAST, feature selection and VirulentPred. Further analysis showed that the functional associations such as the gene neighborhood and co-occurrence were the primary associations between these virulence factors in the STRING database. The high success rates indicate that the network-based method is quite promising. The novel approach holds high potential for identifying virulence factors in many other various organisms as well because it can be easily extended to identify the virulence factors in many other bacterial species, as long as the relevant significant statistical data are available for them.  相似文献   

4.
MOTIVATION: We consider the problem of identifying low-complexity regions (LCRs) in a protein sequence. LCRs are regions of biased composition, normally consisting of different kinds of repeats. RESULTS: We define new complexity measures to compute the complexity of a sequence based on a given scoring matrix, such as BLOSUM 62. Our complexity measures also consider the order of amino acids in the sequence and the sequence length. We develop a novel graph-based algorithm called GBA to identify LCRs in a protein sequence. In the graph constructed for the sequence, each vertex corresponds to a pair of similar amino acids. Each edge connects two pairs of amino acids that can be grouped together to form a longer repeat. GBA finds short subsequences as LCR candidates by traversing this graph. It then extends them to find longer subsequences that may contain full repeats with low complexities. Extended subsequences are then post-processed to refine repeats to LCRs. Our experiments on real data show that GBA has significantly higher recall compared to existing algorithms, including 0j.py, CARD, and SEG. AVAILABILITY: The program is available on request.  相似文献   

5.
Zang X  Komatsu S 《Phytochemistry》2007,68(4):426-437
Osmotic stress can endanger the survival of plants. To investigate the mechanisms of how plants respond to osmotic stress, rice protein profiles from mannitol-treated plants, were monitored using a proteomics approach. Two-week-old rice seedlings were treated with 400mM mannitol for 48h. After separation of proteins from the basal part of leaf sheaths by two-dimensional polyacrylamide gel electrophoresis, 327 proteins were detected. The levels of 12 proteins increased and the levels of three proteins decreased with increasing concentration or duration, of mannitol treatment. Levels of a heat shock protein and a dnaK-type molecular chaperone were reduced under osmotic, cold, salt and drought stresses, and ABA treatment, whereas a 26S proteasome regulatory subunit was found to be responsive only to osmotic stress. Furthermore, proteins whose accumulation was sensitive to osmotic stress are present in an osmotic-tolerant cultivar. These results indicate that specific proteins expressed in the basal part of rice leaf sheaths show a coordinated response to cope with osmotic stress.  相似文献   

6.
7.
ABSTRACT: BACKGROUND: Many studies have demonstrated genetic and environmental factors that lead to renal cell carcinoma (RCC) and that occur during a protracted period of tumourigenesis. It appears suitable to identify and characterise potential molecular markers that appear during tumourigenesis and that might provide rapid and effective possibilities for the early detection of RCC. EGFR activation induces cell cycle progression, inhibition of apoptosis and angiogenesis, promotion of invasion/metastasis, and other tumour promoting activities. Over-expression of EGFR is thought to play an important role in tumour initiation and progression of RCC because up-regulation of EGFR has been associated with high grade cancers and a worse prognosis. METHODS: Characterisation of the protein profile interacting with EGFR was performed using the following: an immunohistochemical (IHC) study of EGFR, a comprehensive computational study of EGFR protein-protein interactions, an analysis correlating the expression levels of EGFR with other significant markers in the tumourigenicity of RCC, and finally, an analysis of the utility of EGFR for prognosis in a cohort of patients with renal cell carcinoma. RESULTS: The cases that showed a higher level of this protein fell within the clear cell histological subtype (p = 0.001). The EGFR significance statistic was found with respect to a worse prognosis. In vivo significant correlations were found with PDGFR-beta, Flk-1, Hif1-alpha, proteins related to differentiation (such as DLL3 and DLL4 ligands), and certain metabolic proteins such as Glut5. In silico significant associations gave us a panel of 32 EGFR-interacting proteins (EIP) using the APID and STRING databases. CONCLUSIONS: This work summarises the multifaceted role of EGFR in the pathology of RCC, and it identifies EIPs that could help to provide mechanistic explanations for the different behaviours observed in tumours.  相似文献   

8.
9.
Protein tyrosine kinases (PTKs) play a central role in the modulation of a wide variety of cellular events such as differentiation, proliferation and metabolism, and their unregulated activation can lead to various diseases including cancer and diabetes. PTKs represent a diverse family of proteins including both receptor tyrosine kinases (RTKs) and non-receptor tyrosine kinases (NRTKs). Due to the diversity and important cellular roles of PTKs, accurate classification methods are required to better understand and differentiate different PTKs. In addition, PTKs have become important targets for drugs, providing a further need to develop novel methods to accurately classify this set of important biological molecules. Here, we introduce a novel statistical model for the classification of PTKs that is based on their structural features. The approach allows for both the recognition of PTKs and the classification of RTKs into their subfamilies. This novel approach had an overall accuracy of 98.5% for the identification of PTKs, and 99.3% for the classification of RTKs.  相似文献   

10.
The diagnosis of cancer by examination of the urine has the potential to improve patient outcomes by means of earlier detection. Due to the fact that the urine contains metabolic signatures of many biochemical pathways, this biofluid is ideally suited for metabolomic analysis, especially involving diseases of the kidney and urinary system. In this pilot study, we test three independent analytical techniques for suitability for detection of renal cell carcinoma (RCC) in urine of affected patients. Hydrophilic interaction chromatography (HILIC-LC-MS), reversed-phase ultra performance liquid chromatography (RP-UPLC-MS), and gas chromatography time-of-flight mass spectrometry (GC-TOF-MS) all were used as complementary separation techniques. The combination of these techniques is best suited to cover a very large part of the urine metabolome by enabling the detection of both lipophilic and hydrophilic metabolites present therein. In this study, it is demonstrated that sample pretreatment with urease dramatically alters the metabolome composition apart from removal of urea. Two new freely available peak alignment methods, MZmine and XCMS, are used for peak detection and retention time alignment. The results are analyzed by a feature selection algorithm with subsequent univariate analysis of variance (ANOVA) and a multivariate partial least squares (PLS) approach. From more than 2000 mass spectral features detected in the urine, we identify several significant components that lead to discrimination between RCC patients and controls despite the relatively small sample size. A feature selection process condensed the significant features to less than 30 components in each of the data sets. In future work, these potential biomarkers will be further validated with a larger patient cohort. Such investigation will likely lead to clinically applicable assays for earlier diagnosis of RCC, as well as other malignancies, and thereby improved patient prognosis.  相似文献   

11.
We employ a structurally-motivated phenomenological formulation to identify biomechanical experiments which can be used to determine a vascular constitutive relation directly from data. Large deformations, nonlinear material behavior, load-dependent anisotropy, material heterogeneity and incompressibility are accounted for in the analysis. For purposes of illustration, we outline a procedure for studying elastic arteries wherein the behavior of the media and adventitia is considered separately. This general approach for identifying vascular constitutive relations can be applied to any vessel or airway, however, and should provide certain advantages over previous microstructural or purely phenomenological formulations.  相似文献   

12.
13.
Heme is a key cofactor in aerobic life, both in eukaryotes and prokaryotes. Because of the high reactivity of ferrous protoporphyrin IX, the reactions of heme in cells are often carried out through heme-protein complexes. Traditionally studies of heme-binding proteins have been approached on a case by case basis, thus there is a limited global view of the distribution of heme-binding proteins in different cells or tissues. The procedure described here is aimed at profiling heme-binding proteins in mouse tissues sequentially by 1) purification of heme-binding proteins by heme-agarose, an affinity chromatographic resin; 2) isolation of heme-binding proteins by SDS-PAGE or two-dimensional electrophoresis; 3) identification of heme-binding proteins by mass spectrometry. In five mouse tissues, over 600 protein spots were visualized on 2DE gel stained by Commassie blue and 154 proteins were identified by MALDI-TOF, in which most proteins belong to heme related. This methodology makes it possible to globally c  相似文献   

14.
Chatterji S  Pachter L 《Genomics》2007,90(1):44-48
The exon-intron structure of eukaryotic genes allows for phenomena such as alternative splicing, nonsense-mediated decay, and regulation through untranslated regions. However, the evolution of the exon structure of genes is not well elucidated because of limited and phylogenetically sparse data sets. In this study, we use the phylogenetically diverse sequencing of the ENCODE regions to study gene structure evolution in mammalian genomes. This first phylogenetically diverse study of gene structure changes offers insights into the mode and tempo of mammalian gene structure evolution. The genes undergoing structure changes appear to be moderately to highly expressed in germline cells and show levels of selection similar to those of other ENCODE genes. Patterns of gene duplication of the affected genes are more complex than expected. The number of sampled genomes is sufficiently dense to infer that certain gene duplications happened after intron loss. Thus, although gene duplication is highly correlated with intron loss, we conclude that structural changes in genes are not necessarily due to a loss of constraint following gene duplication as previously suggested.  相似文献   

15.

Background  

We describe the distribution of indels in the 44 Encyclopedia of DNA Elements (ENCODE) regions (about 1% of the human genome) and evaluate the potential contributions of small insertion and deletion polymorphisms (indels) to human genetic variation. We relate indels to known genomic annotation features and measures of evolutionary constraint.  相似文献   

16.
17.
We propose a computational model of mating strategies for controlled animal breeding programs. A mating strategy in a controlled breeding program is a heuristic with some optimization criteria as a goal. Thus, it is appropriate to use the computational tools available for analysis of optimization heuristics. In this paper, we propose the first discrete model of the controlled animal breeding problem and analyse heuristics for two possible objectives: (1) breeding for maximum diversity and (2) breeding a target individual. These two goals are representative of conservation biology and agricultural livestock management, respectively. We evaluate several mating strategies and provide upper and lower bounds for the expected number of matings. While the population parameters may vary and can change the actual number of matings for a particular strategy, the order of magnitude of the number of expected matings and the relative competitiveness of the mating heuristics remains the same. Thus, our simple discrete model of the animal breeding problem provides a novel viable and robust approach to designing and comparing breeding strategies in captive populations.  相似文献   

18.
A new computational approach for real protein folding prediction   总被引:4,自引:0,他引:4  
An effective and fast minimization approach is proposed for the prediction of protein folding, in which the 'relative entropy' is used as a minimization function and the off-lattice model is used. In this approach, we only use the information of distances between the consecutive Calpha atoms along the peptide chain and a generalized form of the contact potential for 20 types of amino acids. Tests of the algorithm are performed on the real proteins. The root mean square deviations of the structures of eight folded target proteins versus the native structures are in a reasonable range. In principle, this method is an improvement on the energy minimization approach.  相似文献   

19.
A computational approach to motion perception   总被引:10,自引:0,他引:10  
In this paper it is shown that the computation of the optical flow from a sequence of timevarying images is not, in general, an underconstrained problem. A local algorithm for the computation of the optical flow which uses second order derivatives of the image brightness pattern, and that avoids the aperture problem, is presented. The obtained optical flow is very similar to the true motion field — which is the vector field associated with moving features on the image plane — and can be used to recover 3D motion information. Experimental results on sequences of real images, together with estimates of relevant motion parameters, like time-to-crash for translation and angular velocity for rotation, are presented and discussed. Due to the remarkable accuracy which can be achieved in estimating motion parameters, the proposed method is likely to be very useful in a number of computer vision applications.  相似文献   

20.
Dirithromycin is a macrolide antibiotic derived from erythromycin A. Dirithromycin is synthesized by the condensation of 9(S)-erythromycylamine with 2-(2-methoxyethoxy)-acetaldehyde. To gain insight into the synthesis, the condensation mechanism has been analyzed computationally by the AM1 method in the gas phase. First, the formation of the Schiff bases of dirithromycin and epidirithromycin from 9(S)-erythromycylamine and 2-(2-methoxyethoxy)-acetaldehyde were modeled. Then, the tautomerization of the Schiff bases to dirithromycin and epidirithromycin were considered. Finally, the epimerization of the Schiff base of epidirithromycin to the Schiff base of dirithromycin was investigated. Our results show that, even though carbinolamine forms faster for epidirithromycin than the corresponding structure for dirithromycin, dirithromycin is the major product of the synthesis. Figure Synthesis of dirithromycin  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号