共查询到15条相似文献,搜索用时 0 毫秒
1.
Attention deficit hyperactivity disorder (ADHD) is a common, highly heritable psychiatric disorder characterized by hyperactivity, inattention and increased impulsivity. In recent years, a large number of genetic studies for ADHD have been published and related genetic data has been accumulated dramatically. To provide researchers a comprehensive ADHD genetic resource, we previously developed the first genetic database for ADHD (ADHDgene). The abundant genetic data provides novel candidates for further study. Meanwhile, it also brings new challenge for selecting promising candidate genes for replication and verification research. In this study, we surveyed the computational tools for candidate gene prioritization and selected five tools, which integrate multiple data sources for gene prioritization, to prioritize ADHD candidate genes in ADHDgene. The prioritization analysis resulted in 16 prioritized candidate genes, which are mainly involved in several major neurotransmitter systems or in nervous system development pathways. Among these genes, nervous system development related genes, especially SNAP25 , STX1A and the gene-gene interactions related with each of them deserve further investigations. Our results may provide new insight for further verification study and facilitate the exploration of pathogenesis mechanism of ADHD. 相似文献
2.
Troyanskaya OG 《Briefings in bioinformatics》2005,6(1):34-43
In recent years, multiple types of high-throughput functional genomic data that facilitate rapid functional annotation of sequenced genomes have become available. Gene expression microarrays are the most commonly available source of such data. However, genomic data often sacrifice specificity for scale, yielding very large quantities of relatively lower-quality data than traditional experimental methods. Thus sophisticated analysis methods are necessary to make accurate functional interpretation of these large-scale data sets. This review presents an overview of recently developed methods that integrate the analysis of microarray data with sequence, interaction, localisation and literature data, and further outlines current challenges in the field. The focus of this review is on the use of such methods for gene function prediction, understanding of protein regulation and modelling of biological networks. 相似文献
3.
Graphical models play an important role in neuroscience studies, particularly in brain connectivity analysis. Typically, observations/samples are from several heterogenous groups and the group membership of each observation/sample is unavailable, which poses a great challenge for graph structure learning. In this paper, we propose a method which can achieve Simultaneous Clustering and Estimation of Heterogeneous Graphs (briefly denoted as SCEHG) for matrix-variate functional magnetic resonance imaging (fMRI) data. Unlike the conventional clustering methods which rely on the mean differences of various groups, the proposed SCEHG method fully exploits the group differences of conditional dependence relationships among brain regions for learning cluster structure. In essence, by constructing individual-level between-region network measures, we formulate clustering as penalized regression with grouping and sparsity pursuit, which transforms the unsupervised learning into supervised learning. A modified difference of convex programming with the alternating direction method of multipliers (DC-ADMM) algorithm is proposed to solve the corresponding optimization problem. We also propose a generalized criterion to specify the number of clusters. Extensive simulation studies illustrate the superiority of the SCEHG method over some state-of-the-art methods in terms of both clustering and graph recovery accuracy. We also apply the SCEHG procedure to analyze fMRI data associated with attention-deficit hyperactivity disorder (ADHD), which illustrates its empirical usefulness. 相似文献
4.
Model selection is an essential issue in longitudinal data analysis since many different models have been proposed to fit the covariance structure. The likelihood criterion is commonly used and allows to compare the fit of alternative models. Its value does not reflect, however, the potential improvement that can still be reached in fitting the data unless a reference model with the actual covariance structure is available. The score test approach does not require the knowledge of a reference model, and the score statistic has a meaningful interpretation in itself as a goodness-of-fit measure. The aim of this paper was to show how the score statistic may be separated into the genetic and environmental parts, which is difficult with the likelihood criterion, and how it can be used to check parametric assumptions made on variance and correlation parameters. Selection of models for genetic analysis was applied to a dairy cattle example for milk production. 相似文献
5.
Romano P 《Briefings in bioinformatics》2008,9(1):57-68
Data integration is needed in order to cope with the huge amounts of biological information now available and to perform data mining effectively. Current data integration systems have strict limitations, mainly due to the number of resources, their size and frequency of updates, their heterogeneity and distribution on the Internet. Integration must therefore be achieved by accessing network services through flexible and extensible data integration and analysis network tools. EXtensible Markup Language (XML), Web Services and Workflow Management Systems (WMS) can support the creation and deployment of such systems. Many XML languages and Web Services for bioinformatics have already been designed and implemented and some WMS have been proposed. In this article, we review a methodology for data integration in biomedical research that is based on these technologies. We also briefly describe some of the available WMS and discuss the current limitations of this methodology and the ways in which they can be overcome. 相似文献
6.
The program XEASY for computer-supported NMR spectral analysis of biological macromolecules 总被引:16,自引:0,他引:16
Summary A new program package, XEASY, was written for interactive computer support of the analysis of NMR spectra for three-dimensional structure determination of biological macromolecules. XEASY was developed for work with 2D, 3D and 4D NMR data sets. It includes all the functions performed by the precursor program EASY, which was designed for the analysis of 2D NMR spectra, i.e., peak picking and support of sequence-specific resonance assignments, cross-peak assignments, cross-peak integration and rate constant determination for dynamic processes. Since the program utilizes the X-window system and the Motif widget set, it is portable on a wide range of UNIX workstations. The design objective was to provide maximal computer support for the analysis of spectra, while providing the user with complete control over the final resonance assignments. Technically important features of XEASY are the use and flexible visual display of strips, i.e., two-dimensional spectral regions that contain the relevant parts of 3D or 4D NMR spectra, automated sorting routines to narrow down the selection of strips that need to be interactively considered in a particular assignment step, a protocol of resonance assignments that can be used for reliable bookkeeping, independent of the assignment strategy used, and capabilities for proper treatment of spectral folding and efficient transfer of resonance assignments between spectra of different types and different dimensionality, including projected, reduced-dimensionality triple-resonance experiments.Abbreviations 1D, 2D, 3D, 4D
one-, two-, three-, four-dimensional
- NOE
nuclear Overhauser enhancement
- NOESY
nuclear Overhauser enhancement spectroscopy
- TOCSY
total correlation spectroscopy
- COSY
correlation spectroscopy
- TPPI
time-proportional phase incrementation 相似文献
7.
In plants, more favourable environmental conditions can lead to dramatic increases in both mean fitness and variance in fitness. This results in data that violate the equality-of-variance assumption of anova, a problem that most empiricists would address by log-transforming fitness values. Using heuristic data sets and simple simulations, we show that anova on log-transformed fitness consistently fails to match the outcome of selection in a heterogeneous environment or its sensitivity to environmental frequency. Only anova based on relative fitness within environments accurately predicts the sensitivity of genotype selection to the frequency of alternative environments. Parallel analyses of variance based on absolute fitness and relative fitness can bracket the expected success of alternative genotypes under hard and soft selection, respectively. For example, for Sinapis arvensis growing in full sun and partial shade treatments, families achieving high fitness in the best environment are favoured under hard selection, whereas soft selection favours different families that achieve consistently good performance across environments. Based on these findings, we recommend that log-transformation of fitness should no longer be standard practice in ecological genetics studies. Weighted anova is a preferable method for dealing with unequal variances, and investigators should also make greater use of techniques such as quantile regression or resampling to describe and evaluate fitness variation across heterogeneous environments. 相似文献
8.
A systematic analysis of the hydrogen-bonding geometry in helices and beta sheets has been performed. The distances and angles between the backbone carbonyl O and amide N atoms were correlated considering more than 1500 protein chains in crystal structures determined to a resolution better than 1.5 A. They reveal statistically significant trends in the H-bond geometry across the different secondary structural elements. The analysis has been performed using Secbase, a modular extension of Relibase (Receptor Ligand Database) which integrates information about secondary structural elements assigned to individual protein structures with the various search facilities implemented into Relibase. A comparison of the mean hydrogen-bond distances in alpha helices and 3(10) helices of increasing length shows opposing trends. Whereas in alpha helices the mean H-bond distance shrinks with increasing helix length and turn number, the corresponding mean dimension in 3(10) helices expands in a comparable series. Comparing similarly the hydrogen-bond lengths in beta sheets there is no difference to be found between the mean H-bond length in antiparallel and parallel beta sheets along the strand direction. In contrast, an interesting systematic trend appears to be given for the hydrogen bonds perpendicular to the strands bridging across an extended sheet. With increasing number of accumulated strands, which results in a growing number of back-to-back piling hydrogen bonds across the strands, a slight decrease of the mean H-bond distance is apparent in parallel beta sheets whereas such trends are obviously not given in antiparallel beta sheets. This observation suggests that cooperative effects mutually polarizing spatially well-aligned hydrogen bonds are present either in alpha helices and parallel beta sheets whereas such influences seem to be lacking in 3(10) helices and antiparallel beta sheets. 相似文献
9.
This paper reports the CASP13 results of distance-based contact prediction, threading, and folding methods implemented in three RaptorX servers, which are built upon the powerful deep convolutional residual neural network (ResNet) method initiated by us for contact prediction in CASP12. On the 32 CASP13 FM (free-modeling) targets with a median multiple sequence alignment (MSA) depth of 36, RaptorX yielded the best contact prediction among 46 groups and almost the best 3D structure modeling among all server groups without time-consuming conformation sampling. In particular, RaptorX achieved top L/5, L/2, and L long-range contact precision of 70%, 58%, and 45%, respectively, and predicted correct folds (TMscore > 0.5) for 18 of 32 targets. Further, RaptorX predicted correct folds for all FM targets with >300 residues (T0950-D1, T0969-D1, and T1000-D2) and generated the best 3D models for T0950-D1 and T0969-D1 among all groups. This CASP13 test confirms our previous findings: (a) predicted distance is more useful than contacts for both template-based and free modeling; and (b) structure modeling may be improved by integrating template and coevolutionary information via deep learning. This paper will discuss progress we have made since CASP12, the strength and weakness of our methods, and why deep learning performed much better in CASP13. 相似文献
10.
Gerhard Klebe 《Proteins》2012,80(2):626-648
Small molecules are recognized in protein‐binding pockets through surface‐exposed physicochemical properties. To optimize binding, they have to adopt a conformation corresponding to a local energy minimum within the formed protein–ligand complex. However, their conformational flexibility makes them competent to bind not only to homologous proteins of the same family but also to proteins of remote similarity with respect to the shape of the binding pockets and folding pattern. Considering drug action, such observations can give rise tounexpected and undesired cross reactivity. In this study, datasets of six different cofactors (ADP, ATP, NAD(P)(H), FAD, and acetyl CoA, sharing an adenosine diphosphate moiety as common substructure), observed in multiple crystal structures of protein–cofactor complexes exhibiting sequence identity below 25%, have been analyzed for the conformational properties of the bound ligands, the distribution of physicochemical properties in the accommodating protein‐binding pockets, and the local folding patterns next to the cofactor‐binding site. State‐of‐the‐art clustering techniques have been applied to group the different protein–cofactor complexes in the different spaces. Interestingly, clustering in cavity (Cavbase) and fold space (DALI) reveals virtually the same data structuring. Remarkable relationships can be found among the different spaces. They provide information on how conformations are conserved across the host proteins and which distinct local cavity and fold motifs recognize the different portions of the cofactors. In those cases, where different cofactors are found to be accommodated in a similar fashion to the same fold motifs, only a commonly shared substructure of the cofactors is used for the recognition process. Proteins 2012. © 2011 Wiley Periodicals, Inc. 相似文献
11.
12.
Yu. F. Krupyansky M. G. Mikhailyuk S. V. Esin G. V. Eshchenko A. P. Moroz E. A. Okisheva N. Kh. Seifullina P. P. Knox A. B. Rubin 《Biophysics》2006,51(1):8-16
Radial distribution functions were deduced by Fourier transform analysis of the angular dependences of diffuse X-ray scattering intensities for the following proteins with different hydration degrees: water-soluble α-protein myoglobin, water-soluble (α + β) protein lysozyme, and transmembrane proteins from the photosynthetic reaction centers of purple bacteria Rhodobacter sphaeroides and Blastochlorii (Rhodopseudomonas) viridis. The results of Fourier transform analysis of X-ray scattering intensities give quantitative characteristics of the mechanism underlying the influence of water on the formation of biological macromolecules. On the one hand, water loosens the network of hydrogen bonds, which results in a considerable conformational mobility in the molecules of lysozyme and myoglobin and the reaction centers. On the other hand, water stabilizes and orders the protein globule. A strict correlation was found between the shift of the “first” maximum of the radial distribution function, loosening of the intraglobular hydrogen bonds, increase in the intramolecular mobility, and appearance of pronounced functional activity in macromolecules. The pattern of behavior of the first maximum in the transmembrane proteins of the reaction center was similar to that observed for the water-soluble proteins. However, the first maximum reached the limiting value of 2.9 Å at a considerably lower hydration degree compared with the water-soluble proteins. A quick transition of the protein complex of the reaction center to its native state is due to the fact that the dehydrated conformation of this complex is very close to the native conformation. Comparison of the radial distribution function for water, water-soluble proteins, and transmembrane proteins suggests a quantitative conclusion that water is the least densely packed and ordered system, the water-soluble proteins are more densely packed than water, and the transmembrane proteins are the most densely packed and ordered system. 相似文献
13.
14.
《Saudi Journal of Biological Sciences》2021,28(9):4938-4945
BackgroundAbout half-century ago, Immunoglobulin A nephropathy (IgAN) was discovered as a complicated disease with frequent clinical symptoms. Until now, exact mechanism underlying the pathogenesis of IgAN is poorly known. Therefore, current study was aimed to understand the molecular mechanism of IgAN by identifying the key miRNAs and their targeted hub genes. The key miRNAs might contribute to the diagnosis and therapy of IgAN, and could turn out to be a new star in the field of IgAN.MethodsThe microarray datasets were downloaded from Gene Expresssion Omnibus (GEO) database and analyzed using R package (LIMMA) in order to obtain differential expressed genes (DEGs). Then, the hub genes were identified using cytoHubba plugin of cytoscpae tool and other bioinformatics approaches including protein-protein interaction (PPI) network analysis, module analysis, and miRNA-hub gene network construction was also performed.ResultsA total of 348 DEGs were identified, of which 107 were upregulated genes and 241 were downregulated genes. Subsequently, the 12 overlapped genes were predicted from cytoHubba, and considered as hub genes. Moreover, a network among miRNA-hub genes was created to explore the correlation between the hub genes and their targeted miRNAs. Network construction ultimately lead to the identification of nine gene named FN1, EGR1, FOS, JUN, SERPINE1, MMP2, ATF3, MYC, and IL1B and one novel key miRNA namely, has-miR-144-3p as biomarker for diagnosis and therapy of IgAN.ConclusionThis study updates the information and yield a new perspective in context of understanding the pathogenesis and development of IgAN. In future, key miRNAs might be capable of improving the personalized detection and therapies for IgAN. In vivo and in vitro investigation of miRNAs and pathway interaction is essential to delineate the specific roles of the novel miRNAs, which may help to further reveal the mechanisms underlying IgAN. 相似文献
15.
《Saudi Journal of Biological Sciences》2022,29(7):103318
Breast cancer accounts for nearly half of all cancer-related deaths in women worldwide. However, the molecular mechanisms that lead to tumour development and progression remain poorly understood and there is a need to identify candidate genes associated with primary and metastatic breast cancer progression and prognosis. In this study, candidate genes associated with prognosis of primary and metastatic breast cancer were explored through a novel bioinformatics approach. Primary and metastatic breast cancer tissues and adjacent normal breast tissues were evaluated to identify biomarkers characteristic of primary and metastatic breast cancer. The Cancer Genome Atlas-breast invasive carcinoma (TCGA-BRCA) dataset (ID: HS-01619) was downloaded using the mRNASeq platform. Genevestigator 8.3.2 was used to analyse TCGA-BRCA gene expression profiles between the sample groups and identify the differentially-expressed genes (DEGs) in each group. For each group, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses were used to determine the function of DEGs. Networks of protein–protein interactions were constructed to identify the top hub genes with the highest degree of interaction. Additionally, the top hub genes were validated based on overall survival and immunohistochemistry using The Human Protein Atlas. Of the top 20 hub genes identified, four (KRT14, KIT, RAD51, and TTK) were considered as prognostic risk factors based on overall survival. KRT14 and KIT expression levels were upregulated while those of RAD51 and TTK were downregulated in patients with breast cancer. The four proposed candidate hub genes might aid in further understanding the molecular changes that distinguish primary breast tumours from metastatic tumours as well as help in developing novel therapeutics. Furthermore, they may serve as effective prognostic risk markers based on the strong correlation between their expression and patient overall survival. 相似文献