首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background  

Interpretation of lists of genes or proteins with altered expression is a critical and time-consuming part of microarray and proteomics research, but relatively little attention has been paid to methods for extracting biological meaning from these output lists. One powerful approach is to examine the expression of predefined biological pathways and gene sets, such as metabolic and signaling pathways and macromolecular complexes. Although many methods for measuring pathway expression have been proposed, a systematic analysis of the performance of multiple methods over multiple independent data sets has not previously been reported.  相似文献   

2.
Huang H  Eversley CD  Threadgill DW  Zou F 《Genetics》2007,176(4):2529-2540
A Bayesian methodology has been developed for multiple quantitative trait loci (QTL) mapping of complex binary traits that follow liability threshold models. Unlike most QTL mapping methods where only one or a few markers are used at a time, the proposed method utilizes all markers across the genome simultaneously. The outperformance of our Bayesian method over the traditional single-marker analysis and interval mapping has been illustrated via simulations and real data analysis to identify candidate loci associated with colorectal cancer.  相似文献   

3.
Protein 4.1 is a globular 80-kDa component of the erythrocyte membrane skeleton that enhances spectrin–actin interaction via its internal 10-kDa domain. Previous studies have shown that protein 4.1 mRNA is expressed as multiple alternatively spliced isoforms, resulting from the inclusion or exclusion of small cassette sequences called motifs. By tissue screening for protein 4.1 isoforms, we have observed new features of an already complex pattern of alternative splicing within the spectrin/actin binding domain. In particular, we found a new 51-nt exon that is present almost exclusively in muscle tissue. In addition, we have isolated multiple genomic clones spanning over 200 kb, containing the entire erythroid and nonerythroid coding sequence of the human locus. The exon/intron structure has now been characterized; with the exception of a 17-nt motif, all of the alternatively spliced motifs correspond to individual exons. The 3′-untranslated region (UTR) has also been completely sequenced using various PCR and genomic-sequencing methods. The 3′ UTR, over 3 kb, accounts for one-half of the mature mRNA.  相似文献   

4.
QTL mapping and the genetic basis of adaptation: recent developments   总被引:6,自引:0,他引:6  
Zeng ZB 《Genetica》2005,123(1-2):25-37
Quantitative trait loci (QTL) mapping has been used in a number of evolutionary studies to study the genetic basis of adaptation by mapping individual QTL that explain the differences between differentiated populations and also estimating their effects and interaction in the mapping population. This analysis can provide clues about the evolutionary history of populations and causes of the population differentiation. QTL mapping analysis methods and associated computer programs provide us tools for such an inference on the genetic basis and architecture of quantitative trait variation in a mapping population. Current methods have the capability to separate and localize multiple QTL and estimate their effects and interaction on a quantitative trait. More recent methods have been targeted to provide a comprehensive inference on the overall genetic architecture of multiple traits in a number of environments. This development is important for evolutionary studies on the genetic basis of multiple trait variation, genotype by environment interaction, host–parasite interaction, and also microarray gene expression QTL analysis.  相似文献   

5.
The activation of D-tyrosine by tyrosyl-tRNA synthetase has been investigated using single and multiple turnover kinetic methods. In the presence of saturating concentrations of D-tyrosine, the activation reaction displays sigmoidal kinetics with respect to ATP concentration under single turnover conditions. In contrast, when the kinetics for the activation reaction are monitored using a steady-state (multiple turnover) pyrophosphate exchange assay, Michaelis-Menten kinetics are observed. Previous investigations indicated that activation of l-tyrosine by the K233A variant of Bacillus stearothermophilus tyrosyl-tRNA synthetase displays sigmoidal kinetics similar to those observed for activation of d-tyrosine by the wild-type enzyme. Kinetic analyses indicate that the sigmoidal behavior of the d-tyrosine activation reaction is not enhanced when Lys-233 is replaced by alanine. This supports the hypothesis that the mechanistic basis for the sigmoidal behavior is the same for both d-tyrosine activation by wild-type tyrosyl-tRNA synthetase and activation of l-tyrosine by the K233A variant. The observed sigmoidal behavior presents a paradox, as tyrosyl-tRNA synthetase displays an extreme form of negative cooperativity, known as "half-of-the-sites reactivity," with respect to tyrosine binding and tyrosyl-adenylate formation. We propose that the binding of D-tyrosine weakens the affinity with which ATP binds to the functional subunit in tyrosyl-tRNA synthetase. This allows ATP to bind initially to the nonfunctional subunit, inducing a conformational change in the enzyme that enhances the affinity of the functional subunit for ATP. The observation that sigmoidal kinetics are observed only under single turnover conditions suggests that this conformational change is stable over multiple rounds of catalysis.  相似文献   

6.
Although a large number of methods have been proposed to control the non-linear dynamics of unstable populations, very few have been verified using biological populations. Here, we investigated the effects of two well-studied control methods (Both Limiter Control and Target-Oriented Control) on the dynamics of unstable populations of Drosophila melanogaster. We also perform biologically realistic simulations to ascertain the generalizability of our results. We show that both methods can significantly reduce population fluctuations, decrease extinction probability and increase effective population size simultaneously. This is in contrast with earlier studies on single parameter control methods that are not able to concurrently achieve multiple aspects of stability. We use the distribution of population sizes to derive biologically intuitive explanations for the mechanisms of how these two control methods attain stability. Finally, we show that non-Drosophila specific biologically realistic simulations are able to capture almost all the trends of our data. This shows that our results are likely generalizable over a wide range of taxa. Therefore, the control methods that incorporate both culling and restocking (like BLC and TOC) can simultaneously achieve multiple kinds of stability and therefore are strong candidates for field applications.  相似文献   

7.

Background  

The strength of selective constraints operating on amino acid sites of proteins has a multifactorial nature. In fact, amino acid sites within proteins coevolve due to their functional and/or structural relationships. Different methods have been developed that attempt to account for the evolutionary dependencies between amino acid sites. Researchers have invested a significant effort to increase the sensitivity of such methods. However, the difficulty in disentangling functional co-dependencies from historical covariation has fuelled the scepticism over their power to detect biologically meaningful results. In addition, the biological parameters connecting linear sequence evolution to structure evolution remain elusive. For these reasons, most of the evolutionary studies aimed at identifying functional dependencies among protein domains have focused on the structural properties of proteins rather than on the information extracted from linear multiple sequence alignments (MSA). Non-parametric methods to detect coevolution have been reported to be especially susceptible to produce false positive results based on the properties of MSAs. However, no formal statistical analysis has been performed to definitively test the differential effects of these properties on the sensitivity of such methods.  相似文献   

8.
Detecting departures from Hardy-Weinberg equilibrium (HWE) of marker-genotype frequencies is a crucial first step in almost all human genetic analyses. When a sample is stratified by multiple ethnic groups, it is important to allow the marker-allele frequencies to differ over the strata. In this situation, it is common to test for HWE by using an exact test within each stratum and then using the minimum P value as a global test. This approach does not account for multiple testing, and, because it does not combine information over strata, it does not have optimal power. Several approximate methods to combine information over strata have been proposed, but most of them sum over strata a measure of departure from HWE; if the departures are in different directions, then summing can diminish the overall evidence of departure from HWE. An exact stratified test is more appealing because it uses the probability of genotype configurations across the strata as evidence for global departures from HWE. We developed an exact stratified test for HWE for diallelic markers, such as single-nucleotide polymorphisms (SNPs), and an exact test for homogeneity of Hardy-Weinberg disequilibrium. By applying our methods to data from Perlegen and HapMap--a combined total of more than five million SNP genotypes, with three to four strata and strata sizes ranging from 23 to 60 subjects--we illustrate that the exact stratified test provides more-robust and more-powerful results than those obtained by either the minimum of exact test P values over strata or approximate stratified tests that sum measures of departure from HWE. Hence, our new methods should be useful for samples composed of multiple ethnic groups.  相似文献   

9.
Two common goals when choosing a method for performing all pairwise comparisons of J independent groups are controlling experiment wise Type I error and maximizing power. Typically groups are compared in terms of their means, but it has been known for over 30 years that the power of these methods becomes highly unsatisfactory under slight departures from normality toward heavy-tailed distributions. An approach to this problem, well-known in the statistical literature, is to replace the sample mean with a measure of location having a standard error that is relatively unaffected by heavy tails and outliers. One possibility is to use the trimmed mean. This paper describes three such multiple comparison procedures and compares them to two methods for comparing means.  相似文献   

10.
We have developed several new methods for blood-based cancer detection by diagnostic proteomics. Ultrasensitive methods of immunoassay using multiphoton-detection (IA/MPD) increase sensitivity by 200- to 1,000-fold (1 femtogram/mL). This has allowed the measurement of cancer biomarkers with very low concentrations in blood that could not be measured for full patient cohorts with conventional immunoassays. Sensitivity and specificity in cancer detection have been found to be potentiated by use of immunoassay panels which include tissue-specific cancer biomarkers as well as cytokines and angiogenic factors. The ultrasensitive immunoassays revealed that patient to patient variations in the concentrations of individual biomarkers in blood can extend over many orders of magnitude (up to six) and that the distributions of biomarker concentrations over patient cohorts are non-Gaussian. New methods of data analysis which correlate abundances of multiple, different biomarkers have been developed to deal with such data sets. Sensitivity and specificity of about 95% have been achieved for blood-based detection of breast cancer in pilot studies on 250 patients and 95 controls. Pilot studies indicate that this methodology may also allow differentiation of malignant breast cancer from benign lesions and can provide similar sensitivity and specificity for other epithelial cancers such as prostate cancer, ovarian cancer and melanoma. The methods developed for selection, application, and evaluation of very high sensitivity biomarker panels are expected to have general relevance for diagnostic proteomics.  相似文献   

11.
Hereditary multiple exostosis (HME), a bone tumor first described by Virchow, has been studied over a period of 15 years on a comparative basis. The horse, an excellent biomedical model for this physically deforming multiple bone tumor in man, has been utilized in this study. The etiology, hereditary pattern, potential for malignancy and other aspects of this strange affliction need additional clarification. This in-depth study of 261 individuals from 144 families was compared with that of 55 horses bearing the HME trait, selectively bred and studied over the same period. Important information has been collected and evaluated about this condition that is suspect of being frequently missed diagnostically, with a higher incidence in humans that recognized. Continuing development studies of offspring of the original study participants; sarcomatous transformation monitoring; and recently developed genetic techniques should add to our understanding of this puzzling hereditary condition.  相似文献   

12.
Understanding the mechanisms underlying the observed dynamics of complex biological systems requires the statistical assessment and comparison of multiple alternative models. Although this has traditionally been done using maximum likelihood-based methods such as Akaike''s Information Criterion (AIC), Bayesian methods have gained in popularity because they provide more informative output in the form of posterior probability distributions. However, comparison between multiple models in a Bayesian framework is made difficult by the computational cost of numerical integration over large parameter spaces. A new, efficient method for the computation of posterior probabilities has recently been proposed and applied to complex problems from the physical sciences. Here we demonstrate how nested sampling can be used for inference and model comparison in biological sciences. We present a reanalysis of data from experimental infection of mice with Salmonella enterica showing the distribution of bacteria in liver cells. In addition to confirming the main finding of the original analysis, which relied on AIC, our approach provides: (a) integration across the parameter space, (b) estimation of the posterior parameter distributions (with visualisations of parameter correlations), and (c) estimation of the posterior predictive distributions for goodness-of-fit assessments of the models. The goodness-of-fit results suggest that alternative mechanistic models and a relaxation of the quasi-stationary assumption should be considered.  相似文献   

13.
The four methods of examining the relation of amount of multiple crossing over to age of mothers agree in showing that the "internode length" or average distance required for double crossing over has changed in a characteristic fashion, giving an M-shaped curve. These changes have not been independent of changes in total recombination but concomitant with them. However, the changes in recombination percentages were far greater than could be accounted for by change in internode length, and the larger factor must be assumed to be changes in the coefficients of crossing over. The amounts of these changes are greatest for the mid-sections of the chromosome and least for the distal sections. The changes in the two limbs are of like amount for equal distances from the center of symmetry in the distribution of simple and multiple crossing over.  相似文献   

14.
The troponin complex was discovered over thirty years ago and since then much insight has been gained into how this complex senses fluctuating levels of Ca2+ and transmits this signal to the myofilament. Advances in genetics methods have allowed identification of mutations that lead to the phenotypically distinct cardiomyopathies: hypertrophic cardiomyopathy (HCM), restrictive cardiomyopathy (RCM) and dilated cardiomyopathy (DCM). This review serves to highlight key in vivo studies of mutation effects that have followed many years of functional studies and discusses how these mutations alter energetics and promote the characteristic remodeling associated with cardiomyopathic diseases. Studies have been performed that examine alterations in signaling and genomic methods have been employed to isolate upregulated proteins, however these processes are complex as there are multiple roads to hypertrophy or dilation associated with genetic cardiomyopathies. This review suggests future directions to explore in the troponin field that would heighten our understanding of the complex regulation of cardiac muscle contraction.  相似文献   

15.
DNA error correcting codes over the edit metric consist of embeddable markers for sequencing projects that are tolerant of sequencing errors. When a genetic library has multiple sources for its sequences, use of embedded markers permit tracking of sequence origin. This study compares different methods for synthesizing DNA error correcting codes. A new code-finding technique called the salmon algorithm is introduced and used to improve the size of best known codes in five difficult cases of the problem, including the most studied case: length six, distance three codes. An updated table of the best known code sizes with 36 improved values, resulting from three different algorithms, is presented. Mathematical background results for the problem from multiple sources are summarized. A discussion of practical details that arise in application, including biological design and decoding, is also given in this study.  相似文献   

16.
A heuristic approach to search for the maximum-likelihood (ML) phylogenetic tree based on a genetic algorithm (GA) has been developed. It outputs the best tree as well as multiple alternative trees that are not significantly worse than the best one on the basis of the likelihood criterion. These near-optimum trees are subjected to further statistical tests. This approach enables ones to infer phylogenetic trees of over 20 taxa taking account of the rate heterogeneity among sites on practical time scales on a PC cluster. Computer simulations were conducted to compare the efficiency of the present approach with that of several likelihood-based methods and distance-based methods, using amino acid sequence data of relatively large (5–24) taxa. The superiority of the ML method over distance-based methods increases as the condition of simulations becomes more realistic (an incorrect model is assumed or many taxa are involved). This approach was applied to the inference of the universal tree based on the concatenated amino acid sequences of vertically descendent genes that are shared among all genomes whose complete sequences have been reported. The inferred tree strongly supports that Archaea is paraphyletic and Eukarya is specifically related to Crenarchaeota. Apart from the paraphyly of Archaea and some minor disagreements, the universal tree based on these genes is largely consistent with the universal tree based on SSU rRNA. Received: 4 January 2001 / Accepted: 16 May 2001  相似文献   

17.
The classic structure–function paradigm holds that a protein exhibits a single well-defined native state that gives rise to its biological function. Nonetheless, over the past few decades, numerous examples of proteins exhibiting biological function arising from multiple structural states of varying disorder have been identified. Most recently, several examples of ‘metamorphic proteins’, able to interconvert between vastly different native-like topologies under physiological conditions, have been characterised with multiple functions. In this review, we look at the concept of protein metamorphosis in relation to the current understanding of the protein structure–function landscape. Although structural dynamism observed for metamorphic proteins provides a novel source of functional versatility, the dynamic nature of the metamorphic proteins generally makes them difficult to identify and probe using conventional protein structure determination methods. However, as the existence of metamorphic proteins has now been established and techniques enabling the analysis of multiple protein conformers are improving, it is likely that this class will continue to grow in number.  相似文献   

18.
The selective extinction of the dinosaurs and other giant reptiles has long been a topic of speculation and controversy. Everyone is familiar with the theory of the giant bollide colliding with Earth. But, would it not be more likely that that multiple factors acted over a relatively long period of time to produce this mass extinction?  相似文献   

19.
MOTIVATION: A quantitative study of molecular evolutionary events such as substitutions, insertions and deletions from closely related genomes requires (1) an accurate multiple sequence alignment program and (2) a method to annotate the insertions and deletions that explain the 'gaps' in the alignment. Although the former requirement has been extensively addressed, the latter problem has received little attention, especially in a comprehensive probabilistic framework. RESULTS: Here, we present Indelign, a program that uses a probabilistic evolutionary model to compute the most likely scenario of insertions and deletions consistent with an input multiple alignment. It is also capable of modifying the given alignment so as to obtain a better agreement with the evolutionary model. We find close to optimal performance and substantial improvement over alternative methods, in tests of Indelign on synthetic data. We use Indelign to analyze regulatory sequences in Drosophila, and find an excess of insertions over deletions, which is different from what has been reported for neutral sequences. AVAILABILITY: The Indelign program may be downloaded from the website http://veda.cs.uiuc.edu/indelign/ SUPPLEMENTARY INFORMATION: Supplementary material is available at Bioinformatics online.  相似文献   

20.
Rigorous assessments of protein structure prediction have demonstrated that fold recognition methods can identify remote similarities between proteins when standard sequence search methods fail. It has been shown that the accuracy of predictions is improved when refined multiple sequence alignments are used instead of single sequences and if different methods are combined to generate a consensus model. There are several meta-servers available that integrate protein structure predictions performed by various methods, but they do not allow for submission of user-defined multiple sequence alignments and they seldom offer confidentiality of the results. We developed a novel WWW gateway for protein structure prediction, which combines the useful features of other meta-servers available, but with much greater flexibility of the input. The user may submit an amino acid sequence or a multiple sequence alignment to a set of methods for primary, secondary and tertiary structure prediction. Fold-recognition results (target-template alignments) are converted into full-atom 3D models and the quality of these models is uniformly assessed. A consensus between different FR methods is also inferred. The results are conveniently presented on-line on a single web page over a secure, password-protected connection. The GeneSilico protein structure prediction meta-server is freely available for academic users at http://genesilico.pl/meta.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号