首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Statistical analysis of time series is still inadequate within circulation research. With the advent of increasing computational power and real-time recordings from hemodynamic studies, one is increasingly dealing with vast amounts of data in time series. This paper aims to illustrate how statistical analysis using the significant nonstationarities (SiNoS) method may complement traditional repeated-measures ANOVA and linear mixed models. We applied these methods on a dataset of local hepatic and systemic circulatory changes induced by aortoportal shunting and graded liver resection. We found SiNoS analysis more comprehensive when compared with traditional statistical analysis in the following four ways: 1) the method allows better signal-to-noise detection; 2) including all data points from real time recordings in a statistical analysis permits better detection of significant features in the data; 3) analysis with multiple scales of resolution facilitates a more differentiated observation of the material; and 4) the method affords excellent visual presentation by combining group differences, time trends, and multiscale statistical analysis allowing the observer to quickly view and evaluate the material. It is our opinion that SiNoS analysis of time series is a very powerful statistical tool that may be used to complement conventional statistical methods.  相似文献   

2.
BACKGROUND: HDX mass spectrometry is a powerful platform to probe protein structure dynamics during ligand binding, protein folding, enzyme catalysis, and such. HDX mass spectrometry analysis derives the protein structure dynamics based on the mass increase of a protein of which the backbone protons exchanged with solvent deuterium. Coupled with enzyme digestion and MS/MS analysis, HDX mass spectrometry can be used to study the regional dynamics of protein based on the m/z value or percentage of deuterium incorporation for the digested peptides in the HDX experiments. Various software packages have been developed to analyze HDX mass spectrometry data. Despite the progresses, proper and explicit statistical treatment is still lacking in most of the current HDX mass spectrometry software. In order to address this issue, we have developed the HDXanalyzer for the statistical analysis of HDX mass spectrometry data using R, Python, and RPY2. IMPLEMENTATION AND RESULTS: HDXanalyzer package contains three major modules, the data processing module, the statistical analysis module, and the user interface. RPY2 is employed to enable the connection of these three components, where the data processing module is implemented using Python and the statistical analysis module is implemented with R. RPY2 creates a low-level interface for R and allows the effective integration of statistical module for data processing. The data processing module generates the centroid for the peptides in form of m/z value, and the differences of centroids between the peptides derived from apo and ligand-bound protein allow us to evaluate whether the regions have significant changes in structure dynamics or not. Another option of the software is to calculate the deuterium incorporation rate for the comparison. The two types of statistical analyses are Paired Student's t-test and the linear combination of the intercept for multiple regression and ANCOVA model. The user interface is implemented with wxpython to facilitate the data visualization in graphs and the statistical analysis output presentation. In order to evaluate the software, a previously published xylanase HDX mass spectrometry analysis dataset is processed and presented. The results from the different statistical analysis methods are compared and shown to be similar. The statistical analysis results are overlaid with the three dimensional structure of the protein to highlight the regional structure dynamics changes in the xylanase enzyme. CONCLUSION: Statistical analysis provides crucial evaluation of whether a protein region is significantly protected or unprotected during the HDX mass spectrometry studies. Although there are several other available software programs to process HDX experimental data, HDXanalyzer is the first software program to offer multiple statistical methods to evaluate the changes in protein structure dynamics based on HDX mass spectrometry analysis. Moreover, the statistical analysis can be carried out for both m/z value and deuterium incorporation rate. In addition, the software package can be used for the data generated from a wide range of mass spectrometry instruments.  相似文献   

3.
等位基因多态性群体遗传结构的多元非线性分析方法   总被引:4,自引:0,他引:4  
长期以来,对于多维基因多态性数据的多元统计分析,如计算遗传距离时昕用的聚类分析、分析群体遗传结构时所用的主成分分析、因子分析和典型相关分析等,一直应用为无约束条件数据而设计的经典多元线性分析方法,并没有注意基因多态性数据的“闭合效应”所带来的问题。从分析基因多态性数据的分布和结构特征入手,文中指出了基因多态性分布具有“闭合数据”的特点,分析了由于“闭合效应”的影响,经典多元线性方法用于群体遗传结构分析昕面临的困难。根据成分数据统计分析的理论和方法,提出了基因多态性群体遗传结构的多元非线性分析基本方法。并以主成分分析为例,通过实例比较和分析了经典线性主成分分析和“对数比”非线性主成分分析的结果,证明“对数比”非线性主成分分析方法是研究基因多态性群体遗传结构的良好方法,具有特异、灵敏等优点,其结果符合群体遗传学规律。  相似文献   

4.
Appropriate study design and proper statistical analysis are necessary ingredients for improving the quality and reliability of the information in journal articles. General surgery and plastic surgery articles were compared for principal author's academic degree, a Ph.D.'s presence as a coauthor, the study type, the presence of statistical analysis, the analysis' appropriateness, and the types of errors in study design or statistical analysis. Ph. D. authorship was associated with increased percentage of articles using statistical analysis. When compared with general surgery articles, plastic surgery articles performed four times fewer statistical analyses. However, when statistical analyses were performed, there were few differences between these two specialties. Although there were no differences in the types of statistical analysis errors, there were differences in the types of study design errors. The causes of these discrepancies may lie in the nature of plastic surgery; they may be reduced by adherence to Feinstein's principles of study design and result interpretation.  相似文献   

5.
SUMMARY: SpA is a web-accessible system for the management, visualization and statistical analysis of T-cell receptor spectratype data. Users upload data from their spectratype analyzers to SpA, which saves the raw data and user-defined supplementary covariates to a secure database. The statistical engine performs several data analyses and statistical summaries. The visualization engine displays spectratype histograms in a Java applet and in an image file suitable for download. All of these results are also saved to the database and remain accessible to the user. Additional statistical tools specific to the analysis of multiple spectratypes are also available through the SpA interface. AVAILABILITY: The service is freely accessible via the web at http://www.duke.edu/~kepler/spa.html. Additional technical support and specialized statistical analysis and consultation are available by arrangement with the authors and, depending on the service requested, may be subject to fee.  相似文献   

6.
A screening method aimed at identifying potential human carcinogens using either animal cancer bioassays or short-term genotoxic assays has 4 possible results: true positive, true negative, false positive and false negative. Such a categorisation is superficially similar to the results of hypothesis testing in a statistical analysis. In this latter case the false positive rate is determined by the significance level of the test and the false negative rate by the statistical power of the test. Although the two types of categorisation appear somewhat similar, different statistical issues are involved in their interpretation. Statistical methods appropriate for the analysis of the results of a series of assays include the use of Bayes' theorem and multivariate methods such as clustering techniques for the selection of batteries of short-term test capable of a better prediction of potential carcinogens. The conclusions drawn from such studies are dependent upon the estimates of values of sensitivity and specificity used, the choice of statistical method and the nature of the data set. The statistical issues resulting from the analysis of specific genotoxicity experiments involve the choice of suitable experimental designs and appropriate analyses together with the relationship of statistical significance to biological importance. The purpose of statistical analysis should increasingly be to estimate and explore effects rather than for formal hypothesis testing.  相似文献   

7.
Information theoretic and statistical techniques for determining the number of discernible levels in cutaneous receptor neurons are reviewed. Reasons for the large variance in these results are discussed. A new continuous information theoretic analysis technique is presented that overcomes many of the problems in the other methods of analysis discussed. Comparison of this new method of analysis with a statistical technique developed by Schreiner et al. (1978) clearly shows some of the misconceptions that are associated with statistical analysis techniques, and why these problems cannot arise in the new information theoretic technique discussed here.This work was supported, in part, by USPHS grant NS08470 and a Rackham predoctoral fellowship  相似文献   

8.
Comparative proteomic studies often use statistical tests included in the software for the analysis of digitized images of two-dimensional electrophoresis gels. As these programs include only limited capabilities for statistical analysis, many studies do not further describe their statistical approach. To find potential differences produced by different data processing, we compared the results of (1) Student's t-test using a spreadsheet program, (2) the intrinsic algorithms implemented in the Phoretix 2D gel analysis software, and (3) the SAM algorithm originally developed for microarray analysis. We applied the algorithms to proteome data of undifferentiated neural stem cells versus in vitro differentiated neural stem cells. We found (1) 367 spots differentially expressed using Student's t-test, (2) 203 spots using the algorithms in Phoretix 2D, and (3) 119 spots using the algorithms in SAM, respectively, with an overlap of 42 spots detected by all three algorithms. Applying different statistical approaches on the same dataset resulted in divergent set of protein spots labeled as statistically "significant". Currently, there is no agreement on statistical data processing of 2DE datasets, but the statistical tests applied in 2DE studies should be documented. Tools for the statistical analysis of proteome data should be implemented and documented in the existing 2DE software.  相似文献   

9.
目的:采用常用的电子表格处理系统Microsoft Excel解决药学实验过程中遇到的数据分析问题。方法:应用工作表函数中内置的统计函数,以线性回归为例说明源数据的输入与结果返回的具体操作过程;对数据分析工具中的"描述统计"工具、t检验与方差分析,结合具体实例对药学实践中遇到的药学统计实际问题进行综合探讨。结果:用Excel表中内置的统计函数工具进行线性回归分析,方法简单、结果可靠;Excel表中的数据分析工具适用于日常药学实验数据分析过程中遇到的描述统计分析、t检验与方差分析。Excel与其它数据处理软件相比具有操作快捷、使用方便、计算精确、易于学习与掌握等优点。结论:Excel友好的界面,清晰的统计分析结果,使医药工作者在使用Excel的数据分析软件时会感到非常的方便快捷,灵活实用,值得在药学实践中应用推广。  相似文献   

10.
Tumour cell invasion is a complex process, which is essential for the formation of metastasis and is therefore of critical clinical importance. For detailed investigations of the invasive process, quantifiable in vitro models of invasion are necessary. In this study we describe an image analysis procedure and a statistical program which facilitate an objective analysis of experiments carried out using the embryonic chick heart invasion model of Mareel. Tumour multicellular spheroids are confronted with embryonic chick heart fragments in culture and are sampled after different time intervals for up to 7 days. Immunohistological sections are then evaluated by an image analysis procedure which provides 9 parameters indicating invasion, proliferation and destruction taking place in the confrontation cultures. The data obtained by image analysis are further evaluated by a statistical program which describes the change with time of each parameter by means of linear regression analysis. Thus the data obtained at various time intervals serve as the source data for a single statistic, namely the slope of the regression line. Confidence intervals and statistical differences between various experiments can be calculated. In order to make the procedure more comprehensible in biological terms, the program provides a full text interpretation of the experimental results. The image analysis procedure in conjunction with statistical evaluation and text interpretation provides a comprehensive tool for the quantitative assessment of experimental invasion in vitro.  相似文献   

11.
Identification of protein coding regions is fundamentally a statistical pattern recognition problem. Discriminant analysis is a statistical technique for classifying a set of observations into predefined classes and it is useful to solve such problems. It is well known that outliers are present in virtually every data set in any application domain, and classical discriminant analysis methods (including linear discriminant analysis (LDA) and quadratic discriminant analysis (QDA)) do not work well if the data set has outliers. In order to overcome the difficulty, the robust statistical method is used in this paper. We choose four different coding characters as discriminant variables and an approving result is presented by the method of robust discriminant analysis.  相似文献   

12.
Statistical methods for efficiency adjusted real-time PCR quantification   总被引:1,自引:0,他引:1  
The statistical treatment for hypothesis testing using real-time PCR data is a challenge for quantification of gene expression. One has to consider two key factors in precise statistical analysis of real-time PCR data: a well-defined statistical model and the integration of amplification efficiency (AE) into the model. Previous publications in real-time PCR data analysis often fall short in integrating the AE into the model. Novel, user-friendly, and universal AE-integrated statistical methods were developed for real-time PCR data analysis with four goals. First, we addressed the definition of AE, introduced the concept of efficiency-adjusted Delta Delta Ct, and developed a general mathematical method for its calculation. Second, we developed several linear combination approaches for the estimation of efficiency adjusted Delta Delta Ct and statistical significance for hypothesis testing based on different mathematical formulae and experimental designs. Statistical methods were also adopted to estimate the AE and its equivalence among the samples. A weighted Delta Delta Ct method was introduced to analyze the data with multiple internal controls. Third, we implemented the linear models with SAS programs and analyzed a set of data for each model. In order to allow other researchers to use and compare different approaches, SAS programs are included in the Supporting Information. Fourth, the results from analysis of different statistical models were compared and discussed. Our results underline the differences between the efficiency adjusted Delta Delta Ct methods and previously published methods, thereby better identifying and controlling the source of errors introduced by real-time PCR data analysis.  相似文献   

13.
14.
Terminal restriction fragment length polymorphism (T-RFLP) analysis is a popular high-throughput fingerprinting technique used to monitor changes in the structure and composition of microbial communities. This approach is widely used because it offers a compromise between the information gained and labor intensity. In this review, we discuss the progress made in T-RFLP analysis of 16S rRNA genes and functional genes over the last 10 years and evaluate the performance of this technique when used in conjunction with different statistical methods. Web-based tools designed to perform virtual polymerase chain reaction and restriction enzyme digests greatly facilitate the choice of primers and restriction enzymes for T-RFLP analysis. Significant improvements have also been made in the statistical analysis of T-RFLP profiles such as the introduction of objective procedures to distinguish between signal and noise, the alignment of T-RFLP peaks between profiles, and the use of multivariate statistical methods to detect changes in the structure and composition of microbial communities due to spatial and temporal variation or treatment effects. The progress made in T-RFLP analysis of 16S rRNA and genes allows researchers to make methodological and statistical choices appropriate for the hypotheses of their studies.  相似文献   

15.
Background: High resolution melting (HRM) is an emerging new method for interrogating and characterizing DNA samples. An important aspect of this technology is data analysis. Traditional HRM curves can be difficult to interpret and the method has been criticized for lack of statistical interrogation and arbitrary interpretation of results. Methods: Here we report the basic principles and first applications of a new statistical approach to HRM analysis addressing these concerns. Our method allows automated genotyping of unknown samples coupled with formal statistical information on the likelihood, if an unknown sample is of a known genotype (by discriminant analysis or “supervised learning”). It can also determine the assortment of alleles present (by cluster analysis or “unsupervised learning”) without a priori knowledge of the genotypes present. Conclusion: The new algorithms provide highly sensitive and specific auto-calling of genotypes from HRM data in both supervised an unsupervised analysis mode. The method is based on pure statistical interrogation of the data set with a high degree of standardization. The hypothesis-free unsupervised mode offers various possibilities for de novo HRM applications such as mutation discovery.  相似文献   

16.
The statistical analysis of cancer bioassay data has historically depended on the pathological determination of the experimental animal's cause of death. The poly-k statistical test has provided a method of statistical analysis of animal bioassay data without the need for cause of death information. The test has been shown to have good statistical properties in the typical 2-year cancer bioassay. However, while the poly-k test has been applied to chronic lifetime animal studies, it has not been formally evaluated with respect to the operating characteristics of this statistical test when applied to such studies. Thus, our objective is to assess the performance of the poly-k test for lifetime studies and to make comparisons with other tests. We observed in one recent lifetime study of the gasoline additive methyl tertiary butyl ether (MTBE) that the application of the poly-k test was not statistically robust. Simulation studies were subsequently conducted for a limited number of scenarios of lifetime cancer bioassays. These simulations showed that the poly-k test is not statistically robust for testing effect of increasing dose in some lifetime cancer studies.  相似文献   

17.
BackgroundAlthough a substantial number of studies focus on the teaching and application of medical statistics in China, few studies comprehensively evaluate the recognition of and demand for medical statistics. In addition, the results of these various studies differ and are insufficiently comprehensive and systematic.ObjectivesThis investigation aimed to evaluate the general cognition of and demand for medical statistics by undergraduates, graduates, and medical staff in China.MethodsWe performed a comprehensive database search related to the cognition of and demand for medical statistics from January 2007 to July 2014 and conducted a meta-analysis of non-controlled studies with sub-group analysis for undergraduates, graduates, and medical staff.ResultsThere are substantial differences with respect to the cognition of theory in medical statistics among undergraduates (73.5%), graduates (60.7%), and medical staff (39.6%). The demand for theory in medical statistics is high among graduates (94.6%), undergraduates (86.1%), and medical staff (88.3%). Regarding specific statistical methods, the cognition of basic statistical methods is higher than of advanced statistical methods. The demand for certain advanced statistical methods, including (but not limited to) multiple analysis of variance (ANOVA), multiple linear regression, and logistic regression, is higher than that for basic statistical methods. The use rates of the Statistical Package for the Social Sciences (SPSS) software and statistical analysis software (SAS) are only 55% and 15%, respectively.ConclusionThe overall statistical competence of undergraduates, graduates, and medical staff is insufficient, and their ability to practically apply their statistical knowledge is limited, which constitutes an unsatisfactory state of affairs for medical statistics education. Because the demand for skills in this area is increasing, the need to reform medical statistics education in China has become urgent.  相似文献   

18.
Lack of adequate statistical methods for the analysis of microarray data remains the most critical deterrent to uncovering the true potential of these promising techniques in basic and translational biological studies. The popular practice of drawing important biological conclusions from just one replicate (slide) should be discouraged. In this paper, we discuss some modern trends in statistical analysis of microarray data with a special focus on statistical classification (pattern recognition) and variable selection. In addressing these issues we consider the utility of some distances between random vectors and their nonparametric estimates obtained from gene expression data. Performance of the proposed distances is tested by computer simulations and analysis of gene expression data on two different types of human leukemia. In experimental settings, the error rate is estimated by cross-validation, while a control sample is generated in computer simulation experiments aimed at testing the proposed gene selection procedures and associated classification rules.  相似文献   

19.
Statistical methods for microarray assays   总被引:1,自引:0,他引:1  
The paper shortly reviews statistical methods used in the area of DNA microarray studies. All stages of the experiment are taken into account: planning, data collection, data preprocessing, analysis and validation. Among the methods of data analysis, the algorithms for estimating differential expression, multivariate approaches, clustering methods, as well as classification and discrimination are reviewed. The need is stressed for routine statistical data processing protocols and for the search of links of microarray data analysis with quantitative genetic models.  相似文献   

20.
LenaMånsson  PerLundberg 《Oikos》2006,113(2):217-225
Time series analysis of herbivore data with weather included as covariate is commonly used as a mean to shed light on the state and ecology of the studied population. Conclusions about the herbivore population are drawn from statistical parameter values and presence/absence in the most parsimonious model. However, this procedure is only reliable if the statistical parameters have general interpretations regardless of system characteristics. Here we investigated the extent to which this is true by deriving six different vegetation–herbivore-systems and analyzing their respective statistical parameters. The analysis was done in both continuous and discrete time. It turned out that both density parameters (a1 and a2) and rainfall coefficients change with biological interactions and amount of average rainfall, and they do so in different ways in different systems. This means that there is no valid general interpretation of them and, most important, the probability of detecting density dependence and effects of rainfall vary between systems. Hence, you can not make inference about the biological processes from statistical analysis without knowing the system that you study and what model best describes the interactions within it.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号