首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
High throughput macromolecular structure determination is very essential in structural genomics as the available number of sequence information far exceeds the number of available 3D structures. ACORN, a freely available resource in the CCP4 suite of programs is a comprehensive and efficient program for phasing in the determination of protein structures, when atomic resolution data are available. ACORN with the automatic model-building program ARP/wARP and refinement program REFMAC is a suitable combination for the high throughput structural genomics. ACORN can also be run with secondary structural elements like helices and sheets as inputs with high resolution data. In situations, where ACORN phasing is not sufficient for building the protein model, the fragments (incomplete model/dummy atoms) can again be used as a starting input. Iterative ACORN is proved to work efficiently in the subsequent model building stages in congerin (PDB-ID: lis3) and catalase (PDB-ID: 1gwe) for which models are available.  相似文献   

2.
SUMMARY: WebBLAST is a suite of programs intended to assist in organizing sequencing data and to provide first-pass sequence analysis in an automated fashion. Data processing is fully automated, with end-users being presented both graphical and tabular summaries of data that can be viewed using any Web browser. AVAILABILITY: The program is free and available at http://genome.nhgri.nih. gov/webblast.  相似文献   

3.
Cyclops is a new computer program designed as a graphical front-end that allows easy control and interaction with tasks and programs for 3D reconstruction of biological complexes using cryo-electron microscopy. Cyclops' current plug-ins are designed for automated particle picking and include two new algorithms, automated carbon masking and quaternion based rotation space sampling, which are also presented here. Additional plug-ins are in the pipeline. Cyclops allows straightforward organization and visualization of all data and tasks and allows both interactive and batch-wise processing. Furthermore, it was designed for straightforward implementation in grid architectures. As a front-end to a collection of programs it provides a common interface to these programs, thus enhancing the usability of the suite and the productivity of the user.  相似文献   

4.
The accuracy of the vast amount of genotypic information generated by high-throughput genotyping technologies is crucial in haplotype analyses and linkage-disequilibrium mapping for complex diseases. To date, most automated programs lack quality measures for the allele calls; therefore, human interventions, which are both labor intensive and error prone, have to be performed. Here, we propose a novel genotype clustering algorithm, GeneScore, based on a bivariate t-mixture model, which assigns a set of probabilities for each data point belonging to the candidate genotype clusters. Furthermore, we describe an expectation-maximization (EM) algorithm for haplotype phasing, GenoSpectrum (GS)-EM, which can use probabilistic multilocus genotype matrices (called "GenoSpectrum") as inputs. Combining these two model-based algorithms, we can perform haplotype inference directly on raw readouts from a genotyping machine, such as the TaqMan assay. By using both simulated and real data sets, we demonstrate the advantages of our probabilistic approach over the current genotype scoring methods, in terms of both the accuracy of haplotype inference and the statistical power of haplotype-based association analyses.  相似文献   

5.
A high-throughput crystallization-to-structure pipeline for structural genomics was recently developed at the Advanced Protein Crystallography Research Group of the RIKEN SPring-8 Center in Japan. The structure determination pipeline includes three newly developed technologies for automating X-ray protein crystallography: the automated crystallization and observation robot system "TERA", the SPring-8 Precise Automatic Cryosample Exchanger "SPACE" for automated data collection, and the Package of Expert Researcher's Operation Network "PERON" for automated crystallographic computation from phasing to model checking. During the 5 years following April, 2002, this pipeline was used by seven researchers to determine 138 independent crystal structures (resulting from 437 purified proteins, 234 cryoloop-mountable crystals, and 175 diffraction data sets). The protocols used in the high-throughput pipeline are described in this paper.  相似文献   

6.
In only a few years, multiple wavelength anomalous diffraction (MAD) phasing has advanced from an esoteric technique used in only a few favorable cases to the method of choice for solving new macromolecular structures. Before 1994, MAD phasing had been used for fewer than a dozen new structure determinations. In 1999 alone, well over 100 new structures were determined by MAD phasing. The meteoric rise in MAD applications resulted from the availability of new synchrotron beamlines, equipped with low bandpass optics, fast readout detectors, cryogenic cooling and user-friendly interfaces. The power of MAD phasing has been amplified by the availability of new computer programs for locating the positions of the anomalous scattering atoms and for calculating phases from the experimental data. Phasing by anomalous scattering techniques has been applied to structures as large as 640 kDa and 120 selenium atoms in the asymmetric unit. The practical size limitation for application of MAD phasing techniques has not yet been encountered.  相似文献   

7.
8.
Recent progress in macromolecular phasing, in part stimulated by the high-throughput structural biology initiatives, has made this crucial stage of the elucidation of crystal structures easier and more automatic. A quick soak in various salts leads to the rapid incorporation of the anomalously scattering ions, suitable for phasing by MAD (multiwavelength anomalous dispersion), SAD (single-wavelength anomalous dispersion) or MIR (multiple isomorphous replacement) methods. The availability of stable synchrotron beam lines equipped with elaborate hardware control and sophisticated data processing programs makes it possible to collect very accurate diffraction data and to solve structures from the very weak anomalous signal of such atoms as sulfur or phosphorus, inherently present in macromolecules. The current progress in phasing, coupled with the parallel advances in protein crystallization, diffraction data collection and so on, suggests that, in the near future, the process of macromolecular crystal structure elucidation may become fully automatic.  相似文献   

9.
NuMA/centrophilin: sequence analysis of the coiled-coil rod domain.   总被引:1,自引:0,他引:1       下载免费PDF全文
D A Parry 《Biophysical journal》1994,67(3):1203-1206
Nuclear mitotic apparatus protein (NuMA), also known as centrophilin, has been shown in previous work to contain a centrally located sequence of length 1485 residues that has both a heptad substructure and a high propensity for alpha-helix formation. Further analysis of this sequence here has revealed that NuMA will form a two-stranded coiled-coil structure with multiple (18) points at which the conformation is interrupted either by proline-containing segments or by discontinuities in the phasing of the heptad substructure. It has also been shown that the two chains will be parallel (rather than antiparallel), that they will lie in axial register, and that this arrangement will be stabilized by a large number of interchain ionic interactions. Interestingly the coiled-coil rod domain is also shown to lack any significant long-range periodicity in the linear distribution of either its acidic or its basic residues. Hence there is no direct evidence from the sequence data that NuMA molecules will aggregate to form closely packed filaments within nuclear space.  相似文献   

10.
This article describes the integration of programs from the widely used CCP4 macromolecular crystallography package into a modern data flow visualization environment (application visualization system [AVS]), which provides a simple graphical user interface, a visual programming paradigm, and a variety of 1-, 2-, and 3-D data visualization tools for the display of graphical information and the results of crystallographic calculations, such as electron density and Patterson maps. The CCP4 suite comprises a number of separate Fortran 77 programs, which communicate via common file formats. Each program is encapsulated into an AVS macro module, and may be linked to others in a data flow network, reflecting the nature of many crystallo-graphic calculations. Named pipes are used to pass input parameters from a graphical user interface to the program module, and also to intercept line printer output, which can be filtered to extract graphical information and significant numerical parameters. These may be passed to downstream modules, permitting calculations to be automated if no user interaction is required, or giving the user the opportunity to make selections in an interactive manner.  相似文献   

11.
12.
MOTIVATION: A number of free-standing programs have been developed in order to help researchers find potential coding regions and deduce gene structure for long stretches of what is essentially 'anonymous DNA'. As these programs apply inherently different criteria to the question of what is and is not a coding region, multiple algorithms should be used in the course of positional cloning and positional candidate projects to assure that all potential coding regions within a previously-identified critical region are identified. RESULTS: We have developed a gene identification tool called GeneMachine which allows users to query multiple exon and gene prediction programs in an automated fashion. BLAST searches are also performed in order to see whether a previously-characterized coding region corresponds to a region in the query sequence. A suite of Perl programs and modules are used to run MZEF, GENSCAN, GRAIL 2, FGENES, RepeatMasker, Sputnik, and BLAST. The results of these runs are then parsed and written into ASN.1 format. Output files can be opened using NCBI Sequin, in essence using Sequin as both a workbench and as a graphical viewer. The main feature of GeneMachine is that the process is fully automated; the user is only required to launch GeneMachine and then open the resulting file with Sequin. Annotations can then be made to these results prior to submission to GenBank, thereby increasing the intrinsic value of these data. AVAILABILITY: GeneMachine is freely-available for download at http://genome.nhgri.nih.gov/genemachine. A public Web interface to the GeneMachine server for academic and not-for-profit users is available at http://genemachine.nhgri.nih.gov. The Web supplement to this paper may be found at http://genome.nhgri.nih.gov/genemachine/supplement/.  相似文献   

13.
NMR-based metabolomics requires robust automated methodologies, and the accuracy of NMR-based metabolomics data is greatly influenced by the reproducibility of data acquisition and processing methods. Effective water resonance signal suppression and reproducible spectral phasing and baseline traces across series of related samples are crucial for statistical analysis. We assess robustness, repeatability, sensitivity, selectivity, and practicality of commonly used solvent peak suppression methods in the NMR analysis of biofluids with respect to the automated processing of the NMR spectra and the impact of pulse sequence and data processing methods on the sensitivity of pattern recognition and statistical analysis of the metabolite profiles. We introduce two modifications to the excitation sculpting pulse sequence whereby the excitation solvent suppression pulse cascade is preceded by low-power water resonance presaturation pulses during the relaxation delay. Our analysis indicates that combining water presaturation with excitation sculpting water suppression delivers the most reproducible and information-rich NMR spectra of biofluids.  相似文献   

14.
BioServe Space Technologies, a NASA Research Partnership Center (RPC), has developed and operated various middeck payloads launched on 23 shuttle missions since 1991 in support of commercial space biotechnology projects. Modular cell culture systems are contained within the Commercial Generic Bioprocessing Apparatus (CGBA) suite of flight-qualified hardware, compatible with Space Shuttle, SPACEHAB, Spacelab and International Space Station (ISS) EXPRESS Rack interfaces. As part of the CGBA family, the Isothermal Containment Module (ICM) incubator provides thermal control, data acquisition and experiment manipulation capabilities, including accelerometer launch detection for automated activation and thermal profiling for culture incubation and sample preservation. The ICM can accommodate up to 8 individually controlled temperature zones. Command and telemetry capabilities allow real-time downlink of data and video permitting remote payload operation and ground control synchronization. Individual cell culture experiments can be accommodated in a variety of devices ranging from 'microgravity test tubes' or standard 100 mm Petri dishes, to complex, fed-batch bioreactors with automated culture feeding, waste removal and multiple sample draws. Up to 3 levels of containment can be achieved for chemical fixative addition, and passive gas exchange can be provided through hydrophobic membranes. Many additional options exist for designing customized hardware depending on specific science requirements.  相似文献   

15.
High-throughput analyses of single-cell microscopy data are a critical tool within the field of bacterial cell biology. Several programs have been developed to specifically segment bacterial cells from phase-contrast images. Together with spot and object detection algorithms, these programs offer powerful approaches to quantify observations from microscopy data, ranging from cell-to-cell genealogy to localization and movement of proteins. Most segmentation programs contain specific post-processing and plotting options, but these options vary between programs and possibilities to optimize or alter the outputs are often limited. Therefore, we developed BactMAP (Bacterial toolbox for Microscopy Analysis & Plotting), a command-line based R package that allows researchers to transform cell segmentation and spot detection data generated by different programs into various plots. Furthermore, BactMAP makes it possible to perform custom analyses and change the layout of the output. Because BactMAP works independently of segmentation and detection programs, inputs from different sources can be compared within the same analysis pipeline. BactMAP complies with standard practice in R which enables the use of advanced statistical analysis tools, and its graphic output is compatible with ggplot2, enabling adjustable plot graphics in every operating system. User feedback will be used to create a fully automated Graphical User Interface version of BactMAP in the future. Using BactMAP, we visualize key cell cycle parameters in Bacillus subtilis and Staphylococcus aureus, and demonstrate that the DNA replication forks in Streptococcus pneumoniae dissociate and associate before splitting of the cell, after the Z-ring is formed at the new quarter positions. BactMAP is available from https://veeninglab.com/bactmap .  相似文献   

16.
From a user's point-of-view we are in the Golden Age of protein crystallographic software. In the past few decades, solving protein structures has gone from a task requiring man-months of effort to a process requiring minutes on an ordinary laptop with no human intervention required. The birth of XtalView coincided with the mainstream use of synchrotron radiation, seleno-Met phasing and it continues to be used in this age of robotic crystallization, Fed-Ex data collection and fully automated structure solution "pipelines". This article is a retrospective history of protein crystallographic computing and a discussion of the current state of the art.  相似文献   

17.
Next‐generation sequencing (NGS) experiments are often performed in biomedical research nowadays, leading to methodological challenges related to the high‐dimensional and complex nature of the recorded data. In this work we review some of the issues that arise in disorder detection from NGS experiments, that is, when the focus is the detection of deletion and duplication disorders for homozygosity and heterozygosity in DNA sequencing. A statistical model to cope with guanine/cytosine bias and phasing and prephasing phenomena at base level is proposed, and a goodness‐of‐fit procedure for disorder detection is derived. The method combines the proper evaluation of local p‐values (one for each DNA base) with suitable corrections for multiple comparisons and the discrete nature of the p‐values. A global test for the detection of disorders in the whole DNA region is proposed too. The performance of the introduced procedures is investigated through simulations. A real data illustration is provided.  相似文献   

18.
The molecular graphics program Sculptor and the command-line suite Situs are software packages for the integration of biophysical data across spatial resolution scales. Herein, we provide an overview of recently developed tools relevant to cryo-electron tomography (cryo-ET), with an emphasis on functionality supported by Situs 2.7.1 and Sculptor 2.1.1. We describe a work flow for automatically segmenting filaments in cryo-ET maps including denoising, local normalization, feature detection, and tracing. Tomograms of cellular actin networks exhibit both cross-linked and bundled filament densities. Such filamentous regions in cryo-ET data sets can then be segmented using a stochastic template-based search, VolTrac. The approach combines a genetic algorithm and a bidirectional expansion with a tabu search strategy to localize and characterize filamentous regions. The automated filament segmentation by VolTrac compares well to a manual one performed by expert users, and it allows an efficient and reproducible analysis of large data sets. The software is free, open source, and can be used on Linux, Macintosh or Windows computers.  相似文献   

19.
We report here the structure of BstYI, an "intermediate" type II restriction endonuclease with overlapping sequence specificities to BamHI and BglII. BstYI, a thermophilic endonuclease, recognizes and cleaves the degenerate hexanucleotide sequence 5'-RGATCY-3' (where R=A or G and Y=C or T), cleaving DNA after the 5'-R on each strand to produce four-base (5') staggered ends. The crystal structure of free BstYI was solved at 1.85A resolution by multi-wavelength anomalous dispersion (MAD) phasing. Comparison with BamHI and BglII reveals a strong structural consensus between all three enzymes mapping to the alpha/beta core domain and residues involved in catalysis. Unexpectedly, BstYI also contains an additional "arm" substructure outside of the core protein, which enables the enzyme to adopt a more compact, intertwined dimer structure compared with BamHI and BglII. This arm substructure may underlie the thermostability of BstYI. We identify putative DNA recognition residues and speculate as to how this enzyme achieves a "relaxed" DNA specificity.  相似文献   

20.
The detection of patterns in monitoring data of vital signs is of great importance for adequate bedside decision support in critical care. Currently used alarm systems, which are based on fixed thresholds and independency assumptions, are not satisfactory in clinical practice. Time series techniques such as AR‐models consider autocorrelations within the series, which can be used for pattern recognition in the data. For practical applications in intensive care the data analysis has to be automated. An important issue is the suitable choice of the model order which is difficult to accomplish online. In a comparative case‐study we analyzed 34564 univariate time series of hemodynamic variables in critically ill patients by autoregressive models of different orders and compared the results of pattern detection. AR(2)‐models seem to be most suitable for the detection of clinically relevant patterns, thus affirming that treating the data as independent leads to false alarms. Moreover, using AR(2)‐models affords only short estimation periods. These findings for pattern detection in intensive care data are of medical importance as they justify a preselection of a model order, easing further automated statistical online analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号