期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Occurrence of protein structure elements in conserved sequence regions

Einat Sitbon Shmuel Pietrokovski 《BMC structural biology》2007,7(1):3

Background

Conserved protein sequence regions are extremely useful for identifying and studying functionally and structurally important regions. By means of an integrated analysis of large-scale protein structure and sequence data, structural features of conserved protein sequence regions were identified. 相似文献

2.

ABC: software for interactive browsing of genomic multiple sequence alignment data

Gregory?M?Cooper Senthil?AG?Singaravelu Arend?Sidow Email author 《BMC bioinformatics》2004,5(1):192

Background

Alignment and comparison of related genome sequences is a powerful method to identify regions likely to contain functional elements. Such analyses are data intensive, requiring the inclusion of genomic multiple sequence alignments, sequence annotations, and scores describing regional attributes of columns in the alignment. Visualization and browsing of results can be difficult, and there are currently limited software options for performing this task. 相似文献

3.

Morphological and Hemodynamic Discriminators for Rupture Status in Posterior Communicating Artery Aneurysms

Nan Lv Chi Wang Christof Karmonik Yibin Fang Jinyu Xu Ying Yu Wei Cao Jianmin Liu Qinghai Huang 《PloS one》2016,11(2)

Background and Purpose

The conflicting findings of previous morphological and hemodynamic studies on intracranial aneurysm rupture may be caused by the relatively small sample sizes and the variation in location of the patient-specific aneurysm models. We aimed to determine the discriminators for aneurysm rupture status by focusing on only posterior communicating artery (PCoA) aneurysms.

Materials and Methods

In 129 PCoA aneurysms (85 ruptured, 44 unruptured), clinical, morphological and hemodynamic characteristics were compared between the ruptured and unruptured cases. Multivariate logistic regression analysis was performed to determine the discriminators for rupture status of PCoA aneurysms.

Results

While univariate analyses showed that the size of aneurysm dome, aspect ratio (AR), size ratio (SR), dome-to-neck ratio (DN), inflow angle (IA), normalized wall shear stress (NWSS) and percentage of low wall shear stress area (LSA) were significantly associated with PCoA aneurysm rupture status. With multivariate analyses, significance was only retained for higher IA (OR = 1.539, p < 0.001) and LSA (OR = 1.393, p = 0.041).

Conclusions

Hemodynamics and morphology were related to rupture status of intracranial aneurysms. Higher IA and LSA were identified as discriminators for rupture status of PCoA aneurysms. 相似文献

4.

A database of phylogenetically atypical genes in archaeal and bacterial genomes,identified using the <Emphasis Type="Italic">DarkHorse</Emphasis> algorithm

Sheila Podell Terry Gaasterland Eric E Allen 《BMC bioinformatics》2008,9(1):419

Background

The process of horizontal gene transfer (HGT) is believed to be widespread in Bacteria and Archaea, but little comparative data is available addressing its occurrence in complete microbial genomes. Collection of high-quality, automated HGT prediction data based on phylogenetic evidence has previously been impractical for large numbers of genomes at once, due to prohibitive computational demands. DarkHorse, a recently described statistical method for discovering phylogenetically atypical genes on a genome-wide basis, provides a means to solve this problem through lineage probability index (LPI) ranking scores. LPI scores inversely reflect phylogenetic distance between a test amino acid sequence and its closest available database matches. Proteins with low LPI scores are good horizontal gene transfer candidates; those with high scores are not. 相似文献

5.

Targeted enrichment beyond the consensus coding DNA sequence exome reveals exons with higher variant densities

Bainbridge MN Wang M Wu Y Newsham I Muzny DM Jefferies JL Albert TJ Burgess DL Gibbs RA 《Genome biology》2011,12(7):R68

Background

Enrichment of loci by DNA hybridization-capture, followed by high-throughput sequencing, is an important tool in modern genetics. Currently, the most common targets for enrichment are the protein coding exons represented by the consensus coding DNA sequence (CCDS). The CCDS, however, excludes many actual or computationally predicted coding exons present in other databases, such as RefSeq and Vega, and non-coding functional elements such as untranslated and regulatory regions. The number of variants per base pair (variant density) and our ability to interrogate regions outside of the CCDS regions is consequently less well understood. 相似文献

6.

Length-dependent prediction of protein intrinsic disorder 总被引：2，自引：0，他引：2

Kang Peng Predrag Radivojac Slobodan Vucetic A Keith Dunker Zoran Obradovic 《BMC bioinformatics》2006,7(1):208-17

Background

Due to the functional importance of intrinsically disordered proteins or protein regions, prediction of intrinsic protein disorder from amino acid sequence has become an area of active research as witnessed in the 6th experiment on Critical Assessment of Techniques for Protein Structure Prediction (CASP6). Since the initial work by Romero et al. (Identifying disordered regions in proteins from amino acid sequences, IEEE Int. Conf. Neural Netw., 1997), our group has developed several predictors optimized for long disordered regions (>30 residues) with prediction accuracy exceeding 85%. However, these predictors are less successful on short disordered regions (≤30 residues). A probable cause is a length-dependent amino acid compositions and sequence properties of disordered regions. 相似文献

7.

Reranking candidate gene models with cross-species comparison for improved gene prediction

Qian Liu Koby Crammer Fernando CN Pereira David S Roos 《BMC bioinformatics》2008,9(1):433

Background

Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc). Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. 相似文献

8.

Scipio: Using protein sequences to determine the precise exon/intron structures of genes and their orthologs in closely related species

Oliver Keller Florian Odronitz Mario Stanke Martin Kollmar Stephan Waack 《BMC bioinformatics》2008,9(1):278

Background

For many types of analyses, data about gene structure and locations of non-coding regions of genes are required. Although a vast amount of genomic sequence data is available, precise annotation of genes is lacking behind. Finding the corresponding gene of a given protein sequence by means of conventional tools is error prone, and cannot be completed without manual inspection, which is time consuming and requires considerable experience. 相似文献

9.

A comparative study of conservation and variation scores

Fredrik Johansson Hiroyuki Toh 《BMC bioinformatics》2010,11(1):388

Background

Conservation and variation scores are used when evaluating sites in a multiple sequence alignment, in order to identify residues critical for structure or function. A variety of scores are available today but it is not clear how different scores relate to each other. 相似文献

10.

Mapping sequences by parts

Gilles Didier Carito Guziolowski 《Algorithms for molecular biology : AMB》2007,2(1):11-15

Background:

We present the N-map method, a pairwise and asymmetrical approach which allows us to compare sequences by taking into account evolutionary events that produce shuffled, reversed or repeated elements. Basically, the optimal N-map of a sequence s over a sequence t is the best way of partitioning the first sequence into N parts and placing them, possibly complementary reversed, over the second sequence in order to maximize the sum of their gapless alignment scores. 相似文献

11.

Syntenator: Multiple gene order alignments with a gene-specific scoring function

Christian Rödelsperger Christoph Dieterich 《Algorithms for molecular biology : AMB》2008,3(1):14

Background

Identification of homologous regions or conserved syntenies across genomes is one crucial step in comparative genomics. This task is usually performed by genome alignment softwares like WABA or blastz. In case of conserved syntenies, such regions are defined as conserved gene orders. On the gene order level, homologous regions can even be found between distantly related genomes, which do not align on the nucleotide sequence level. 相似文献

12.

PacBio-LITS: a large-insert targeted sequencing method for characterization of human disease-associated chromosomal structural variations

Min Wang Christine R Beck Adam C English Qingchang Meng Christian Buhay Yi Han Harsha V Doddapaneni Fuli Yu Eric Boerwinkle James R Lupski Donna M Muzny Richard A Gibbs 《BMC genomics》2015,16(1)

Background

Generation of long (>5 Kb) DNA sequencing reads provides an approach for interrogation of complex regions in the human genome. Currently, large-insert whole genome sequencing (WGS) technologies from Pacific Biosciences (PacBio) enable analysis of chromosomal structural variations (SVs), but the cost to achieve the required sequence coverage across the entire human genome is high.

Results

We developed a method (termed PacBio-LITS) that combines oligonucleotide-based DNA target-capture enrichment technologies with PacBio large-insert library preparation to facilitate SV studies at specific chromosomal regions. PacBio-LITS provides deep sequence coverage at the specified sites at substantially reduced cost compared with PacBio WGS. The efficacy of PacBio-LITS is illustrated by delineating the breakpoint junctions of low copy repeat (LCR)-associated complex structural rearrangements on chr17p11.2 in patients diagnosed with Potocki–Lupski syndrome (PTLS; MIM#610883). We successfully identified previously determined breakpoint junctions in three PTLS cases, and also were able to discover novel junctions in repetitive sequences, including LCR-mediated breakpoints. The new information has enabled us to propose mechanisms for formation of these structural variants.

Conclusions

The new method leverages the cost efficiency of targeted capture-sequencing as well as the mappability and scaffolding capabilities of long sequencing reads generated by the PacBio platform. It is therefore suitable for studying complex SVs, especially those involving LCRs, inversions, and the generation of chimeric Alu elements at the breakpoints. Other genomic research applications, such as haplotype phasing and small insertion and deletion validation could also benefit from this technology.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1370-2) contains supplementary material, which is available to authorized users. 相似文献

13.

An algorithm for automated closure during assembly

Sergey Koren Jason R Miller Brian P Walenz Granger Sutton 《BMC bioinformatics》2010,11(1):457

Background

Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targeted addition by local re-assembly of gap regions. An obvious alternative uses de novo assembly of all the reads. 相似文献

14.

Applying genotyping (TILLING) and phenotyping analyses to elucidate gene function in a chemically induced sorghum mutant population 总被引：2，自引：0，他引：2

Zhanguo Xin Ming Li Wang Noelle A Barkley Gloria Burow Cleve Franks Gary Pederson John Burke 《BMC plant biology》2008,8(1):103

Background

Sorghum [Sorghum bicolor (L.) Moench] is ranked as the fifth most important grain crop and serves as a major food staple and fodder resource for much of the world, especially in arid and semi-arid regions. The recent surge in sorghum research is driven by its tolerance to drought/heat stresses and its strong potential as a bioenergy feedstock. Completion of the sorghum genome sequence has opened new avenues for sorghum functional genomics. However, the availability of genetic resources, specifically mutant lines, is limited. Chemical mutagenesis of sorghum germplasm, followed by screening for mutants altered in important agronomic traits, represents a rapid and effective means of addressing this limitation. Induced mutations in novel genes of interest can be efficiently assessed using the technique known as Targeting Induced Local Lesion IN Genomes (TILLING). 相似文献

15.

Predicting conserved protein motifs with Sub-HMMs

Kevin Horan Christian R Shelton Thomas Girke 《BMC bioinformatics》2010,11(1):205

Background

Profile HMMs (hidden Markov models) provide effective methods for modeling the conserved regions of protein families. A limitation of the resulting domain models is the difficulty to pinpoint their much shorter functional sub-features, such as catalytically relevant sequence motifs in enzymes or ligand binding signatures of receptor proteins. 相似文献

16.

Functional annotation by identification of local surface similarities: a novel tool for structural genomics

Fabrizio?Ferrè Email author Gabriele?Ausiello Andreas?Zanzoni Manuela?Helmer-Citterich 《BMC bioinformatics》2005,6(1):194

Background

Protein function is often dependent on subsets of solvent-exposed residues that may exist in a similar three-dimensional configuration in non homologous proteins thus having different order and/or spacing in the sequence. Hence, functional annotation by means of sequence or fold similarity is not adequate for such cases. 相似文献

17.

Multiple non-collinear TF-map alignments of promoter regions

Enrique Blanco Roderic Guigó Xavier Messeguer 《BMC bioinformatics》2007,8(1):138

Background

The analysis of the promoter sequence of genes with similar expression patterns is a basic tool to annotate common regulatory elements. Multiple sequence alignments are on the basis of most comparative approaches. The characterization of regulatory regions from co-expressed genes at the sequence level, however, does not yield satisfactory results in many occasions as promoter regions of genes sharing similar expression programs often do not show nucleotide sequence conservation. 相似文献

18.

A comparative map viewer integrating genetic maps for <Emphasis Type="Italic">Brassica</Emphasis> and <Emphasis Type="Italic">Arabidopsis</Emphasis>

Geraldine AC Lim Erica G Jewell Xi Li Timothy A Erwin Christopher Love Jacqueline Batley German Spangenberg David Edwards 《BMC plant biology》2007,7(1):40

Background

Molecular genetic maps provide a means to link heritable traits with underlying genome sequence variation. Several genetic maps have been constructed for Brassica species, yet to date, there has been no simple means to compare this information or to associate mapped traits with the genome sequence of the related model plant, Arabidopsis. 相似文献

19.

Identification of recurrent regions of copy-number variants across multiple individuals

Teo Shu Mei Agus Salim Stefano Calza Ku Chee Seng Chia Kee Seng Yudi Pawitan 《BMC bioinformatics》2010,11(1):147

Background

Algorithms and software for CNV detection have been developed, but they detect the CNV regions sample-by-sample with individual-specific breakpoints, while common CNV regions are likely to occur at the same genomic locations across different individuals in a homogenous population. Current algorithms to detect common CNV regions do not account for the varying reliability of the individual CNVs, typically reported as confidence scores by SNP-based CNV detection algorithms. General methodologies for identifying these recurrent regions, especially those directed at SNP arrays, are still needed. 相似文献

20.

Homopolymer tract length dependent enrichments in functional regions of 27 eukaryotes and their novel dependence on the organism DNA (G+C)% composition

Yue Zhou Jeffrey W Bizzaro Kenneth A Marx 《BMC genomics》2004,5(1):95

Background

DNA homopolymer tracts, poly(dA).poly(dT) and poly(dG).poly(dC), are the simplest of simple sequence repeats. Homopolymer tracts have been systematically examined in the coding, intron and flanking regions of a limited number of eukaryotes. As the number of DNA sequences publicly available increases, the representation (over and under) of homopolymer tracts of different lengths in these regions of different genomes can be compared. 相似文献