期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs

Victor X Jin Gregory AC Singer Francisco J Agosto-Pérez Sandya Liyanarachchi Ramana V Davuluri 《BMC bioinformatics》2006,7(1):114-13

Background

The canonical core promoter elements consist of the TATA box, initiator (Inr), downstream core promoter element (DPE), TFIIB recognition element (BRE) and the newly-discovered motif 10 element (MTE). The motifs for these core promoter elements are highly degenerate, which tends to lead to a high false discovery rate when attempting to detect them in promoter sequences. 相似文献

2.

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs

Bartek Wilczynski Norbert Dojer Mateusz Patelak Jerzy Tiuryn 《BMC bioinformatics》2009,10(1):82

相似文献

3.

HMM-ModE – Improved classification using profile hidden Markov models by optimising the discrimination threshold and modifying emission probabilities with negative training sequences

Prashant K Srivastava Dhwani K Desai Soumyadeep Nandi Andrew M Lynn 《BMC bioinformatics》2007,8(1):104

Background

Profile Hidden Markov Models (HMM) are statistical representations of protein families derived from patterns of sequence conservation in multiple alignments and have been used in identifying remote homologues with considerable success. These conservation patterns arise from fold specific signals, shared across multiple families, and function specific signals unique to the families. The availability of sequences pre-classified according to their function permits the use of negative training sequences to improve the specificity of the HMM, both by optimizing the threshold cutoff and by modifying emission probabilities to minimize the influence of fold-specific signals. A protocol to generate family specific HMMs is described that first constructs a profile HMM from an alignment of the family's sequences and then uses this model to identify sequences belonging to other classes that score above the default threshold (false positives). Ten-fold cross validation is used to optimise the discrimination threshold score for the model. The advent of fast multiple alignment methods enables the use of the profile alignments to align the true and false positive sequences, and the resulting alignments are used to modify the emission probabilities in the original model. 相似文献

4.

Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST

E Michael Gertz Yi-Kuo Yu Richa Agarwala Alejandro A Schäffer Stephen F Altschul 《BMC biology》2006,4(1):41-14

相似文献

5.

Effect of false positive and false negative rates on inference of binding target conservation across different conditions and species from ChIP-chip data

Debayan Datta Hongyu Zhao 《BMC bioinformatics》2009,10(1):23

相似文献

6.

CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes

Matthew S Hestand Michiel van Galen Michel P Villerius Gert-Jan B van Ommen Johan T den Dunnen Peter AC 't Hoen 《BMC bioinformatics》2008,9(1):495

相似文献

7.

A novel approach to denoising ion trap tandem mass spectra

Jiarui Ding Jinhong Shi Guy G Poirier Fang-Xiang Wu 《Proteome science》2009,7(1):9-10

Background

Mass spectrometers can produce a large number of tandem mass spectra. They are unfortunately noise-contaminated. Noises can affect the quality of tandem mass spectra and thus increase the false positives and false negatives in the peptide identification. Therefore, it is appealing to develop an approach to denoising tandem mass spectra. 相似文献

8.

False occurrences of functional motifs in protein sequences highlight evolutionary constraints

Allegra Via Pier Federico Gherardini Enrico Ferraro Gabriele Ausiello Gianpaolo Scalia Tomba Manuela Helmer-Citterich 《BMC bioinformatics》2007,8(1):68

Background

False occurrences of functional motifs in protein sequences can be considered as random events due solely to the sequence composition of a proteome. Here we use a numerical approach to investigate the random appearance of functional motifs with the aim of addressing biological questions such as: How are organisms protected from undesirable occurrences of motifs otherwise selected for their functionality? Has the random appearance of functional motifs in protein sequences been affected during evolution?

Results

Here we analyse the occurrence of functional motifs in random sequences and compare it to that observed in biological proteomes; the behaviour of random motifs is also studied. Most motifs exhibit a number of false positives significantly similar to the number of times they appear in randomized proteomes (=expected number of false positives). Interestingly, about 3% of the analysed motifs show a different kind of behaviour and appear in biological proteomes less than they do in random sequences. In some of these cases, a mechanism of evolutionary negative selection is apparent; this helps to prevent unwanted functionalities which could interfere with cellular mechanisms.

Conclusion

Our thorough statistical and biological analysis showed that there are several mechanisms and evolutionary constraints both of which affect the appearance of functional motifs in protein sequences.

相似文献

9.

Poly purine.pyrimidine sequences upstream of the beta-galactosidase gene affect gene expression in Saccharomyces cerevisiae

Amit K Maiti Samir K Brahmachari 《BMC molecular biology》2001,2(1):11-7

相似文献

10.

MCALIGN2: Faster, accurate global pairwise alignment of non-coding DNA sequences based on explicit models of indel evolution

Jun Wang Peter D Keightley Toby Johnson 《BMC bioinformatics》2006,7(1):292-15

Background

Non-coding DNA sequences comprise a very large proportion of the total genomic content of mammals, most other vertebrates, many invertebrates, and most plants. Unraveling the functional significance of non-coding DNA depends on how well we are able to align non-coding DNA sequences. However, the alignment of non-coding DNA sequences is more difficult than aligning protein-coding sequences. 相似文献

11.

Ultra-fast sequence clustering from similarity networks with <Emphasis FontCategory="NonProportional">SiLiX</Emphasis>

Vincent Miele Simon Penel Laurent Duret 《BMC bioinformatics》2011,12(1):116

Background

The number of gene sequences that are available for comparative genomics approaches is increasing extremely quickly. A current challenge is to be able to handle this huge amount of sequences in order to build families of homologous sequences in a reasonable time. 相似文献

12.

Dissecting systems-wide data using mixture models: application to identify affected cellular processes

J?Peter?Svensson Renée?X?de Menezes Ingela?Turesson Micheline?Giphart-Gassler Harry?Vrieling Email author 《BMC bioinformatics》2005,6(1):177

Background

Functional analysis of data from genome-scale experiments, such as microarrays, requires an extensive selection of differentially expressed genes. Under many conditions, the proportion of differentially expressed genes is considerable, making the selection criteria a balance between the inclusion of false positives and the exclusion of false negatives. 相似文献

13.

Exploration of phylogenetic data using a global sequence analysis method

Charles?Chapus Christine?Dufraigne Scott?Edwards Alain?Giron Bernard?Fertil Patrick?Deschavanne Email author 《BMC evolutionary biology》2005,5(1):63

Background

Molecular phylogenetic methods are based on alignments of nucleic or peptidic sequences. The tremendous increase in molecular data permits phylogenetic analyses of very long sequences and of many species, but also requires methods to help manage large datasets. 相似文献

14.

A human RNA polymerase II subunit is encoded by a recently generated multigene family

Sylvie Grandemange Sophie Schaller Shigeru Yamano Stanislas Du Manoir George V Shpakovski Marie-Geneviève Mattei Claude Kedinger Marc Vigneron 《BMC molecular biology》2001,2(1):14

Background

The sequences encoding the yeast RNA polymerase II (RPB) subunits are single copy genes. 相似文献

15.

A fast algorithm for determining the best combination of local alignments to a query sequence

Gavin?C?Conant Email author Andreas?Wagner 《BMC bioinformatics》2004,5(1):62

Background

Existing sequence alignment algorithms assume that similarities between DNA or amino acid sequences are linearly ordered. That is, stretches of similar nucleotides or amino acids are in the same order in both sequences. Recombination perturbs this order. An algorithm that can reconstruct sequence similarity despite rearrangement would be helpful for reconstructing the evolutionary history of recombined sequences. 相似文献

16.

A machine learning strategy to identify candidate binding sites in human protein-coding sequence

Thomas Down Bernard Leong Tim JP Hubbard 《BMC bioinformatics》2006,7(1):419-13

相似文献

17.

<Emphasis Type="Italic">Phylo-mLogo</Emphasis>: an interactive and hierarchical multiple-logo visualization tool for alignment of many sequences

Arthur Chun-Chieh Shih DT Lee Chin-Lin Peng Yu-Wei Wu 《BMC bioinformatics》2007,8(1):63

Background

When aligning several hundreds or thousands of sequences, such as epidemic virus sequences or homologous/orthologous sequences of some big gene families, to reconstruct the epidemiological history or their phylogenies, how to analyze and visualize the alignment results of many sequences has become a new challenge for computational biologists. Although there are several tools available for visualization of very long sequence alignments, few of them are applicable to the alignments of many sequences. 相似文献

18.

A robust linear regression based algorithm for automated evaluation of peptide identifications from shotgun proteomics by use of reversed-phase liquid chromatography retention time

Hua Xu Lanhao Yang Michael A Freitas 《BMC bioinformatics》2008,9(1):347

Background

Rejection of false positive peptide matches in database searches of shotgun proteomic experimental data is highly desirable. Several methods have been developed to use the peptide retention time as to refine and improve peptide identifications from database search algorithms. This report describes the implementation of an automated approach to reduce false positives and validate peptide matches. 相似文献

19.

Predicting the sensitivity and specificity of published real-time PCR assays

Gordon H Lemmon Shea N Gardner 《Annals of clinical microbiology and antimicrobials》2008,7(1):1-10

Background

In recent years real-time PCR has become a leading technique for nucleic acid detection and quantification. These assays have the potential to greatly enhance efficiency in the clinical laboratory. Choice of primer and probe sequences is critical for accurate diagnosis in the clinic, yet current primer/probe signature design strategies are limited, and signature evaluation methods are lacking.

Methods

We assessed the quality of a signature by predicting the number of true positive, false positive and false negative hits against all available public sequence data. We found real-time PCR signatures described in recent literature and used a BLAST search based approach to collect all hits to the primer-probe combinations that should be amplified by real-time PCR chemistry. We then compared our hits with the sequences in the NCBI taxonomy tree that the signature was designed to detect.

Results

We found that many published signatures have high specificity (almost no false positives) but low sensitivity (high false negative rate). Where high sensitivity is needed, we offer a revised methodology for signature design which may designate that multiple signatures are required to detect all sequenced strains. We use this methodology to produce new signatures that are predicted to have higher sensitivity and specificity.

Conclusion

We show that current methods for real-time PCR assay design have unacceptably low sensitivities for most clinical applications. Additionally, as new sequence data becomes available, old assays must be reassessed and redesigned. A standard protocol for both generating and assessing the quality of these assays is therefore of great value. Real-time PCR has the capacity to greatly improve clinical diagnostics. The improved assay design and evaluation methods presented herein will expedite adoption of this technique in the clinical lab. 相似文献

20.

Ranking genes with respect to differential expression

Broberg P 《Genome biology》2002,3(9):preprint00-23

Background

In the pharmaceutical industry and in academia substantial efforts are made to make the best use of the promising microarray technology. The data generated by microarrays are more complex than most other biological data attracting much attention at this point. A method for finding an optimal test statistic with which to rank genes with respect to differential expression is outlined and tested. At the heart of the method lies an estimate of the false negative and false positive rates. Both investing in false positives and missing true positives lead to a waste of resources. The procedure sets out to minimise these errors. For calculation of the false positive and negative rates a simulation procedure is invoked. 相似文献