期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

galaxieEST: addressing EST identity through automated phylogenetic analysis

R Henrik Nilsson Balaji Rajashekar Karl-Henrik Larsson Bj?rn M Ursing 《BMC bioinformatics》2004,5(1):87

Background

Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. 相似文献

2.

The Impact of Outgroup Choice and Missing Data on Major Seed Plant Phylogenetics Using Genome-Wide EST Data

Jose Eduardo de la Torre-Bárcena Sergios-Orestis Kolokotronis Ernest K. Lee Dennis Wm. Stevenson Eric D. Brenner Manpreet S. Katari Gloria M. Coruzzi Rob DeSalle 《PloS one》2009,4(6)

Background

Genome level analyses have enhanced our view of phylogenetics in many areas of the tree of life. With the production of whole genome DNA sequences of hundreds of organisms and large-scale EST databases a large number of candidate genes for inclusion into phylogenetic analysis have become available. In this work, we exploit the burgeoning genomic data being generated for plant genomes to address one of the more important plant phylogenetic questions concerning the hierarchical relationships of the several major seed plant lineages (angiosperms, Cycadales, Gingkoales, Gnetales, and Coniferales), which continues to be a work in progress, despite numerous studies using single, few or several genes and morphology datasets. Although most recent studies support the notion that gymnosperms and angiosperms are monophyletic and sister groups, they differ on the topological arrangements within each major group.

Methodology

We exploited the EST database to construct a supermatrix of DNA sequences (over 1,200 concatenated orthologous gene partitions for 17 taxa) to examine non-flowering seed plant relationships. This analysis employed programs that offer rapid and robust orthology determination of novel, short sequences from plant ESTs based on reference seed plant genomes. Our phylogenetic analysis retrieved an unbiased (with respect to gene choice), well-resolved and highly supported phylogenetic hypothesis that was robust to various outgroup combinations.

Conclusions

We evaluated character support and the relative contribution of numerous variables (e.g. gene number, missing data, partitioning schemes, taxon sampling and outgroup choice) on tree topology, stability and support metrics. Our results indicate that while missing characters and order of addition of genes to an analysis do not influence branch support, inadequate taxon sampling and limited choice of outgroup(s) can lead to spurious inference of phylogeny when dealing with phylogenomic scale data sets. As expected, support and resolution increases significantly as more informative characters are added, until reaching a threshold, beyond which support metrics stabilize, and the effect of adding conflicting characters is minimized. 相似文献

3.

OrthoInspector: comprehensive orthology analysis and visual exploration

Benjamin Linard Julie D Thompson Olivier Poch Odile Lecompte 《BMC bioinformatics》2011,12(1):11

Background

The accurate determination of orthology and inparalogy relationships is essential for comparative sequence analysis, functional gene annotation and evolutionary studies. Various methods have been developed based on either simple blast all-versus-all pairwise comparisons and/or time-consuming phylogenetic tree analyses. 相似文献

4.

A Hybrid Distance Measure for Clustering Expressed Sequence Tags Originating from the Same Gene Family

Keng-Hoong Ng Chin-Kuan Ho Somnuk Phon-Amnuaisuk 《PloS one》2012,7(10)

相似文献

5.

Using ESTs to improve the accuracy of de novo gene prediction

Chaochun Wei Michael R Brent 《BMC bioinformatics》2006,7(1):327

Background

ESTs are a tremendous resource for determining the exon-intron structures of genes, but even extensive EST sequencing tends to leave many exons and genes untouched. Gene prediction systems based exclusively on EST alignments miss these exons and genes, leading to poor sensitivity. De novo gene prediction systems, which ignore ESTs in favor of genomic sequence, can predict such "untouched" exons, but they are less accurate when predicting exons to which ESTs align. TWINSCAN is the most accurate de novo gene finder available for nematodes and N-SCAN is the most accurate for mammals, as measured by exact CDS gene prediction and exact exon prediction. 相似文献

6.

prot4EST: Translating Expressed Sequence Tags from neglected genomes

James?D?Wasmuth Email author Mark?L?Blaxter 《BMC bioinformatics》2004,5(1):187

相似文献

7.

HaMStR: Profile hidden markov model based search for orthologs in ESTs

Ingo Ebersberger Sascha Strauss Arndt von Haeseler 《BMC evolutionary biology》2009,9(1):157-9

Background

EST sequencing is a versatile approach for rapidly gathering protein coding sequences. They provide direct access to an organism's gene repertoire bypassing the still error-prone procedure of gene prediction from genomic data. Therefore, ESTs are often the only source for biological sequence data from taxa outside mainstream interest. The widespread use of ESTs in evolutionary studies and particularly in molecular systematics studies is still hindered by the lack of efficient and reliable approaches for automated ortholog predictions in ESTs. Existing methods either depend on a known species tree or cannot cope with redundancy in EST data. 相似文献

8.

EST2uni: an open,parallel tool for automated EST analysis and database creation,with a data mining web interface and microarray expression data integration

Javier Forment Francisco Gilabert Antonio Robles Vicente Conejero Fernando Nuez Jose M Blanca 《BMC bioinformatics》2008,9(1):5

Background

Expressed sequence tag (EST) collections are composed of a high number of single-pass, redundant, partial sequences, which need to be processed, clustered, and annotated to remove low-quality and vector regions, eliminate redundancy and sequencing errors, and provide biologically relevant information. In order to provide a suitable way of performing the different steps in the analysis of the ESTs, flexible computation pipelines adapted to the local needs of specific EST projects have to be developed. Furthermore, EST collections must be stored in highly structured relational databases available to researchers through user-friendly interfaces which allow efficient and complex data mining, thus offering maximum capabilities for their full exploitation. 相似文献

9.

Large-scale identification of polymorphic microsatellites using an <Emphasis Type="Italic">in silico</Emphasis> approach

Jifeng Tang Samantha J Baldwin Jeanne ME Jacobs C Gerard van der Linden Roeland E Voorrips Jack AM Leunissen Herman van Eck Ben Vosman 《BMC bioinformatics》2008,9(1):374

Background

Simple Sequence Repeat (SSR) or microsatellite markers are valuable for genetic research. Experimental methods to develop SSR markers are laborious, time consuming and expensive. In silico approaches have become a practicable and relatively inexpensive alternative during the last decade, although testing putative SSR markers still is time consuming and expensive. In many species only a relatively small percentage of SSR markers turn out to be polymorphic. This is particularly true for markers derived from expressed sequence tags (ESTs). In EST databases a large redundancy of sequences is present, which may contain information on length-polymorphisms in the SSR they contain, and whether they have been derived from heterozygotes or from different genotypes. Up to now, although a number of programs have been developed to identify SSRs in EST sequences, no software can detect putatively polymorphic SSRs. 相似文献

10.

Alternative splicing and protein function 总被引：1，自引：0，他引：1

AD?Neverov II?Artamonova RN?Nurtdinov D?Frishman MS?Gelfand Email author AA?Mironov 《BMC bioinformatics》2005,6(1):266

Background

Alternative splicing is a major mechanism of generating protein diversity in higher eukaryotes. Although at least half, and probably more, of mammalian genes are alternatively spliced, it was not clear, whether the frequency of alternative splicing is the same in different functional categories. The problem is obscured by uneven coverage of genes by ESTs and a large number of artifacts in the EST data. 相似文献

11.

Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

Shaolin Wang Eric Peatman Jason Abernathy Geoff Waldbieser Erika Lindquist Paul Richardson Susan Lucas Mei Wang Ping Li Jyothi Thimmapuram Lei Liu Deepika Vullaganti Huseyin Kucuktas Christopher Murdock Brian C Small Melanie Wilson Hong Liu Yanliang Jiang Yoona Lee Fei Chen Jianguo Lu Wenqi Wang Peng Xu Benjaporn Somridhivej Puttharat Baoprasertkul Jonas Quilang Zhenxia Sha Baolong Bao Yaping Wang Qun Wang Tomokazu Takano Samiran Nandi Shikai Liu Lilian Wong Ludmilla Kaltenboeck Sylvie Quiniou Eva Bengten Norman Miller John Trant Daniel Rokhsar Zhanjiang Liu 《Genome biology》2010,11(1):1-14

相似文献

12.

Detection and mapping of mtDNA SNPs in Atlantic salmon using high throughput DNA sequencing

Olafur Fridjonsson Kristinn Olafsson Scott Tompsett Snaedis Bjornsdottir Sonia Consuegra David Knox Carlos Garcia de Leaniz Steinunn Magnusdottir Gudbjorg Olafsdottir Eric Verspoor Sigridur Hjorleifsdottir 《BMC genomics》2011,12(1):1-10

相似文献

13.

Pepper EST database: comprehensive <Emphasis Type="Italic">in silico</Emphasis> tool for analyzing the chili pepper (<Emphasis Type="Italic">Capsicum annuum</Emphasis>) transcriptome

Hyun-Jin Kim Kwang-Hyun Baek Seung-Won Lee JungEun Kim Bong-Woo Lee Hye-Sun Cho Woo Taek Kim Doil Choi Cheol-Goo Hur 《BMC plant biology》2008,8(1):101

Background

There is no dedicated database available for Expressed Sequence Tags (EST) of the chili pepper (Capsicum annuum), although the interest in a chili pepper EST database is increasing internationally due to the nutritional, economic, and pharmaceutical value of the plant. Recent advances in high-throughput sequencing of the ESTs of chili pepper cv. Bukang have produced hundreds of thousands of complementary DNA (cDNA) sequences. Therefore, a chili pepper EST database was designed and constructed to enable comprehensive analysis of chili pepper gene expression in response to biotic and abiotic stresses. 相似文献

14.

Isolation, in silico characterization and chromosomal localization of a group of cDNAs from ciliated epithelial cells after in vitro ciliogenesis

Maiti AK Jorissen M Bouvagnet P 《Genome biology》2001,2(7):research0026.1-research00269

Background

Immotile cilia syndrome (ICS) or primary ciliary dyskinesia (PCD) is an autosomal recessive disorder in humans in which the beating of cilia and sperm flagella is impaired. Ciliated epithelial cell linings are present in many tissues. To understand ciliary assembly and motility, it is important to isolate those genes involved in the process.

Results

Total RNA was isolated from cultured ciliated nasal epithelial cells after in vitro ciliogenesis and expressed sequenced tags (ESTs) were generated. The functions and locations of 63 of these ESTs were derived by BLAST from two public databases. These ESTs are grouped into various classes. One group has high homology not only with the mitochondrial genome but also with one or more chromosomal DNAs, suggesting that very similar genes, or genes with very similar domains, are expressed from both mitochondrial and nuclear DNA. A second class comprises genes with complete homology with part of a known gene, suggesting that they are the same genes. A third group has partial homology with domains of known genes. A fourth group, constituting 33% of the ESTs characterized, has no significant homology with any gene or EST in the database.

Conclusions

We have shown that sufficient information about the location of ESTs could be derived electronically from the recently completed human genome sequences. This strategy of EST localization should be significantly useful for mapping and identification of new genes in the forthcoming human genome sequences with the vast number of ESTs in the dbEST database. 相似文献

15.

OligoSpawn: a software tool for the design of overgo probes from large unigene datasets

Jie Zheng Jan T Svensson Kavitha Madishetty Timothy J Close Tao Jiang Stefano Lonardi 《BMC bioinformatics》2006,7(1):7

Background

Expressed sequence tag (EST) datasets represent perhaps the largest collection of genetic information. ESTs can be exploited in a variety of biological experiments and analysis. Here we are interested in the design of overlapping oligonucleotide (overgo) probes from large unigene (EST-contigs) datasets. 相似文献

16.

Influence of genetic background on the occurrence of chromosomal rearrangements in Saccharomyces cerevisiae

Emilie S Fritsch Joseph Schacherer Claudine Bleykasten-Grosshans Jean-Luc Souciet Serge Potier Jacky de Montigny 《BMC genomics》2009,10(1):1-9

相似文献

17.

Utility of EST-derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachis wild species 总被引：1，自引：0，他引：1

Xuanqiang Liang Xiaoping Chen Yanbin Hong Haiyan Liu Guiyuan Zhou Shaoxiong Li Baozhu Guo 《BMC plant biology》2009,9(1):35-9

Background

Lack of sufficient molecular markers hinders current genetic research in peanuts (Arachis hypogaea L.). It is necessary to develop more molecular markers for potential use in peanut genetic research. With the development of peanut EST projects, a vast amount of available EST sequence data has been generated. These data offered an opportunity to identify SSR in ESTs by data mining. 相似文献

18.

Development and production of an oligonucleotide MuscleChip: use for validation of ambiguous ESTs

Rehannah?HA?Borup Stefano?Toppo Yi-Wen?Chen Tanya?M?Teslovich Gerolamo?Lanfranchi Giorgio?Valle Eric?P?Hoffman Email author 《BMC bioinformatics》2002,3(1):33

相似文献

19.

GO-Diff: Mining functional differentiation between EST-based transcriptomes

Zuozhou Chen Weilin Wang Xuefeng Bruce Ling Jane Jijun Liu Liangbiao Chen 《BMC bioinformatics》2006,7(1):72-13

相似文献

20.

Strengths and weaknesses of EST-based prediction of tissue-specific alternative splicing

Shobhit Gupta Dorothea Zink Bernhard Korn Martin Vingron Stefan A Haas 《BMC genomics》2004,5(1):72-8

相似文献