期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: High-resolution annotation for microarrays

Jun Lu Joseph C Lee Marc L Salit Margaret C Cam 《BMC bioinformatics》2007,8(1):108

Background

Extracting biological information from high-density Affymetrix arrays is a multi-step process that begins with the accurate annotation of microarray probes. Shortfalls in the original Affymetrix probe annotation have been described; however, few studies have provided rigorous solutions for routine data analysis. 相似文献

2.

Protozoan genomes: gene identification and annotation

Worthey EA Myler PJ 《International journal for parasitology》2005,35(5):495-512

The draft sequence of several complete protozoan genomes is now available and genome projects are ongoing for a number of other species. Different strategies are being implemented to identify and annotate protein coding and RNA genes in these genomes, as well as study their genomic architecture. Since the genomes vary greatly in size, GC-content, nucleotide composition, and degree of repetitiveness, genome structure is often a factor in choosing the methodology utilised for annotation. In addition, the approach taken is dictated, to a greater or lesser extent, by the particular reasons for carrying out genome-wide analyses and the level of funding available for projects. Nevertheless, these projects have provided a plethora of material that will aid in understanding the biology and evolution of these parasites, as well as identifying new targets that can be used to design urgently required drug treatments for the diseases they cause. 相似文献

3.

Pairagon+N-SCAN_EST: a model-based gene annotation pipeline

Arumugam M Wei C Brown RH Brent MR 《Genome biology》2006,7(Z1):S5.1-S510

相似文献

4.

GeneMachine: gene prediction and sequence annotation

Makalowska I Ryan JF Baxevanis AD 《Bioinformatics (Oxford, England)》2001,17(9):843-844

MOTIVATION: A number of free-standing programs have been developed in order to help researchers find potential coding regions and deduce gene structure for long stretches of what is essentially 'anonymous DNA'. As these programs apply inherently different criteria to the question of what is and is not a coding region, multiple algorithms should be used in the course of positional cloning and positional candidate projects to assure that all potential coding regions within a previously-identified critical region are identified. RESULTS: We have developed a gene identification tool called GeneMachine which allows users to query multiple exon and gene prediction programs in an automated fashion. BLAST searches are also performed in order to see whether a previously-characterized coding region corresponds to a region in the query sequence. A suite of Perl programs and modules are used to run MZEF, GENSCAN, GRAIL 2, FGENES, RepeatMasker, Sputnik, and BLAST. The results of these runs are then parsed and written into ASN.1 format. Output files can be opened using NCBI Sequin, in essence using Sequin as both a workbench and as a graphical viewer. The main feature of GeneMachine is that the process is fully automated; the user is only required to launch GeneMachine and then open the resulting file with Sequin. Annotations can then be made to these results prior to submission to GenBank, thereby increasing the intrinsic value of these data. AVAILABILITY: GeneMachine is freely-available for download at http://genome.nhgri.nih.gov/genemachine. A public Web interface to the GeneMachine server for academic and not-for-profit users is available at http://genemachine.nhgri.nih.gov. The Web supplement to this paper may be found at http://genome.nhgri.nih.gov/genemachine/supplement/. 相似文献

5.

MiMiR: a comprehensive solution for storage,annotation and exchange of microarray data

Mahendra?Navarange Laurence?Game Derek?Fowler Vihar?Wadekar Helen?Banks Nicola?Cooley Fatimah?Rahman Justin?Hinshelwood Peter?Broderick Helen?C?Causton Email author 《BMC bioinformatics》2005,6(1):268

Background

The generation of large amounts of microarray data presents challenges for data collection, annotation, exchange and analysis. Although there are now widely accepted formats, minimum standards for data content and ontologies for microarray data, only a few groups are using them together to build and populate large-scale databases. Structured environments for data management are crucial for making full use of these data. 相似文献

6.

GFam: a platform for automatic annotation of gene families

Rajkumar Sasidharan Tam��s Nepusz David Swarbreck Eva Huala Alberto Paccanaro 《Nucleic acids research》2012,40(19):e152

We have developed GFam, a platform for automatic annotation of gene/protein families. GFam provides a framework for genome initiatives and model organism resources to build domain-based families, derive meaningful functional labels and offers a seamless approach to propagate functional annotation across periodic genome updates. GFam is a hybrid approach that uses a greedy algorithm to chain component domains from InterPro annotation provided by its 12 member resources followed by a sequence-based connected component analysis of un-annotated sequence regions to derive consensus domain architecture for each sequence and subsequently generate families based on common architectures. Our integrated approach increases sequence coverage by 7.2 percentage points and residue coverage by 14.6 percentage points higher than the coverage relative to the best single-constituent database within InterPro for the proteome of Arabidopsis. The true power of GFam lies in maximizing annotation provided by the different InterPro data sources that offer resource-specific coverage for different regions of a sequence. GFam’s capability to capture higher sequence and residue coverage can be useful for genome annotation, comparative genomics and functional studies. GFam is a general-purpose software and can be used for any collection of protein sequences. The software is open source and can be obtained from http://www.paccanarolab.org/software/gfam/. 相似文献

7.

EST analysis and annotation of transcripts derived from a trichome-specific cDNA library from Salvia fruticosa

Fani M. Chatzopoulou Antonios M. Makris Anagnostis Argiriou Jörg Degenhardt Angelos K. Kanellis 《Plant cell reports》2010,29(5):523-534

相似文献

8.

Community-based gene structure annotation

Schlueter SD Wilkerson MD Huala E Rhee SY Brendel V 《Trends in plant science》2005,10(1):9-14

Uncertainty and inconsistency of gene structure annotation remain limitations on research in the genome era, frustrating both biologists and bioinformaticians, who have to sort out annotation errors for their genes of interest or to generate trustworthy datasets for algorithmic development. It is unrealistic to hope for better software solutions in the near future that would solve all the problems. The issue is all the more urgent with more species being sequenced and analyzed by comparative genomics - erroneous annotations could easily propagate, whereas correct annotations in one species will greatly facilitate annotation of novel genomes. We propose a dynamic, economically feasible solution to the annotation predicament: broad-based, web-technology-enabled community annotation, a prototype of which is now in use for Arabidopsis. 相似文献

9.

On gene ontology and function annotation

下载免费PDF全文

Pal D 《Bioinformation》2006,1(3):97-98

The effort of function annotation does not merely involve associating a gene with some structured vocabulary that describes action. Rather the details of the actions, the components of the actions, the larger context of the actions are important issues that are of direct relevance, because they help understand the biological system to which the gene/protein belongs. Currently Gene Ontology (GO) Consortium offers the most comprehensive sets of relationships to describe gene/protein activity. However, its choice to segregate gene ontology to subdomains of molecular function, biological process and cellular component is creating significant limitations in terms of future scope of use. If we are to understand biology in its total complexity, comprehensive ontologies in larger biological domains are essential. A vigorous discussion on this topic is necessary for the larger benefit of the biological community. I highlight this point because larger-bio-domain ontologies cannot be simply created by integrating subdomain ontologies. Relationships in larger bio-domain-ontologies are more complex due to larger size of the system and are therefore more labor intensive to create. The current limitations of GO will be a handicap in derivation of more complex relationships from the high throughput biology data. 相似文献

10.

Text detective: a rule-based system for gene annotation in biomedical texts

Tamames J 《BMC bioinformatics》2005,6(Z1):S10

相似文献

11.

ExonHunter: a comprehensive approach to gene finding

Brejová B Brown DG Li M Vinar T 《Bioinformatics (Oxford, England)》2005,21(Z1):i57-i65

相似文献

12.

Complexity of automated gene annotation

Nikoloski Z Grimbs S Klie S Selbig J 《Bio Systems》2011,104(1):1-8

Describing the determinants of robustness of biological systems has become one of the central questions in systems biology. Despite the increasing research efforts, it has proven difficult to arrive at a unifying definition for this important concept. We argue that this is due to the multifaceted nature of the concept of robustness and the possibility to formally capture it at different levels of systemic formalisms (e.g., topology and dynamic behavior). Here we provide a comprehensive review of the existing definitions of robustness pertaining to metabolic networks. As kinetic approaches have been excellently reviewed elsewhere, we focus on definitions of robustness proposed within graph-theoretic and constraint-based formalisms. 相似文献

13.

Evidence-based annotation of transcripts and proteins in the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough

Price MN Deutschbauer AM Kuehl JV Liu H Witkowska HE Arkin AP 《Journal of bacteriology》2011,193(20):5716-5727

相似文献

14.

Mass spectrometry-based prokaryote gene annotation

Ishino Y Okada H Ikeuchi M Taniguchi H 《Proteomics》2007,7(22):4053-4065

MS combined with database searching has become the preferred method for identifying proteins present in cell or tissue samples. The technique enables us to execute large-scale proteome analyses of species whose genomes have already been sequenced. Searching mass spectrometric data against protein databases composed of annotated genes has been widely conducted. However, there are some issues with this technique; wrong annotations in protein databases cause deterioration in the accuracy of protein identification, and only proteins that have already been annotated can be identified. We propose a new framework that can detect correct ORFs by integrating an MS/MS proteomic data mapping and a knowledge-based system regarding the translation initiation sites. This technique can provide correction of predicted coding sequences, together with the possibility of identifying novel genes. We have developed a computational system; it should first conduct the probabilistic peptide-matching against all possible translational frames using MS/MS data, then search for discriminative DNA patterns around the detected peptides, and lastly integrate the facts using empirical knowledge stored in knowledge bases to obtain correct ORFs. We used photosynthetic bacteria Synechocystis sp. PCC6803 as a sample prokaryote, resulting in the finding of 14 N-terminus annotation errors and several new candidate genes. 相似文献

15.

Vimentin gene expression during myogenesis: two functional transcripts from a single copy gene 总被引：7，自引：2，他引：7

下载免费PDF全文

Z E Zehner B M Paterson 《Nucleic acids research》1983,11(23):8317-8332

相似文献

16.

LongSAGE analysis significantly improves genome annotation: identifications of novel genes and alternative transcripts in the mouse

Wahl MB Heinzmann U Imai K 《Bioinformatics (Oxford, England)》2005,21(8):1393-1400

相似文献

17.

WNP: a novel algorithm for gene products annotation from weighted functional networks

Magi A Tattini L Benelli M Giusti B Abbate R Ruffo S 《PloS one》2012,7(6):e38767

Predicting the biological function of all the genes of an organism is one of the fundamental goals of computational system biology. In the last decade, high-throughput experimental methods for studying the functional interactions between gene products (GPs) have been combined with computational approaches based on Bayesian networks for data integration. The result of these computational approaches is an interaction network with weighted links representing connectivity likelihood between two functionally related GPs. The weighted network generated by these computational approaches can be used to predict annotations for functionally uncharacterized GPs. Here we introduce Weighted Network Predictor (WNP), a novel algorithm for function prediction of biologically uncharacterized GPs. Tests conducted on simulated data show that WNP outperforms other 5 state-of-the-art methods in terms of both specificity and sensitivity and that it is able to better exploit and propagate the functional and topological information of the network. We apply our method to Saccharomyces cerevisiae yeast and Arabidopsis thaliana networks and we predict Gene Ontology function for about 500 and 10000 uncharacterized GPs respectively. 相似文献

18.

ChipInfo: Software for extracting gene annotation and gene ontology information for microarray analysis

下载免费PDF全文

Zhong S Li C Wong WH 《Nucleic acids research》2003,31(13):3483-3486

To date, assembling comprehensive annotation information for all probe sets of any Affymetrix microarrays remains a time-consuming, error-prone and challenging task. ChipInfo is designed for retrieving annotation information from online databases such as NetAffx and Gene Ontology and organizing such information into easily interpretable tabular format outputs. As companion software to dChip and GoSurfer, ChipInfo enables users to independently update the information resource files of these software packages. It also has functions for computing related summary statistics of probe sets and Gene Ontology terms. ChipInfo is available at http://biosun1.harvard.edu/complab/chipinfo/. 相似文献

19.

IDconverter and IDClight: Conversion and annotation of gene and protein IDs

Andreu Alibés Patricio Yankilevich Andrés Cañada Ramón Díaz-Uriarte 《BMC bioinformatics》2007,8(1):9

Background

Researchers involved in the annotation of large numbers of gene, clone or protein identifiers are usually required to perform a one-by-one conversion for each identifier. When the field of research is one such as microarray experiments, this number may be around 30,000. 相似文献

20.

The odorant-binding proteins of Drosophila melanogaster: annotation and characterization of a divergent gene family 总被引：3，自引：0，他引：3

Graham LA Davies PL 《Gene》2002,292(1-2):43-55

相似文献