首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Complete genome annotation is a necessary tool as Anopheles gambiae researchers probe the biology of this potent malaria vector.

Results

We reannotate the A. gambiae genome by synthesizing comparative and ab initio sets of predicted coding sequences (CDSs) into a single set using an exon-gene-union algorithm followed by an open-reading-frame-selection algorithm. The reannotation predicts 20,970 CDSs supported by at least two lines of evidence, and it lowers the proportion of CDSs lacking start and/or stop codons to only approximately 4%. The reannotated CDS set includes a set of 4,681 novel CDSs not represented in the Ensembl annotation but with EST support, and another set of 4,031 Ensembl-supported genes that undergo major structural and, therefore, probably functional changes in the reannotated set. The quality and accuracy of the reannotation was assessed by comparison with end sequences from 20,249 full-length cDNA clones, and evaluation of mass spectrometry peptide hit rates from an A. gambiae shotgun proteomic dataset confirms that the reannotated CDSs offer a high quality protein database for proteomics. We provide a functional proteomics annotation, ReAnoXcel, obtained by analysis of the new CDSs through the AnoXcel pipeline, which allows functional comparisons of the CDS sets within the same bioinformatic platform. CDS data are available for download.

Conclusion

Comprehensive A. gambiae genome reannotation is achieved through a combination of comparative and ab initio gene prediction algorithms.  相似文献   

2.

Background

Infection of plants by pathogens and the subsequent disease development involves substantial changes in the biochemistry and physiology of both partners. Analysis of genes that are expressed during these interactions represents a powerful strategy to obtain insights into the molecular events underlying these changes. We have employed expressed sequence tag (EST) analysis to identify rice genes involved in defense responses against infection by the blast fungus Magnaporthe oryzae and fungal genes involved in infectious growth within the host during a compatible interaction.

Results

A cDNA library was constructed with RNA from rice leaves (Oryza sativa cv. Hwacheong) infected with M. oryzae strain KJ201. To enrich for fungal genes, subtraction library using PCR-based suppression subtractive hybridization was constructed with RNA from infected rice leaves as a tester and that from uninfected rice leaves as the driver. A total of 4,148 clones from two libraries were sequenced to generate 2,302 non-redundant ESTs. Of these, 712 and 1,562 ESTs could be identified to encode fungal and rice genes, respectively. To predict gene function, Gene Ontology (GO) analysis was applied, with 31% and 32% of rice and fungal ESTs being assigned to GO terms, respectively. One hundred uniESTs were found to be specific to fungal infection EST. More than 80 full-length fungal cDNA sequences were used to validate ab initio annotated gene model of M. oryzae genome sequence.

Conclusion

This study shows the power of ESTs to refine genome annotation and functional characterization. Results of this work have advanced our understanding of the molecular mechanisms underpinning fungal-plant interactions and formed the basis for new hypothesis.  相似文献   

3.

Background

Anopheles gambiae is the main vector of Plasmodium falciparum in Africa. The mosquito midgut constitutes a barrier that the parasite must cross if it is to develop and be transmitted. Despite the central role of the mosquito midgut in the host/parasite interaction, little is known about its protein composition. Characterisation of An. gambiae midgut proteins may identify the proteins that render An. gambiae receptive to the malaria parasite.

Methods

We carried out two-dimensional gel electrophoresis of An. gambiae midgut proteins and compared protein profiles for midguts from males, sugar-fed females and females fed on human blood.

Results

Very few differences were detected between male and female mosquitoes for the approximately 375 silver-stained proteins. Male midguts contained ten proteins not detected in sugar-fed or blood-fed females, which are therefore probably involved in male-specific functions; conversely, female midguts contained twenty-three proteins absent from male midguts. Eight of these proteins were specific to sugar-fed females, and another ten, to blood-fed females.

Conclusion

Mass spectrometry analysis of the proteins found only in blood-fed female midguts, together with data from the recent sequencing of the An. gambiae genome, should make it possible to determine the role of these proteins in blood digestion or parasite receptivity.  相似文献   

4.
5.
6.

Background

The question whether Plasmodium falciparum infection affects the fitness of mosquito vectors remains open. A hurdle for resolving this question is the lack of appropriate control, non-infected mosquitoes that can be compared to the infected ones. It was shown recently that heating P. falciparum gametocyte-infected blood before feeding by malaria vectors inhibits the infection. Therefore, the same source of gametocyte-infected blood could be divided in two parts, one heated, serving as the control, the other unheated, allowing the comparison of infected and uninfected mosquitoes which fed on exactly the same blood otherwise. However, before using this method for characterizing the cost of infection to mosquitoes, it is necessary to establish whether feeding on previously heated blood affects the survival and fecundity of mosquito females.

Methods

Anopheles gambiae M molecular form females were exposed to heated versus non-heated, parasite-free human blood to mimic blood meal on non-infectious versus infectious gametocyte-containing blood. Life history traits of mosquito females fed on blood that was heat-treated or not were then compared.

Results

The results reveal that heat treatment of the blood did not affect the survival and fecundity of mosquito females. Consistently, blood heat treatment did not affect the quantity of blood ingested.

Conclusions

The study indicates that heat inactivation of gametocyte-infected blood will only inhibit mosquito infection and that this method is suitable for quantifying the fitness cost incurred by mosquitoes upon infection by P. falciparum.  相似文献   

7.
8.

Background

In eukaryotic cells, oxidative phosphorylation (OXPHOS) uses the products of both nuclear and mitochondrial genes to generate cellular ATP. Interspecies comparative analysis of these genes, which appear to be under strong functional constraints, may shed light on the evolutionary mechanisms that act on a set of genes correlated by function and subcellular localization of their products.

Results

We have identified and annotated the Drosophila melanogaster, D. pseudoobscura and Anopheles gambiae orthologs of 78 nuclear genes encoding mitochondrial proteins involved in oxidative phosphorylation by a comparative analysis of their genomic sequences and organization. We have also identified 47 genes in these three dipteran species each of which shares significant sequence homology with one of the above-mentioned OXPHOS orthologs, and which are likely to have originated by duplication during evolution. Gene structure and intron length are essentially conserved in the three species, although gain or loss of introns is common in A. gambiae. In most tissues of D. melanogaster and A. gambiae the expression level of the duplicate gene is much lower than that of the original gene, and in D. melanogaster at least, its expression is almost always strongly testis-biased, in contrast to the soma-biased expression of the parent gene.

Conclusions

Quickly achieving an expression pattern different from the parent genes may be required for new OXPHOS gene duplicates to be maintained in the genome. This may be a general evolutionary mechanism for originating phenotypic changes that could lead to species differentiation.  相似文献   

9.

Background

Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project.

Results

We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology.

Conclusion

We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals.  相似文献   

10.
Maiti AK  Jorissen M  Bouvagnet P 《Genome biology》2001,2(7):research0026.1-research00269

Background

Immotile cilia syndrome (ICS) or primary ciliary dyskinesia (PCD) is an autosomal recessive disorder in humans in which the beating of cilia and sperm flagella is impaired. Ciliated epithelial cell linings are present in many tissues. To understand ciliary assembly and motility, it is important to isolate those genes involved in the process.

Results

Total RNA was isolated from cultured ciliated nasal epithelial cells after in vitro ciliogenesis and expressed sequenced tags (ESTs) were generated. The functions and locations of 63 of these ESTs were derived by BLAST from two public databases. These ESTs are grouped into various classes. One group has high homology not only with the mitochondrial genome but also with one or more chromosomal DNAs, suggesting that very similar genes, or genes with very similar domains, are expressed from both mitochondrial and nuclear DNA. A second class comprises genes with complete homology with part of a known gene, suggesting that they are the same genes. A third group has partial homology with domains of known genes. A fourth group, constituting 33% of the ESTs characterized, has no significant homology with any gene or EST in the database.

Conclusions

We have shown that sufficient information about the location of ESTs could be derived electronically from the recently completed human genome sequences. This strategy of EST localization should be significantly useful for mapping and identification of new genes in the forthcoming human genome sequences with the vast number of ESTs in the dbEST database.  相似文献   

11.

Background

Plants growing in their natural habitat represent a valuable resource for elucidating mechanisms of acclimation to environmental constraints. Populus euphratica is a salt-tolerant tree species growing in saline semi-arid areas. To identify genes involved in abiotic stress responses under natural conditions we constructed several normalized and subtracted cDNA libraries from control, stress-exposed and desert-grown P. euphratica trees. In addition, we identified several metabolites in desert-grown P. euphratica trees.

Results

About 14,000 expressed sequence tag (EST) sequences were obtained with a good representation of genes putatively involved in resistance and tolerance to salt and other abiotic stresses. A P. euphratica DNA microarray with a uni-gene set of ESTs representing approximately 6,340 different genes was constructed. The microarray was used to study gene expression in adult P. euphratica trees growing in the desert canyon of Ein Avdat in Israel. In parallel, 22 selected metabolites were profiled in the same trees.

Conclusion

Of the obtained ESTs, 98% were found in the sequenced P. trichocarpa genome and 74% in other Populus EST collections. This implies that the P. euphratica genome does not contain different genes per se, but that regulation of gene expression might be different and that P. euphratica expresses a different set of genes that contribute to adaptation to saline growth conditions. Also, all of the five measured amino acids show increased levels in trees growing in the more saline soil.  相似文献   

12.
13.

Background

Since the initial publication of its complete genome sequence, Arabidopsis thaliana has become more important than ever as a model for plant research. However, the initial genome annotation was submitted by multiple centers using inconsistent methods, making the data difficult to use for many applications.

Results

Over the course of three years, TIGR has completed its effort to standardize the structural and functional annotation of the Arabidopsis genome. Using both manual and automated methods, Arabidopsis gene structures were refined and gene products were renamed and assigned to Gene Ontology categories. We present an overview of the methods employed, tools developed, and protocols followed, summarizing the contents of each data release with special emphasis on our final annotation release (version 5).

Conclusion

Over the entire period, several thousand new genes and pseudogenes were added to the annotation. Approximately one third of the originally annotated gene models were significantly refined yielding improved gene structure annotations, and every protein-coding gene was manually inspected and classified using Gene Ontology terms.  相似文献   

14.
15.
16.
17.
18.
19.

Background

Large collections of expressed sequence tags (ESTs) are a fundamental resource for analysis of gene expression and annotation of genome sequences. We generated 116,899 ESTs from 17 normalized and two non-normalized cDNA libraries representing 16 tissues from tilapia, a cichlid fish widely used in aquaculture and biological research.

Results

The ESTs were assembled into 20,190 contigs and 36,028 singletons for a total of 56,218 unique sequences and a total assembled length of 35,168,415 bp. Over the whole project, a unique sequence was discovered for every 2.079 sequence reads. 17,722 (31.5%) of these unique sequences had significant BLAST hits (e-value < 10-10) to the UniProt database.

Conclusion

Normalization of the cDNA pools with double-stranded nuclease allowed us to efficiently sequence a large collection of ESTs. These sequences are an important resource for studies of gene expression, comparative mapping and annotation of the forthcoming tilapia genome sequence.
  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号