首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
【背景】由于甲基营养菌被发现的时间较短,而且可以生产吡咯喹啉醌(pyrroloquinoline quinone,PQQ)的甲基杆菌属细菌只有少数菌株的全基因组序列被公布,增加了该类细菌基因组学和生物代谢途径研究的难度。【目的】将本实验室筛选的PQQ生产菌经多种诱变方式处理,用于提高PQQ的发酵产量。对高产突变菌株进行全基因组解析,以探究甲基杆菌PQQ合成的分子机制,为后续分子育种提供序列背景信息。【方法】将野生型PQQ生产菌株进行紫外诱变、亚硝基胍诱变、甲基磺酸乙酯诱变、硫酸二乙酯诱变和紫外-氯化锂复合诱变。将突变菌株利用PromethION三代测序平台和MGISEQ-2000二代测序平台测序,然后进行组装和功能注释。组装得到的全基因组序列与模式菌株扭脱甲基杆菌AM1 (Methylobacterium extorquens AM1)进行比较基因组学分析。【结果】经11轮诱变获得一株突变菌株NI91,其PQQ产量为19.49mg/L,相较原始菌株提高44.91%。突变菌株NI91的基因组由一个5 409 262 bp的染色体组成,共编码4 957个蛋白,与模式菌株M. extorqu...  相似文献   

2.
We have collected over half a million splice sites from five species-Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana-and classified them into four subtypes: U2-type GT-AG and GC-AG and U12-type GT-AG and AT-AC. We have also found new examples of rare splice-site categories, such as U12-type introns without canonical borders, and U2-dependent AT-AC introns. The splice-site sequences and several tools to explore them are available on a public website (SpliceRack). For the U12-type introns, we find several features conserved across species, as well as a clustering of these introns on genes. Using the information content of the splice-site motifs, and the phylogenetic distance between them, we identify: (i) a higher degree of conservation in the exonic portion of the U2-type splice sites in more complex organisms; (ii) conservation of exonic nucleotides for U12-type splice sites; (iii) divergent evolution of C.elegans 3' splice sites (3'ss) and (iv) distinct evolutionary histories of 5' and 3'ss. Our study proves that the identification of broad patterns in naturally-occurring splice sites, through the analysis of genomic datasets, provides mechanistic and evolutionary insights into pre-mRNA splicing.  相似文献   

3.
The human genome sequence provides a reference point from which we can compare ourselves with other organisms. Interspecies comparison is a powerful tool for inferring function from genomic sequence and could ultimately lead to the discovery of what makes humans unique. To date, most comparative sequencing has focused on pair-wise comparisons between human and a limited number of other vertebrates, such as mouse. Targeted approaches now exist for mapping and sequencing vertebrate bacterial artificial chromosomes (BACs) from numerous species, allowing rapid and detailed molecular and phylogenetic investigation of multi-megabase loci. Such targeted sequencing is complementary to current whole-genome sequencing projects, and would benefit greatly from the creation of BAC libraries from a diverse range of vertebrates.  相似文献   

4.
Restauro-G: A Rapid Genome Re-Annotation System for Comparative Genomics   总被引:1,自引:0,他引:1  
of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLASTLike Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/ under the GNU General Public License.  相似文献   

5.
草地贪夜蛾基因组注释及分析   总被引:2,自引:0,他引:2       下载免费PDF全文
草地贪夜蛾Spodoptera frugiperda近年来在我国迅速扩散,造成了重大的经济损失,引起社会关注。草地贪夜蛾基因组序列对深入研究其迁飞、入侵和抗药性等特性具有十分重要的作用。目前,已有5个版本的基因组序列被公开报道,但3个版本无基因组注释信息。除以Sf 9细胞系为DNA来源的基因组版本外,其他版本的scaffold N50过小,拼接质量偏低。为此,本研究选取了scaffold N50最大的草地贪夜蛾Sf 9细胞系基因组进行了蛋白编码基因注释。该版本的基因组重复序列占比28.1%。CEGMA评估显示该本版本基因组可覆盖93.6%的核心基因,BUSCO评估显示可覆盖90.8%的核心基因。利用OMIGA注释流程预测到25 699个蛋白质编码基因,详细的基因序列可从InsectBase网站获得(http://www.insect-genome.com/FAW/),其中具有GO注释的基因为15 623个,具有KEGG注释的基因共有9 213个。选取了12个鳞翅目昆虫进行比较基因组学分析,发现草地贪夜蛾与斜纹夜蛾的亲缘关系最近,两者分化时间大约在1 284万年前。对12个鳞翅目昆虫蛋白质编码基因进行同源分析,在草地贪夜蛾中发现了2 490个单拷贝基因、891个鳞翅目特有基因、2 360个物种特异扩增基因和4 180个物种特异基因。GO富集分析显示,2 360个物种特异扩增基因主要参与DNA整合、代谢相关的生物过程;4 180个物种特异基因主要参与酶活性、光感受、糖代谢等,KEGG通路富集发现草地贪夜蛾特异基因主要参与氨基酸代谢、糖代谢和Wnt信号通路。本研究结果丰富了草地贪夜蛾的基因信息,对进一步了解其生物学特性、开发新型绿色防控方法具有指导意义。  相似文献   

6.
7.
For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown that the minimum number of genes from each species that need to be compared to produce a reliable phylogeny is about 20. Yeast has also become an attractive model to study speciation in eukaryotes, especially to understand molecular mechanisms behind the establishment of reproductive isolation. Comparison of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide the background to use more yeast species in model studies, to combat pathogens and for efficient manipulation of industrial strains.  相似文献   

8.
The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found strong evidence for 1,300 new C. elegans genes. In addition, comparisons of the two genomes will help to understand the evolutionary forces that mold nematode genomes.  相似文献   

9.
10.
11.
12.
13.
The animal in the genome: comparative genomics and evolution   总被引:1,自引:0,他引:1  
Comparisons between completely sequenced metazoan genomes have generally emphasized how similar their encoded protein content is, even when the comparison is between phyla. Given the manifest differences between phyla and, in particular, intuitive notions that some animals are more complex than others, this creates something of a paradox. Simplistic explanations have included arguments such as increased numbers of genes; greater numbers of protein products produced through alternative splicing; increased numbers of regulatory non-coding RNAs and increased complexity of the cis-regulatory code. An obvious value of complete genome sequences lies in their ability to provide us with inventories of such components. I examine progress being made in linking genome content to the pattern of animal evolution, and argue that the gap between genomic and phenotypic complexity can only be understood through the totality of interacting components.  相似文献   

14.

Background

Is it possible to construct an accurate and detailed subgene-level map of a genome using bacterial artificial chromosome (BAC) end sequences, a sparse marker map, and the sequences of other genomes?

Results

A sheep BAC library, CHORI-243, was constructed and the BAC end sequences were determined and mapped with high sensitivity and low specificity onto the frameworks of the human, dog, and cow genomes. To maximize genome coverage, the coordinates of all BAC end sequence hits to the cow and dog genomes were also converted to the equivalent human genome coordinates. The 84,624 sheep BACs (about 5.4-fold genome coverage) with paired ends in the correct orientation (tail-to-tail) and spacing, combined with information from sheep BAC comparative genome contigs (CGCs) built separately on the dog and cow genomes, were used to construct 1,172 sheep BAC-CGCs, covering 91.2% of the human genome. Clustered non-tail-to-tail and outsize BACs located close to the ends of many BAC-CGCs linked BAC-CGCs covering about 70% of the genome to at least one other BAC-CGC on the same chromosome. Using the BAC-CGCs, the intrachromosomal and interchromosomal BAC-CGC linkage information, human/cow and vertebrate synteny, and the sheep marker map, a virtual sheep genome was constructed. To identify BACs potentially located in gaps between BAC-CGCs, an additional set of 55,668 sheep BACs were positioned on the sheep genome with lower confidence. A coordinate conversion process allowed us to transfer human genes and other genome features to the virtual sheep genome to display on a sheep genome browser.

Conclusion

We demonstrate that limited sequencing of BACs combined with positioning on a well assembled genome and integrating locations from other less well assembled genomes can yield extensive, detailed subgene-level maps of mammalian genomes, for which genomic resources are currently limited.  相似文献   

15.
InterPro, an integrated documentation resource for protein families, protein domains, and functional sites, was developed to amalgamate the individual efforts of the PROSITE, PRINTS, Pfam, and ProDom databases. InterPro can be used for the computational functional classification of newly determined amino acid sequences that lack biochemical characterization and for comparative genome analysis. InterPro contains over 3500 entries for more than 1 000 000 hits in SWISS-PROT and TrEMBL. The database is accessible for text-and sequence-based searches at http://www.ebi.ac.uk/interpro/. InterPro was used for the complete analysis of the proteome of the pathogenic microorganism Mycobacterium tuberculosis and the comparison with the predicted protein-coding sequences of the complete genomes of Bacillus subtilis and Escherichia coli. It was found that 64.8% of proteins in the proteome of M. tuberculosis matched InterPro entries and can be classified by their functions. The comparison with B. subtilis and E. coli provided information on the most common protein families and domains and on the most highly represented protein families in each organism. Thus, InterPro is a useful tool for general comparison of complete proteomes and their compositions.  相似文献   

16.
17.
Systematic comparison of the current repertoire of virulence-associated genes for three Pseudomonas syringae strains with complete genome sequences, P. syringae pv. tomato DC3,000, P. syringae pv. phaseolicola 1448A, and P. syringae pv. syringae B728a, is prompted by recent advances in virulence factor identification in P. syringae and other bacteria. Among these are genes linked to epiphytic fitness, plant- and insect-active toxins, secretion pathways, and virulence regulators, all reflected in the recently updated DC3,000 genome annotation. Distribution of virulence genes in relation to P. syringae genome organization was analyzed to distinguish patterns of conservation among genomes and association between genes and mobile genetic elements. Variable regions were identified on the basis of deviation in sequence composition and gaps in syntenic alignment among the three genomes. Mapping gene location relative to the genome structure revealed strong segregation of the HrpL regulon with variable genome regions (VR), divergent distribution patterns for toxin genes depending on association with plant or insect pathogenesis, and patterns of distribution for other virulence genes that highlight potential sources of strain-to-strain differences in host interaction. Distribution of VR among other sequenced bacterial genomes was analyzed and future plans for characterization of this potential reservoir of virulence genes are discussed.  相似文献   

18.
Bonin A 《Molecular ecology》2008,17(16):3583-3584
Population genomics is an increasingly popular approach to investigate the genetic basis of adaptation and speciation at the genome scale. However, it has so far largely failed to go beyond the mere identification of anonymous markers displaying selection signatures. Will population genomics ever be up to our expectations and able to really pinpoint genes underlying adaptation and speciation processes? In this issue of Molecular Ecology, Namroud et al. use population genomics to investigate local adaptation in natural populations of a conifer tree, the white spruce (Picea glauca). They show how population and functional genomics can finally converge with the deployment of the next generation of genome scans, which target gene‐rich regions rather than the whole genome.  相似文献   

19.
All bovine chromosomes are now represented by a syntenic group, a linkage map and at least one in situ hybridization. Almost 1000 loci are mapped, about 300 of which are coding sequences useful for comparative mapping and evolutionary inference. Economically important loci are beginning to appear on bovine linkage maps and enhanced comparative maps are likely to be necessary to identify many of these genes by a comparative positional candidate gene approach.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号