共查询到19条相似文献,搜索用时 46 毫秒
1.
近年来,随着测序技术的不断发展,基因组测序技术渐趋成熟并在动物和植物基因组上获得了越来越多的成功,大量植物的基因组的草图和精细图不断地被公布出来。比较和分析了三代测序技术各自的特点,对测序前的准备、基因组组装、注释和比较基因组学等方面的研究进展进行了详细的评述,阐明了植物基因组研究的特点和难点。通过植物的全基因组测序,研究者不仅可以获得该植物基因组和重要功能基因的序列信息,为从分子水平研究植物的分子进化、基因组成和基因调控等提供了一定的依据,而且还对即将测序的植物基因组研究具有重要的借鉴意义。 相似文献
2.
3.
金针菇是我国一种重要的食药用真菌,具有较高的营养和药用价值,尤其是从金针菇中分离得到了多种麦角甾醇、倍半萜等萜类,具有抗菌、抗肿瘤和降血脂等功效。以往关于金针菇的研究多集中在营养成分、栽培和市场开发等方面,而很少报道有关生物活性成分的合成和代谢研究。从福建省金针菇主要栽培品种中获得原生质体单核化L11菌株,我们进行了金针菇基因组测序与初步分析,并利用生物信息学手段,研究了其主要的生物活性成分——萜类合成途径中的关键基因。结果显示,金针菇单孢全基因组序列长度为34.75Mb。通过分析萜类合成途径中的相关基因,确定了4个萜类合成关键基因的基因结构,并且表明其蛋白结构具有稳定性,该类基因还含有多个信号肽和跨膜结构。金针菇基因组测序的完成,为下一步功能基因的筛选、分析等研究工作奠定了基础。 相似文献
4.
rDNA序列中的ITS作为DNA barcoding广泛应用于真菌的系统发育与物种辅助鉴定,IGS被认为可以用于种内水平不同菌株的鉴别。食用菌中还没有完整的rDNA序列的报道。本研究采用二代和三代测序技术分别对金针菇单核菌株“6-3”进行测序,用二代测序的数据对三代测序组装得到的基因组序列进行修正,得到一个在基因完整性、连续性和准确性均较好的基因组序列,对比Fibroporia vaillantii rDNA序列,获得金针菇完整的rDNA序列。金针菇rDNA序列结构分析表明,它有8个rDNA转录单元,长度均为5 903bp,有9个基因间隔区,其长度有较大差异,3 909-4 566bp。rDNA转录单元中,各元件的序列长度分别为:18S rDNA 1 796bp、ITS1 234bp、5.8S rDNA 173bp、ITS2 291bp、28S rDNA 3 410bp。基因间间隔区中,IGS1 1 351-1 399bp、5S rDNA 124bp、IGS2 2 435-3 092bp。金针菇的5S、5.8S、18S、28S rDNA序列准确性得到转录组数据的验证,也得到系统发育分析结果的支持。多序列比对发现,不同拷贝的基因间间隔区序列(IGS1和IGS2)存在丰富的多态性,多态性来源于SNP、InDel和TRS(串联重复序列),而TRS来源于重复单元的类型和数量。9个基因间间隔区之间,IGS1只有少量的SNP和InDel,IGS2不仅有SNP和InDel,还有TRS。本研究结果提示,在应用IGS进行种内水平不同菌株之间的鉴别时,需要选取不同拷贝之间的保守IGS序列。 相似文献
5.
二代测序技术的发展对测序数据的处理分析提出了很高的要求。目前二代测序数据分析软件很多, 但是绝大多数软件仅能完成单一的分析功能(例如:仅进行序列比对或变异读取或功能注释等), 如何能正确高效地选择整合这些软件已成为迫切需求。文章设计了一套基于perl语言和SGE资源管理的自动化处理流程来分析Illumina平台基因组测序数据。该流程以测序原始序列数据作为输入, 调用业界标准的数据处理软件(如:BWA, Samtools, GATK, ANNOVAR等), 最终生成带有相应功能注释、便于研究者进一步分析的变异位点列表。该流程通过自动化并行脚本控制流程的高效运行, 一站式输出分析结果和报告, 简化了数据分析过程中的人工操作, 大大提高了运行效率。用户只需填写配置文件或使用图形界面输入即可完成全部操作。该工作为广大研究者分析二代测序数据提供了便利的途径。 相似文献
6.
7.
香瓜茄又名人参果,具有抗氧化、抗肿瘤、抗糖尿病等多种生物活性。为丰富茄科作物基因组信息及进化发育历程,获取香瓜茄全基因组序列信息,同时为香瓜茄相关分子研究奠定基础。以香瓜茄植物组织为试验材料,基于Illumina HiSeq构建小片段文库进行基因组特征评估,利用PacBio三代测序技术、Hi-C技术构建及组装香瓜茄全基因组数据库。利用生物信息学方法对获得的基因组序列进行组装、功能注释以及进化分析研究。结果表明,获得54.11 Gb Illumina HiSeq数据;获得55.08 Gb PacBio数据,reads平均长度为14 179 bp;获得Hi-C数据量约143 Gb;拼接得到该基因组contig序列总长为1.16 Gb,Hi-C纠错后contig N50为22.63 Mb;Hi-C挂载染色体,共有1.12 Gb长度的序列可以挂载到12条染色体上,占比97.16%;其中,能够确定顺序和方向的序列长度为1.08 Gb,占定位染色体序列总长度的96.11%,得到基因组大小1.25 Gb;预测有64.22%的重复序列,41 571个基因,99.06%的基因可以注释到NR、GO、KEGG等数据库中;预测得到4 360个tRNA、5 677个rRNA、154个miRNA;得到449个假基因。香瓜茄与马铃薯的进化时间大约在12.82 MYA。 相似文献
8.
复杂基因组测序技术研究进展 总被引:1,自引:0,他引:1
复杂基因组指的是无法使用常规测序和组装手段直接解析的一类基因组,通常指包含高比例重复序列、高杂合度、极端GC含量、存在难消除异源DNA污染的基因组。为了解决复杂基因组的测序和组装问题,需要分别从基因组测序实验方法、测序技术平台、组装算法与策略3个方面进行深入研究。本文详细介绍了复杂基因组测序组装相关的现有技术与方法,并结合复杂基因组经典实例介绍了复杂基因组测序的技术解决途径和发展历程,可为制订合适的复杂基因组测序策略提供参考。 相似文献
9.
本研究对金针菇Flammulina velutipes的一个RNAi转化子菌株1382R3进行了高通量测序,以本实验室先前获得的野生型W23基因组数据为参考,分析了该转化子的基因插入位点以及拷贝数。转化子菌株1382R3是通过农杆菌介导将fv-hmg1-RNAi载体转化至金针菇菌株并通过PCR检测筛选标记而得到。通过BLAST将转化子测序的reads对外源载体和基因组定位,找到具有基因组序列(GS)和外源载体序列(ES)两种序列的临界reads,并据此使用PERL语言程序成功在转化子1382R3菌株中找到两个插入位点。对两个插入位置的序列分析表明:在插入位点1,T-DNA片段部分插入;在插入位点2,T-DNA全部插入到基因组。两个插入位点都对基因组内源基因的表达造成了一定的干扰。此方法拓宽了高通量测序技术的应用范围,将其运用到遗传转化插入位置和拷贝数的研究中,有利于食用菌的功能基因组及基因工程研究。 相似文献
10.
白芷为常用的药食同源物种,既是临床常用中药,又是香料,用途十分广泛。为获取白芷全基因组序列信息,该研究首次以杭白芷叶片DNA为材料,采用Nanopore测序技术构建杭白芷全基因组数据库,并利用生物信息学方法对获得的核苷酸序列进行组装、功能注释以及进化分析研究。结果表明:(1)原始测序数据过滤后获得662 Gb三代数据,Read N50约为32 932 bp,经过组装得到杭白芷基因组大小为5.6 Gb, Contig N50约为806 638 bp。(2)组装后的序列通过与KOG、GO、KEGG等功能数据库比对,得到了功能注释的基因占66.47%,KOG功能注释结果表明杭白芷的蛋白功能主要集中在一般功能预测、翻译后修饰、蛋白质转换、伴侣以及信号转导机制;GO功能分类表明杭白芷的基因集中在生物学过程及细胞组分;KEGG通路注释表明参与代谢途径的基因占主要地位。(3)杭白芷中鉴定到45个BGLU家族基因。该研究首次利用第三代测序技术对杭白芷全基因组进行解析,为杭白芷的系统生物学研究和BGLU在杭白芷生长发育中的后续功能研究提供了重要的理论参考。 相似文献
11.
D Scaglione S Lanteri A Acquadro Z Lai SJ Knapp L Rieseberg E Portis 《Plant biotechnology journal》2012,10(8):956-969
12.
13.
Graham M. Hughes Li Gang William J. Murphy Desmond G. Higgins Emma C. Teeling 《Molecular ecology resources》2013,13(3):510-521
The advent of Next Generation Sequencing Technology (NGST) has revolutionized molecular biology research, allowing for rapid gene/genome sequencing from a multitude of diverse species. As high throughput sequencing becomes more accessible, more efficient workflows must be developed to deal with the amounts of data produced and better assemble the genomes of de novo lineages. We combine traditional laboratory methods with Illumina NGST to amplify and sequence the largest mammalian multigene family, the Olfactory Receptor gene family, for species with and without a reference genome. We develop novel assembly methods to annotate and filter these data, which can be utilized for any gene family or any species. We find no significant difference between the ratio of genes within their respective gene families of our data compared with available genomic data. Using simulated data we explore the limitations of short‐read sequence data and our assembly in recovering this gene family. We highlight the benefits and shortcomings of these methods. Compared with data generated from traditional polymerase chain reaction, cloning and Sanger sequencing methodologies, sequence data generated using our pipeline increases yield and sequencing efficiency without reducing the number of unique genes amplified. A cloning step is not required, therefore shortening data generation time. The novel downstream methodologies and workflows described provide a tool to be utilized by many fields of biology, to access and analyze the vast quantities of data generated. By combining laboratory and in silico methods, we provide a means of extracting genomic information for multigene families without complete genome sequencing. 相似文献
14.
15.
Xiaoling Yu Wenqian Jiang Yang Shi Hanhui Ye Jun Lin 《Journal of cellular and molecular medicine》2019,23(11):7143-7150
Infectious diseases are a type of disease caused by pathogenic microorganisms. Although the discovery of antibiotics changed the treatment of infectious diseases and reduced the mortality of bacterial infections, resistant bacterial strains have emerged. Anti‐infective therapy based on aetiological evidence is the gold standard for clinical treatment, but the time lag and low positive culture rate of traditional methods of pathogen diagnosis leads to relative difficulty in obtaining the evidence of pathogens. Compared with traditional methods of pathogenic diagnosis, next‐generation and third‐generation sequencing technologies have many advantages in the detection of pathogenic microorganisms. In this review, we mainly introduce recent progress in research on pathogenic diagnostic technology and the applications of sequencing technology in the diagnosis of pathogenic microorganisms. This review provides new insights into the application of sequencing technology in the clinical diagnosis of microorganisms. 相似文献
16.
17.
In the realm of bioinformatics and computational biology,the most rudimentary data upon which all the analysis is built is the sequence data of genes,proteins and RNA.The sequence data of the entire genome is the solution to the genome assembly problem.The scope of this contribution is to provide an overview on the art of problem-solving applied within the domain of genome assembly in the nextgeneration sequencing(NGS) platforms.This article discusses the major genome assemblers that were proposed in the literature during the past decade by outlining their basic working principles.It is intended to act as a qualitative,not a quantitative,tutorial to all working on genome assemblers pertaining to the next generation of sequencers.We discuss the theoretical aspects of various genome assemblers,identifying their working schemes.We also discuss briefly the direction in which the area is headed towards along with discussing core issues on software simplicity. 相似文献
18.
Background: Next-generation sequencing (NGS) technologies have fostered an unprecedented proliferation of high-throughput sequencing projects and a concomitant development of novel algorithms for the assembly of short reads. However, numerous technical or computational challenges in de novo assembly still remain, although many new ideas and solutions have been suggested to tackle the challenges in both experimental and computational settings.Results: In this review, we first briefly introduce some of the major challenges faced by NGS sequence assembly. Then, we analyze the characteristics of various sequencing platforms and their impact on assembly results. After that, we classify de novo assemblers according to their frameworks (overlap graph-based, de Bruijn graph-based and string graph-based), and introduce the characteristics of each assembly tool and their adaptation scene. Next, we introduce in detail the solutions to the main challenges of de novo assembly of next generation sequencing data, single-cell sequencing data and single molecule sequencing data. At last, we discuss the application of SMS long reads in solving problems encountered in NGS assembly.Conclusions: This review not only gives an overview of the latest methods and developments in assembly algorithms, but also provides guidelines to determine the optimal assembly algorithm for a given input sequencing data type. 相似文献
19.
Lilian Pukk Freed Ahmad Shihab Hasan Veljo Kisand Riho Gross Anti Vasemgi 《Molecular ecology resources》2015,15(5):1145-1152
Massively parallel sequencing a small proportion of the whole genome at high coverage enables answering a wide range of questions from molecular evolution and evolutionary biology to animal and plant breeding and forensics. In this study, we describe the development of restriction‐site associated DNA (RAD) sequencing approach for Ion Torrent PGM platform. Our protocol results in extreme genome complexity reduction using two rare‐cutting restriction enzymes and strict size selection of the library allowing sequencing of a relatively small number of genomic fragments with high sequencing depth. We applied this approach to a common freshwater fish species, the Eurasian perch (Perca fluviatilis L.), and generated over 2.2 MB of novel sequence data consisting of ~17 000 contigs, identified 1259 single nucleotide polymorphisms (SNPs). We also estimated genetic differentiation between the DNA pools from freshwater (Lake Peipus) and brackish water (the Baltic Sea) populations and identified SNPs with the strongest signal of differentiation that could be used for robust individual assignment in the future. This work represents an important step towards developing genomic resources and genetic tools for the Eurasian perch. We expect that our ddRAD sequencing protocol for semiconductor sequencing technology will be useful alternative for currently available RAD protocols. 相似文献