首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Use of sequencing approaches is an important aspect in the field of cancer genomics, where next‐generation sequencing has already been utilized for targeting oncogenes or tumour‐suppressor genes, that can be sequenced in a short time period. Alterations such as point mutations, insertions/deletions, copy number alterations, chromosomal rearrangements and epigenetic changes are encountered in cancer cell genomes, and application of various NGS technologies in cancer research will encounter such modifications. Rapid advancement in technology has led to exponential growth in the field of genomic analysis. The $1000 Genome Project (in which the goal is to sequence an entire human genome for $1000), and deep sequencing techniques (which have greater accuracy and provide a more complete analysis of the genome), are examples of rapid advancements in the field of cancer genomics. In this mini review, we explore sequencing techniques, correlating their importance in cancer therapy and treatment.  相似文献   

2.
The goal of human genome re-sequencing is obtaining an accurate assembly of an individual's genome. Recently, there has been great excitement in the development of many technologies for this (e.g. medium and short read sequencing from companies such as 454 and SOLiD, and high-density oligo-arrays from Affymetrix and NimbelGen), with even more expected to appear. The costs and sensitivities of these technologies differ considerably from each other. As an important goal of personal genomics is to reduce the cost of re-sequencing to an affordable point, it is worthwhile to consider optimally integrating technologies. Here, we build a simulation toolbox that will help us optimally combine different technologies for genome re-sequencing, especially in reconstructing large structural variants (SVs). SV reconstruction is considered the most challenging step in human genome re-sequencing. (It is sometimes even harder than de novo assembly of small genomes because of the duplications and repetitive sequences in the human genome.) To this end, we formulate canonical problems that are representative of issues in reconstruction and are of small enough scale to be computationally tractable and simulatable. Using semi-realistic simulations, we show how we can combine different technologies to optimally solve the assembly at low cost. With mapability maps, our simulations efficiently handle the inhomogeneous repeat-containing structure of the human genome and the computational complexity of practical assembly algorithms. They quantitatively show how combining different read lengths is more cost-effective than using one length, how an optimal mixed sequencing strategy for reconstructing large novel SVs usually also gives accurate detection of SNPs/indels, how paired-end reads can improve reconstruction efficiency, and how adding in arrays is more efficient than just sequencing for disentangling some complex SVs. Our strategy should facilitate the sequencing of human genomes at maximum accuracy and low cost.  相似文献   

3.
Applications of the polymerase chain reaction to genome analysis   总被引:2,自引:0,他引:2  
E A Rose 《FASEB journal》1991,5(1):46-54
The objectives of the Human Genome Project are to create high-resolution genetic and physical maps, and ultimately to determine the complete nucleotide sequence of the human genome. The result of this initiative will be to localize the estimated 50,000-100,000 human genes, and acquire information that will enable development of a better understanding of the relationship between genome structure and function. To achieve these goals, new methodologies that provide more rapid, efficient, and cost effective means of genomic analysis will be required. From both conceptual and practical perspectives, the polymerase chain reaction (PCR) represents a fundamental technology for genome mapping and sequencing. The availability of PCR has allowed definition of a technically credible form that the final composite map of the human genome will take, as described in the sequence-tagged site proposal. Moreover, applications of PCR have provided efficient approaches for identifying, isolating, mapping, and sequencing DNA, many of which are amenable to automation. The versatility and power provided by PCR have encouraged its involvement in almost every aspect of human genome research, with new applications of PCR being developed on a continual basis.  相似文献   

4.
Advances in sequencing technology   总被引:8,自引:0,他引:8  
Chan EY 《Mutation research》2005,573(1-2):13-40
Faster sequencing methods will undoubtedly lead to faster single nucleotide polymorphism (SNP) discovery. The Sanger method has served as the cornerstone for genome sequence production since 1977, close to almost 30 years of tremendous utility [Sanger, F., Nicklen, S., Coulson, A.R, DNA sequencing with chain-terminating inhibitors, Proc. Natl. Acad. Sci. U.S.A. 74 (1977) 5463-5467]. With the completion of the human genome sequence [Venter, J.C. et al., The sequence of the human genome, Science 291 (2001) 1304-1351; Lander, E.S. et al., Initial sequencing and analysis of the human genome, Nature 409 (2001) 860-921], there is now a focus on developing new sequencing methodologies that will enable "personal genomics", or the routine study of our individual genomes. Technologies that will lead us to this lofty goal are those that can provide improvements in three areas: read length, throughput, and cost. As progress is made in this field, large sections of genomes and then whole genomes of individuals will become increasingly more facile to sequence. SNP discovery efforts will be enhanced lock-step with these improvements. Here, the breadth of new sequencing approaches will be summarized including their status and prospects for enabling personal genomics.  相似文献   

5.
The emergence of third‐generation sequencing (3GS; long‐reads) is bringing closer the goal of chromosome‐size fragments in de novo genome assemblies. This allows the exploration of new and broader questions on genome evolution for a number of nonmodel organisms. However, long‐read technologies result in higher sequencing error rates and therefore impose an elevated cost of sufficient coverage to achieve high enough quality. In this context, hybrid assemblies, combining short‐reads and long‐reads, provide an alternative efficient and cost‐effective approach to generate de novo, chromosome‐level genome assemblies. The array of available software programs for hybrid genome assembly, sequence correction and manipulation are constantly being expanded and improved. This makes it difficult for nonexperts to find efficient, fast and tractable computational solutions for genome assembly, especially in the case of nonmodel organisms lacking a reference genome or one from a closely related species. In this study, we review and test the most recent pipelines for hybrid assemblies, comparing the model organism Drosophila melanogaster to a nonmodel cactophilic Drosophila, D. mojavensis. We show that it is possible to achieve excellent contiguity on this nonmodel organism using the dbg2olc pipeline.  相似文献   

6.
The study of nematode genomes over the last three decades has relied heavily on the model organism Caenorhabditis elegans, which remains the best-assembled and annotated metazoan genome. This is now changing as a rapidly expanding number of nematodes of medical and economic importance have been sequenced in recent years. The advent of sequencing technologies to achieve the equivalent of the $1000 human genome promises that every nematode genome of interest will eventually be sequenced at a reasonable cost. As the sequencing of species spanning the nematode phylum becomes a routine part of characterizing nematodes, the comparative approach and the increasing use of ecological context will help us to further understand the evolution and functional specializations of any given species by comparing its genome to that of other closely and more distantly related nematodes. We review the current state of nematode genomics and discuss some of the highlights that these genomes have revealed and the trend and benefits of ecological genomics, emphasizing the potential for new genomes and the exciting opportunities this provides for nematological studies.  相似文献   

7.
The goal of the Hungate1000 project is to generate a reference set of rumen microbial genome sequences. Toward this goal we have carried out a meta-analysis using information from culture collections, scientific literature, and the NCBI and RDP databases and linked this with a comparative study of several rumen 16S rRNA gene-based surveys. In this way we have attempted to capture a snapshot of rumen bacterial diversity to examine the culturable fraction of the rumen bacterial microbiome. Our analyses have revealed that for cultured rumen bacteria, there are many genera without a reference genome sequence. Our examination of culture-independent studies highlights that there are few novel but many uncultured taxa within the rumen bacterial microbiome. Taken together these results have allowed us to compile a list of cultured rumen isolates that are representative of abundant, novel and core bacterial species in the rumen. In addition, we have identified taxa, particularly within the phylum Bacteroidetes, where further cultivation efforts are clearly required.This information is being used to guide the isolation efforts and selection of bacteria from the rumen microbiota for sequencing through the Hungate1000.  相似文献   

8.
Next-generation sequencing technologies are revolutionizing the field of phylogenetics by making available genome scale data for a fraction of the cost of traditional targeted sequencing. One challenge will be to make use of these genomic level data without necessarily resorting to full-scale genome assembly and annotation, which is often time and labor intensive. Here we describe a technique, the Target Restricted Assembly Method (TRAM), in which the typical process of genome assembly and annotation is in essence reversed. Protein sequences of phylogenetically useful genes from a species within the group of interest are used as targets in tblastn searches of a data set from a lane of Illumina reads for a related species. Resulting blast hits are then assembled locally into contigs and these contigs are then aligned against the reference “cDNA” sequence to remove portions of the sequences that include introns. We illustrate the Target Restricted Assembly Method using genomic scale datasets for 20 species of lice (Insecta: Psocodea) to produce a test phylogenetic data set of 10 nuclear protein coding gene sequences. Given the advantages of using DNA instead of RNA, this technique is very cost effective and feasible given current technologies.  相似文献   

9.
While recently developed short-read sequencing technologies may dramatically reduce the sequencing cost and eventually achieve the $1000 goal for re-sequencing, their limitations prevent the de novo sequencing of eukaryotic genomes with the standard shotgun sequencing protocol. We present SHRAP (SHort Read Assembly Protocol), a sequencing protocol and assembly methodology that utilizes high-throughput short-read technologies. We describe a variation on hierarchical sequencing with two crucial differences: (1) we select a clone library from the genome randomly rather than as a tiling path and (2) we sample clones from the genome at high coverage and reads from the clones at low coverage. We assume that 200 bp read lengths with a 1% error rate and inexpensive random fragment cloning on whole mammalian genomes is feasible. Our assembly methodology is based on first ordering the clones and subsequently performing read assembly in three stages: (1) local assemblies of regions significantly smaller than a clone size, (2) clone-sized assemblies of the results of stage 1, and (3) chromosome-sized assemblies. By aggressively localizing the assembly problem during the first stage, our method succeeds in assembling short, unpaired reads sampled from repetitive genomes. We tested our assembler using simulated reads from D. melanogaster and human chromosomes 1, 11, and 21, and produced assemblies with large sets of contiguous sequence and a misassembly rate comparable to other draft assemblies. Tested on D. melanogaster and the entire human genome, our clone-ordering method produces accurate maps, thereby localizing fragment assembly and enabling the parallelization of the subsequent steps of our pipeline. Thus, we have demonstrated that truly inexpensive de novo sequencing of mammalian genomes will soon be possible with high-throughput, short-read technologies using our methodology.  相似文献   

10.
Anticipating the $1,000 genome   总被引:1,自引:0,他引:1  
A new generation of DNA-sequencing platforms will become commercially available over the next few years. These instruments will enable re-sequencing of human genomes at a previously unimagined throughput and low cost. Here, I examine why the $1,000 human genome is an important goal for research and clinical diagnostics, and what will be required to achieve it.  相似文献   

11.
A new generation of DNA-sequencing platforms will become commercially available over the next few years. These instruments will enable re-sequencing of human genomes at a previously unimagined throughput and low cost. Here, I examine why the 1,000 dollar human genome is an important goal for research and clinical diagnostics, and what will be required to achieve it.  相似文献   

12.
《动物学研究》2017,(6):449-458
Eukaryotic genome size data are important both as the basis for comparative research into genome evolution and as estimators of the cost and difficulty of genome sequencing programs for non-model organisms.In this study,the genome size of 14 species of fireflies (Lampyridae) (two genera in Lampyrinae,three genera in Luciolinae,and one genus in subfamily incertae sedis) were estimated by propidium iodide (PI)-based flow cytometry.The haploid genome sizes of Lampyridae ranged from 0.42 to 1.31 pg,a 3.1-fold span.Genome sizes of the fireflies varied within the tested subfamilies and genera.Lamprigera and Pyrocoelia species had large and small genome sizes,respectively.No correlation was found between genome size and morphological traits such as body length,body width,eye width,and antennal length.Our data provide additional information on genome size estimation of the firefly family Lampyridae.Furthermore,this study will help clarify the cost and difficulty of genome sequencing programs for non-model organisms and will help promote studies on firefly genome evolution.  相似文献   

13.
MegaBACETM 1000 DNA测序仪是由美国Pharmacia公司生产的一种高通量的DNA测序仪,利用这套测序仪,可以进行DNA测序、遗传分析等相关研究。但该公司生产的用于遗传分析的DNA标记物(marker)(ET550-R Size Standards)价格不菲,出于节省成本的考虑,自行开发了荧光标记物,经过验证,这个标记完全可以在MegaBACE 1000 DNA测序仪上使用。利用这套标记物和自己开发的标记物识别软件,构建了一套基于MegaBACE 1000 DNA测序仪的改良的、高通量的AFLP操作流程。  相似文献   

14.
Target sequence capture is an efficient technique to enrich specific genomic regions for high‐throughput sequencing in ecological and evolutionary studies. In recent years, many sequence capture approaches have been proposed, but most of them rely on commercial synthetic baits which make the experiment expensive. Here, we present a novel sequence capture approach called AFLP‐based genome sequence capture (AFLP Capture). This method uses the AFLP (amplified fragment length polymorphism) technique to generate homemade capture baits without the need for prior genome information, thus is applicable to any organisms. In this approach, biotinylated AFLP fragments representing a random fraction of the genome are used as baits to capture the homologous fragments from genomic shotgun sequencing libraries. In a trial study, by using AFLP Capture, we successfully obtained 511 orthologous loci (>700,000 bp in total length) from 11 Odorrana species and more than 100,000 single nucleotide polymorphisms (SNPs) in four analyzed individuals of an Odorrana species. This result shows that our method can be used to address questions of various evolutionary depths (from interspecies level to intraspecies level). We also discuss the flexibility in bait preparation and how the sequencing data are analyzed. In summary, AFLP Capture is a rapid and flexible tool and can significantly reduce the experimental cost for phylogenetic studies that require analyzing genome‐scale data (hundreds or thousands of loci).  相似文献   

15.
The increased numbers of genetic markers produced by genomic techniques have the potential to both identify hybrid individuals and localize chromosomal regions responding to selection and contributing to introgression. We used restriction-site-associated DNA sequencing to identify a dense set of candidate SNP loci with fixed allelic differences between introduced rainbow trout (Oncorhynchus mykiss) and native westslope cutthroat trout (Oncorhynchus clarkii lewisi). We distinguished candidate SNPs from homeologs (paralogs resulting from whole-genome duplication) by detecting excessively high observed heterozygosity and deviations from Hardy-Weinberg proportions. We identified 2923 candidate species-specific SNPs from a single Illumina sequencing lane containing 24 barcode-labelled individuals. Published sequence data and ongoing genome sequencing of rainbow trout will allow physical mapping of SNP loci for genome-wide scans and will also provide flanking sequence for design of qPCR-based TaqMan(?) assays for high-throughput, low-cost hybrid identification using a subset of 50-100 loci. This study demonstrates that it is now feasible to identify thousands of informative SNPs in nonmodel species quickly and at reasonable cost, even if no prior genomic information is available.  相似文献   

16.
The yeast genetics community has embraced genomic biology, and there is a general understanding that obtaining a full encyclopedia of functions of the approximately 6000 genes is a worthwhile goal. The yeast literature comprises over 40,000 research papers, and the number of yeast researchers exceeds the number of genes. There are mutated and tagged alleles for virtually every gene, and hundreds of high-throughput data sets and computational analyses have been described. Why, then, are there >1000 genes still listed as uncharacterized on the Saccharomyces Genome Database, 10 years after sequencing the genome of this powerful model organism? Examination of the currently uncharacterized gene set suggests that while some are small or newly discovered, the vast majority were evident from the initial genome sequence. Most are present in multiple genomics data sets, which may provide clues to function. In addition, roughly half contain recognizable protein domains, and many of these suggest specific metabolic activities. Notably, the uncharacterized gene set is highly enriched for genes whose only homologs are in other fungi. Achieving a full catalog of yeast gene functions may require a greater focus on the life of yeast outside the laboratory.  相似文献   

17.
Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. Here, we present SeXY, a novel sex‐identification pipeline, for very low‐coverage shotgun sequencing data from a single individual. SeXY was designed to utilize low‐effort screening data for sex identification and does not require a conspecific sex‐chromosome assembly as reference. We assess the accuracy of our pipeline to data quantity by downsampling sequencing data from 100,000 to 1000 mapped reads and to reference genome selection by mapping to a variety of reference genomes of various qualities and phylogenetic distance. We show that our method is 100% accurate when mapping to a high‐quality (highly contiguous N50 > 30 Mb) conspecific genome, even down to 1000 mapped reads. For lower‐quality reference assemblies (N50 < 30 Mb), our method is 100% accurate with 50,000 mapped reads, regardless of reference assembly quality or phylogenetic distance. The SeXY pipeline provides several advantages over previously implemented methods; SeXY (i) requires sequencing data from only a single individual, (ii) does not require assembled conspecific sex chromosomes, or even a conspecific reference assembly, (iii) takes into account variation in coverage across the genome, and (iv) is accurate with only 1000 mapped reads in many cases.  相似文献   

18.
Next generation sequencing (NGS) allows whole exome or whole genome sequencing for a given patient to be performed timely and at reasonable cost. This diagnostic quantum leap not only has various legal, ethical and economical aspects but will naturally also impact upon patient care. Currently, however, the wide-spread introduction of NGS into routine diagnostics is facing many obstacles. In particular, it is to be expected that NGS will identify a large number of rare variants in a given patient that are of (yet) unknown clinical significance. As a first step towards solving this problem, we introduce the concept of a database that will systematically integrate genotypic and phenotypic information from the German health care context. Not only will this resource be of great scientific value, but the database shall also provide human geneticists with the evidence base necessary for the reliable evaluation of their patient-related sequencing data.  相似文献   

19.
The genome of tomato (Solanum lycopersicum) is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, The Netherlands, France, Japan, Spain, Italy and the United States) as part of a larger initiative called the ‘International Solanaceae Genome Project (SOL): Systems Approach to Diversity and Adaptation’. The goal of this grassroots initiative, launched in November 2003, is to establish a network of information, resources and scientists to ultimately tackle two of the most significant questions in plant biology and agriculture: (1) How can a common set of genes/proteins give rise to a wide range of morphologically and ecologically distinct organisms that occupy our planet? (2) How can a deeper understanding of the genetic basis of plant diversity be harnessed to better meet the needs of society in an environmentally friendly and sustainable manner? The Solanaceae and closely related species such as coffee, which are included in the scope of the SOL project, are ideally suited to address both of these questions. The first step of the SOL project is to use an ordered BAC approach to generate a high quality sequence for the euchromatic portions of the tomato as a reference for the Solanaceae. Due to the high level of macro and micro-synteny in the Solanaceae the BAC-by-BAC tomato sequence will form the framework for shotgun sequencing of other species. The starting point for sequencing the genome is BACs anchored to the genetic map by overgo hybridization and AFLP technology. The overgos are derived from approximately 1500 markers from the tomato high density F2-2000 genetic map (http://sgn.cornell.edu/). These seed BACs will be used as anchors from which to radiate the tiling path using BAC end sequence data. Annotation will be performed according to SOL project guidelines. All the information generated under the SOL umbrella will be made available in a comprehensive website. The information will be interlinked with the ultimate goal that the comparative biology of the Solanaceae—and beyond—achieves a context that will facilitate a systems biology approach.  相似文献   

20.

Background  

The decrease in cost for sequencing and improvement in technologies has made it easier and more common for the re-sequencing of large genomes as well as parallel sequencing of small genomes. It is possible to completely sequence a small genome within days and this increases the number of publicly available genomes. Among the types of genomes being rapidly sequenced are those of microbial and viral genomes responsible for infectious diseases. However, accurate gene prediction is a challenge that persists for decoding a newly sequenced genome. Therefore, accurate and efficient gene prediction programs are highly desired for rapid and cost effective surveillance of RNA viruses through full genome sequencing.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号