首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Crop plants not only have economic significance, but also comprise important botanical models for evolution and development. This is reflected by the recent increase in the percentage of publicly available sequence data that are derived from angiosperms. Further genome sequencing of the major crop plants will offer new learning opportunities, but their large, repetitive, and often polyploid genomes present challenges. Reduced-representation approaches - such as EST sequencing, methyl filtration and Cot-based cloning and sequencing - provide increased efficiency in extracting key information from crop genomes without full-genome sequencing. Combining these methods with phylogenetically stratified sampling to allow comparative genomic approaches has the potential to further accelerate progress in angiosperm genomics.  相似文献   

2.
The question of where retroviral DNA becomes integrated in chromosomes is important for understanding (i) the mechanisms of viral growth, (ii) devising new anti-retroviral therapy, (iii) understanding how genomes evolve, and (iv) developing safer methods for gene therapy. With the completion of genome sequences for many organisms, it has become possible to study integration targeting by cloning and sequencing large numbers of host–virus DNA junctions, then mapping the host DNA segments back onto the genomic sequence. This allows statistical analysis of the distribution of integration sites relative to the myriad types of genomic features that are also being mapped onto the sequence scaffold. Here we present methods for recovering and analyzing integration site sequences.  相似文献   

3.
The success in complete sequencing of "small" genomes and development of new technologies which sharply accelerate processes of cloning and sequencing made real an intensive development of plant genomics and complete sequencing of DNA of some species. It is assumed that the success in plant genomics will result in revolutionary changes in biotechnology and plant breeding. However, the enormous size of genomes (tens of billions bp), their extraordinary enrichment in repetitive sequences, and allopolyploidy (the presence in a nucleus of several related but not identical genomes) force us to think that only few "basic" will undergo complete sequencing, whereas the genome investigations in other species will follow principles of comparative genomics. By the present time, complete sequencing of the Arabidopsis genome (125 Mbp) is completed and that of the rice genome (about 430 Mbp) is close to its end. Studying other plant genomes, including those economically valuable, already began on the basis of these investigations. Peculiarities of plant genomes make extraordinarily important the knowledge on plant chromosomes which, in its turn, requires expansion of investigations in this direction and development of new chromosome technologies, including the DNA-sparing methods of high-resolution banding.  相似文献   

4.
细菌人工染色体文库的构建及应用   总被引:1,自引:0,他引:1  
细菌人工染色体(BAC)是第二代大片段DNA的克隆载体系统,具有容量大、嵌合率低、遗传特性稳定、转化效率高、插入片段易回收、操作简便等优点,因而被广泛应用于基因组较大的真核生物基因组研究中,并发挥着前所未有的重要作用。本文综述了BAC的发展,利用此载体构建基因组文库的程序和鉴定方法,及其在物理图谱构建、图位克隆、基因组测序、转基因技术等研究中的应用。  相似文献   

5.
Plant, and particularly cereal genomes, are challenging to sequence due to their large size and high repetitive DNA content. Gene-enrichment strategies are alternative or complementary approaches to complete genome sequencing that yield, rapidly and inexpensively, useful sequence data from large and complex genomes. The maize genome is large (2.7 Gbp) and contains large amounts of conserved repetitive elements. Furthermore, the high allelic diversity found between maize inbred lines may necessitate sequencing several inbred lines in order to recover the maize "gene pool". Two gene-enrichment approaches, methylation filtration (MF) and high C(o)t (HC) sequencing have been tested in maize and their ability to sample the gene space has been examined. Combined with other genomic sequencing strategies, gene-enriched genomic sequencing is a practical way to examine the maize gene pool, to order and orient the genic sequences on the genome, and to enable investigation of gene content of other complex plant genomes.  相似文献   

6.
Xu J  Fonseca DM 《Mitochondrial DNA》2011,22(5-6):155-158
Repetitive DNA sequences not only exist abundantly in eukaryotic nuclear genomes, but also occur as tandem repeats in many animal mitochondrial DNA (mtDNA) control regions. Due to concerted evolution, these repetitive sequences are highly similar or even identical within a genome. When long repetitive regions are the targets of amplification for the purpose of sequencing, multiple amplicons may result if one primer has to be located inside the repeats. Here, we show that, without separating these amplicons by gel purification or cloning, directly sequencing the mitochondrial repeats with the primer outside repetitive region is feasible and efficient. We exemplify it by sequencing the mtDNA control region of the mosquito Aedes albopictus, which harbors typical large tandem DNA repeats. This one-way sequencing strategy is optimal for population surveys.  相似文献   

7.
Next-generation DNA sequencing   总被引:17,自引:0,他引:17  
Shendure J  Ji H 《Nature biotechnology》2008,26(10):1135-1145
  相似文献   

8.
Genome analysis in many plant species is hampered by large genome size and by sequence redundancy due to the presence of repetitive DNA and polyploidy. One solution is to reduce the sample complexity by dissecting the genomes to single chromosomes. This can be realized by flow cytometric sorting, which enables purification of chromosomes in large numbers. Coupling the chromosome sorting technology with next generation sequencing provides a targeted and cost effective way to tackle complex genomes. The methods outlined in this article describe a procedure for preparation of chromosomal DNA suitable for next-generation sequencing.  相似文献   

9.
Sequencing of megabase plus DNA by hybridization: theory of the method   总被引:42,自引:0,他引:42  
A mismatch-free hybridization of oligonucleotides containing from 11 to 20 monomers to unknown DNA represents, in essence, a sequencing of a complementary target. Realizing this, we have used probability calculations and, in part, computer simulations to estimate the types and numbers of oligonucleotides that would have to be synthesized in order to sequence a megabase plus segment of DNA. We estimate that 95,000 specific mixes of 11-mers, mainly of the 5'(A,T,C,G)(A,T,C,G)N8(A,T,C,G)3' type, hybridized consecutively to dot blots of cloned genomic DNA fragments would provide primary data for the sequence assembly. An optimal mixture of representative libraries in M13 vector, having inserts of (i) 7 kb, (ii) 0.5 kb genomic fragments randomly ligated in up to 10-kb inserts, and (iii) tandem "jumping" fragments 100 kb apart in the genome, will be needed. To sequence each million base pairs of DNA, one would need hybridization data from about 2100 separate hybridization sample dots. Inevitable gaps and uncertainties in alignment of sequenced fragments arising from nonrandom or repetitive sequence organization of complex genomes and difficulties in cloning "poisonous" sequences in Escherichia coli, inherent to large sequencing by any method, have been considered and minimized by choice of libraries and number of subclones used for hybridization. Because it is based on simpler biochemical procedures, our method is inherently easier to automate than existing sequencing methods. The sequence can be derived from simple primary data only by extensive computing. Phased experimental tests and computer simulations increasing in complexity are needed before accurate estimates can be made in terms of cost and speed of sequencing by the new approach. Nevertheless, sequencing by hybridization should show advantages over existing methods because of the inherent redundancy and parallelism in its data gathering.  相似文献   

10.
Bacterial artificial chromosomes (BACs) derived from genomes of large DNA viruses are powerful tools for functional delineation of viral genes. Current methods for cloning the genomes of large DNA viruses as BACs require prior knowledge of the viral sequences or the cloning of viral DNA fragments, and are tedious because of the laborious process of multiple plaque purifications, which is not feasible for some fastidious viruses. Here, we describe a novel method for cloning the genomes of large DNA viruses as BACs, which entails direct in vitro transposition of viral genomes with a BAC cassette, and subsequent recovery in Escherichia coli. Determination of insertion sites and adjacent viral sequences identify the BAC clones for genetic manipulation and functional characterization. Compared to existing methods, this new approach is highly efficient, and does not require any information on viral sequences or cloning of viral DNA fragments, and plaque purifications. This method could potentially be used for discovering previously unidentified viruses.  相似文献   

11.
12.
The discovery of novel viruses has often been accomplished by using hybridization-based methods that necessitate the availability of a previously characterized virus genome probe or knowledge of the viral nucleotide sequence to construct consensus or degenerate PCR primers. In their natural replication cycle, certain viruses employ a rolling-circle mechanism to propagate their circular genomes, and multiply primed rolling-circle amplification (RCA) with phi29 DNA polymerase has recently been applied in the amplification of circular plasmid vectors used in cloning. We employed an isothermal RCA protocol that uses random hexamer primers to amplify the complete genomes of papillomaviruses without the need for prior knowledge of their DNA sequences. We optimized this RCA technique with extracted human papillomavirus type 16 (HPV-16) DNA from W12 cells, using a real-time quantitative PCR assay to determine amplification efficiency, and obtained a 2.4 x 10(4)-fold increase in HPV-16 DNA concentration. We were able to clone the complete HPV-16 genome from this multiply primed RCA product. The optimized protocol was subsequently applied to a bovine fibropapillomatous wart tissue sample. Whereas no papillomavirus DNA could be detected by restriction enzyme digestion of the original sample, multiply primed RCA enabled us to obtain a sufficient amount of papillomavirus DNA for restriction enzyme analysis, cloning, and subsequent sequencing of a novel variant of bovine papillomavirus type 1. The multiply primed RCA method allows the discovery of previously unknown papillomaviruses, and possibly also other circular DNA viruses, without a priori sequence information.  相似文献   

13.
Recently, the nucleotide sequences of entire genomes became available. This information combined with older sequencing data discloses the exact chromosomal location of millions of nucleotide markers stored in the databases at NCBI, EMBO or DDBJ. Despite having resolved the intron/exon structures of all described genes within these genomes with a stroke of a pen, the sequencing data opens up other interesting possibilities. For example, the genomic mapping of the end sequences of the human, murine and rat BAC libraries generated at The Institute for Genomic Research (TIGR), reveals now the entire encompassed sequence of the inserts for more than a million of these clones. Since these clones are individually stored, they are now an invaluable source for experiments which depend on genomic DNA. Isolation of smaller fragments from such clones with standard methods is a time consuming process. We describe here a reliable one-step cloning technique to obtain a DNA fragment with a defined size and sequence from larger genomic clones in less than 48 hours using a standard vector with a multiple cloning site, and common restriction enzymes and equipment. The only prerequisites are the sequences of ends of the insert and of the underlying genome.  相似文献   

14.

Background  

DNA sequencing is used ubiquitously: from deciphering genomes[1] to determining the primary sequence of small RNAs (smRNAs) [25]. The cloning of smRNAs is currently the most conventional method to determine the actual sequence of these important regulators of gene expression. Typical smRNA cloning projects involve the sequencing of hundreds to thousands of smRNA clones that are delimited at their 5' and 3' ends by fixed sequence regions. These primers result from the biochemical protocol used to isolate and convert the smRNA into clonable PCR products. Recently we completed a smRNA cloning project involving tobacco plants, where analysis was required for ~700 smRNA sequences[6]. Finding no easily accessible research tool to enter and analyze smRNA sequences we developed Ebbie to assist us with our study.  相似文献   

15.
Next generation sequencing (NGS) is revolutionizing genomics and is providing novel insights into genome organization, evolution and function. The number of plant genomes targeted for sequencing is rising. For the moment, however, the acquisition of full genome sequences in large genome species remains difficult, largely because the short reads produced by NGS platforms are inadequate to cope with repeat-rich DNA, which forms a large part of these genomes. The problem of sequence redundancy is compounded in polyploids, which dominate the plant kingdom. An approach to overcoming some of these difficulties is to reduce the full nuclear genome to its individual chromosomes using flow-sorting. The DNA acquired in this way has proven to be suitable for many applications, including PCR-based physical mapping, in situ hybridization, forming DNA arrays, the development of DNA markers, the construction of BAC libraries and positional cloning. Coupling chromosome sorting with NGS offers opportunities for the study of genome organization at the single chromosomal level, for comparative analyses between related species and for the validation of whole genome assemblies. Apart from the primary aim of reducing the complexity of the template, taking a chromosome-based approach enables independent teams to work in parallel, each tasked with the analysis of a different chromosome(s). Given that the number of plant species tractable for chromosome sorting is increasing, the likelihood is that chromosome genomics – the marriage of cytology and genomics – will make a significant contribution to the field of plant genetics.  相似文献   

16.
Common wheat (Triticum aestivum L., 2n = 6x = 42) is a polyploid species possessing one of the largest genomes among the cultivated crops (1C is approximately 17 000 Mb). The presence of three homoeologous genomes (A, B and D), and the prevalence of repetitive DNA make sequencing the wheat genome a daunting task. We have developed a novel 'chromosome arm-based' strategy for wheat genome sequencing to simplify this task; this relies on sub-genomic libraries of large DNA inserts. In this paper, we used a di-telosomic line of wheat to isolate six million copies of the short arm of chromosome 1B (1BS) by flow sorting. Chromosomal DNA was partially digested with HindIII and used to construct an arm-specific BAC library. The library consists of 65 280 clones with an average insert size of 82 kb. Almost half of the library (45%) has inserts larger than 100 kb, while 18% of the inserts range in size between 75 and 100 kb, and 37% are shorter than 75 kb. We estimated the chromosome arm coverage to be 14.5-fold, giving a 99.9% probability of identifying a clone corresponding to any sequence on the short arm of 1B. Each chromosome arm in wheat can be flow sorted from an appropriate cytogenetic stock, and we envisage that the availability of chromosome arm-specific BAC resources in wheat will greatly facilitate the development of ready-to-sequence physical maps and map-based gene cloning.  相似文献   

17.
Comparative analysis is one of the most powerful methods available for understanding the diverse and complex systems found in biology, but it is often limited by a lack of comprehensive taxonomic sampling. Despite the recent development of powerful genome technologies capable of producing sequence data in large quantities (witness the recently completed first draft of the human genome), there has been relatively little change in how evolutionary studies are conducted. The application of genomic methods to evolutionary biology is a challenge, in part because gene segments from different organisms are manipulated separately, requiring individual purification, cloning, and sequencing. We suggest that a feasible approach to collecting genome-scale data sets for evolutionary biology (i.e., evolutionary genomics) may consist of combination of DNA samples prior to cloning and sequencing, followed by computational reconstruction of the original sequences. This approach will allow the full benefit of automated protocols developed by genome projects to be realized; taxon sampling levels can easily increase to thousands for targeted genomes and genomic regions. Sequence diversity at this level will dramatically improve the quality and accuracy of phylogenetic inference, as well as the accuracy and resolution of comparative evolutionary studies. In particular, it will be possible to make accurate estimates of normal evolution in the context of constant structural and functional constraints (i.e., site-specific substitution probabilities), along with accurate estimates of changes in evolutionary patterns, including pairwise coevolution between sites, adaptive bursts, and changes in selective constraints. These estimates can then be used to understand and predict the effects of protein structure and function on sequence evolution and to predict unknown details of protein structure, function, and functional divergence. In order to demonstrate the practicality of these ideas and the potential benefit for functional genomic analysis, we describe a pilot project we are conducting to simultaneously sequence large numbers of vertebrate mitochondrial genomes.  相似文献   

18.
Bulked segregant analysis (BSA) is used to identify existing or induced variants that are linked to phenotypes. Although it is widely used in Arabidopsis and rice, it remains challenging for crops with large genomes, such as maize. Moreover, analysis of huge data sets can present a bottleneck linking phenotypes to their molecular basis, especially for geneticists without programming experience. Here, we identified two genes of maize defective kernel mutants with newly developed analysis pipelines that require no programing skills and should be applicable to any large genome. In the 1970s, Neuffer and Sheridan generated a chemically induced defective kernel (dek) mutant collection with the potential to uncover critical genes for seed development. To locate such mutations, the dek phenotypes were introgressed into two inbred lines to take advantage of maize haplotype variations and their sequenced genomes. We generated two pipelines that take fastq files derived from next‐generation (nextGen) paired‐end DNA and cDNA sequencing as input, call on several well established and freely available genomic analysis tools to call SNPs and INDELs, and generate lists of the most likely causal mutations together with variant index plots to locate the mutation to a specific sequence position on a chromosome. The pipelines were validated with a known strawberry mutation before cloning the dek mutants, thereby enabling phenotypic analysis of large genomes by next‐generation sequencing.  相似文献   

19.
The genomes of grasses are very different in terms of size, ploidy level and chromosome number. Despite these significant differences, it was found by comparative mapping that the linear order (colinearity) of genetic markers and genes is very well conserved between different grass genomes. The potential of such conservation has been exploited in several directions, e.g. in defining rice as a model genome for grasses and in designing better strategies for positional cloning in large genomes. Recently, the development of large insert libraries in species such as maize, rice, barley and diploid wheat has allowed the study of large stretches of DNA sequence and has provided insight into gene organization in grasses. It was found that genes are not distributed randomly along the chromosomes and that there are clusters of high gene density in species with large genomes. Comparative analysis performed at the DNA sequence level has demonstrated that colinearity between the grass genomes is retained at the molecular level (microcolinearity) in most cases. However, detailed analysis has also revealed a number of exceptions to microcolinearity, which have given insight into mechanisms that are involved in grass-genome evolution. In some cases, the use of rice as a model to support gene isolation from other grass genomes will be complicated by local rearrangements. In this Botanical Briefing, we present recent progress and future prospects of comparative genomics in grasses.  相似文献   

20.
通过PCR技术从产蛋鹅输卵管基因组中扩增出1.2kb的鹅清蛋白基因5’端调控区,将其亚克隆入phD18-T载体的多克隆位点(标记为pOV),经酶切和测序鉴定:扩增产物只有3个碱基发生了突变,其TATA框、组织特异性因子和卵清蛋白上游启动子均未发生变异,表明鹅清蛋白基因5’端调控区可作为启动外源基因表达的调控序列。为构建其启动外源基因的质粒表达载体奠定了研究基础。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号