首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
DNA markers play important roles in plant breeding and genetics.The Insertion/Deletion(InDel) marker is one kind of co-dominant DNA markers widely used due to its low cost and high precision.However,the canonical way of searching for InDel markers is time-consuming and laborintensive.We developed an end-to-end computational solution(InDel Markers Development Platform,IMDP) to identify genome-wide InDel markers under a graphic pipeline environment.IMDP constitutes assembled genome sequences alignment pipeline(AGA-pipe) and next-generation resequencing data mapping pipeline(NGS-pipe).With AGA-pipe we are able to identify 12,944 markers between the genome of rice cultivars Nipponbare and 93-11.Using NGS-pipe,we reported 34,794 InDels from re-sequencing data of rice cultivars Wu-Yun-Geng7 and Guang-Lu-Ai4.Combining AGApipe and NGS-pipe,we developed 205,659 InDels in eight japonica and nine indica cultivars and 2,681 InDels showed a subgroup-specific pattern.Polymerase chain reaction(PCR)analysis of subgroup-specific markers indicated that the precision reached 90%(86 of 95).Finally,to make them available to the public,we have integrated the InDels/markers information into a website(Rice InDel Marker Database,RIMD,http://202.120.45.71/).The application of IMDP in rice will facilitate efficiency for development of genome-wide InDel markers,in addition it can be used in other species with reference genome sequences and NGS data.  相似文献   

2.
This article reviews basic concepts,general applications,and the potential impact of next-generation sequencing(NGS)technologies on genomics,with particular reference to currently available and possible future platforms and bioinformatics.NGS technologies have demonstrated the capacity to sequence DNA at unprecedented speed,thereby enabling previously unimaginable scientific achievements and novel biological applications.But,the massive data produced by NGS also presents a significant challenge for data storage,analyses,and management solutions.Advanced bioinformatic tools are essential for the successful application of NGS technology.As evidenced throughout this review,NGS technologies will have a striking impact on genomic research and the entire biological field.With its ability to tackle the unsolved challenges unconquered by previous genomic technologies,NGS is likely to unravel the complexity of the human genome in terms of genetic variations,some of which may be confined to susceptible loci for some common human conditions.The impact of NGS technologies on genomics will be far reaching and likely change the field for years to come.  相似文献   

3.
4.

Background

CRISPR-Cas9 is a revolutionary genome editing technique that allows for efficient and directed alterations of the eukaryotic genome. This relatively new technology has already been used in a large number of ‘loss of function’ experiments in cultured cells. Despite its simplicity and efficiency, screening for mutated clones remains time-consuming, laborious and/or expensive.

Results

Here we report a high-throughput screening strategy that allows parallel screening of up to 96 clones, using next-generation sequencing. As a proof of principle, we used CRISPR-Cas9 to disrupt the coding sequence of the homeobox gene, Evx1 in mouse embryonic stem cells. We screened 67 CRISPR-Cas9 transfected clones simultaneously by next-generation sequencing on the Ion Torrent PGM. We were able to identify both homozygous and heterozygous Evx1 mutants, as well as mixed clones, which must be identified to maintain the integrity of subsequent experiments.

Conclusions

Our CRISPR-Cas9 screening strategy could be widely applied to screen for CRISPR-Cas9 mutants in a variety of contexts including the generation of mutant cell lines for in vitro research, the generation of transgenic organisms and for assessing the veracity of CRISPR-Cas9 homology directed repair. This technique is cost and time-effective, provides information on clonal heterogeneity and is adaptable for use on various sequencing platforms.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1002) contains supplementary material, which is available to authorized users.  相似文献   

5.
张阁  黄原 《生命科学》2010,(9):896-900
插入和缺失(insertion and deletion)是DNA和蛋白质在进化过程中发生的序列长度上的改变,由于缺乏祖先序列的信息,不能肯定其到底是插入事件还是缺失事件,故统称之为增减(indel)。indel是分子水平进化变异的主要来源之一,近年来对这种进化事件的研究已经涵盖了其发生频率、大小、分布模式、序列进化模型及应用等各个方面。该文总结了基因组水平上插入和缺失的研究进展和发生机制;介绍了已经提出的插入和缺失进化模型,包括TKF91、TKF92、"Long Indel"模型和序列环境模型;讨论了插入和缺失作为分子标记在分子进化、基因分型和药物设计等方面的应用。  相似文献   

6.
《遗传学报》2021,48(11):961-971
In plants, transposable element (TE)-triggered mutants are important resources for functional genomic studies. However, conventional approaches for genome-wide identification of TE insertion sites are costly and laborious. This study developed a novel, rapid, and high-throughput TE insertion site identification workflow based on next-generation sequencing and named it Transposable Element Amplicon Sequencing (TEAseq). Using TEAseq, we systemically profiled the Dissociation (Ds) insertion sites in 1606 independent Ds insertional mutants in advanced backcross generation using K17 as background. The Ac-containing individuals were excluded for getting rid of the potential somatic insertions. We characterized 35,696 germinal Ds insertions tagging 10,323 genes, representing approximately 23.3% of the total genes in the maize genome. The insertion sites were presented in chromosomal hotspots around the ancestral Ds loci, and insertions occurred preferentially in gene body regions. Furthermore, we mapped a loss-of-function AGL2 gene using bulked segregant RNA-sequencing assay and proved that AGL2 is essential for seed development. We additionally established an open-access database named MEILAM for easy access to Ds insertional mutations. Overall, our results have provided an efficient workflow for TE insertion identification and rich sequence-indexed mutant resources for maize functional genomic studies.  相似文献   

7.
The Chinese walnut (Juglans cathayensis L.), valued for both its nut and wood, is an ecologically important tree species endemic temperate southern China. Investigation of the genetic diversity of Chinese walnut has been limited to natural population genetics and genetic germplasm resources. Here, we describe the development of 12 polymorphic microsatellite markers using next-generation sequencing to screen 96 Chinese walnut individuals collected from 11 natural populations. The number of alleles per locus ranged from 5 to 12. The observed heterozygosity (0.288–0.748) overlapped well with the expected heterozygosity (0.337–0.751). This species has high genetic diversity and gene flow among different populations (FST = 0.075, Nm = 3.088). These markers will be useful for future studies on population genetic structure, evolutionary ecology, and genetic breeding of this walnut tree or other Juglans species.  相似文献   

8.
Animal models provide myriad benefits to both experimental and clinical research. Unfortunately, in many situations, they fall short of expected results or provide contradictory results. In part, this can be the result of traditional molecular biological approaches that are relatively inefficient in elucidating underlying molecular mechanism. To improve the efficacy of animal models, a technological breakthrough is required. The growing availability and application of the high-throughput methods make systematic comparisons between human and animal models easier to perform. In the present study, we introduce the concept of the comparative systems biology, which we define as "comparisons of biological systems in different states or species used to achieve an integrated understanding of life forms with all their characteristic complexity of interactions at multiple levels". Furthermore, we discuss the applications of RNA-seq and ChIP-seq technologies to comparative systems biology between human and animal models and assess the potential applications for this approach in the future studies.  相似文献   

9.

Background

Cancer immunotherapy has recently entered a remarkable renaissance phase with the approval of several agents for treatment. Cancer treatment platforms have demonstrated profound tumor regressions including complete cure in patients with metastatic cancer. Moreover, technological advances in next-generation sequencing (NGS) as well as the development of devices for scanning whole-slide bioimages from tissue sections and image analysis software for quantitation of tumor-infiltrating lymphocytes (TILs) allow, for the first time, the development of personalized cancer immunotherapies that target patient specific mutations. However, there is currently no bioinformatics solution that supports the integration of these heterogeneous datasets.

Results

We have developed a bioinformatics platform – Personalized Oncology Suite (POS) – that integrates clinical data, NGS data and whole-slide bioimages from tissue sections. POS is a web-based platform that is scalable, flexible and expandable. The underlying database is based on a data warehouse schema, which is used to integrate information from different sources. POS stores clinical data, genomic data (SNPs and INDELs identified from NGS analysis), and scanned whole-slide images. It features a genome browser as well as access to several instances of the bioimage management application Bisque. POS provides different visualization techniques and offers sophisticated upload and download possibilities. The modular architecture of POS allows the community to easily modify and extend the application.

Conclusions

The web-based integration of clinical, NGS, and imaging data represents a valuable resource for clinical researchers and future application in medical oncology. POS can be used not only in the context of cancer immunology but also in other studies in which NGS data and images of tissue sections are generated. The application is open-source and can be downloaded at http://www.icbi.at/POS.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-306) contains supplementary material, which is available to authorized users.  相似文献   

10.
Little is known about variation of nucleotide insertion/deletions (indels) within species. In Arabidopsis thaliana, we investigated indel polymorphism patterns between two genome sequences and among 96 accessions at 1215 loci. Our study identified patterns in the variation of indel density, size, GC content and distribution, and a correlation between indels and substitutions. We found that the GC content in indel sequences was lower than that in non-indel sequences and that indels typically occur in regions with lower GC content. Patterns of indel frequency distribution among populations were more consistent with neutral expectation than substitution patterns. We also found that the local level of substitutions is positively correlated with indel density and negatively correlated with their distance to the closed indel, suggesting that indels play an important role in nucleotide variation.  相似文献   

11.
The application of next-generation sequencing (NGS) technology in cancer is influenced by the quality and purity of tissue samples. This issue is especially critical for patient-derived xenograft (PDX) models, which have proven to be by far the best preclinical tool for investigating human tumor biology, because the sensitivity and specificity of NGS analysis in xenograft samples would be compromised by the contamination of mouse DNA and RNA. This definitely affects downstream analyses by causing inaccurate mutation calling and gene expression estimates. The reliability of NGS data analysis for cancer xenograft samples is therefore highly dependent on whether the sequencing reads derived from the xenograft could be distinguished from those originated from the host. That is, each sequence read needs to be accurately assigned to its original species. Here, we review currently available methodologies in this field, including Xenome, Disambiguate, bamcmp and pdxBlacklist, and provide guidelines for users.  相似文献   

12.

Background

Transposable elements constitute an important part of the genome and are essential in adaptive mechanisms. Transposition events associated with phenotypic changes occur naturally or are induced in insertional mutant populations. Transposon mutagenesis results in multiple random insertions and recovery of most/all the insertions is critical for forward genetics study. Using genome next-generation sequencing data and appropriate bioinformatics tool, it is plausible to accurately identify transposon insertion sites, which could provide candidate causal mutations for desired phenotypes for further functional validation.

Results

We developed a novel bioinformatics tool, ITIS (Identification of Transposon Insertion Sites), for localizing transposon insertion sites within a genome. It takes next-generation genome re-sequencing data (NGS data), transposon sequence, and reference genome sequence as input, and generates a list of highly reliable candidate insertion sites as well as zygosity information of each insertion. Using a simulated dataset and a case study based on an insertional mutant line from Medicago truncatula, we showed that ITIS performed better in terms of sensitivity and specificity than other similar algorithms such as RelocaTE, RetroSeq, TEMP and TIF. With the case study data, we demonstrated the efficiency of ITIS by validating the presence and zygosity of predicted insertion sites of the Tnt1 transposon within a complex plant system, M. truncatula.

Conclusion

This study showed that ITIS is a robust and powerful tool for forward genetic studies in identifying transposable element insertions causing phenotypes. ITIS is suitable in various systems such as cell culture, bacteria, yeast, insect, mammal and plant.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0507-2) contains supplementary material, which is available to authorized users.  相似文献   

13.
14.
Kong Y 《Genomics》2011,98(2):152-153
Btrim is a fast and lightweight software to trim adapters and low quality regions in reads from ultra high-throughput next-generation sequencing machines. It also can reliably identify barcodes and assign the reads to the original samples. Based on a modified Myers's bit-vector dynamic programming algorithm, Btrim can handle indels in adapters and barcodes. It removes low quality regions and trims off adapters at both or either end of the reads. A typical trimming of 30 M reads with two sets of adapter pairs can be done in about a minute with a small memory footprint. Btrim is a versatile stand-alone tool that can be used as the first step in virtually all next-generation sequence analysis pipelines. The program is available at http://graphics.med.yale.edu/trim/.  相似文献   

15.
To investigate the genetic diversity and population structure of Iksookimia koreensis, we characterized 11 novel polymorphic microsatellite markers developed using next-generation sequencing. The number of alleles per locus ranged from 4 to 10 (average = 6.26). Observed and expected heterozygosities ranged from 0.333 to 0.866 and from 0.375 to 0.866, respectively. No loci showed significant deviation from Hardy–Weinberg equilibrium (HWE). These loci were also used successfully to study the genetic diversity of a closely related species, Iksookimia longicorpa. Four of the 11 loci amplified in the two species showed different allele frequency and distribution, indicating deep genetic divergence between I. koreensis and I. longicorpa. The newly developed microsatellite markers reported here will provide a useful tool for examining gene flow, population genetic structure, and genetic diversity of these species.  相似文献   

16.
Helminths (worms) include parasitic nematodes (roundworms) and platyhelminths (flatworms). These worms are abundant, and many of them are of agricultural, aquacultural, veterinary and medical importance and cause substantial socioeconomic losses worldwide. The genetic characterization of parasitic nematodes using advanced molecular tools is central to the diagnosis of infections and the control of parasitism. The accurate analysis of genetic variation also underpins studies of their taxonomy, epidemiology and evolutionary history. Although the nuclear genome contains suitable genetic markers (e.g., in ribosomal DNA) for the identification of many species, the large size and high variability of the mt genome consistently provides a rich source of such markers for informative systematic and epidemiological studies both within and among species. There is significant value in establishing a practical platform for the rapid sequencing, annotation and analysis of mt genomic datasets to underpin such fundamental and applied studies of parasitic worms (= helminths). In the last decade, there have been some important advances in the mt genomics of helminths, but next-generation sequencing (NGS) technologies now provide opportunities for high throughput sequencing, assembly and annotation. In this article, we provide a background on mt genomics, cover technological challenges and recent advances, and provide a perspective on future mt genome research of parasitic helminths and its fundamental scientific and biotechnological implications.  相似文献   

17.
18.

Background

Targeting Induced Local Lesions IN Genomes (TILLING) is a reverse genetics approach to directly identify point mutations in specific genes of interest in genomic DNA from a large chemically mutagenized population. Classical TILLING processes, based on enzymatic detection of mutations in heteroduplex PCR amplicons, are slow and labor intensive.

Results

Here we describe a new TILLING strategy in zebrafish using direct next generation sequencing (NGS) of 250bp amplicons followed by Paired-End Low-Error (PELE) sequence analysis. By pooling a genomic DNA library made from over 9,000 N-ethyl-N-nitrosourea (ENU) mutagenized F1 fish into 32 equal pools of 288 fish, each with a unique Illumina barcode, we reduce the complexity of the template to a level at which we can detect mutations that occur in a single heterozygous fish in the entire library. MiSeq sequencing generates 250 base-pair overlapping paired-end reads, and PELE analysis aligns the overlapping sequences to each other and filters out any imperfect matches, thereby eliminating variants introduced during the sequencing process. We find that this filtering step reduces the number of false positive calls 50-fold without loss of true variant calls. After PELE we were able to validate 61.5% of the mutant calls that occurred at a frequency between 1 mutant call:100 wildtype calls and 1 mutant call:1000 wildtype calls in a pool of 288 fish. We then use high-resolution melt analysis to identify the single heterozygous mutation carrier in the 288-fish pool in which the mutation was identified.

Conclusions

Using this NGS-TILLING protocol we validated 28 nonsense or splice site mutations in 20 genes, at a two-fold higher efficiency than using traditional Cel1 screening. We conclude that this approach significantly increases screening efficiency and accuracy at reduced cost and can be applied in a wide range of organisms.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1263-4) contains supplementary material, which is available to authorized users.  相似文献   

19.

Background

Recent developments in deep (next-generation) sequencing technologies are significantly impacting medical research. The global analysis of protein coding regions in genomes of interest by whole exome sequencing is a widely used application. Many technologies for exome capture are commercially available; here we compare the performance of four of them: NimbleGen’s SeqCap EZ v3.0, Agilent’s SureSelect v4.0, Illumina’s TruSeq Exome, and Illumina’s Nextera Exome, all applied to the same human tumor DNA sample.

Results

Each capture technology was evaluated for its coverage of different exome databases, target coverage efficiency, GC bias, sensitivity in single nucleotide variant detection, sensitivity in small indel detection, and technical reproducibility. In general, all technologies performed well; however, our data demonstrated small, but consistent differences between the four capture technologies. Illumina technologies cover more bases in coding and untranslated regions. Furthermore, whereas most of the technologies provide reduced coverage in regions with low or high GC content, the Nextera technology tends to bias towards target regions with high GC content.

Conclusions

We show key differences in performance between the four technologies. Our data should help researchers who are planning exome sequencing to select appropriate exome capture technology for their particular application.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-449) contains supplementary material, which is available to authorized users.  相似文献   

20.

Background

Assembling genes from next-generation sequencing data is not only time consuming but computationally difficult, particularly for taxa without a closely related reference genome. Assembling even a draft genome using de novo approaches can take days, even on a powerful computer, and these assemblies typically require data from a variety of genomic libraries. Here we describe software that will alleviate these issues by rapidly assembling genes from distantly related taxa using a single library of paired-end reads: aTRAM, automated Target Restricted Assembly Method. The aTRAM pipeline uses a reference sequence, BLAST, and an iterative approach to target and locally assemble the genes of interest.

Results

Our results demonstrate that aTRAM rapidly assembles genes across distantly related taxa. In comparative tests with a closely related taxon, aTRAM assembled the same sequence as reference-based and de novo approaches taking on average < 1 min per gene. As a test case with divergent sequences, we assembled >1,000 genes from six taxa ranging from 25 – 110 million years divergent from the reference taxon. The gene recovery was between 97 – 99% from each taxon.

Conclusions

aTRAM can quickly assemble genes across distantly-related taxa, obviating the need for draft genome assembly of all taxa of interest. Because aTRAM uses a targeted approach, loci can be assembled in minutes depending on the size of the target. Our results suggest that this software will be useful in rapidly assembling genes for phylogenomic projects covering a wide taxonomic range, as well as other applications. The software is freely available http://www.github.com/juliema/aTRAM.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0515-2) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号