期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Detecting structural variations in the human genome using next generation sequencing

Xi R Kim TM Park PJ 《Briefings in functional genomics》2010,9(5-6):405-415

Structural variations are widespread in the human genome and can serve as genetic markers in clinical and evolutionary studies. With the advances in the next-generation sequencing technology, recent methods allow for identification of structural variations with unprecedented resolution and accuracy. They also provide opportunities to discover variants that could not be detected on conventional microarray-based platforms, such as dosage-invariant chromosomal translocations and inversions. In this review, we will describe some of the sequencing-based algorithms for detection of structural variations and discuss the key issues in future development. 相似文献

2.

Assessment of metagenomic assembly using simulated next generation sequencing data

Mende DR Waller AS Sunagawa S Järvelin AI Chan MM Arumugam M Raes J Bork P 《PloS one》2012,7(2):e31386

Due to the complexity of the protocols and a limited knowledge of the nature of microbial communities, simulating metagenomic sequences plays an important role in testing the performance of existing tools and data analysis methods with metagenomic data. We developed metagenomic read simulators with platform-specific (Sanger, pyrosequencing, Illumina) base-error models, and simulated metagenomes of differing community complexities. We first evaluated the effect of rigorous quality control on Illumina data. Although quality filtering removed a large proportion of the data, it greatly improved the accuracy and contig lengths of resulting assemblies. We then compared the quality-trimmed Illumina assemblies to those from Sanger and pyrosequencing. For the simple community (10 genomes) all sequencing technologies assembled a similar amount and accurately represented the expected functional composition. For the more complex community (100 genomes) Illumina produced the best assemblies and more correctly resembled the expected functional composition. For the most complex community (400 genomes) there was very little assembly of reads from any sequencing technology. However, due to the longer read length the Sanger reads still represented the overall functional composition reasonably well. We further examined the effect of scaffolding of contigs using paired-end Illumina reads. It dramatically increased contig lengths of the simple community and yielded minor improvements to the more complex communities. Although the increase in contig length was accompanied by increased chimericity, it resulted in more complete genes and a better characterization of the functional repertoire. The metagenomic simulators developed for this research are freely available. 相似文献

3.

Understanding genetic variation and function- the applications of next generation sequencing

Harrison RJ 《Seminars in cell & developmental biology》2012,23(2):230-236

Next generation sequencing (NGS) technology has had a transformatory effect upon population-level studies linking genetic variation to gene function. In this review, I briefly describe recent studies that have used top-down genome scanning and population genetic approaches to identify loci under recent selection, as well as some examples of how large NGS datasets can be deployed to detect the total level of deleterious, neutral and advantageous variation present in standing genetic variation. I then explore studies that have used some of these approaches to study gene function along with advances in sequencing populations under selection, QTL mapping techniques and emerging methodologies utilising targeted capture and NGS. 相似文献

4.

Single nucleotide polymorphism analysis of Korean native chickens using next generation sequencing data

Dong-Won Seo Jae-Don Oh Shil Jin Ki-Duk Song Hee-Bok Park Kang-Nyeong Heo Younhee Shin Myunghee Jung Junhyung Park Cheorun Jo Hak-Kyo Lee Jun-Heon Lee 《Molecular biology reports》2015,42(2):471-477

相似文献

5.

An integrative framework for the identification of double minute chromosomes using next generation sequencing data

Matthew?Hayes Email author Jing?Li 《BMC genetics》2015,16(Z2):S1

Background

Double minute chromosomes are circular fragments of DNA whose presence is associated with the onset of certain cancers. Double minutes are lethal, as they are highly amplified and typically contain oncogenes. Locating double minutes can supplement the process of cancer diagnosis, and it can help to identify therapeutic targets. However, there is currently a dearth of computational methods available to identify double minutes. We propose a computational framework for the idenfication of double minute chromosomes using next-generation sequencing data. Our framework integrates predictions from algorithms that detect DNA copy number variants, and it also integrates predictions from algorithms that locate genomic structural variants. This information is used by a graph-based algorithm to predict the presence of double minute chromosomes.

Results

Using a previously published copy number variant algorithm and two structural variation prediction algorithms, we implemented our framework and tested it on a dataset consisting of simulated double minute chromosomes. Our approach uncovered double minutes with high accuracy, demonstrating its plausibility.

Conclusions

Although we only tested the framework with three programs (RDXplorer, BreakDancer, Delly), it can be extended to incorporate results from programs that 1) detect amplified copy number and from programs that 2) detect genomic structural variants like deletions, translocations, inversions, and tandem repeats.The software that implements the framework can be accessed here: https://github.com/mhayes20/DMFinder

相似文献

6.

Reporting protein identification data: the next generation of guidelines

Bradshaw RA Burlingame AL Carr S Aebersold R 《Molecular & cellular proteomics : MCP》2006,5(5):787-788

相似文献

7.

A novel method for the multiplexed target enrichment of MinION next generation sequencing libraries using PCR-generated baits

Timokratis Karamitros Gkikas Magiorkinis 《Nucleic acids research》2015,43(22):e152

The enrichment of targeted regions within complex next generation sequencing libraries commonly uses biotinylated baits to capture the desired sequences. This method results in high read coverage over the targets and their flanking regions. Oxford Nanopore Technologies recently released an USB3.0-interfaced sequencer, the MinION. To date no particular method for enriching MinION libraries has been standardized. Here, using biotinylated PCR-generated baits in a novel approach, we describe a simple and efficient way for multiplexed enrichment of MinION libraries, overcoming technical limitations related with the chemistry of the sequencing-adapters and the length of the DNA fragments. Using Phage Lambda and Escherichia coli as models we selectively enrich for specific targets, significantly increasing the corresponding read-coverage, eliminating unwanted regions. We show that by capturing genomic fragments, which contain the target sequences, we recover reads extending targeted regions and thus can be used for the determination of potentially unknown flanking sequences. By pooling enriched libraries derived from two distinct E. coli strains and analyzing them in parallel, we demonstrate the efficiency of this method in multiplexed format. Crucially we evaluated the optimal bait size for large fragment libraries and we describe for the first time a standardized method for target enrichment in MinION platform. 相似文献

8.

Analysis of the transcriptome of differentiating and non-differentiating preadipocytes from rats and humans by next generation sequencing

F Birzele S Fässler H Neubauer T Hildebrandt BS Hamilton 《Molecular and cellular biochemistry》2012,369(1-2):175-181

相似文献

9.

The next generation of microarray research: applications in evolutionary and ecological genomics 总被引：1，自引：0，他引：1

Shiu SH Borevitz JO 《Heredity》2008,100(2):141-149

Microarray technology is one of the key developments in recent years that has propelled biological research into the post-genomic era. With the ability to assay thousands to millions of features at the same time, microarray technology has fundamentally changed how biological questions are addressed, from examining one or a few genes to a collection of genes or the whole genome. This technology has much to offer in the study of genome evolution. After a brief introduction on the technology itself, we then focus on the use of microarrays to examine genome dynamics, to uncover novel functional elements in genomes, to unravel the evolution of regulatory networks, to identify genes important for behavioral and phenotypic plasticity, and to determine microbial community diversity in environmental samples. Although there are still practical issues in using microarrays, they will be alleviated by rapid advances in array technology and analysis methods, the availability of many genome sequences of closely related species and flexibility in array design. It is anticipated that the application of microarray technology will continue to better our understanding of evolution and ecology through the examination of individuals, populations, closely related species or whole microbial communities. 相似文献

10.

An insight into the salivary transcriptome and proteome of the soft tick and vector of epizootic bovine abortion, Ornithodoros coriaceus

Francischetti IM Meng Z Mans BJ Gudderra N Hall M Veenstra TD Pham VM Kotsyfakis M Ribeiro JM 《Journal of Proteomics》2008,71(5):493-512

相似文献

11.

An improved protocol for sequencing of repetitive genomic regions and structural variations using mutagenesis and next generation sequencing

B Sipos T Massingham AM Stütz N Goldman 《PloS one》2012,7(8):e43359

The rise of Next Generation Sequencing (NGS) technologies has transformed de novo genome sequencing into an accessible research tool, but obtaining high quality eukaryotic genome assemblies remains a challenge, mostly due to the abundance of repetitive elements. These also make it difficult to study nucleotide polymorphism in repetitive regions, including certain types of structural variations. One solution proposed for resolving such regions is Sequence Assembly aided by Mutagenesis (SAM), which relies on the fact that introducing enough random mutations breaks the repetitive structure, making assembly possible. Sequencing many different mutated copies permits the sequence of the repetitive region to be inferred by consensus methods. However, this approach relies on molecular cloning in order to isolate and amplify individual mutant copies, making it hard to scale-up the approach for use in conjunction with high-throughput sequencing technologies. To address this problem, we propose NG-SAM, a modified version of the SAM protocol that relies on PCR and dilution steps only, coupled to a NGS workflow. NG-SAM therefore has the potential to be scaled-up, e.g. using emerging microfluidics technologies. We built a realistic simulation pipeline to study the feasibility of NG-SAM, and our results suggest that under appropriate experimental conditions the approach might be successfully put into practice. Moreover, our simulations suggest that NG-SAM is capable of reconstructing robustly a wide range of potential target sequences of varying lengths and repetitive structures. 相似文献

12.

BubbleTree: an intuitive visualization to elucidate tumoral aneuploidy and clonality using next generation sequencing data

Wei Zhu Michael Kuziora Todd Creasy Zhongwu Lai Christopher Morehouse Xiang Guo Yinong Sebastian Dong Shen Jiaqi Huang Jonathan R. Dry Feng Xue Liyan Jiang Yihong Yao Brandon W. Higgs 《Nucleic acids research》2016,44(4):e38

Tumors are characterized by properties of genetic instability, heterogeneity, and significant oligoclonality. Elucidating this intratumoral heterogeneity is challenging but important. In this study, we propose a framework, BubbleTree, to characterize the tumor clonality using next generation sequencing (NGS) data. BubbleTree simultaneously elucidates the complexity of a tumor biopsy, estimating cancerous cell purity, tumor ploidy, allele-specific copy number, and clonality and represents this in an intuitive graph. We further developed a three-step heuristic method to automate the interpretation of the BubbleTree graph, using a divide-and-conquer strategy. In this study, we demonstrated the performance of BubbleTree with comparisons to similar commonly used tools such as THetA2, ABSOLUTE, AbsCN-seq and ASCAT, using both simulated and patient-derived data. BubbleTree outperformed these tools, particularly in identifying tumor subclonal populations and polyploidy. We further demonstrated BubbleTree''s utility in tracking clonality changes from patients’ primary to metastatic tumor and dating somatic single nucleotide and copy number variants along the tumor clonal evolution. Overall, the BubbleTree graph and corresponding model is a powerful approach to provide a comprehensive spectrum of the heterogeneous tumor karyotype in human tumors. BubbleTree is R-based and freely available to the research community (https://www.bioconductor.org/packages/release/bioc/html/BubbleTree.html). 相似文献

13.

CapR: revealing structural specificities of RNA-binding protein target recognition using CLIP-seq data

Tsukasa Fukunaga Haruka Ozaki Goro Terai Kiyoshi Asai Wataru Iwasaki Hisanori Kiryu 《Genome biology》2014,15(1):R16

RNA-binding proteins (RBPs) bind to their target RNA molecules by recognizing specific RNA sequences and structural contexts. The development of CLIP-seq and related protocols has made it possible to exhaustively identify RNA fragments that bind to RBPs. However, no efficient bioinformatics method exists to reveal the structural specificities of RBP–RNA interactions using these data. We present CapR, an efficient algorithm that calculates the probability that each RNA base position is located within each secondary structural context. Using CapR, we demonstrate that several RBPs bind to their target RNA molecules under specific structural contexts. CapR is available at https://sites.google.com/site/fukunagatsu/software/capr. 相似文献

14.

Erratum to: Single nucleotide polymorphism analysis of Korean native chickens using next generation sequencing data

Dong-Won Seo Jae-Don Oh Shil Jin Ki-Duk Song Hee-Bok Park Kang-Nyeong Heo Younhee Shin Myunghee Jung Junhyung Park Cheorun Jo Hak-Kyo Lee Jun-Heon Lee 《Molecular biology reports》2015,42(2):567-567

相似文献

15.

Rapid evaluation and quality control of next generation sequencing data with FaQCs

Chien-Chi Lo Patrick S G Chain 《BMC bioinformatics》2014,15(1)

Background

Next generation sequencing (NGS) technologies that parallelize the sequencing process and produce thousands to millions, or even hundreds of millions of sequences in a single sequencing run, have revolutionized genomic and genetic research. Because of the vagaries of any platform’s sequencing chemistry, the experimental processing, machine failure, and so on, the quality of sequencing reads is never perfect, and often declines as the read is extended. These errors invariably affect downstream analysis/application and should therefore be identified early on to mitigate any unforeseen effects.

Results

Here we present a novel FastQ Quality Control Software (FaQCs) that can rapidly process large volumes of data, and which improves upon previous solutions to monitor the quality and remove poor quality data from sequencing runs. Both the speed of processing and the memory footprint of storing all required information have been optimized via algorithmic and parallel processing solutions. The trimmed output compared side-by-side with the original data is part of the automated PDF output. We show how this tool can help data analysis by providing a few examples, including an increased percentage of reads recruited to references, improved single nucleotide polymorphism identification as well as de novo sequence assembly metrics.

Conclusion

FaQCs combines several features of currently available applications into a single, user-friendly process, and includes additional unique capabilities such as filtering the PhiX control sequences, conversion of FASTQ formats, and multi-threading. The original data and trimmed summaries are reported within a variety of graphics and reports, providing a simple way to do data quality control and assurance.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0366-2) contains supplementary material, which is available to authorized users. 相似文献

16.

Identification and analysis of mouse non-coding RNA using transcriptome data

Yuhui Zhao Wanfei Liu Jingyao Zeng Shoucheng Liu Xinyu Tan Hasanawad Aljohi Songnian Hu 《中国科学：生命科学英文版》2016,59(6):589-603

相似文献

17.

EXP-PAC: providing comparative analysis and storage of next generation gene expression data

Church PC Goscinski A Lefèvre C 《Genomics》2012,100(1):8-13

Microarrays and more recently RNA sequencing has led to an increase in available gene expression data. How to manage and store this data is becoming a key issue. In response we have developed EXP-PAC, a web based software package for storage, management and analysis of gene expression and sequence data. Unique to this package is SQL based querying of gene expression data sets, distributed normalization of raw gene expression data and analysis of gene expression data across experiments and species. This package has been populated with lactation data in the international milk genomic consortium web portal (http://milkgenomics.org/). Source code is also available which can be hosted on a Windows, Linux or Mac APACHE server connected to a private or public network (http://mamsap.it.deakin.edu.au/~pcc/Release/EXP_PAC.html). 相似文献

18.

Synthesis of atrial natriuretic peptides and studies on structural factors in tissue specificity 总被引：4，自引：0，他引：4

M Sugiyama H Fukumi R T Grammer K S Misono Y Yabe Y Morisawa T Inagami 《Biochemical and biophysical research communications》1984,123(1):338-344

Two atrial natriuretic peptides, containing 25 amino acid residues, ANF IV, and 21 amino acid residues, ANF V, were synthesized by a solid phase method and oxidized with K3Fe(CN)6 to form a disulfide bridge. Synthetic ANF IV exhibited a natriuretic activity with an ED50 70 times higher than that of synthetic ANF V, whereas the longer peptide was only 2.5 times more potent in chick rectal smooth muscle relaxant activity. Both peptides inhibited norepinephrine-induced contraction of rabbit aorta. The shorter peptide, ANF V, was 300 times less efficient than the longer peptide, ANF IV. It is proposed that the carboxy-terminal of ANF IV seems to have a modulating effect on receptor affinity in kidney and vascular tissue. 相似文献

19.

The structural basis for specificity of substrate and recruitment peptides for cyclin-dependent kinases

Brown NR Noble ME Endicott JA Johnson LN 《Nature cell biology》1999,1(7):438-443

Progression through the eukaryotic cell cycle is driven by the orderly activation of cyclin-dependent kinases (CDKs). For activity, CDKs require association with a cyclin and phosphorylation by a separate protein kinase at a conserved threonine residue (T160 in CDK2). Here we present the structure of a complex consisting of phosphorylated CDK2 and cyclin A together with an optimal peptide substrate, HHASPRK. This structure provides an explanation for the specificity of CDK2 towards the proline that follows the phosphorylatable serine of the substrate peptide, and the requirement for the basic residue in the P+3 position of the substrate. We also present the structure of phosphorylated CDK2 plus cyclin A3 in complex with residues 658-668 from the CDK2 substrate p107. These residues include the RXL motif required to target p107 to cyclins. This structure explains the specificity of the RXL motif for cyclins. 相似文献

20.

Analysis of the substrate specificity of tyrosylprotein sulfotransferase using synthetic peptides 总被引：4，自引：0，他引：4

C Niehrs M Kraft R W Lee W B Huttner 《The Journal of biological chemistry》1990,265(15):8525-8532

Tyrosylprotein sulfotransferase (TPST) catalyzes the sulfation of proteins at tyrosine residues. We have analyzed the substrate specificity of TPST from bovine adrenal medulla with a novel assay, using synthetic peptides as substrates. The peptides were modeled after the known, or putative, tyrosine sulfation sites of the cholecystokinin precursor, chromogranin B (secretogranin I) and vitronectin, as well as the tyrosine phosphorylation sites of alpha-tubulin and pp60src. Varying the sequence of these peptides, we found that (i) the apparent Km of peptides with multiple tyrosine sulfation sites decreased exponentially with the number of sites; (ii) acidic amino acids were the major determinant for tyrosine sulfation, acidic amino acids adjacent to the tyrosine being more important than distant ones; (iii) a carboxyl terminally located tyrosine residue may be sulfated. Moreover, TPST catalyzed the sulfation of a peptide corresponding to the tyrosine autophosphorylation site of pp60v-src (Tyr-416) but not of a peptide corresponding to the non-autophosphorylation site of pp60c-src (Tyr-527). These results experimentally define structural determinants for the substrate specificity of TPST and show that this enzyme and certain autophosphorylating tyrosine kinases have overlapping substrate specificities in vitro. 相似文献