首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Since the completion of the bovine sequencing projects, a substantial number of genetic variations such as single nucleotide polymorphisms have become available across the cattle genome. Recently, cataloguing such genetic variations has been accelerated using massively parallel sequencing technology. However, most of the recent studies have been concentrated on European Bos taurus cattle breeds, resulting in a severe lack of knowledge for valuable native cattle genetic resources worldwide. Here, we present the first whole-genome sequencing results for an endangered Korean native cattle breed, Chikso, using the Illumina HiSeq 2,000 sequencing platform. The genome of a Chikso bull was sequenced to approximately 25.3-fold coverage with 98.8% of the bovine reference genome sequence (UMD 3.1) covered. In total, 5,874,026 single nucleotide polymorphisms and 551,363 insertion/deletions were identified across all 29 autosomes and the X-chromosome, of which 45% and 75% were previously unknown, respectively. Most of the variations (92.7% of single nucleotide polymorphisms and 92.9% of insertion/deletions) were located in intergenic and intron regions. A total of 16,273 single nucleotide polymorphisms causing missense mutations were detected in 7,111 genes throughout the genome, which could potentially contribute to variation in economically important traits in Chikso. This study provides a valuable resource for further investigations of the genetic mechanisms underlying traits of interest in cattle, and for the development of improved genomics-based breeding tools.  相似文献   

2.
《遗传学报》2021,48(11):961-971
In plants, transposable element (TE)-triggered mutants are important resources for functional genomic studies. However, conventional approaches for genome-wide identification of TE insertion sites are costly and laborious. This study developed a novel, rapid, and high-throughput TE insertion site identification workflow based on next-generation sequencing and named it Transposable Element Amplicon Sequencing (TEAseq). Using TEAseq, we systemically profiled the Dissociation (Ds) insertion sites in 1606 independent Ds insertional mutants in advanced backcross generation using K17 as background. The Ac-containing individuals were excluded for getting rid of the potential somatic insertions. We characterized 35,696 germinal Ds insertions tagging 10,323 genes, representing approximately 23.3% of the total genes in the maize genome. The insertion sites were presented in chromosomal hotspots around the ancestral Ds loci, and insertions occurred preferentially in gene body regions. Furthermore, we mapped a loss-of-function AGL2 gene using bulked segregant RNA-sequencing assay and proved that AGL2 is essential for seed development. We additionally established an open-access database named MEILAM for easy access to Ds insertional mutations. Overall, our results have provided an efficient workflow for TE insertion identification and rich sequence-indexed mutant resources for maize functional genomic studies.  相似文献   

3.
《Biologicals》2014,42(1):34-41
Massively parallel sequencing (MPS)-based virus detection has potential regulatory applications. We studied the ability of one of these approaches, based on degenerate oligonucleotide primer (DOP)-polymerase chain reaction (PCR), to detect viral sequences in cell lines known to express viral genes or particles. DOP-PCR was highly sensitive for the detection of small quantities of isolated viral sequences. Detected viral sequences included nodavirus, bracovirus, and endogenous retroviruses in High Five cells, porcine circovirus type 1 and porcine endogenous retrovirus in PK15 cells, human T-cell leukemia virus 1 in MJ cells, human papillomavirus 18 in HeLa cells, human herpesvirus 8 in BCBL-1 cells, and Epstein–Barr Virus in Raji cells. Illumina sequencing (for which primers were most efficiently added using PCR) provided greater sensitivity for virus detection than Roche 454 sequencing. Analyzing nucleic acids extracted both directly from samples and from capsid-enriched preparations provided useful information. Although there are limitations of these methods, these results indicate significant promise for the combination of nonspecific PCR and MPS in identifying contaminants in clinical and biological samples, including cell lines and reagents used to produce vaccines and therapeutic products.  相似文献   

4.
5.
Although pioneering sequencing projects have shed light on the boxer and poodle genomes, a number of challenges need to be met before the sequencing and annotation of the dog genome can be considered complete. Here, we present the DNA sequence of the Jindo dog genome, sequenced to 45-fold average coverage using Illumina massively parallel sequencing technology. A comparison of the sequence to the reference boxer genome led to the identification of 4 675 437 single nucleotide polymorphisms (SNPs, including 3 346 058 novel SNPs), 71 642 indels and 8131 structural variations. Of these, 339 non-synonymous SNPs and 3 indels are located within coding sequences (CDS). In particular, 3 non-synonymous SNPs and a 26-bp deletion occur in the TCOF1 locus, implying that the difference observed in cranial facial morphology between Jindo and boxer dogs might be influenced by those variations. Through the annotation of the Jindo olfactory receptor gene family, we found 2 unique olfactory receptor genes and 236 olfactory receptor genes harbouring non-synonymous homozygous SNPs that are likely to affect smelling capability. In addition, we determined the DNA sequence of the Jindo dog mitochondrial genome and identified Jindo dog-specific mtDNA genotypes. This Jindo genome data upgrade our understanding of dog genomic architecture and will be a very valuable resource for investigating not only dog genetics and genomics but also human and dog disease genetics and comparative genomics.  相似文献   

6.
7.
Flexibility and low cost make genotyping‐by‐sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI‐MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference‐free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.  相似文献   

8.
文章利用20个中国汉族个体样本建立了稳定精确的HLA-A、-B基因全长序列的克隆测序方法, 获得HLA-A 10个等位基因4.2 kb序列, HLA-B 6个等位基因3.7 kb序列, 序列涵盖了两个基因的所有外显子、所有内含子、5′启动子区以及3′非翻译区(3′UTR)。A*1153是文章发现的一个新等位基因, B*151101的内含子序列、5个HLA-A以及2个HLA-B等位基因的5′启动子序列和3′UTR序列为国际上首次报道, 其他等位基因均延伸了IMGT/HLA数据库中释放的全长序列。文章首次在中国汉族个体中测定了IMGT/HLA数据库中没有覆盖的HLA-A、-B基因的上游5′启动子以及下游3′UTR区域的多态性模式。HLA-A基因5′启动子延伸区域共发现26个SNPs和一处3 bp(AAA/-)的插入/缺失, 3′UTR延伸区域共发现14个SNPs; HLA-B基因5′启动子延伸区域共发现5个SNPs和一处1 bp(T/-)的插入/缺失, 3′UTR延伸区域共发现8个SNPs。通过对两个基因的5′启动子、外显子以及3′UTR的系统发育树分析, 发现两个基因调控区与外显子的进化关系有所不同, HLA-A基因除A*24020101外, 其他等位基因两端调控区与外显子连锁比较紧密, HLA-B基因两端调控区与外显子之间则发生了较为频繁的重组事件。  相似文献   

9.
The advent of massive parallel sequencing of immunopurified chromatin and its determinants has provided new avenues for researchers to map epigenome-wide changes and there is tremendous interest to uncover regulatory signatures to understand fundamental questions associated with chromatin structure and function. Indeed, the rapid development of large genome annotation projects has seen a resurgence in chromatin immunoprecipitation (ChIP) based protocols which are used to distinguish protein interactions coupled with large scale sequencing (Seq) to precisely map epigenome-wide interactions. Despite some of the great advances in our understanding of chromatin modifying complexes and their determinants, the development of ChIP-Seq technologies also pose specific demands on the integration of data for visualization, manipulation and analysis. In this article we discuss some of the considerations for experimental design planning, quality control, and bioinformatic analysis. The key aspects of post sequencing analysis are the identification of regions of interest, differentiation between biological conditions and the characterization of sequence differences for chromatin modifications. We provide an overview of best-practise approaches with background information and considerations of integrative analysis from ChIP-Seq experiments.  相似文献   

10.
Although per-base sequencing costs have decreased during recent years, library preparation for targeted massively parallel sequencing remains constrained by high reagent cost, limited design flexibility, and protocol complexity. To address these limitations, we previously developed Hi-Plex, a polymerase chain reaction (PCR) massively parallel sequencing strategy for screening panels of genomic target regions. Here, we demonstrate that Hi-Plex applied with hybrid adapters can generate a library suitable for sequencing with both the Ion Torrent and the TruSeq chemistries and that adjusting primer concentrations improves coverage uniformity. These results expand Hi-Plex capabilities as an accurate, affordable, flexible, and rapid approach for various genetic screening applications.  相似文献   

11.
Coastal microbial mats are small-scale and largely closed ecosystems in which a plethora of different functional groups of microorganisms are responsible for the biogeochemical cycling of the elements. Coastal microbial mats play an important role in coastal protection and morphodynamics through stabilization of the sediments and by initiating the development of salt-marshes. Little is known about the bacterial and especially archaeal diversity and how it contributes to the ecological functioning of coastal microbial mats. Here, we analyzed three different types of coastal microbial mats that are located along a tidal gradient and can be characterized as marine (ST2), brackish (ST3) and freshwater (ST3) systems. The mats were sampled during three different seasons and subjected to massive parallel tag sequencing of the V6 region of the 16S rRNA genes of Bacteria and Archaea. Sequence analysis revealed that the mats are among the most diverse marine ecosystems studied so far and consist of several novel taxonomic levels ranging from classes to species. The diversity between the different mat types was far more pronounced than the changes between the different seasons at one location. The archaeal community for these mats have not been studied before and revealed a strong reaction on a short period of draught during summer resulting in a massive increase in halobacterial sequences, whereas the bacterial community was barely affected. We concluded that the community composition and the microbial diversity were intrinsic of the mat type and depend on the location along the tidal gradient indicating a relation with salinity.  相似文献   

12.
We show that single strand conformation polymorphism (SSCP) analysis, using the mutation detection enhancement (MDETM) matrix, is efficient at detecting sequence polymorphisms in PCR amplicons. Four independent wheat genomic fragments were amplified from two contrasting templates and sequenced. The allelic fragments were differentiated at 1–6 single nucleotide positions, but MDE-SSCP was able to unequivocally distinguish each allelic pair. The approach is therefore considered a powerful way of identifying single nucleotide polymorphisms (SNPs) without extensive amplicon sequencing.  相似文献   

13.
The next generation sequencing enables generation of high resolution and high throughput data for structure sequence of any genome at a fast declining cost. This opens opportunity for population based genetic and genomic analyses. In many applications, whole genome sequencing or re-sequencing is unnecessary or prohibited by budget limits. The Reduced Representation Genome Sequencing (RRGS), which sequences only a small proportion of the genome of interest, has been proposed to deal with the situations. Several forms of RRGS are proposed and implemented in the literature. When applied to plant or crop species, the current RRGS protocols shared a key drawback that a significantly high proportion (up to 60%) of sequence reads to be generated may be of non-genomic origin but attributed to chloroplast DNA or rRNA genes, leaving an exceptional low efficiency of the sequencing experiment. We recommended and discussed here the design of optimized simplified genomic DNA and bisulfite sequencing strategies, which may greatly improves efficiency of the sequencing experiments by bringing down the presentation of the undesirable sequencing reads to less than 10% in the whole sequence reads. The optimized RAD-seq and RRBS-seq methods are potentially useful for sequence variant screening and genotyping in large plant/crop populations.  相似文献   

14.
Since 2002, blooms of Alexandrium catenella sensu Fraga et al. (2015) and paralytic shellfish toxicity events have occurred almost yearly in Osaka Bay, Japan. To better understand the triggers for reoccurring A. catenella blooms in Osaka Bay, phytoplankton community was monitored during the spring seasons of 2012–2015. Monitoring was performed using massively parallel sequencing (MPS)-based technique on amplicon sequences of the 18S rRNA gene. Dense blooms of A. catenella occurred every year except in 2012, however, there was no significant correlation with the environmental parameters investigated. Plankton community diversity decreased before and middle of the A. catenella blooms, suggesting that the decline in diversity could be an indicator for the bloom occurrence. The yearly abundance pattern of A. catenella cells obtained by morphology-based counting coincided with the relative sequence abundances, which supports the effectiveness of MPS-based phytoplankton monitoring.  相似文献   

15.
A new approach for optically sequencing ensembles of single DNA molecules using DNA polymerase to mediate the consecutive incorporation of fluorochrome-labeled nucleotides into an array of large single DNA molecules is presented. The approach utilizes cycles of labeled fluorochrome addition, detection to count incorporations, and bleaching to reset the counter. These additions are imaged and analyzed to estimate the number of labeled additions and to correlate them on a per-locus basis along DNA backbones. Initial studies used precisely labeled polymerase chain reaction products to aid the development and validation of simple models of fluorochrome point spread functions within the imaging system. In complementary studies, nucleotides labeled with the fluorochrome R110 were incorporated into surface-elongated lambda DNA, and fluorescent signals corresponding to the addition of R110-dUTP were counted and assigned precise loci along DNA backbones. The labeled DNAs were then subjected to photobleaching and to a second cycle of addition of R110-labeled nucleotides-a second round of additions was correlated with the first to establish strings of addition histories among the ensemble of largely double-stranded templates. These results confirm the basic operational validity of this approach and point the way to the development of a practical system for optical sequencing.  相似文献   

16.
The informational content of RNA sequencing is currently far from being completely explored. Most of the analyses focus on processing tables of counts or finding isoform deconvolution via exon junctions. This article presents a comparison of several techniques that can be used to estimate differential expression of exons or small genomic regions of expression, based on their coverage function shapes. The problem is defined as finding the differentially expressed exons between two samples using local expression profile normalization and statistical measures to spot the differences between two profile shapes. Initial experiments have been done using synthetic data, and real data modified with synthetically created differential patterns. Then, 160 pipelines (5 types of generator × 4 normalizations × 8 difference measures) are compared. As a result, the best analysis pipelines are selected based on linearity of the differential expression estimation and the area under the ROC curve. These platform-independent techniques have been implemented in the Bioconductor package rnaSeqMap. They point out the exons with differential expression or internal splicing, even if the counts of reads may not show this. The areas of application include significant difference searches, splicing identification algorithms and finding suitable regions for QPCR primers.  相似文献   

17.
Bacterial assemblages from subsurface (100 m depth), meso- (200-1000 m depth) and bathy-pelagic (below 1000 m depth) zones at 10 stations along a North Atlantic Ocean transect from 60°N to 5°S were characterized using massively parallel pyrotag sequencing of the V6 region of the 16S rRNA gene (V6 pyrotags). In a dataset of more than 830,000 pyrotags, we identified 10,780 OTUs of which 52% were singletons. The singletons accounted for less than 2% of the OTU abundance, whereas the 100 and 1000 most abundant OTUs represented 80% and 96% respectively of all recovered OTUs. Non-metric Multi-Dimensional Scaling and Canonical Correspondence Analysis of all the OTUs excluding the singletons revealed a clear clustering of the bacterial communities according to the water masses. More than 80% of the 1000 most abundant OTUs corresponded to Proteobacteria of which 55% were Alphaproteobacteria, mostly composed of the SAR11 cluster. Gammaproteobacteria increased with depth and included a relatively large number of OTUs belonging to Alteromonadales and Oceanospirillales. The bathypelagic zone showed higher taxonomic evenness than the overlying waters, albeit bacterial diversity was remarkably variable. Both abundant and low-abundance OTUs were responsible for the distinct bacterial communities characterizing the major deep-water masses. Taken together, our results reveal that deep-water masses act as bio-oceanographic islands for bacterioplankton leading to water mass-specific bacterial communities in the deep waters of the Atlantic.  相似文献   

18.
Massively parallel signature sequencing (MPSS) is one of the newest tools available for conducting in-depth expression profiling. MPSS is an open-ended platform that analyses the level of expression of virtually all genes in a sample by counting the number of individual mRNA molecules produced from each gene. There is no requirement that genes be identified and characterised prior to conducting an experiment. MPSS has a routine sensitivity at a level of a few molecules of mRNA per cell, and the datasets are in a digital format that simplifies the management and analysis of the data. Therefore, of the various microarray and non-microarray technologies currently available, MPSS provides many advantages for generating the type of complete datasets that will help to facilitate hypothesis-driven experiments in the era of digital biology.  相似文献   

19.
Accurate identification of sparse heterozygous single-nucleotide variants (SNVs) is a critical challenge for identifying the causative mutations in mouse genetic screens, human genetic diseases and cancer. When seeking to identify causal DNA variants that occur at such low rates, they are overwhelmed by false-positive calls that arise from a range of technical and biological sources. We describe a strategy using whole-exome capture, massively parallel DNA sequencing and computational analysis, which identifies with a low false-positive rate the majority of heterozygous and homozygous SNVs arising de novo with a frequency of one nucleotide substitution per megabase in progeny of N-ethyl-N-nitrosourea (ENU)-mutated C57BL/6j mice. We found that by applying a strategy of filtering raw SNV calls against known and platform-specific variants we could call true SNVs with a false-positive rate of 19.4 per cent and an estimated false-negative rate of 21.3 per cent. These error rates are small enough to enable calling a causative mutation from both homozygous and heterozygous candidate mutation lists with little or no further experimental validation. The efficacy of this approach is demonstrated by identifying the causative mutation in the Ptprc gene in a lymphocyte-deficient strain and in 11 other strains with immune disorders or obesity, without the need for meiotic mapping. Exome sequencing of first-generation mutant mice revealed hundreds of unphenotyped protein-changing mutations, 52 per cent of which are predicted to be deleterious, which now become available for breeding and experimental analysis. We show that exome sequencing data alone are sufficient to identify induced mutations. This approach transforms genetic screens in mice, establishes a general strategy for analysing rare DNA variants and opens up a large new source for experimental models of human disease.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号