首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.

Background

NGS data contains many machine-induced errors. The most advanced methods for the error correction heavily depend on the selection of solid k-mers. A solid k-mer is a k-mer frequently occurring in NGS reads. The other k-mers are called weak k-mers. A solid k-mer does not likely contain errors, while a weak k-mer most likely contains errors. An intensively investigated problem is to find a good frequency cutoff f0 to balance the numbers of solid and weak k-mers. Once the cutoff is determined, a more challenging but less-studied problem is to: (i) remove a small subset of solid k-mers that are likely to contain errors, and (ii) add a small subset of weak k-mers, that are likely to contain no errors, into the remaining set of solid k-mers. Identification of these two subsets of k-mers can improve the correction performance.

Results

We propose to use a Gamma distribution to model the frequencies of erroneous k-mers and a mixture of Gaussian distributions to model correct k-mers, and combine them to determine f0. To identify the two special subsets of k-mers, we use the z-score of k-mers which measures the number of standard deviations a k-mer’s frequency is from the mean. Then these statistically-solid k-mers are used to construct a Bloom filter for error correction. Our method is markedly superior to the state-of-art methods, tested on both real and synthetic NGS data sets.

Conclusion

The z-score is adequate to distinguish solid k-mers from weak k-mers, particularly useful for pinpointing out solid k-mers having very low frequency. Applying z-score on k-mer can markedly improve the error correction accuracy.
  相似文献   

2.
3.
Two repeated DNA sequences isolated from a partial genomic DNA library of Helianthus annuus, p HaS13 and p HaS211, were shown to represent portions of the int gene of a Ty3 /gypsy retroelement and of the RNase-Hgene of a Ty1 /copia retroelement, respectively. Southern blotting patterns obtained by hybridizing the two probes to BglII- or DraI-digested genomic DNA from different Helianthus species showed p HaS13 and p HaS211 were parts of dispersed repeats at least 8 and 7 kb in length, respectively, that were conserved in all species studied. Comparable hybridization patterns were obtained in all species with p HaS13. By contrast, the patterns obtained by hybridizing p HaS211 clearly differentiated annual species from perennials. The frequencies of p HaS13- and p HaS211-related sequences in different species were 4.3x10(4)-1.3x10(5) copies and 9.9x10(2)-8.1x10(3) copies per picogram of DNA, respectively. The frequency of p HaS13-related sequences varied widely within annual species, while no significant difference was observed among perennial species. Conversely, the frequency variation of p HaS211-related sequences was as large within annual species as within perennials. Sequences of both families were found to be dispersed along the length of all chromosomes in all species studied. However, Ty3 /gypsy-like sequences were localized preferentially at the centromeric regions, whereas Ty1/ copia-like sequences were less represented or absent around the centromeres and plentiful at the chromosome ends. These findings suggest that the two sequence families played a role in Helianthusgenome evolution and species divergence, evolved independently in the same genomic backgrounds and in annual or perennial species, and acquired different possible functions in the host genomes.  相似文献   

4.
The determination of k(L) a by a gas balance method coupled with sulphite oxidation is compared for three kinds of processes (stirred tank, bubble column and fixed-bed column reactors) with a gassing-in and with a classical chemical sulphite oxidation method. The mathematical relations required for the determination of the k(L) a value are detailed. In coalescing gas-liquid conditions, the values calculated by the three methods are shown to be comparable. The gas balance method is more rapid than either the steady-state gassing-in or the chemical sulphite reaction rate measurement methods. It is also well adapted for three-phase systems (gas-liquid-solid) in which the non-coalescing effects of sulphite solution are reduced by solid interferences.  相似文献   

5.
A revision of Penstemon sect. Saccanthera subsect. Serrulati includes a new species (P. salmonensis), a new variety (P. triphyllus var. infernalis), and the elevation of a subspecies to species (P. curtiflorus), bringing the total number of species to eight, which are keyed and described, complete with nomenclature and type citations.  相似文献   

6.
A genetic transformation system has been developed for callus cells of Crataegus aronia using Agrobacterium tumefaciens. Callus culture was established from internodal stem segments incubated on Murashige and Skoog (MS) medium supplemented with 5 mg l−1 Indole-3-butyric acid (IBA) and 0.5 mg l−1 6-benzyladenine (BA). In order to optimize the callus culture system with respect to callus growth and coloration, different types and concentrations of plant growth regulators were tested. Results indicated that the best average fresh weight of red colored callus was obtained on MS medium supplemented with 2 mg l−1 2,4-dichlorophenoxyacetic acid (2,4-D) and 1.5 mg l−1 kinetin (Kin) (callus maintenance medium). Callus cells were co-cultivated with Agrobacterium harboring the binary plasmid pCAMBIA1302 carrying the mgfp5 and hygromycin phosphotransferase (hptII) genes conferring green fluorescent protein (GFP) activity and hygromycin resistance, respectively. Putative transgenic calli were obtained 4 weeks after incubation of the co-cultivated explants onto maintenance medium supplemented with 50 mg l−1 hygromycin. Molecular analysis confirmed the integration of the transgenes in transformed callus. To our knowledge, this is the first time to report an Agrobacterium-mediated transformation system in Crataegus aronia.  相似文献   

7.
Studying Pneumocystis has proven to be a challenge from the perspective of propagating a significant amount of the pathogen in a facile manner. The study of several fungal pathogens has been aided by the use of invertebrate model hosts. Our efforts to infect the invertebrate larvae Galleria mellonella with Pneumocystis proved futile since P. murina neither caused disease nor was able to proliferate within G. mellonella. It did, however, show that the pathogen could be rapidly cleared from the host.  相似文献   

8.
Tropilaelaps mercedesae is a serious ectoparasite of Apis mellifera in China. The aim of this study was to investigate the infestation rates and intensity of T. mercedesae in A. mellifera in China, and to explore the relative importance of climate, district, management practices and beekeeper characteristics that are assumed to be associated with the intensity of T. mercedesae. Of the 410 participating apiaries, 379 apiaries were included in analyses of seasonal infestation rates and 352 apiaries were included in multivariable regression analysis. The highest infestation rate (86.3%) of T. mercedesae was encountered in autumn, followed by summer (66.5%), spring (17.2%) and winter (14.8%). In autumn, 28.9% (93) of the infested apiaries were in the north (including the northeast and northwest of China), 71.1% (229) were in the central and south (including east, southeast and southwest China), and 306 apiaries (82.9%) were co-infested by both T. mercedesae and Varroa. Multivariable regression analysis showed that geographical location, season, royal jelly collection and Varroa infestation were the factors that influence the intensity of T. mercedesae. The influence of beekeeper’s education, time of beekeeping, operation size, and hive migration on the intensity of T. mercedesa was not statistically significant. This study provided information about the establishment of the linkage of the environment and the parasite and could lead to better timing and methods of control.  相似文献   

9.

Background  

Non-biological factors give rise to unwanted variations in cDNA microarray data. There are many normalization methods designed to remove such variations. However, to date there have been few published systematic evaluations of these techniques for removing variations arising from dye biases in the context of downstream, higher-order analytical tasks such as classification.  相似文献   

10.
11.

Background  

Calcium signaling plays a prominent role in plants for coordinating a wide range of developmental processes and responses to environmental cues. Stimulus-specific generation of intracellular calcium transients, decoding of calcium signatures, and transformation of the signal into cellular responses are integral modules of the transduction process. Several hundred proteins with functions in calcium signaling circuits have been identified, and the number of downstream targets of calcium sensors is expected to increase. We previously identified a novel, calmodulin-binding nuclear protein, IQD1, which stimulates glucosinolate accumulation and plant defense in Arabidopsis thaliana. Here, we present a comparative genome-wide analysis of a new class of putative calmodulin target proteins in Arabidopsis and rice.  相似文献   

12.
The maT clade of transposons is a group of transposable elements intermediate in sequence and predicted protein structure to mariner and Tc transposons, with a distribution thus far limited to a few invertebrate species. We present evidence, based on searches of publicly available databases, that the nematode Caenorhabditis briggsae has several maT-like transposons, which we have designated as CbmaT elements, dispersed throughout its genome. We also describe two additional transposon sequences that probably share their evolutionary history with the CbmaT transposons. One resembles a fold back variant of a CbmaT element, with long (380-bp) inverted terminal repeats (ITRs) that show a high degree (71%) of identity to CbmaT1. The other, which shares only the 26-bp ITR sequences with one of the CbmaT variants, is present in eight nearly identical copies, but does not have a transposase gene and may therefore be cross mobilised by a CbmaT transposase. Using PCR-based mobility assays, we show that CbmaT1 transposons are capable of excising from the C. briggsae genome. CbmaT1 excised approximately 500 times less frequently than Tcb1 in the reference strain AF16, but both CbmaT1 and Tcb1 excised at extremely high frequencies in the HK105 strain. The HK105 strain also exhibited a high frequency of spontaneous induction of unc-22 mutants, suggesting that it may be a mutator strain of C. briggsae.  相似文献   

13.
The B subfamily of ATP-binding cassette (ABC) proteins (ABCB) plays a vital role in auxin efflux. However, no systematic study has been done in apple. In this study, we performed genomewide identification and expression analyses of the ABCB family in Malus domestica for the first time. We identified a total of 25 apple ABCBs that were divided into three clusters based on the phylogenetic analysis. Most ABCBs within the same cluster demonstrated a similar exon–intron organization. Additionally, the digital expression profiles of ABCB genes shed light on their functional divergence. ABCB1 and ABCB19 are two well-studied auxin efflux carrier genes, and we found that their expression levels are higher in young shoots of M106 than in young shoots of M9. Since young shoots are the main source of auxin synthesis and auxin efflux involves in tree height control. This suggests that ABCB1 and ABCB19 may also take a part in the auxin efflux and tree height control in apple.  相似文献   

14.
Ogataea parapolymorpha sp. n. (NRRL YB-1982, CBS 12304, type strain), the ascosporic state of Candida parapolymorpha, is described. The species appears homothallic, assimilates methanol as is typical of most Ogataea species and forms hat-shaped ascospores in asci that become deliquescent. O. parapolymorpha is closely related to Ogataea angusta and Ogataea polymorpha. The three species can be resolved from gene sequence analyses but are unresolved from fermentation and growth reactions that are typically used for yeast identification. On the basis of multiple isolates, O. angusta is known only from California, USA, in association with Drosophila and Aulacigaster flies, O. parapolymorpha is predominantly associated with insect frass from trees in the eastern USA but O. polymorpha has been isolated from various substrates in the USA, Brazil, Spain and Costa Rica.  相似文献   

15.
Seol E  Jung Y  Lee J  Cho C  Kim T  Rhee Y  Lee S 《Plant cell reports》2008,27(7):1197-1206
Notocactus scopa cv. Soonjung was subjected to in planta Agrobacterium tumefaciens-mediated transformation with vacuum infiltration, pin-pricking, and a combination of the two methods. The pin-pricking combined with vacuum infiltration (20-30 cmHg for 15 min) resulted in a transformation efficiency of 67-100%, and the expression of the uidA and nptII genes was detected in transformed cactus. The established in planta transformation technique generated a transgenic cactus with higher transformation efficiency, shortened selection process, and stable gene expression via asexual reproduction. All of the results showed that the in planta transformation method utilized in the current study provided an efficient and time-saving procedure for the delivery of genes into the cactus genome, and that this technique can be applied to other asexually reproducing succulent plant species.  相似文献   

16.
17.
18.

Background  

Molecular genetic maps provide a means to link heritable traits with underlying genome sequence variation. Several genetic maps have been constructed for Brassica species, yet to date, there has been no simple means to compare this information or to associate mapped traits with the genome sequence of the related model plant, Arabidopsis.  相似文献   

19.
Genome sequence analysis of Xanthomonas oryzae pv. oryzae has revealed a cluster of 12 ORFs that are closely related to the gum gene cluster of Xanthomonas campestris pv. campestris. The gum gene cluster of X. oryzae encodes proteins involved in xanthan production; however, there is little experimental evidence supporting this. In this study, biochemical analyses of xanthan produced by a defined set of X. oryzae gum mutant strains allowed us to preliminarily assign functions to most of the gum gene products: biosynthesis of the pentasaccharide repeating unit for GumD, GumM, GumH, GumK, and GumI, xanthan polymerization and transport for GumB, GumC, GumE, and GumJ, and modification of the pentasaccharide repeating unit for GumF, GumG, and GumL. In addition, we found that the exopolysaccharides are essential but not specific for the virulence of X. oryzae. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users. Sang-Yoon Kim and Jeong-Gu Kim contributed equally to this work.  相似文献   

20.
This study examines the interactions that occur between Saccharomyces cerevisiae and Oenococcus oeni strains during the process of winemaking. Various yeast/bacteria pairs were studied by applying a sequential fermentation strategy which simulated the natural winemaking process. First, four yeast strains were tested in the presence of one bacterial strain leading to the inhibition of the bacterial component. The extent of inhibition varied widely from one pair to another and closely depended on the specific yeast strain chosen. Inhibition was correlated to weak bacterial growth rather than a reduction in the bacterial malolactic activity. Three of the four yeast strains were then grown with another bacteria strain. Contrary to the first results, this led to the bacterial stimulation, thus highlighting the importance of the bacteria strain. The biochemical profile of the four yeast fermented media exhibited slight variations in ethanol, SO(2) and fatty acids produced as well as assimilable consumed nitrogen. These parameters were not the only factors responsible for the malolactic fermentation inhibition observed with the first bacteria strain. The stimulation of the second has not been reported before in such conditions and remains unexplained.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号