首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 4 毫秒
1.
Detecting genomic structural variants from high-throughput sequencing data is a complex and unresolved challenge. We have developed a statistical learning approach, based on Random Forests, that integrates prior knowledge about the characteristics of structural variants and leads to improved discovery in high-throughput sequencing data. The implementation of this technique, forestSV, offers high sensitivity and specificity coupled with the flexibility of a data-driven approach.  相似文献   

2.
Comprehensive discovery of structural variation (SV) from whole genome sequencing data requires multiple detection signals including read-pair, split-read, read-depth and prior knowledge. Owing to technical challenges, extant SV discovery algorithms either use one signal in isolation, or at best use two sequentially. We present LUMPY, a novel SV discovery framework that naturally integrates multiple SV signals jointly across multiple samples. We show that LUMPY yields improved sensitivity, especially when SV signal is reduced owing to either low coverage data or low intra-sample variant allele frequency. We also report a set of 4,564 validated breakpoints from the NA12878 human genome. https://github.com/arq5x/lumpy-sv.  相似文献   

3.
Apolipoprotein(apo) C-II DNA, RNA and protein from a patient with a familial deficiency of apoC-II were evaluated and compared to normal individuals. No major defect of the apoC-II gene could be detected by Southern blot hybridization. Northern and slot blot analyses of total liver RNA documented normal levels of a normal sized apoC-II mRNA. Immunohistochemical studies of the liver of the apoC-II deficient patient revealed a normal to slightly elevated intracellular content of the C-II apolipoprotein. Plasma apoC-II was 3 to 5% of normal apoC-II levels and exhibited abnormal electrophoretic mobility on two dimensional gel electrophoresis and immunoblotting. We postulate that at the molecular level, the deficiency of apoC-II in the plasma of this patient results from a structural defect in the coding portion of the apoC-II gene leading to either defective secretion of cellular apoC-II or increased catabolism of a structurally defective apoC-II in plasma.  相似文献   

4.
Structural variation (SV) plays a fundamental role in genome evolution and can underlie inherited or acquired diseases such as cancer. Long-read sequencing technologies have led to improvements in the characterization of structural variants (SVs), although paired-end sequencing offers better scalability. Here, we present dysgu, which calls SVs or indels using paired-end or long reads. Dysgu detects signals from alignment gaps, discordant and supplementary mappings, and generates consensus contigs, before classifying events using machine learning. Additional SVs are identified by remapping of anomalous sequences. Dysgu outperforms existing state-of-the-art tools using paired-end or long-reads, offering high sensitivity and precision whilst being among the fastest tools to run. We find that combining low coverage paired-end and long-reads is competitive in terms of performance with long-reads at higher coverage values.  相似文献   

5.
Photochemical uncaging of bio-active molecules was introduced in 1977, but since then, there has been no substantial improvement in the properties of generic caging chromophores. We have developed a new chromophore, nitrodibenzofuran (NDBF) for ultra-efficient uncaging of second messengers inside cells. Photolysis of a NDBF derivative of EGTA (caged calcium) is about 16-160 times more efficient than photolysis of the most widely used caged compounds (the quantum yield of photolysis is 0.7 and the extinction coefficient is 18,400 M(-1) cm(-1)). Ultraviolet (UV)-laser photolysis of NDBF-EGTA:Ca(2+) rapidly released Ca(2+) (rate of 20,000 s(-1)) and initiated contraction of skinned guinea pig cardiac muscle. NDBF-EGTA has a two-photon cross-section of approximately 0.6 GM and two-photon photolysis induced localized Ca(2+)-induced Ca(2+) release from the sarcoplasmic recticulum of intact cardiac myocytes. Thus, the NDBF chromophore has great promise as a generic and photochemically efficient protecting group for both one- and two-photon uncaging in living cells.  相似文献   

6.
7.
8.
Next-generation sequencing (NGS) technologies have revolutionised the analysis of genomic structural variants (SVs), providing significant insights into SV de novo formation based on analyses of rearrangement breakpoint junctions. The short DNA reads generated by NGS, however, have also created novel obstacles by biasing the ascertainment of SVs, an aspect that we refer to as the 'short-read dilemma'. For example, recent studies have found that SVs are often complex, with SV formation generating large numbers of breakpoints in a single event (multi-breakpoint SVs) or structurally polymorphic loci having multiple allelic states (multi-allelic SVs). This complexity may be obscured in short reads, unless the data is analysed and interpreted within its wider genomic context. We discuss how novel approaches will help to overcome the short-read dilemma, and how integration of other sources of information, including the structure of chromatin, may help in the future to deepen the understanding of SV formation processes.  相似文献   

9.
10.
11.
A variant of the MM glycoprotein (glycophorin A) was isolated from erythrocyte membranes of two individual donors, a mother (L.G.) and daughter (V.W.). This glycoprotein was found to be a carbohydrate variant in which, for both donors, certain O-glycosidically linked saccharides retained the core structure consisting of NeuAc(alpha 2,3)Gal(beta 1,3)GalNAc that is common to all O-linked saccharides of the MN glycoproteins, and, in addition, contained substituents, of varying chain lengths, on the primary carbinol of GalNAc. These saccharides were released from the polypeptide by beta-elimination in the presence of sodium borohydride, and aspects of their structure were investigated by glycosidase digestion and periodate oxidation. Thus, the smallest variant structure was deduced to be NeuAc(alpha 2,3)Gal(beta 1,3)[GlcNAc(beta 1,6)]H2GalNAc. The 6-O-linked GlcNAc appears to serve as the focus of further chain elongation reactions, involving alternate additions of Gal and GlcNAc residues and leading to the formation of several homologous structures. Two such structures, NeuAc(alpha 2,3)Gal(beta 1,3)[GlcNAc(beta 1,?) Gal(beta 1,3/4)GlcNAc(beta 1,6)]H2GalNAc and NeuAc(alpha 2,3) Gal(beta 1,3)[Gal(beta 1,3/4)GlcNAc(beta 1,6)]H2GalNAc were the predominant species present. A larger saccharide was also isolated and its partial sequence was determined to be Gal(beta 1,3/4)GlcNAc(beta 1,?)[Gal(beta 1,3/4)Glc-NAc(beta 1,?)] Gal(beta 1,3/4)GlcNAc(beta 1,6)[NeuAc(alpha 2,3)Gal-(beta 1,3)]H2GalNAc. Because the peptide portion of these glycoproteins contains two methionine residues, it was possible to isolate two CNBr glycopeptides from separate regions of the molecule, and to assess the distribution of these variant structures in the polypeptide. The saccharides were linked to about 2-3 Ser and/or Thr residues in the donor LG glycoprotein and one of the attachment sites was located within the CNBr glycooctapeptide representing the NH2 terminus. Considerable heterogeneity in saccharide structure was documented for this site, and it is likely that such heterogeneity occurs also at other sites. The variant saccharides bear structural similarities to the core region of O-linked saccharides of certain blood group-active mucins and ovarian cyst secretions, and to the outer sequences of N-linked carbohydrate units (I-, i-active) of the major glycoprotein of human erythrocytes, band 3. The structures of the variant saccharides suggest that they may be potential precursors of H blood group-active carbohydrates, present in varying degrees of maturity, and attached to an integral protein of erythrocytes.  相似文献   

12.
Influenza-specific cytotoxic T cells restricted by HLA-A3 recognize differences between HLA-A3 antigens that are serologically indistinguishable. To examine whether this differential recognition had a structural basis, we have compared the structures of HLA-A3 molecules from Epstein Barr virus-transformed peripheral blood lymphocytes of two individuals, E1 and M17. M17 was representative of the majority of HLA-A3-bearing individuals, whereas E1 was a variant distinguished by cytotoxic T cells. Peptide map comparisons revealed a small number of differences when particular amino acids were used to radiolabel the A3 molecules. Sequence analysis and comparison of the results with the prototypic HLA-A3 sequence localized the variability to a tryptic peptide spanning residues 147-157. To obtain the complete amino acid sequence for the E1 and M17 A3 molecules in the 147-157 region, the CNBr fragments beginning at residue 139 were isolated and sequenced. Two amino acid differences were detected between the HLA-A3 molecule of the CTL-defined variant E1 and that of M17. At position 152, Glu in donor M17 had changed to Va1 in donor E1, and at position 156, Leu in donor M17 had changed to Gln in donor E1. The finding that A3 molecules from E1 are altered in the region between residues 147 and 157 is consistent with studies on HLA-A2 variants and Kb mutants showing that this region of class I molecules is important for CTL recognition but not for recognition by serologic reagents.  相似文献   

13.
In contrast to the major form of human growth hormone the 20,000-dalton (20K) variant of the hormone produced no decrease in either serum glucose of free fatty acids one hour after injection into fasted, hypophysectomized rats. Furthermore, the variant caused no rise in serum free fatty acids after 5 hours. Invitro experiments utilizing epididymal adipose tissue from hypophysectomized rats indicated that 20K was unable to accelerate glucose utilization as measured by glucose uptake and CO2 formation. The data show that this form, even though growth promoting, lacks some of the metabolic properties attributed to growth hormone. We conclude that an insulin-like effect is not necessarily a prerequisite for growth promoting activity.  相似文献   

14.
After characterization of a novel odorant-binding protein (OBP) variant isolated from the rat nasal mucus, the corresponding cDNA was cloned by RT-PCR. Recombinant OBP-1F, the sequence of which is close to that of previously reported rat OBP-1, has been secreted by the yeast Pichia pastoris at a concentration of 80 mg.L-1 in a form identical to the natural protein as shown by MS, N-terminal sequencing and CD. We observed that, in contrast with porcine OBP-1, purified recombinant OBP-1F is a homodimer exhibiting two disulfide bonds (C44-C48 and C63-C155), a pairing close to that of hamster aphrodisin. OBP-1F interacts with fluorescent probe 1-aminoanthracene (1-AMA) with a dissociation constant of 0.6 +/- 0. 3 microM. Fluorescence experiments revealed that 1-AMA was displaced efficiently by molecules including usual solvents such as EtOH and dimethylsulfoxide. Owing to the large OBP-1F amounts expressed, we set up a novel biomimetic assay (volatile-odorant binding assay) to study the uptake of airborne odorants without radiolabelling and attempted to understand the odorant capture by OBP in the nasal mucus under natural conditions. The assay permitted observations on the binding of airborne odorants of different chemical structures and odors (2-isobutyl-3-methoxypyrazine, linalool, isoamyl acetate, 1-octanal, 1-octanol, dimethyl disulfide and methyl thiobutyrate). Uptake of airborne odorants in nearly physiological conditions strengthens the role of OBP as volatile hydrophobic odorant carriers in the mucus of the olfactory epithelium through the aqueous barrier towards the chemo-sensory cells.  相似文献   

15.
《Genomics》2023,115(2):110568
It has recently been shown that structural variants (SV) can have a higher impact on gene expression variation compared to single nucleotide variants (SNV) in different plant species. Additionally, SV were associated with phenotypic variation in several crops. However, compared to the established SV detection based on short-read sequencing, less approaches were described for linked-read based SV calling. We therefore evaluated the performance of six linked-read SV callers compared to an established short-read SV caller based on simulated linked-reads in tetraploid potato. The objectives of our study were to i) compare the performance of SV callers based on linked-read sequencing to short-read sequencing, ii) examine the influence of SV type, SV length, haplotype incidence (HI), as well as sequencing coverage on the SV calling performance in the tetraploid potato genome, and iii) evaluate the accuracy of detecting insertions by linked-read compared to short-read sequencing. We observed high break point resolutions (BPR) detecting short SV and slightly lower BPR for large SV. Our observations highlighted the importance of short-read signals provided by Manta and LinkedSV to detect short SV. Manta and NAIBR performed well for detecting larger deletions, inversions, and duplications. Detected large SV were weakly influenced by the HI. Furthermore, we illustrated that large insertions can be assembled by Novel-X. Our results suggest the usage of the short-read and linked-read SV callers Manta, NAIBR, LinkedSV, and Novel-X based on at least 90x linked-read sequencing coverage to ensure the detection of a broad range of SV in the tetraploid potato genome.  相似文献   

16.
We have previously selected structural variants of the Kk antigen from a (C3 × D2)F1 T-cell lymphoma. Those mutants were identified by the loss of certain epitopes defined by monoclonal antibodies. The variant Kk molecule from HK13.S3 cells is no longer recognized by 40% of the trinitrophenyl-specific, Kk-restricted cytotoxic T lymphocytes. Here we report on the primary structure of the altered Kk molecules from the cell line HK13.S3. Comparison with the parental Kk reveals a single base pair exchange, GCG to GTG, that results in an alanine to valine exchange in position 40 of the protein. This observation emphasizes that minor structural alterations in class I molecules may have a strong effect on the H-2-restricted T-cell response.  相似文献   

17.
Read-depths (RDs) are frequently used in identifying structural variants (SVs) from sequencing data. For existing RD-based SV callers, it is difficult for them to determine breakpoints in single-nucleotide resolution due to the noisiness of RD data and the bin-based calculation. In this paper, we propose to use the deep segmentation model UNet to learn base-wise RD patterns surrounding breakpoints of known SVs. We integrate model predictions with an RD-based SV caller to enhance breakpoints in single-nucleotide resolution. We show that UNet can be trained with a small amount of data and can be applied both in-sample and cross-sample. An enhancement pipeline named RDBKE significantly increases the number of SVs with more precise breakpoints on simulated and real data. The source code of RDBKE is freely available at https://github.com/yaozhong/deepIntraSV.  相似文献   

18.
Under the selective pressure of protease inhibitor therapy, patients infected with human immunodeficiency virus (HIV) often develop drug-resistant HIV strains. One of the first drug-resistant mutations to arise in the protease, particularly in patients receiving indinavir or ritonavir treatment, is V82A, which compromises the binding of these and other inhibitors but allows the virus to remain viable. To probe this drug resistance, we solved the crystal structures of three natural substrates and two commercial drugs in complex with an inactive drug-resistant mutant (D25N/V82A) HIV-1 protease. Through structural analysis and comparison of the protein-ligand interactions, we found that Val82 interacts more closely with the drugs than with the natural substrate peptides. The V82A mutation compromises these interactions with the drugs while not greatly affecting the substrate interactions, which is consistent with previously published kinetic data. Coupled with our earlier observations, these findings suggest that future inhibitor design may reduce the probability of the appearance of drug-resistant mutations by targeting residues that are essential for substrate recognition.  相似文献   

19.
Water is essential for life. This is a trivial fact but has profound implications since the forming of life on the early Earth required water. The sources of water and the related amount of delivery depend not only on the conditions on the early Earth itself but also on the evolutionary history of the solar system. Thus we ask where and when water formed in the solar nebula-the precursor of the solar system. In this paper we explore the chemical mechanics for water formation and its expected abundance. This is achieved by studying the parental cloud core of the solar nebula and its gravitational collapse. We have identified two different sources of water for the region of Earth's accretion. The first being the sublimation of the icy mantles of dust grains formed in the parental cloud. The second source is located in the inner region of the collapsing cloud core - the so-called hot corino with a temperature of several hundred Kelvin. There, water is produced efficiently in the gas phase by reactions between neutral molecules. Additionally, we analyse the dependence of the production of water on the initial abundance ratio between carbon and oxygen.  相似文献   

20.
Molecular Biology Reports - Rice landraces are vital genetic resources for agronomic and quality traits but the undeniable collection of Kerala landraces remains poorly delineated. To effectively...  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号