首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 875 毫秒
1.
Bi-PROF     
The use of next generation sequencing has expanded our view on whole mammalian methylome patterns. In particular, it provides a genome-wide insight of local DNA methylation diversity at single nucleotide level and enables the examination of single chromosome sequence sections at a sufficient statistical power. We describe a bisulfite-based sequence profiling pipeline, Bi-PROF, which is based on the 454 GS-FLX Titanium technology that allows to obtain up to one million sequence stretches at single base pair resolution without laborious subcloning. To illustrate the performance of the experimental workflow connected to a bioinformatics program pipeline (BiQ Analyzer HT) we present a test analysis set of 68 different epigenetic marker regions (amplicons) in five individual patient-derived xenograft tissue samples of colorectal cancer and one healthy colon epithelium sample as a control. After the 454 GS-FLX Titanium run, sequence read processing and sample decoding, the obtained alignments are quality controlled and statistically evaluated. Comprehensive methylation pattern interpretation (profiling) assessed by analyzing 102-104 sequence reads per amplicon allows an unprecedented deep view on pattern formation and methylation marker heterogeneity in tissues concerned by complex diseases like cancer.  相似文献   

2.
The amino acid sequence of 6.5k-arginine/glutamate rich polypeptide (6.5k-AGRP) from the seeds of sponge gourd (Luffa cylindrica) has been determined. The 6.5k-AGRP consists of a 47-residue polypeptide chain containing two disulfide bonds, and a molecular mass calculated to be 5695 Da, which fully coincides with a value of [Μ + H] + = m/z 5693.39 obtained by matrix-assisted laser desorption ionization time of flight mass spectrometry (MALDI-TOF MS). The mass spectrometric evidence indicated that 6.5k-AGRP is also present partially truncated at the C-terminus. In our preparations, approximately half of the polypeptide molecules have the C-terminal sequence Arg-Arg-Glu-Val-Asp; the other half lack Val-Asp and end with the glutamic acid, making a total of 45 residues in the polypeptide chain. The two disulfide bonds connect Cys12 to Cys33 and Cys16 to Cys29. Comparison of the amino acid sequence of 6.5k-AGRP with those of the other known proteins included in the PIR protein sequence database showed that it is related to the amino acid sequence of the N-terminal region encoded by the first exon of the cocoa (Theobroma cacao) and cotton seeds vicilin genes, sharing a characteristic two Cys-Xaa-Xaa-Xaa-Cys motif.  相似文献   

3.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1 and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8.The ITS2 Database9 presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11 accurately reannotated10. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12 (direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14 search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16 and ProfDistS17 for multiple sequence-structure alignment calculation and Neighbor Joining18 tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.  相似文献   

4.
The complete amino acid sequence of the CNBr fragment comprising residues 229–284 of the murine major histocompatibility complex antigen H-2Db has been determined using radiochemical methodology. The sequence was determined by N-terminal sequence analysis of the intact CNBr fragment and by sequence determinations of peptides derived from this fragment by trypsin and staphylococcal V8 protease cleavage. In addition to the amino acid assignments for H-2Db, it was possible to assign the linkage position of the third N-linked glycosyl unit to the asparagine at residue 256. Additional amino acid sequence assignments have also been made for three other CNBr fragments that span residues 99–138, 139–228, and 308–331 of the H-2Db molecule. The total protein sequence information available (222 of 338 residues) agrees in every comparable position with the protein sequence derived from the cDNA clone (pH203) isolated by Reyes and co-workers (1982b), which strongly suggests that this clone encodes H-2Db. Combination of the protein sequence with that deduced from the cDNA clone provides the complete H-2Db protein sequence. Comparison of this sequence with other available protein sequence information for murine class I molecules has revealed protein sequences that may be unique to either K or D region molecules.Abbreviations used in this paper HPLC high performance liquid chromatography - V8 Staphylococcus aureus V8 protease - MHC major histocompatibility complex  相似文献   

5.
6.
An octapeptide of the adipokinetic hormone (AKH) peptide family is identified in the corpora cardiaca of the stink bug, Nezara viridula, by ESI-MSN (electrospray ionization multistage MS). This is the second AKH in N. viridula and it has a hydroxyproline residue at position 6, whereas the major AKH (known as Panbo-RPCH) has Pro as the sixth amino acid residue. The correct sequence assignment of [Hyp6]-Panbo-RPCH is confirmed by retention time and MS spectra of the synthetic peptide. Various extraction procedures were followed to ascertain whether the hydroxylation is an artefact of extraction, or whether it is due to a true post-translational modification at the prohormone level. The proline hydroxylation is unique for invertebrate neuropeptides, while it has been described in the vertebrate gonadotropin-releasing hormone (GnRH). The current finding is another piece of evidence that AKH and GnRH form a peptide superfamily and are closely related evolutionarily. Biologically, [Hyp6]-Panbo-RPCH is active in vivo as an AKH, causing hyperlipaemia in the stink bug at low doses, indicating again that it is an endogenous, mature and functional hormone in this insect species.  相似文献   

7.
8.
Six of the eight transfer RNAs coded by bacteriophage T4 are synthesized via three dimeric precursor molecules. The sequences of two of these have been determined. Both of these precursors give rise to equimolar amounts of the cognate tRNA molecules in vivo. In contrast, even in wild-type infections, tRNAIle is present in ≤ 30% the amount of tRNAThr, with which it is processed from a common dimeric precursor.We have now determined the sequence of this dimer. In addition to the nucleotides present in tRNAThr and tRNAIle, it contains nine precursor-specific residues, located at the 5′ and 3′ termini and at the interstitial junction of the two tRNA sequences. While the three dimers share the majority of structural features in common, pre-tRNAThr + Ile is the only case in which an encoded tRNA 3′ -C-C-A terminus is present in the interstitial region.The processing of this dimer in various biosynthetic mutants has been analyzed in vivo and in vitro and shown to be anomalous in several respects. These results suggest that the apparent underproduction of tRNAIle can be explained by a novel processing pathway that generates a metabolically unstable tRNAIle product. Data from DNA sequence analysis of the T4 tRNA gene cluster (Fukada & Abelson, 1980) support the conclusion that the asymmetric maturation of this precursor is a consequence of the unique disposition of the -C-C-A sequence. These results argue that gene expression can be modulated at the level of RNA processing. The biological significance of this phenomenon is discussed in relation to evidence that tRNAIle has a unique physiological role.  相似文献   

9.
The lengths of open reading frame (ORF)100 and ORF29-TrnCGCA, the intronic sequence of rps16 and the transcribed spacer of TrnTUGU-TrnLUAA in chloroplast from different lines of cytoplasmic male sterility (CMS) rice were studied using indica types, japonica types and common wild rice as controls. The results show that the lengths of ORF100 and ORF29-TrnCGCA in CMS lines are similar to those of typical indica. The sequences of the rps16 intron and the TrnTUGU-TrnLUAA spacer in sporophyte sterile types (wild-abortive type, Yinshui type and K type) are almost the same, and they also share a molecular marker of GTTGAG at nucleotide positions 220–225 in the rps16 intron. Therefore, it is speculated that the source of these three types is the same. In contrast, a gametophyte sterile type, Yuetai A does not contain such a GTTGAG sequence in the rps16 intron and has a unique G at position 595, which may works as a molecular marker distinguishing the sporophyte sterile type from the gametophyte sterile type. Based on the observation that CMS rice has much lower cytoplasmic polymorphism than indica, japonica and wild rice, it is concluded that CMS rice lack cytoplasm diversity. Therefore, it is important to introduce new sources of cytoplasm into hybrid rice.  相似文献   

10.
A protein, which consists of a defined sequence of amino-acids, can be converted into another protein of equal length by changes in the sequence of nucleotides in the segment of DNA by which it is coded. It is convenient to term this segment a gene. Most proteins are 200–400 amino-acids long, so that most genes, if defined in this way, are represented by 600–1,200, or about 103, nucleotides. The evolution of protein molecules may appear to be limited by the need for each step in the sequence to be at a selective advantage. Maynard Smith1 has compared this with a word game in which, using exclusively meaningful words, one word has to be changed to another by altering one letter at a time.  相似文献   

11.
Codon-Anticodon Binding in tRNA<Superscript>phe</Superscript>   总被引:5,自引:0,他引:5  
THE anticodon loop of tRNAphe of baker's yeast has the sequence (5′ to 3′) AY A A MeG U MeC. The unusual base Y, adjacent to the anticodon (AA MeG), is the only nucleotide in this tRNA which fluoresces at room temperature and because it absorbs to the red of all other bases, the excitation energy is localized on it exclusively. (7-Methyl guanine is another base in tRNAphe which fluoresces in these conditions but its emission is so weak that it can only be observed in tRNAphe from which Y has been excised.) The fluorescence spectrum undergoes a small blue shift in the presence of the complementary codon1 and we report now the use of this shift to determine the association constants for this binding at several temperatures. The results suggest a simple thermodynamic model for the codon recognition step during protein synthesis.  相似文献   

12.
13.
14.
In this paper, we described the synthesis of three octapeptides analogues which are obtained by the incorporation of photochemical 4-thiouracil (s4U) probes selectively activables. This probe has been incorporated at the 1 and/or 3 positions in KPGEPGPK sequence since it has been demonstrated that those positions could be mutated without major anti-thrombotic activity modifications. These analogues could be used to identify a new receptor for type III collagen named Type III Collagen Binding Protein (TIIICBP).  相似文献   

15.
A high activity creatine amidinohydrolase (creatinase) from Arthrobacter nicotianae 02181 (a strain newly isolated from soil which may utilize creatinine as the unique organic source) was purified, characterized and the creatinase gene was cloned and analyzed in this study. Cells were cultivated under optimized condition for enzyme yield and creatinase was purified by the DEAE-cellulose and hydroxylapatite (HA) chromatography. The creatinase was found to be a dimmer formed by two identical subunit of 46.4 kDa, and the specific activity of the purified creatinase reached 124.44 U/mg protein, which was about 13 folds of the maximum value ever reported. The enzyme was found to be most active at 37 °C (pH 7.0), and it was found to be relatively stable bellow 45 °C around pH 7.0 by fluorescence spectroscopy and circular dichroism (CD) analysis. The activity of this creatinase could be significantly inhibited by Cu2+, Hg2+, Fe3+and SDS, and it could be improved by Ca2+ and NaN3.The creatinase gene was cloned by the consensus-degenerate hybrid oligonucleotide primers (CODEHOP) PCR and the genome walking method. Nucleotide sequence analysis of this gene revealed an open reading frame (ORF) of 1254 base pair (bp) encoding a 417 amino acid (aa) protein. The primary amino acid sequence alignment search in the database revealed a moderate homology between the deduced amino acid sequence and other creatinase. The sequence has been submitted to Genbank with the accession number EU004199.  相似文献   

16.
Earlier immunological experiments with a synthetic 36‐residue peptide (75‐110) from Influenza hemagglutinin have been shown to elicit anti‐peptide antibodies (Ab) which could cross‐react with the parent protein. In this article, we have studied the conformational features of a short antigenic (Ag) peptide (98YPYDVPDYASLRS110) from Influenza hemagglutinin in its free and antibody (Ab) bound forms with molecular dynamics simulations using GROMACS package and OPLS‐AA/L all‐atom force field at two different temperatures (293 K and 310 K). Multiple simulations for the free Ag peptide show sampling of ordered conformations and suggest different conformational preferences of the peptide at the two temperatures. The free Ag samples a conformation crucial for Ab binding (β‐turn formed by “DYAS” sequence) with greater preference at 310 K while, it samples a native‐like conformation with relatively greater propensity at 293 K. The sequence “DYAS” samples β‐turn conformation with greater propensity at 310 K as part of the hemagglutinin protein also. The bound Ag too samples the β‐turn involving “DYAS” sequence and in addition it also samples a β‐turn formed by the sequence “YPYD” at its N‐terminus, which seems to be induced upon binding to the Ab. Further, the bound Ag displays conformational flexibility at both 293 K and 310 K, particularly at terminal residues. The implications of these results for peptide immunogenicity and Ag–Ab recognition are discussed. Proteins 2015; 83:1352–1367. © 2015 Wiley Periodicals, Inc.  相似文献   

17.
The EcoRV restriction endonuclease cleaves DNA at its recognition sequence at least a million times faster than at any other DNA sequence. The only cofactor it requires for activity is Mg2+: but in binding to DNA in the absence of Mg2+, the EcoRV enzyme shows no specificity for its recognition site. Instead, the reason why EcoRV cuts one DNA sequence faster than any other is that the rate of cleavage is controlled by the binding of Mg2+ to EcoRV-DNA complexes: the complex at the recognition site has a high affinity for Mg2+, while the complexes at other DNA sequences have low affinities for Mg2+. The structures of the EcoRV endonuclease, and of its complexes with either 8pecific or non-specific DNA, have been solved by X-ray crystallography. In the specific complex, the protein interacts with the bases in the recognition sequence and the DNA takes up a highly distorted structure. In the non-specific complex with an unrelated DNA sequence, there are virtually no interactions with the bases and the DNA retains a B-like structure. Since the free energy changes for the formation of specific and non-specific complexes are the same, the energy from the specific interactions balances that required for the distortion of the DNA. The distortion inserts the phosphate at the scissile bond into the active site of the enzyme, where it forms part of the binding site for Mg2+. Without this distortion, the EcoRV–DNA complex would be unable to bind Mg2+ and thus unable to cleave DNA. The specificity of the EcoRV restriction enzyme is therefore governed, not by DNA binding as such, but by its ability to organize the structure of the DNA to which it is bound.  相似文献   

18.
19.
Extensive research has provided ample evidences suggesting that protein folding in the cell is a co-translational process1-5. However, the exact pathway that polypeptide chain follows during co-translational folding to achieve its functional form is still an enigma. In order to understand this process and to determine the exact conformation of the co-translational folding intermediates, it is essential to develop techniques that allow the isolation of RNCs carrying nascent chains of predetermined sizes to allow their further structural analysis.SecM (secretion monitor) is a 170 amino acid E. coli protein that regulates expression of the downstream SecA (secretion driving) ATPase in the secM-secA operon6. Nakatogawa and Ito originally found that a 17 amino acid long sequence (150-FSTPVWISQAQGIRAGP-166) in the C-terminal region of the SecM protein is sufficient and necessary to cause stalling of SecM elongation at Gly165, thereby producing peptidyl-glycyl-tRNA stably bound to the ribosomal P-site7-9. More importantly, it was found that this 17 amino acid long sequence can be fused to the C-terminus of virtually any full-length and/or truncated protein thus allowing the production of RNCs carrying nascent chains of predetermined sizes7. Thus, when fused or inserted into the target protein, SecM stalling sequence produces arrest of the polypeptide chain elongation and generates stable RNCs both in vivo in E. coli cells and in vitro in a cell-free system. Sucrose gradient centrifugation is further utilized to isolate RNCs.The isolated RNCs can be used to analyze structural and functional features of the co-translational folding intermediates. Recently, this technique has been successfully used to gain insights into the structure of several ribosome bound nascent chains10,11. Here we describe the isolation of bovine Gamma-B Crystallin RNCs fused to SecM and generated in an in vitro translation system.  相似文献   

20.
Complementary (c)DNA coding for an insect yolk protein, the egg-specific protein of the silkworm Bombyx mori was cloned and the nucleotide sequence determined. The sequence covers the entire coding region of 1,677 base pairs with 5′ and 3′ noncoding regions (21 and 115 base pairs, respectively). The deduced amino acid sequence of the egg-specific protein consists of 559 amino acid residues. The NH2-terminal 18 amino acid sequence is enriched in hydrophobic amino acids and assumed to be a signal peptide. A sequence, Asn-X-Thr, a potential N-linked glycosylation site, is found at positions 191 to 193. A serine-rich domain is localized in the region from 63 to 90, in which phosphorylation takes place. Cys His motif in 405 to 415 is analogous to a proposed metal binding sequence. Lys132-Asn133 and Arg228-Asp229 are probably the sites cleaved by the egg-specific protein protease that appears during embryogenesis. The derived amino acid sequence has no appreciable homology to other sequenced proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号