首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 34 毫秒
1.
Motivation: The nucleotide sequencing process produces not onlythe sequence of nucleotides, but also associated quality values.Quality values provide valuable information, but are primarilyused only for trimming sequences and generally ignored in subsequentanalyses. Results: This article describes how the scoring schemes of standardalignment algorithms can be modified to take into account qualityvalues to produce improved alignments and statistically moreaccurate scores. A prototype implementation is also provided,and used to post-process a set of BLAST results. Quality-adjustedalignment is a natural extension of standard alignment methods,and can be implemented with only a small constant factor performancepenalty. The method can also be applied to related methods includingheuristic search algorithms like BLAST and FASTA. Availability: Software is available at http://malde.org/~ketil/qaa. Contact: ketil.malde{at}imr.no Supplementary information: Supplementary data are availableat Bioinformatics online. Associate Editor: Limsoon Wong  相似文献   

2.
3.
4.
Conservation of sequence in recombination signal sequence spacers.   总被引:16,自引:2,他引:14       下载免费PDF全文
D A Ramsden  K Baetz    G E Wu 《Nucleic acids research》1994,22(10):1785-1796
  相似文献   

5.
In this paper, we used correlation analysis to quantify the correlations between the hydrophobicity sequence and accessibility sequence of 26 alpha-helix bundle membrane proteins and 119 transmembrane helices. Statistical significances of these correlations were also assessed. A slightly positive correlation was found in the alpha-helix bundle membrane proteins due to the contribution of extra-membranous domains. No correlation was found in the transmembrane domains.  相似文献   

6.
Plant Molecular Biology -  相似文献   

7.
Li L  Ly M  Linhardt RJ 《Molecular bioSystems》2012,8(6):1613-1625
Proteoglycans (PGs) are among the most structurally complex biomacromolecules in nature. They are present in all animal cells and frequently exert their critical biological functions through interactions with protein ligands and receptors. PGs are comprised of a core protein to which one or multiple, heterogeneous, and polydisperse glycosaminoglycan (GAG) chains are attached. Proteins, including the protein core of PGs, are now routinely sequenced either directly using proteomics or indirectly using molecular biology through their encoding DNA. The sequencing of the GAG component of PGs poses a considerably more difficult challenge because of the relatively underdeveloped state of glycomics and because the control of their biosynthesis in the endoplasmic reticulum and the Golgi is poorly understood and not believed to be template driven. Recently, the GAG chain of the simplest PG has been suggested to have a defined sequence based on its top-down Fourier transform mass spectral sequencing. This review examines the advances made over the past decade in the sequencing of GAG chains and the challenges the field face in sequencing complex PGs having critical biological functions in developmental biology and pathogenesis.  相似文献   

8.
Publically available cDNA sequence data of Citrullus lanatus were searched for simple sequence repeats (SSRs). Nineteen microsatellites were identified and primer pairs were designed to amplify those loci. Primers were evaluated for their ability to detect polymorphisms within a set of several watermelon varieties and local landraces, C. colocynthis, and interspecific hybrids. Eighteen polymorphic SSR loci were identified. These polymorphic loci can be used for varietal identification and other uses.  相似文献   

9.
Speed sequence     
  相似文献   

10.
剪接后的内含子与相应mRNA序列的相互作用在基因表达调控过程中起着非常重要的作用。基于27个物种的核糖核蛋白基因序列,采用Smith—Waterman局域比对方法得到外显子连接序列与相应内含子序列的最佳匹配片段,分析了外显子连接序列上的匹配频率分布和匹配片段的序列特征。发现一些低等真核生物EJC结合区域的匹配频率明显低于其它区域,所有物种EJC结合区域的序列构成呈现出相对低的结构序。最佳匹配片段的平均长度和配对率分布与siRNA和miRNA的结合特征相同。推测EJC和内含子在与外显子序列结合的过程中存在相互竞争和相互协作的关系,内含子中部序列在基因表达调控过程中起着重要的作用。  相似文献   

11.
12.
13.
We investigated protein sequence/structure correlation by constructing a space of protein sequences, based on methods developed previously for constructing a space of protein structures. The space is constructed by using a representation of the amino acids as vectors of 10 property factors that encode almost all of their physical properties. Each sequence is represented by a distribution of overlapping sequence fragments. A distance between any two sequences can be calculated. By attaching a weight to each factor, intersequence distances can be varied. We optimize the correlation between corresponding distances in the sequence and structure spaces. The optimal correlation between the sequence and structure spaces is significantly better than that which results from correlating randomly generated sequences, having the overall composition of the data base, with the structure space. However, sets of randomly generated sequences, each of which approximates the composition of the real sequence it replaces, produce correlations with the structure space that are as good as that observed for the actual protein sequences. A connection is proposed with previous studies of the protein folding code. It is shown that the most important property factors for the correlation of the sequence and structure spaces are related to helix/bend preference, side chain bulk, and beta-structure preference.  相似文献   

14.
N-terminal sequence of actin   总被引:2,自引:0,他引:2  
R E Alving  K Laki 《Biochemistry》1966,5(8):2597-2601
  相似文献   

15.
《California medicine》1969,110(3):257-258
  相似文献   

16.
17.
18.
S-antigen (S-Ag) is a soluble, highly antigenic protein, the administration of which induces autoimmune uveitis. This protein is found in the retina and pineal. Retinal S-Ag from three species has been sequenced. In this study rat pineal S-Ag was sequenced. Clones were isolated from a rat pineal lambda gt11 cDNA library by probing with a 300 bp fragment of mouse retinal S-Ag cDNA containing the 5'-coding region. The largest clone isolated (RPS-118; 1364 bp) contained the entire coding sequence. Comparison of the rat pineal and mouse retinal S-Ag nucleotide sequences indicated a high homology (95%). The deduced amino acid sequence was found to contain 403 residues (congruent to 44 992 Da). Comparison of the rat pineal and mouse retinal S-Ag amino acid sequences also revealed high homology (97%). The similarity of both the nucleotide and amino acid sequences of rat pineal and mouse retinal S-Ag indicates that expression of the S-Ag gene in both tissues is similar. Further analysis of the rat pineal S-Ag sequence indicated that it contained essentially the same major uveitopathogenic region of S-Ag present in bovine retina; minor uveitopathogenic sites were somewhat different. As is true of retinal S-Ag, rat pineal S-Ag contains the same consensus phosphoryl-binding site present in many GTP/GDP-binding proteins and a homologous sequence found in the C-terminus of alpha-transducin. These sequences may play a role in the action of pineal S-Ag in transmembrane signal transduction.  相似文献   

19.
A 1.1-kb human DNA fragment (ARSH1) capable of functioning as a putative origin of replication in yeast cells has been characterized both by in situ hybridization to human metaphase chromosomes and by DNA sequencing. Our hybridization studies show a preferential localization of ARSH1 in chromosome regions 1p34-36 and 2q34-37. DNA sequence analysis indicates that in addition to the consensus sequence required for ARS function in yeast cells, nuclear matrix-associated DNA motifs are also present in the 1.1-kb fragment. These results suggest that ARSH1 sequences may serve as points of anchorage to the nuclear matrix for chromosomes 1 and 2.  相似文献   

20.
A bovine oviductal fluid catalase (OFC) which preferentially binds to the acrosome surface of some mammalian spermatozoa has recently been purified. The objectives of this study were to clone the OFC, obtain the full-length cDNA and protein sequence and determine which characteristics of the proteins are associated with the binding of the enzyme to sperm surface. Northern blot analysis revealed low levels of catalase mRNA in bovine oviducts and uterus compared to the liver and kidney. Screening of a cDNA library from the cow oviduct permit to obtain a full-length cDNA of 2282 bp, with an open reading frame of 1581 bp coding for a deduced protein of 526 amino acids (59 789 Da). The deduced protein contained four potential N-glycosylation sites and many potential O-glycosylation sites. The OFC protein exhibited high identity with catalase from other bovine tissues, likewise with catalases from human fibroblast and kidney, and with rat liver catalase. The homology of amino acid sequence of OFC with bovine liver catalase was about 99%. However the OFC posses an extended carboxyl terminus of 20 amino acids not present on the liver catalase. This result is supported by a lower mobility of the OFC compared to the liver catalase when both proteins are submitted on SDS-PAGE. Mol. Reprod. Dev. 51:265–273, 1998. © 1998 Wiley-Liss, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号