首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Short protein repeats, frequently with a length between 20 and 40 residues, represent a significant fraction of known proteins. Many repeats appear to possess high amino acid substitution rates and thus recognition of repeat homologues is highly problematic. Even if the presence of a certain repeat family is known, the exact locations and the number of repetitive units often cannot be determined using current methods. We have devised an iterative algorithm based on optimal and sub-optimal score distributions from profile analysis that estimates the significance of all repeats that are detected in a single sequence. This procedure allows the identification of homologues at alignment scores lower than the highest optimal alignment score for non-homologous sequences. The method has been used to investigate the occurrence of eleven families of repeats in Saccharomyces cerevisiae, Caenorhabditis elegans and Homo sapiens accounting for 1055, 2205 and 2320 repeats, respectively. For these examples, the method is both more sensitive and more selective than conventional homology search procedures. The method allowed the detection in the SwissProt database of more than 2000 previously unrecognised repeats belonging to the 11 families. In addition, the method was used to merge several repeat families that previously were supposed to be distinct, indicating common phylogenetic origins for these families.  相似文献   

2.
3.
Richters O  Peixoto TP 《PloS one》2011,6(4):e18384
Non-centralized recommendation-based decision making is a central feature of several social and technological processes, such as market dynamics, peer-to-peer file-sharing and the web of trust of digital certification. We investigate the properties of trust propagation on networks, based on a simple metric of trust transitivity. We investigate analytically the percolation properties of trust transitivity in random networks with arbitrary in/out-degree distributions, and compare with numerical realizations. We find that the existence of a non-zero fraction of absolute trust (i.e. entirely confident trust) is a requirement for the viability of global trust propagation in large systems: The average pair-wise trust is marked by a discontinuous transition at a specific fraction of absolute trust, below which it vanishes. Furthermore, we perform an extensive analysis of the Pretty Good Privacy (PGP) web of trust, in view of the concepts introduced. We compare different scenarios of trust distribution: community- and authority-centered. We find that these scenarios lead to sharply different patterns of trust propagation, due to the segregation of authority hubs and densely-connected communities. While the authority-centered scenario is more efficient, and leads to higher average trust values, it favours weakly-connected "fringe" nodes, which are directly trusted by authorities. The community-centered scheme, on the other hand, favours nodes with intermediate in/out-degrees, in detriment of the authorities and its "fringe" peers.  相似文献   

4.
Violations of transitivity under fitness maximization   总被引:1,自引:0,他引:1       下载免费PDF全文
We present a novel demonstration that violations of transitive choice can result from decision strategies that maximize fitness. Our results depend on how the available options, including options not currently chosen, influence a decision-maker's expectations about the future. In particular, they depend on how the presence of an option may act as an insurance against a run of bad luck in the future.  相似文献   

5.
Although molecular markers and DNA sequence data are now available for many crop species, our ability to identify genetic variation associated with functional or adaptive diversity is still limited. In this study, our aim was to quantify and characterize diversity in a panel of cultivated and wild sorghums (Sorghum bicolor), establish genetic relationships, and, simultaneously, identify selection signals that might be associated with sorghum domestication. We assayed 98 simple sequence repeat (SSR) loci distributed throughout the genome in a panel of 104 accessions comprising 73 landraces (i.e., cultivated lines) and 31 wild sorghums. Evaluation of SSR polymorphisms indicated that landraces retained 86% of the diversity observed in the wild sorghums. The landraces and wilds were moderately differentiated (F st=0.13), but there was little evidence of population differentiation among racial groups of cultivated sorghums (F st=0.06). Neighbor-joining analysis showed that wild sorghums generally formed a distinct group, and about half the landraces tended to cluster by race. Overall, bootstrap support was low, indicating a history of gene flow among the various cultivated types or recent common ancestry. Statistical methods (Ewens-Watterson test for allele excess, lnRH, and F st) for identifying genomic regions with patterns of variation consistent with selection gave significant results for 11 loci (approx. 15% of the SSRs used in the final analysis). Interestingly, seven of these loci mapped in or near genomic regions associated with domestication-related QTLs (i.e., shattering, seed weight, and rhizomatousness). We anticipate that such population genetics-based statistical approaches will be useful for re-evaluating extant SSR data for mining interesting genomic regions from germplasm collections.Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

6.
The 57-bp tandem repeats located in the Mycobacterium tuberculosis leuA gene code for the alpha-isopropylmalate synthase (alpha-IPMS). It is unique to this pathogen. It was previously demonstrated that the leuA-coding sequence Rv3710, containing the tandem repeats, can be translated to an active alpha-IPMS. The objective of the present study was to investigate the significance and effect of the two 57-bp tandem repeats upon gene expression and the general properties of alpha-IPMS. The putative M. tuberculosis H37Rv leuA gene with and without the tandem repeats was cloned by PCR and expressed in an Escherichia coli host. The enzyme product was studied for general properties, comparing that from a native leuA gene containing two repeats and that from the 57-bp tandem repeats deletion mutant. Upon deletion of the two 57-bp tandem repeats, the expression level of leuA from M. tuberculosis H37Rv was comparable with that of the native form. The general properties of the two types of enzymes were similar. They were both functional with the same range of optimal temperature and optimal pH for activity and with similar enzyme stability. Deletion of the repeats had no detectable effect on leuA expression level or the general properties of the enzyme product.  相似文献   

7.
We have explored the possible role of SSR density in genome to generate biological information. In our study, we have checked the SSR (simple sequence repeats) status in virulent and non virulent genes of enteric bacteria to see whether the SSRs distribution contributes to virulence. The genome, plasmid and virulent genes sequences in fasta format were downloaded from NCBI GenBank and VFDB. The sequences were subjected to SSR analysis using software tool ssr.exe. The resulting data was pasted in excel sheet and further analyzed for percentage of each type of SSR. Higher nucleotide repeats have been observed in our study. Overall high density of SSRs can enhance antigenic variance of the pathogen population in a strategy that counteracts the host immune response. Frequency of A and T repeats is higher in the chromosome, plasmid and the virulence genes. However, in dinucleotide repeats the frequencies of GC/CG repeats are higher in genome, whereas plasmid has more of AT/TA repeats. Genome has trinucleotide repeats having predominantly G and C whereas plasmid has trinucleotide repeats having predominantly A and T. The repeat number obtained and percentage of repeats is higher in virulence genes as compared to other gene families. Due to the presence of this large number of SSRs, the organism has an enormous potential for generating this genomic and phenotypic diversity.  相似文献   

8.
MOTIVATION: Tandem peptide repeats play a key role in self-assembly and aggregation processes. A notable example is the occurrence of tandem peptide repeats in prionic proteins and their role in the aggregation process that leads to the formation of the prion. One of the structural characteristics that is evident from the comparison of mammalian and yeast prion proteins is the presence of aromatic residues in their tandem repeats. These residues are accompanied by glycine residues before and/or after the aromatic amino acid. Such aromatic-glycine conjugates are also present in the tandem repeats of the large family of the bacterial ice nucleation proteins. To study the significance of such aromatic-glycine occurrences, a global analysis of all the aromatic octapeptide repeats in the Swiss-Prot and TrEMBL databases was conducted. The search pattern was formulated to compare the number of conjugates of each of the 20 natural amino acids before or after the different aromatic residues. RESULTS: The presence of aromatic-glycine conjugates appears to be significantly higher than aromatic conjugates to any other amino acid. Furthermore, all the six various combination of glycine occurrences before or after the three aromatic residues are present. No such pattern was observed for any other amino acid. The significance of the findings is being discussed in the context of the physicochemical properties of aromatic-glycine conjugates and its possible role in the facilitation of aggregates formation.  相似文献   

9.
The genetic structure of the population of Alia (Sicily, Italy) was analyzed using 15 short tandem repeats: TPOX, D2S1338, D3S1358, FIBRA, D5S818, CSF1PO, D7S820, D8S1179, TH01, VWA, D13S317, D16S539, D18S51, D19S433, and D21S11. Two of these markers, D2S1338 and D19S433, have never before been used in research on population genetics and only recently have they been put to use in forensic medicine. Results of the analysis underline the genetic isolation of the Alia population and show it to be a recent bottleneck as a consequence of a cholera epidemic in 1837. While comparing the Alia population with other populations from Sicily, a genetic heterogeneity within Sicily was uncovered, thus confirming previous results obtained from the analysis of classical markers. This heterogeneity underlines the existence of genetic boundaries within the island. Comparisons with other Italian, Mediterranean, and European populations highlight the differentiation of the Sicilian population, reflecting the presence of a genetic boundary that separates Sicily from northern and central Italy and from the western Mediterranean basin.  相似文献   

10.
Staphylococci are Gram-positive bacteria which play an important role in infectious disease and are major causes of communityacquired and hospital-acquired infections. Strains of Staphylococcus aureus are reported as genomically and phenotypically highly heterogeneous; hence in-silico based comparison of genomic data on simple sequence repeats may provide valuable information for understanding the pathogenicity and control measures. This study determined the distribution of a specific group of Simple Sequence Repeats (SSRs), in genome sequences of six Staphylococcus strains (Staphylococcus aureus COL, S.aureus MRSA252, S.aureus MSSA476, S.aureus Mu50, S.aureus MW2, S.aureus N315) and plasmid sequences of four Staphylococcus strains (Staphylococcus aureus COL pT181, Staphylococcus aureus MSSA pSAS, Staphylococcus aureus VRSAp, Staphylococcus aureus, Staphylococcus aureus pN315 DNA) downloaded from the GenBank database for identifying abundance, distribution and composition of SSRs. The data obtained in the present study shows that (i) a large number of tandem repeats are distributed throughout the genome and plasmid sequences. (ii) Number of mononucleotide SSRs decreased rapidly with increase in size of repeat unit. (iii) Total frequency of SSRs in plasmid regions is less than genomic regions. (iv) In all investigated strains, ratios of AT/TA repeats are dominating over GC/CG repeats in genomics as well as plasmid sequences, and (v) Dinucleotide combination of AT is dominated in all the six Staphylococcus genome sequences.  相似文献   

11.
By conventional genetic methods, including pulse-field gel electrophoresis and multilocus sequence typing, most pathogenic, cholera toxin-positive O1 and O139 isolates of Vibrio cholerae cannot be distinguished. We evaluated relationships among 173 V. cholerae isolates collected between 1992 and 2007 from different geographic areas in India by analyzing five variable number of tandem repeat (VNTR) loci. Each VNTR locus was highly variable, with between 5 and 19 alleles. eburst analysis revealed four large groups of genetically related isolates. Two groups contained genotypes of isolates with the O139 serogroup (which emerged for the first time in epidemic form in 1992), with the other two groups containing O1 strains. In subsequent analysis, it was possible to track the spread of specific genotypes across time and space. Our data highlight the utility of the methodology as an epidemiologic tool for assessing spread of isolates in both epidemic and endemic settings.  相似文献   

12.
13.
Repetitive DNA sequences play paramount biological roles, such as gene variation and regulatory functions on gene expressions. Until now, detection of various kinds of DNA repeats accurately is still an open problem. In this article, we propose a new method and a visualization tool for detecting DNA repeats in a 2D plane of location and frequency by using optimized moving window spectral analysis. The spectrogram can display the general distribution of repetitive sequences while showing the repeat period, length and location without any prior knowledge. Experimental results demonstrate that our method is accurate and robust even under the condition of excessive mutating and interleaving. AVAILABILITY: Available on http://www.hy8.com/~tec/sw01/omwsa01.zip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

14.
Gene(s) for the autosomal dominant endocrine cancer syndromes, multiple endocrine neoplasia type 2A (MEN2A), multiple endocrine neoplasia type 2B (MEN2B), and familial medullary thyroid carcinoma (MTC1) all map to the pericentromeric region of chromosome 10. Predictive testing for the inheritance of mutant alleles in individuals at risk for these disorders has been limited by the availability of highly informative and closely linked flanking markers. We describe the development of eight new markers, including two PCR-based dinucleotide repeat polymorphisms and six RFLPs that flank the disease loci. One of the dinucleotide repeat markers (sJRH-1) derives from the RBP3 locus on 10q11.2 and has a PIC of .88. The other dinucleotide repeat (sTCL-1) defines a new locus, D10S176, that maps by in situ hybridization to 10p11.2 and has a PIC of .68. We have constructed a new genetic linkage map of the pericentromeric region of chromosome 10, on the basis of 13 polymorphisms at six loci, which places the MEN2A locus between the dinucleotide repeat markers, with odds of 5,750:1 over the next most likely position. Using this set of markers, predictive genetic testing of 130 at-risk individuals from six families segregating MEN2A revealed that 95% were jointly informative with flanking markers, representing a significant improvement in genetic testing capabilities.  相似文献   

15.
DNA from ground squirrels of the Citellus genus (Rodentia, Sciuridae) were analysed by centrifugation in the presence of CsCl followed by digestion by restriction endonucleases. Digestion of DNA of two species C. undulatus and C. fulvus by 10 of the 16 restriction endonucleases used led to formation of electrophoretically discrete fragments that are multiple to 330 b.p. in length which points out the tandem organization of repetitive sequences similar to the satellite DNA of many mammal species. However, upon centrifugation we failed to reveal a satellite band in these species; hence the tandem repeats refer to the class of cryptic satellites in the ground squirrels and do not differ in base composition from the remaining part of DNA. The main fraction of the genome was revealed in the form of discrete fragments by cleavage with HindIII and AluI. Both of these restriction endonucleases were used for comparative analysis of DNA of 12 Citellus species. It has been shown that DNA of all species can be digested by HindIII and yields a series of fragments that are multiple to 330-30 b.p. in length and the total content of which varies from species to species within 4-22%. The fraction of the tandem repeats does not correlate with the systematic position of species nor with the amount of heterochromatin in the chromosomes. AluI cuts the DNA of 11 species yielding 110 and 220 b.p. fragments compared to only 60 and 280 b.p. in the DNA of C. dauricus. Under HindIII digestion we can also reveal the tandem repeats in marmot, which is phylogenetically close to the Citellus of the Marmota genus, but they have another periodicity--180 b.p. We propose that the age of ground squirrels repeats is 2-3 million years and they are significantly younger than the marmot repeats.  相似文献   

16.
17.
Most models of animal foraging and consumer choice assume that individuals make choices based on the absolute value of items and are therefore ‘economically rational’. However, frequent violations of rationality by animals, including humans, suggest that animals use comparative valuation rules. Are comparative valuation strategies a consequence of the way brains process information, or are they an intrinsic feature of biological decision-making? Here, we examine the principles of rationality in an organism with radically different information-processing mechanisms: the brainless, unicellular, slime mould Physarum polycephalum. We offered P. polycephalum amoebas a choice between food options that varied in food quality and light exposure (P. polycephalum is photophobic). The use of an absolute valuation rule will lead to two properties: transitivity and independence of irrelevant alternatives (IIA). Transitivity is satisfied if preferences have a consistent, linear ordering, while IIA states that a decision maker''s preference for an item should not change if the choice set is expanded. A violation of either of these principles suggests the use of comparative rather than absolute valuation rules. Physarum polycephalum satisfied transitivity by having linear preference rankings. However, P. polycephalum''s preference for a focal alternative increased when a third, inferior quality option was added to the choice set, thus violating IIA and suggesting the use of a comparative valuation process. The discovery of comparative valuation rules in a unicellular organism suggests that comparative valuation rules are ubiquitous, if not universal, among biological decision makers.  相似文献   

18.
The aim of this study was to develop a cost-effective genotyping method using high-quality DNA for human identification. A total of 21 short tandem repeats (STRs) and amelogenin were selected, and fluorescent fragments at 22 loci were simultaneously amplified in a single-tube reaction using locus-specific primers with 24-base universal tails and four fluorescent universal primers. Several nucleotide substitutions in universal tails and fluorescent universal primers enabled the detection of specific fluorescent fragments from the 22 loci. Multiplex polymerase chain reaction (PCR) produced intense FAM-, VIC-, NED-, and PET-labeled fragments ranging from 90 to 400 bp, and these fragments were discriminated using standard capillary electrophoretic analysis. The selected 22 loci were also analyzed using two commercial kits (the AmpFLSTR Identifiler Kit and the PowerPlex ESX 17 System), and results for two loci (D19S433 and D16S539) were discordant between these kits due to mutations at the primer binding sites. All genotypes from the 100 samples were determined using 2.5 ng of DNA by our method, and the expected alleles were completely recovered. Multiplex 22-locus genotyping using four fluorescent universal primers effectively reduces the costs to less than 20% of genotyping using commercial kits, and our method would be useful to detect silent alleles from commercial kit analysis.  相似文献   

19.
20.
We have evaluated 271 accessions corresponding to 118 European cultivars, 96 from Spain, 16 from Italy, four from France and two from Portugal with the following objectives: (1) to provide a European database based on reference simple sequence repeats (SSRs) and (2) to define a core collection. A set of 24 highly polymorphic SSRs were used for the genetic analysis. Two main clusters were identified using a model-based Bayesian procedure, which correspond to Spanish and Italian cultivar clusters, with the latter showing a higher genetic diversity. An additional genetic substructure was observed among five different groups of cultivars. A core collection with a minimum of 37 cultivars was selected. We provided a database including 132 European accessions with unique genotypes evaluated with 24 SSRs as a reference for distinction, registering and traceability. Finally, we found that a core collection based on 14% of the total accessions conserves all allelic diversity.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号