共查询到20条相似文献,搜索用时 296 毫秒
1.
Background
Tandem repeat variation in protein-coding regions will alter protein length and may introduce frameshifts. Tandem repeat variants are associated with variation in pathogenicity in bacteria and with human disease. We characterized tandem repeat polymorphism in human proteins, using the UniGene database, and tested whether these were associated with host defense roles. 相似文献2.
Background
Polymorphic tandem repeat typing is a new generic technology which has been proved to be very efficient for bacterial pathogens such as B. anthracis, M. tuberculosis, P. aeruginosa, L. pneumophila, Y. pestis. The previously developed tandem repeats database takes advantage of the release of genome sequence data for a growing number of bacteria to facilitate the identification of tandem repeats. The development of an assay then requires the evaluation of tandem repeat polymorphism on well-selected sets of isolates. In the case of major human pathogens, such as S. aureus, more than one strain is being sequenced, so that tandem repeats most likely to be polymorphic can now be selected in silico based on genome sequence comparison.Results
In addition to the previously described general Tandem Repeats Database, we have developed a tool to automatically identify tandem repeats of a different length in the genome sequence of two (or more) closely related bacterial strains. Genome comparisons are pre-computed. The results of the comparisons are parsed in a database, which can be conveniently queried over the internet according to criteria of practical value, including repeat unit length, predicted size difference, etc. Comparisons are available for 16 bacterial species, and the orthopox viruses, including the variola virus and three of its close neighbors.Conclusions
We are presenting an internet-based resource to help develop and perform tandem repeats based bacterial strain typing. The tools accessible at http://minisatellites.u-psud.fr now comprise four parts. The Tandem Repeats Database enables the identification of tandem repeats across entire genomes. The Strain Comparison Page identifies tandem repeats differing between different genome sequences from the same species. The "Blast in the Tandem Repeats Database" facilitates the search for a known tandem repeat and the prediction of amplification product sizes. The "Bacterial Genotyping Page" is a service for strain identification at the subspecies level.3.
Background
Amino acid tandem repeats are found in nearly one-fifth of human proteins. Abnormal expansion of these regions is associated with several human disorders. To gain further insight into the mutational mechanisms that operate in this type of sequence, we have analyzed a large number of mutation variants derived from human expressed sequence tags (ESTs).Results
We identified 137 polymorphic variants in 115 different amino acid tandem repeats. Of these, 77 contained amino acid substitutions and 60 contained gaps (expansions or contractions of the repeat unit). The analysis showed that at least about 21% of the repeats might be polymorphic in humans. We compared the mutations found in different types of amino acid repeats and in adjacent regions. Overall, repeats showed a five-fold increase in the number of gap mutations compared to adjacent regions, reflecting the action of slippage within the repetitive structures. Gap and substitution mutations were very differently distributed between different amino acid repeat types. Among repeats containing gap variants we identified several disease and candidate disease genes.Conclusion
This is the first report at a genome-wide scale of the types of mutations occurring in the amino acid repeat component of the human proteome. We show that the mutational dynamics of different amino acid repeat types are very diverse. We provide a list of loci with highly variable repeat structures, some of which may be potentially involved in disease. 相似文献4.
Lucía Albornos Ignacio Martín Rebeca Iglesias Teresa Jiménez Emilia Labrador Berta Dopico 《BMC plant biology》2012,12(1):1-21
Background
Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats.Results
ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development.Conclusions
We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found. 相似文献5.
6.
Perseus I Missirlis Carri-Lyn R Mead Stefanie L Butland BF Francis Ouellette Rebecca S Devon Blair R Leavitt Robert A Holt 《BMC bioinformatics》2005,6(1):145
Background
To date, 35 human diseases, some of which also exhibit anticipation, have been associated with unstable repeats. Anticipation has been reported in a number of diseases in which repeat expansion may have a role in etiology. Despite the growing importance of unstable repeats in disease, currently no resource exists for the prioritization of repeats. Here we present Satellog, a database that catalogs all pure 1–16 repeat unit satellite repeats in the human genome along with supplementary data. Satellog analyzes each pure repeat in UniGene clusters for evidence of repeat polymorphism. 相似文献7.
Background
Birds have smaller average genome sizes than other tetrapod classes, and it has been proposed that a relatively low frequency of repeating DNA is one factor in reduction of avian genome sizes.Results
DNA repeat arrays in the sequenced portion of the chicken (Gallus gallus) autosomes were quantified and compared with those in human autosomes. In the chicken 10.3% of the genome was occupied by DNA repeats, in contrast to 44.9% in human. In the chicken, the percentage of a chromosome occupied by repeats was positively correlated with chromosome length, but even the largest chicken chromosomes had repeat densities much lower than those in human, indicating that avoidance of repeats in the chicken is not confined to minichromosomes. When 294 simple sequence repeat types shared between chicken and human genomes were compared, mean repeat array length and maximum repeat array length were significantly lower in the chicken than in human.Conclusions
The fact that the chicken simple sequence repeat arrays were consistently smaller than arrays of the same type in human is evidence that the reduction in repeat array length in the chicken has involved numerous independent evolutionary events. This implies that reduction of DNA repeats in birds is the result of adaptive evolution. Reduction of DNA repeats on minichromosomes may be an adaptation to permit chiasma formation and alignment of small chromosomes. However, the fact that repeat array lengths are consistently reduced on the largest chicken chromosomes supports the hypothesis that other selective factors are at work, presumably related to the reduction of cell size and consequent advantages for the energetic demands of flight. 相似文献8.
Alena Zablotskaya Hilde Van Esch Kevin J. Verstrepen Guy Froyen Joris R. Vermeesch 《BMC medical genomics》2018,11(1):123
Background
The etiology of more than half of all patients with X-linked intellectual disability remains elusive, despite array-based comparative genomic hybridization, whole exome or genome sequencing. Since short read massive parallel sequencing approaches do not allow the detection of larger tandem repeat expansions, we hypothesized that such expansions could be a hidden cause of X-linked intellectual disability.Methods
We selectively captured over 1800 tandem repeats on the X chromosome and characterized them by long read single molecule sequencing in 3 families with idiopathic X-linked intellectual disability.Results
In male DNA samples, full tandem repeat length sequences were obtained for 88–93% of the targets and up to 99.6% of the repeats with a moderate guanine-cytosine content. Read length and analysis pipeline allow to detect cases of >?900?bp tandem repeat expansion. In one family, one repeat expansion co-occurs with down-regulation of the neighboring MIR222 gene. This gene has previously been implicated in intellectual disability and is apparently linked to FMR1 and NEFH overexpression associated with neurological disorders.Conclusions
This study demonstrates the power of single molecule sequencing to measure tandem repeat lengths and detect expansions, and suggests that tandem repeat mutations may be a hidden cause of X-linked intellectual disability.9.
Bird CP Stranger BE Liu M Thomas DJ Ingle CE Beazley C Miller W Hurles ME Dermitzakis ET 《Genome biology》2007,8(6):R118-12
Background
Gene regulation is considered one of the driving forces of evolution. Although protein-coding DNA sequences and RNA genes have been subject to recent evolutionary events in the human lineage, it has been hypothesized that the large phenotypic divergence between humans and chimpanzees has been driven mainly by changes in gene regulation rather than altered protein-coding gene sequences. Comparative analysis of vertebrate genomes has revealed an abundance of evolutionarily conserved but noncoding sequences. These conserved noncoding (CNC) sequences may well harbor critical regulatory variants that have driven recent human evolution.Results
Here we identify 1,356 CNC sequences that appear to have undergone dramatic human-specific changes in selective pressures, at least 15% of which have substitution rates significantly above that expected under neutrality. The 1,356 'accelerated CNC' (ANC) sequences are enriched in recent segmental duplications, suggesting a recent change in selective constraint following duplication. In addition, single nucleotide polymorphisms within ANC sequences have a significant excess of high frequency derived alleles and high F ST values relative to controls, indicating that acceleration and positive selection are recent in human populations. Finally, a significant number of single nucleotide polymorphisms within ANC sequences are associated with changes in gene expression. The probability of variation in an ANC sequence being associated with a gene expression phenotype is fivefold higher than variation in a control CNC sequence.Conclusion
Our analysis suggests that ANC sequences have until very recently played a role in human evolution, potentially through lineage-specific changes in gene regulation. 相似文献10.
A high frequency of length polymorphisms in repeated sequences adjacent to Alu sequences 总被引:9,自引:7,他引:9
下载免费PDF全文
![点击此处可从《American journal of human genetics》网站下载免费的PDF全文](/ch/ext_images/free.gif)
We describe a new class of DNA length polymorphism that is due to a variation in the number of tandem repeats associated with Alu sequences (Alu sequence-related polymorphisms). The polymerase chain reaction was used to selectively amplify a (TTA)n repeat identified in the 3-hydroxy-3-methylglutaryl coenzyme A (HMG CoA) reductase gene from genomic DNA of 41 human subjects, and the size of the amplified products was determined by gel electrophoresis. Seven alleles were found that differed in size by integrals of three nucleotides. The allele frequencies ranged from 1.5% to 52%, and the overall heterozygosity index was 62%. The polymorphic TTA repeat was located adjacent to a repetitive sequence of the Alu family. A homology search of human genomic DNA sequences for the trinucleotide TTA (at least five members in length) revealed tandem repeats in six other genes. Three of the six (TTA)n repeats were located adjacent to Alu sequences, and two of the three (in the genes for beta-tubulin and interleukin-1 alpha) were found to be polymorphic in length. Tandemly repetitive sequences found in association with Alu sequences may be frequent sites of length polymorphism that can be used as genetic markers for gene mapping or linkage analysis. 相似文献
11.
12.
Zunyan Dai Audrey C Papp Danxin Wang Heather Hampel Wolfgang Sadee 《BMC medical genomics》2008,1(1):1-18
Background
Variants in numerous genes are thought to affect the success or failure of cancer chemotherapy. Interindividual variability can result from genes involved in drug metabolism and transport, drug targets (receptors, enzymes, etc), and proteins relevant to cell survival (e.g., cell cycle, DNA repair, and apoptosis). The purpose of the current study is to establish a flexible, cost-effective, high-throughput genotyping platform for candidate genes involved in chemoresistance and -sensitivity, and treatment outcomes.Methods
We have adopted SNPlex for genotyping 432 single nucleotide polymorphisms (SNPs) in 160 candidate genes implicated in response to anticancer chemotherapy.Results
The genotyping panels were applied to 39 patients with chronic lymphocytic leukemia undergoing flavopiridol chemotherapy, and 90 patients with colorectal cancer. 408 SNPs (94%) produced successful genotyping results. Additional genotyping methods were established for polymorphisms undetectable by SNPlex, including multiplexed SNaPshot for CYP2D6 SNPs, and PCR amplification with fluorescently labeled primers for the UGT1A1 promoter (TA)nTAA repeat polymorphism.Conclusion
This genotyping panel is useful for supporting clinical anticancer drug trials to identify polymorphisms that contribute to interindividual variability in drug response. Availability of population genetic data across multiple studies has the potential to yield genetic biomarkers for optimizing anticancer therapy. 相似文献13.
Evan M Mathenge Gedion O Misiani David O Oulo Lucy W Irungu Paul N Ndegwa Tom A Smith Gerry F Killeen Bart GJ Knols 《Malaria journal》2005,4(1):1-6
Background
IL-1β and IL-1RA levels are higher in the serum of cerebral malaria patients than in patients with mild malaria. Recently, the level of IL1B expression was reported to be influenced by a polymorphism in the promoter of IL1, IL1B -31C>T.Methods
To examine whether polymorphisms in IL1B and IL1RA influence the susceptibility to cerebral malaria, IL1B -31C>T, IL1B 3953C>T, and IL1RA variable number of tandem repeat (VNTR) were analysed in 312 Thai patients with malaria (109 cerebral malaria and 203 mild malaria patients).Results
In this population, IL1B -31C>T and IL1RA VNTRwere detected, while IL1B 3953C>T (i.e., IL1B 3953T) was not observed in the polymorphism screening for 32 patients. Further analyses for IL1B -31C>T and IL1RA VNTR in 110 cerebral malaria and 206 mild malaria patients showed no significant association of these polymorphisms with cerebral malaria.Conclusion
The present results suggest that IL1B -31C>T and IL1RA VNTR polymorphisms do not play a crucial role in susceptibility or resistance to cerebral malaria. 相似文献14.
Background
A fundamental question in comparative genomics concerns the identification of mechanisms that underpin chromosomal change. In an attempt to shed light on the dynamics of mammalian genome evolution, we analyzed the distribution of syntenic blocks, evolutionary breakpoint regions, and evolutionary breakpoints taken from public databases available for seven eutherian species (mouse, rat, cattle, dog, pig, cat, and horse) and the chicken, and examined these for correspondence with human fragile sites and tandem repeats.Results
Our results confirm previous investigations that showed the presence of chromosomal regions in the human genome that have been repeatedly used as illustrated by a high breakpoint accumulation in certain chromosomes and chromosomal bands. We show, however, that there is a striking correspondence between fragile site location, the positions of evolutionary breakpoints, and the distribution of tandem repeats throughout the human genome, which similarly reflect a non-uniform pattern of occurrence.Conclusion
These observations provide further evidence that certain chromosomal regions in the human genome have been repeatedly used in the evolutionary process. As a consequence, the genome is a composite of fragile regions prone to reorganization that have been conserved in different lineages, and genomic tracts that do not exhibit the same levels of evolutionary plasticity. 相似文献15.
A clustering method for repeat analysis in DNA sequences 总被引:1,自引:0,他引:1
Background
A computational system for analysis of the repetitive structure of genomic sequences is described. The method uses suffix trees to organize and search the input sequences; this data structure has been used previously for efficient computation of exact and degenerate repeats.Results
The resulting software tool collects all repeat classes and outputs summary statistics as well as a file containing multiple sequences (multi fasta), that can be used as the target of searches. Its use is demonstrated here on several complete microbial genomes, the entire Arabidopsis thaliana genome, and a large collection of rice bacterial artificial chromosome end sequences.Conclusions
We propose a new clustering method for analysis of the repeat data captured in suffix trees. This method has been incorporated into a system that can find repeats in individual genome sequences or sets of sequences, and that can organize those repeats into classes. It quickly and accurately creates repeat databases from small and large genomes. The associated software (RepeatFinder), should prove helpful in the analysis of repeat structure for both complete and partial genome sequences. 相似文献16.
17.
Natural selection of protein structural and functional properties: a single nucleotide polymorphism perspective
下载免费PDF全文
![点击此处可从《Genome biology》网站下载免费的PDF全文](/ch/ext_images/free.gif)
Background
The rates of molecular evolution for protein-coding genes depend on the stringency of functional or structural constraints. The Ka/Ks ratio has been commonly used as an indicator of selective constraints and is typically calculated from interspecies alignments. Recent accumulation of single nucleotide polymorphism (SNP) data has enabled the derivation of Ka/Ks ratios for polymorphism (SNP A/S ratios).Results
Using data from the dbSNP database, we conducted the first large-scale survey of SNP A/S ratios for different structural and functional properties. We confirmed that the SNP A/S ratio is largely correlated with Ka/Ks for divergence. We observed stronger selective constraints for proteins that have high mRNA expression levels or broad expression patterns, have no paralogs, arose earlier in evolution, have natively disordered regions, are located in cytoplasm and nucleus, or are related to human diseases. On the residue level, we found higher degrees of variation for residues that are exposed to solvent, are in a loop conformation, natively disordered regions or low complexity regions, or are in the signal peptides of secreted proteins. Our analysis also revealed that histones and protein kinases are among the protein families that are under the strongest selective constraints, whereas olfactory and taste receptors are among the most variable groups.Conclusion
Our study suggests that the SNP A/S ratio is a robust measure for selective constraints. The correlations between SNP A/S ratios and other variables provide valuable insights into the natural selection of various structural or functional properties, particularly for human-specific genes and constraints within the human lineage. 相似文献18.
Yuanguang Meng Zhiqiang Wu Xiaoyun Yin Yali Zhao Meixia Chen Yiling Si Jie Yang Xiaobing Fu Weidong Han 《BMC cell biology》2009,10(1):1-13
Background
As a key player in suppression of colon tumorigenesis, Adenomatous Polyposis Coli (APC) has been widely studied to determine its cellular functions. However, inconsistencies of commercially available APC antibodies have limited the exploration of APC function. APC is implicated in spindle formation by direct interactions with tubulin and microtubule-binding protein EB1. APC also interacts with the actin cytoskeleton to regulate cell polarity. Until now, interaction of APC with the third cytoskeletal element, intermediate filaments, has remained unexamined.Results
We generated an APC antibody (APC-M2 pAb) raised against the 15 amino acid repeat region, and verified its reliability in applications including immunoprecipitation, immunoblotting, and immunofluorescence in cultured cells and tissue. Utilizing this APC-M2 pAb, we immunoprecipitated endogenous APC and its binding proteins from colon epithelial cells expressing wild-type APC. Using Liquid Chromatography Tandem Mass Spectrometry (LC-MS/MS), we identified 42 proteins in complex with APC, including β-catenin and intermediate filament (IF) proteins lamin B1 and keratin 81. Association of lamin B1 with APC in cultured cells and human colonic tissue was verified by co-immunoprecipitation and colocalization. APC also colocalized with keratins and remained associated with IF proteins throughout a sequential extraction procedure.Conclusion
We introduce a versatile APC antibody that is useful for cell/tissue immunostaining, immunoblotting and immunoprecipitation. We also present evidence for interactions between APC and IFs, independent of actin filaments and microtubules. Our results suggest that APC associates with all three major components of the cytoskeleton, thus expanding potential roles for APC in the regulation of cytoskeletal integrity. 相似文献19.
Kathrin Reichwald Chris Lauber Indrajit Nanda Jeanette Kirschner Nils Hartmann Susanne Schories Ulrike Gausmann Stefan Taudien Markus B Schilhabel Karol Szafranski Gernot Glöckner Michael Schmid Alessandro Cellerino Manfred Schartl Christoph Englert Matthias Platzer 《Genome biology》2009,10(2):R16-17
Background
The annual fish Nothobranchius furzeri is the vertebrate with the shortest known life span in captivity. Fish of the GRZ strain live only three to four months under optimal laboratory conditions, show explosive growth, early sexual maturation and age-dependent physiological and behavioral decline, and express aging related biomarkers. Treatment with resveratrol and low temperature significantly extends the maximum life span. These features make N. furzeri a promising new vertebrate model for age research.Results
To contribute to establishing N. furzeri as a new model organism, we provide a first insight into its genome and a comparison to medaka, stickleback, tetraodon and zebrafish. The N. furzeri genome contains 19 chromosomes (2n = 38). Its genome of between 1.6 and 1.9 Gb is the largest among the analyzed fish species and has, at 45%, the highest repeat content. Remarkably, tandem repeats comprise 21%, which is 4-12 times more than in the other four fish species. In addition, G+C-rich tandem repeats preferentially localize to centromeric regions. Phylogenetic analysis based on coding sequences identifies medaka as the closest relative. Genotyping of an initial set of 27 markers and multi-locus fingerprinting of one microsatellite provides the first molecular evidence that the GRZ strain is highly inbred.Conclusions
Our work presents a first basis for systematic genomic and genetic analyses aimed at understanding the mechanisms of life span determination in N. furzeri. 相似文献20.
Liljedahl U Lind L Kurland L Berglund L Kahan T Syvänen AC 《BMC cardiovascular disorders》2004,4(1):16-7