共查询到20条相似文献,搜索用时 0 毫秒
1.
《Trends in genetics : TIG》2021,37(10):903-918
2.
M J Sternberg 《Protein engineering》1990,4(1):45-47
The extent of inter-species sequence identity in single-spanning transmembrane regions of integral membrane proteins was evaluated. The sequences of the 32 human transmembrane regions were compared with the respective rodent homologues. The identity between homologous transmembrane regions ranged from 32 to 100%, compared with a mean value of 14% identity between unrelated transmembrane sections. On average the identity between homologous transmembrane regions is slightly higher than for the rest of the chain. These values suggest that, in general, there are structural and/or functional constraints on the transmembrane regions beyond the simple requirement to act as a passive, nonpolar, connecting region across the cell membrane. Although there is limited experimental evidence available, the three transmembrane regions (CD2 antigen, MHC class I and ICAM-1) with particularly low values of inter-species identity (less than 50%) are probably not involved in an interaction with another transmembrane section in the same cell. 相似文献
3.
Ananyo Choudhury Scott Hazelhurst Ayton Meintjes Ovokeraye Achinike-Oduaran Shaun Aron Junaid Gamieldien Mahjoubeh Jalali Sefid Dashti Nicola Mulder Nicki Tiffin Michèle Ramsay 《BMC genomics》2014,15(1)
Background
Population differentiation is the result of demographic and evolutionary forces. Whole genome datasets from the 1000 Genomes Project (October 2012) provide an unbiased view of genetic variation across populations from Europe, Asia, Africa and the Americas. Common population-specific SNPs (MAF > 0.05) reflect a deep history and may have important consequences for health and wellbeing. Their interpretation is contextualised by currently available genome data.Results
The identification of common population-specific (CPS) variants (SNPs and SSV) is influenced by admixture and the sample size under investigation. Nine of the populations in the 1000 Genomes Project (2 African, 2 Asian (including a merged Chinese group) and 5 European) revealed that the African populations (LWK and YRI), followed by the Japanese (JPT) have the highest number of CPS SNPs, in concordance with their histories and given the populations studied. Using two methods, sliding 50-SNP and 5-kb windows, the CPS SNPs showed distinct clustering across large genome segments and little overlap of clusters between populations. iHS enrichment score and the population branch statistic (PBS) analyses suggest that selective sweeps are unlikely to account for the clustering and population specificity. Of interest is the association of clusters close to recombination hotspots. Functional analysis of genes associated with the CPS SNPs revealed over-representation of genes in pathways associated with neuronal development, including axonal guidance signalling and CREB signalling in neurones.Conclusions
Common population-specific SNPs are non-randomly distributed throughout the genome and are significantly associated with recombination hotspots. Since the variant alleles of most CPS SNPs are the derived allele, they likely arose in the specific population after a split from a common ancestor. Their proximity to genes involved in specific pathways, including neuronal development, suggests evolutionary plasticity of selected genomic regions. Contrary to expectation, selective sweeps did not play a large role in the persistence of population-specific variation. This suggests a stochastic process towards population-specific variation which reflects demographic histories and may have some interesting implications for health and susceptibility to disease.Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-437) contains supplementary material, which is available to authorized users. 相似文献4.
Schneeberger RG Zhang K Tatarinova T Troukhan M Kwok SF Drais J Klinger K Orejudos F Macy K Bhakta A Burns J Subramanian G Donson J Flavell R Feldmann KA 《Functional & integrative genomics》2005,5(4):240-253
Mobile insertion elements such as transposons and T-DNA generate useful genetic variation and are important tools for functional genomics studies in plants and animals. The spectrum of mutations obtained in different systems can be highly influenced by target site preferences inherent in the mechanism of DNA integration. We investigated the target site preferences of Agrobacterium T-DNA insertions in the chromosomes of the model plant Arabidopsis thaliana. The relative frequencies of insertions in genic and intergenic regions of the genome were calculated and DNA composition features associated with the insertion site flanking sequences were identified. Insertion frequencies across the genome indicate that T-strand integration is suppressed near centromeres and rDNA loci, progressively increases towards telomeres, and is highly correlated with gene density. At the gene level, T-DNA integration events show a statistically significant preference for insertion in the 5 and 3 flanking regions of protein coding sequences as well as the promoter region of RNA polymerase I transcribed rRNA gene repeats. The increased insertion frequencies in 5 upstream regions compared to coding sequences are positively correlated with gene expression activity and DNA sequence composition. Analysis of the relationship between DNA sequence composition and gene activity further demonstrates that DNA sequences with high CG-skew ratios are consistently correlated with T-DNA insertion site preference and high gene expression. The results demonstrate genomic and gene-specific preferences for T-strand integration and suggest that DNA sequences with a pronounced transition in CG- and AT-skew ratios are preferred targets for T-DNA integration.Electronic Supplementary Material Supplementary material is available for this article at .This revised version was published online in March 2005 with corrections to Dr. Tatarinovas name. 相似文献
5.
6.
Correlation between sequence conservation and the genomic context after gene duplication 总被引:4,自引:0,他引:4 下载免费PDF全文
A key complication in comparative genomics for reliable gene function prediction is the existence of duplicated genes. To study the effect of gene duplication on function prediction, we analyze orthologs between pairs of genomes where in one genome the orthologous gene has duplicated after the speciation of the two genomes (i.e. inparalogs). For these duplicated genes we investigate whether the gene that is most similar on the sequence level is also the gene that has retained the ancestral gene-neighborhood. Although the majority of investigated cases show a consistent pattern between sequence similarity and gene-neighborhood conservation, a substantial fraction, 29–38%, is inconsistent. The observation of inconsistency is not the result of a chance outcome owing to a lack of divergence time between inparalogs, but rather it seems to be the result of a chance outcome caused by very similar rates of sequence evolution of both inparalogs relative to their ortholog. If one-to-one orthologous relationships are required, it is advisable to combine contextual information (i.e. gene-neighborhood in prokaryotes and co-expression in eukaryotes) with protein sequence information to predict the most probable functional equivalent ortholog in the presence of inparalogs. 相似文献
7.
Jacqueline Batley Rebecca Mogg David Edwards Helen O'Sullivan Keith J. Edwards 《Molecular breeding : new strategies in plant improvement》2003,11(2):111-120
Information on genetic relationships among individuals is of importanceto maize breeders for line and hybrid development. Estimates on the geneticsimilarity of breeding materials is best obtained using DNA markers. SingleNucleotide Polymorphisms and small insertions/deletions are both emerging as anew generation of markers, due to their abundance and amenability to fullyautomated genotyping. Application of SNPs, for example in genetic mappingprojects or breeding programs, involves the analysis of a large number ofsamples, and therefore requires rapid, inexpensive, and highly automatedmethodsto genotype the sequence variants. Towards this, we have applied a highthroughput Single Nucleotide Primer Extension assay to assess 23 polymorphicbase variations at five microsatellite loci in 22 inbred maize lines, as wellasin a mapping population of two of the inbred lines. Using a MegaBACE automatedgenotyper, we are able to assay more than 1248 (96 × 13) samples in asingle 75 minute run. The SNuPE method successfully genotyped the base pairvariations of interest in all maize lines. It was also found that primerscontaining polymorphisms themselves could be used to genotype the samples. Thistechnique allows the rapid production of valuable high throughput informationongenetic relationships among maize varieties. We further demonstrate the utilityof this method, using it to successfully map one of the microsatellite loci. 相似文献
8.
Localization of high level of sequence conservation and divergence regions in cotton 总被引:1,自引:0,他引:1
Wang K Zhang W Cao Y Zhang Z Zheng D Zhou B Guo W Zhang T 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2012,124(7):1173-1182
In a previous study, we observed that the variations in chromosome size are due to uneven expansion and contraction by comparing
the structures and sizes of a pair of homoeologous high-resolution cytogenetic maps of chromosomes 12A and 12D in tetraploid
cotton. To reveal the variation at the sequence level, in the present paper, we sequenced two pairs of homoeologous bacterial
artificial chromosomes derived from high- to low-variable genomic regions. Comparisons of their sequence variations confirmed
that the highly conserved and divergent sequences existed in the distal and pericentric regions, e.g., high- and low-variable
genome size regions in these two pairs of cotton homoeologous chromosomes. Sequence analysis also confirmed that the differential
accumulation of Gossypium retrotransposable gypsy-like element (Gorge3) accounted for the main contributions for the size difference between the pericentric regions. By fluorescence in situ hybridization
analysis, we found that Gorge3 has a bias distribution in the AT/A proximal regions and is associated with the heterochromatin along the chromosomes in the entire Gossypium genome. These results indicate that, between AT/A and DT/D genomes, the distal and pericentric regions usually possess high level of sequence conservation and divergence, respectively,
in cotton. 相似文献
9.
Recently, Fryxell and Moon (2005) examined methylation-dependent transition rates (5mC deamination rates), which were calculated by the difference between the CpG transition and GpC transition rates, using 4,437 transition mutations in CpG or GpC dinucleotides. They concluded that 5mC deamination rates were highly dependent on local GC content but not on local sequence lengths over which GC content was calculated or the genomic regions where the mutations occurred. Here, we reexamined these statements by using 292,216 CpG-->TpG/CpA and GpC-->GpT/ApC mutations, an increase of 66 times as much data. Contrary to Fryxell and Moon's conclusions, our analysis indicated that 5mC deamination rates in the human genome were dependent on both the local sequence length and the genomic region. Some explanations for their conclusions were provided. 相似文献
10.
Background
Complete sequencing and annotation of the 96.2 kb Bacillus anthracis plasmid, pXO2, predicted 85 open reading frames (ORFs). Bacillus cereus and Bacillus thuringiensis isolates that ranged in genomic similarity to B. anthracis, as determined by amplified fragment length polymorphism (AFLP) analysis, were examined by PCR for the presence of sequences similar to 47 pXO2 ORFs. 相似文献11.
Aakrosh Ratan Yu Zhang Vanessa M Hayes Stephan C Schuster Webb Miller 《BMC bioinformatics》2010,11(1):130
Background
The most common application for the next-generation sequencing technologies is resequencing, where short reads from the genome of an individual are aligned to a reference genome sequence for the same species. These mappings can then be used to identify genetic differences among individuals in a population, and perhaps ultimately to explain phenotypic variation. Many algorithms capable of aligning short reads to the reference, and determining differences between them have been reported. Much less has been reported on how to use these technologies to determine genetic differences among individuals of a species for which a reference sequence is not available, which drastically limits the number of species that can easily benefit from these new technologies. 相似文献12.
Background
Rhesus macaque (Macaca mulatta) is the most widely used nonhuman primate animal in biomedical research. A global map of genetic variations in rhesus macaque is valuable for both evolutionary and functional studies. 相似文献13.
14.
Woynarowski JM 《Biochimica et biophysica acta》2002,1587(2-3):300-308
Cellular DNA is not a uniform target for DNA-reactive drugs. At the nucleotide level, drugs recognize and bind short motifs of a few base pairs. The location of drug adducts at the genomic level depends on how these short motifs are distributed in larger domains. This aspect, referred to as region specificity, may be critical for the biological outcome of drug action. Recent studies demonstrated that certain minor groove binding (MGB) drugs, such as bizelesin, produce region-specific lesions in cellular DNA. Bizelesin binds mainly T(A/T)(4)A sites, which are on average scarce, but occasionally cluster in distinct minisatellite regions (200-1000 bp of approximately 85-100% AT), herein referred to as AT islands. Bizelesin-targeted AT islands are likely to function as strong matrix attachment regions (MARs), domains that organize DNA loops on the nuclear matrix. Distortion of MAR-like AT islands may be a basis for the observed inhibition of new replicon initiation and the extreme lethality of bizelesin adducts (<10 adducts/cell for cell growth inhibition). Hence, long AT-islands represent a novel class of critical targets for anticancer drugs. The AT island paradigm illustrates the potential of the concept of regional targeting as an essential component of the rational design of new sequence-specific DNA-reactive drugs. 相似文献
15.
Dirk S. Schmeller Bernd Gruber Bianca Bauch Kaire Lanno Eduardas Budrys Valerija Babij Rimvydas Juškaitis Marek Sammul Zoltan Varga Klaus Henle 《Biodiversity and Conservation》2008,17(14):3607-3622
The Convention on Biodiversity (CBD) commits its signatories to the identification and monitoring of biodiversity. The European
Union has implemented this commitment into its legislation. Despite the legal requirement resources are scarce, requiring
a prioritization of conservation actions, including e.g. monitoring. Red lists are currently the most prominent tool for priority
setting in applied conservation, despite the fact that they were not developed for that purpose. Therefore, it is hardly surprising
that they do not always reflect actual conservation needs. As a response, the concept of national responsibility as a complementary
tool was developed during the last two decades. The existing methods are country specific and mainly incomparable on an international
scale. Here, we present a newly developed method, which is applicable to any taxonomic group, adjustable to different geographic
scales, with little data requirements and clear categorizations. We apply the new method to over 1,000 species in several
countries of different size and report on the applicability of our method and discuss problems that derive from the currently
available data. Our method has several major advantages compared to currently available methods. It is applicable to any geographic
range, allows automatization, given database availability, and is readily adjustable to future data improvements. It further
has comparably low data demands by exploiting one of the most commonly available information on biodiversity, i.e. distribution
maps. We believe that our method allows the allocation of the limited resources in nature conservation in the most sensible
way, e.g. the sharing of monitoring duties, effectively selecting networks of protected areas, improving knowledge on biodiversity,
and closing information gaps in many species groups. 相似文献
16.
Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. 相似文献
17.
Dirk S. Schmeller Bianca Bauch Bernd Gruber Rimvydas Juškaitis Eduardas Budrys Valerija Babij Kaire Lanno Marek Sammul Zoltan Varga Klaus Henle 《Biodiversity and Conservation》2008,17(14):3623-3630
Red lists serve as the most prominent tool for priority setting in applied conservation, even though they were not originally
designed for this task. Hence, threat status does not always reflect actual conservation needs and can be very different from
actual conservation priorities. Therefore, red lists may at best be a suboptimal tool for setting conservation priorities
in a country or region. As a response, a range of alternative or complementary tools have been developed, with approaches,
methods, and parameters such as population decline, population center etc. used, differing widely among countries. One recent
development is the combination of conservation status with a measure of the international importance of a population in a
focal region for the global survival of a species. Here, we provide a new method that integrates the two concepts while keeping
them conceptually separate. The main benefit of this method is that it can be applied across variable geographical scales
such as regions, countries, and even continents. Furthermore, it allows for better recommendations for applied conservation
and conservation policy development than the two concepts in isolation. Our method, if applied internationally, would allow
for a standardized priority setting in species conservation, would be highly comparable between countries, and would lead
to a more efficient use of the limited financial and human resources for monitoring and conservation of biodiversity. 相似文献
18.
We reported previously that the haploid genome of standard strains of laboratory mice contains approximately 70 copies of an amplified long genomic sequence, designated ALGS, that includes a retroposon of the gene for elongation factor 2 (MER). The length of each repeating unit is more than 60 kb, and the sequence of the unit is highly conserved among the repeats. In the present study, Southern blot analysis of the genomes of wild rodents demonstrated that the ALGS is present in all subspecies of Mus musculus and is abundant in M. spicilegus, whereas it is absent in M. spretus as well as in Rattus and other closely related genera. This result indicates that the amplification occurred after the species differentiation with the genus Mus and at least prior to the differentiation of subspecies of M. musculus. To locate chromosomal positions of the ALGS, in situ hybridization was carried out with laboratory strains and wild mice. It appears that the ALGS is located in the centromeric regions of most chromosomes in laboratory mice, M. musculus and M. spicilegus, whereas no positive signals were observed with M. spretus, in accordance with the results from the Southern blotting analysis. 相似文献
19.
E D Lamperti J M Kittelberger T F Smith L Villa-Komaroff 《Nucleic acids research》1992,20(11):2741-2747
We describe evidence that DNA sequences from vectors used for cloning and sequencing have been incorporated accidentally into eukaryotic entries in the GenBank database. These incorporations were not restricted to one type of vector or to a single mechanism. Many minor instances may have been the result of simple editing errors, but some entries contained large blocks of vector sequence that had been incorporated by contamination or other accidents during cloning. Some cases involved unusual rearrangements and areas of vector distant from the normal insertion sites. Matches to vector were found in 0.23% of 20,000 sequences analyzed in GenBank Release 63. Although the possibility of anomalous sequence incorporation has been recognized since the inception of GenBank and should be easy to avoid, recent evidence suggests that this problem is increasing more quickly than the database itself. The presence of anomalous sequence may have serious consequences for the interpretation and use of database entries, and will have an impact on issues of database management. The incorporated vector fragments described here may also be useful for a crude estimate of the fidelity of sequence information in the database. In alignments with well-defined ends, the matching sequences showed 96.8% identity to vector; when poorer matches with arbitrary limits were included, the aggregate identity to vector sequence was 94.8%. 相似文献
20.
Background: Allopolyploidization is one of the major evolutionary modesof plant speciation. Recent interest in studying allopolyploidshas provided significant novel insights into the mechanismsof allopolyploid formation. Compelling evidence indicates thatgenetic and/or epigenetic changes have played significant rolesin shaping allopolyploids, but rates and modes of the changescan be very different among various species. Triticale (x Triticosecale)is an artificial species that has been used to study the evolutionarycourse of complex allopolyploids due to its recent origin andavailability of a highly diversified germplasm pool. Scope: This review summarizes recent genomics studies implemented inhexaploid and octoploid triticales and discusses the mechanismsof the changes and compares the major differences between genomicchanges in triticale and other allopolyploid species. Conclusions: Molecular studies have indicated extensive non-additive sequencechanges or modifications in triticale, and the degree of variationappears to be higher than in other allopolyploid species. Thedata indicate that at least some sequence changes are non-random,and appear to be a function of genome relations, ploidy levelsand sequence types. Specifically, the rye parental genome demonstrateda higher level of changes than the wheat genome. The frequencyof lost parental bands was much higher than the frequency ofgained novel bands, suggesting that sequence modification and/orelimination might be a major force causing genome variationin triticale. It was also shown that 68 % of the total changesoccurred immediately following wide hybridization, but beforechromosome doubling. Genome evolution following chromosome doublingoccurred more slowly at a very low rate and the changes weremainly observed in the first five or so generations. The datasuggest that cytoplasm and relationships between parental genomesare key factors in determining the direction, amount, timingand rate of genomic sequence variation that occurred duringinter-generic allopolyploidization in this system. 相似文献