期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Non-coding sequence retrieval system for comparative genomic analysis of gene regulatory elements

Sung Tae Doh Yunyu Zhang Matthew H Temple Li Cai 《BMC bioinformatics》2007,8(1):94

相似文献

2.

Detecting non-coding selective pressure in coding regions

Chen H Blanchette M 《BMC evolutionary biology》2007,7(Z1):S9

相似文献

3.

Congruent evolution of different classes of non-coding DNA in prokaryotic genomes

Rogozin IB Makarova KS Natale DA Spiridonov AN Tatusov RL Wolf YI Yin J Koonin EV 《Nucleic acids research》2002,30(19):4264-4271

Prokaryotic genomes are considered to be 'wall-to-wall' genomes, which consist largely of genes for proteins and structural RNAs, with only a small fraction of the genomic DNA allotted to intergenic regions, which are thought to typically contain regulatory signals. The majority of bacterial and archaeal genomes contain 6-14% non-coding DNA. Significant positive correlations were detected between the fraction of non-coding DNA and inter- and intra-operonic distances, suggesting that different classes of non-coding DNA evolve congruently. In contrast, no correlation was found between any of these characteristics of non-coding sequences and the number of genes or genome size. Thus, the non-coding regions and the gene sets in prokaryotes seem to evolve in different regimes. The evolution of non-coding regions appears to be determined primarily by the selective pressure to minimize the amount of non-functional DNA, while maintaining essential regulatory signals, because of which the content of non-coding DNA in different genomes is relatively uniform and intra- and inter-operonic non-coding regions evolve congruently. In contrast, the gene set is optimized for the particular environmental niche of the given microbe, which results in the lack of correlation between the gene number and the characteristics of non-coding regions. 相似文献

4.

Detection and Partial Characterization of Milk vetch dwarf virus Isolates from Faba Bean (Vicia faba L.) in Yunnan Province, China

Safaa G. Kumari Brendan Rodoni Heinrich-Josef Vetten Mai Hlaing Loh Angela Freeman Joop van Leur Shiying Bao Xiaoming Wang 《Journal of Phytopathology》2010,158(1):35-39

A virus disease of faba bean ( Vicia faba L.) in China, characterized by leaf yellowing and rolling and plant stunting, was shown to be caused by a virus of the genus Nanovirus based on serological reactions to nanovirus-specific monoclonal antibodies and the generation of polymerase chain reaction amplicons using nanovirus-specific primers. To identify the faba bean-infecting nanovirus, regions of the DNA components encoding the master replication initiator protein and capsid protein of two nanovirus isolates from China were cloned, sequenced and compared with those of other members of the genus Nanovirus . The two Chinese virus isolates shared nucleotide sequence identities ranging from 95 to 98% with the type isolate of Milk vetch dwarf virus (MDV) from Japan. They were thus identified as isolates of MDV, a virus so far known to cause important diseases of legumes in Japan. This is the first record of MDV-infecting faba bean in China. 相似文献

5.

Mutational signatures of de-differentiation in functional non-coding regions of melanoma genomes

SC Parker J Gartner I Cardenas-Navia X Wei H Ozel Abaan SS Ajay NF Hansen L Song UK Bhanot JK Killian Y Gindin RL Walker PS Meltzer JC Mullikin TS Furey GE Crawford SA Rosenberg Y Samuels EH Margulies 《PLoS genetics》2012,8(8):e1002871

Much emphasis has been placed on the identification, functional characterization, and therapeutic potential of somatic variants in tumor genomes. However, the majority of somatic variants lie outside coding regions and their role in cancer progression remains to be determined. In order to establish a system to test the functional importance of non-coding somatic variants in cancer, we created a low-passage cell culture of a metastatic melanoma tumor sample. As a foundation for interpreting functional assays, we performed whole-genome sequencing and analysis of this cell culture, the metastatic tumor from which it was derived, and the patient-matched normal genomes. When comparing somatic mutations identified in the cell culture and tissue genomes, we observe concordance at the majority of single nucleotide variants, whereas copy number changes are more variable. To understand the functional impact of non-coding somatic variation, we leveraged functional data generated by the ENCODE Project Consortium. We analyzed regulatory regions derived from multiple different cell types and found that melanocyte-specific regions are among the most depleted for somatic mutation accumulation. Significant depletion in other cell types suggests the metastatic melanoma cells de-differentiated to a more basal regulatory state. Experimental identification of genome-wide regulatory sites in two different melanoma samples supports this observation. Together, these results show that mutation accumulation in metastatic melanoma is nonrandom across the genome and that a de-differentiated regulatory architecture is common among different samples. Our findings enable identification of the underlying genetic components of melanoma and define the differences between a tissue-derived tumor sample and the cell culture created from it. Such information helps establish a broader mechanistic understanding of the linkage between non-coding genomic variations and the cellular evolution of cancer. 相似文献

6.

The impact of recombination on nucleotide substitutions in the human genome

Duret L Arndt PF 《PLoS genetics》2008,4(5):e1000071

Unraveling the evolutionary forces responsible for variations of neutral substitution patterns among taxa or along genomes is a major issue for detecting selection within sequences. Mammalian genomes show large-scale regional variations of GC-content (the isochores), but the substitution processes at the origin of this structure are poorly understood. We analyzed the pattern of neutral substitutions in 1 Gb of primate non-coding regions. We show that the GC-content toward which sequences are evolving is strongly negatively correlated to the distance to telomeres and positively correlated to the rate of crossovers (R²=47%). This demonstrates that recombination has a major impact on substitution patterns in human, driving the evolution of GC-content. The evolution of GC-content correlates much more strongly with male than with female crossover rate, which rules out selectionist models for the evolution of isochores. This effect of recombination is most probably a consequence of the neutral process of biased gene conversion (BGC) occurring within recombination hotspots. We show that the predictions of this model fit very well with the observed substitution patterns in the human genome. This model notably explains the positive correlation between substitution rate and recombination rate. Theoretical calculations indicate that variations in population size or density in recombination hotspots can have a very strong impact on the evolution of base composition. Furthermore, recombination hotspots can create strong substitution hotspots. This molecular drive affects both coding and non-coding regions. We therefore conclude that along with mutation, selection and drift, BGC is one of the major factors driving genome evolution. Our results also shed light on variations in the rate of crossover relative to non-crossover events, along chromosomes and according to sex, and also on the conservation of hotspot density between human and chimp. 相似文献

7.

Terminal regions of mammal chromosomes: plasticity and role in evolution

Zhdanova NS Rubtsov NB Minina IuM 《Genetika》2007,43(7):873-886

The review considers data on the composition, organization, and functional significance of terminal regions in mammalian chromosomes, including telomeres and subtelomeric regions. Because of specific structure, features of DNA replication, and characteristic localization in somatic and meiotic cells, these regions are hot spots for many events associated with genome functioning in mammals. Instability of these regions is of special interest. Evidence suggesting that instability of chromosomal regions containing telomeric DNA is a factor of chromosome evolution is discussed. The association of size and structure of telomeric regions with replicative aging and cell immortalization is considered. The review deals in detail with classical and alternative mechanisms of telomere size control, the significance of changes in telomeric region length in ontogeny, oncotransformation, and evolution. The issues related to telomere destabilization and the role of this process in chromosome rearrangement formation and chromosome evolution are discussed. The origin of telomere repeats in interstitial chromosome sites, including regions of evolutionary fusions-fissions is given special consideration. The possible role of ribosomal repeats and mechanisms similar to ALT (alternative lengthening of telomeres) in telomere reorganization in some taxa are discussed. 相似文献

8.

Reassortment and concerted evolution in banana bunchy top virus genomes

下载免费PDF全文

Hu JM Fu HC Lin CH Su HJ Yeh HH 《Journal of virology》2007,81(4):1746-1761

The nanovirus Banana bunchy top virus (BBTV) has six standard components in its genome and occasionally contains components encoding additional Rep (replication initiation protein) genes. Phylogenetic network analysis of coding sequences of DNA 1 and 3 confirmed the two major groups of BBTV, a Pacific and an Asian group, but show evidence of web-like phylogenies for some genes. Phylogenetic analysis of 102 major common regions (CR-Ms) from all six components showed a possible concerted evolution within the Pacific group, which is likely due to recombination in this region. The CR-M of additional Rep genes is close to that of DNA 1 and 2. Comparison of tree topologies constructed with DNA 1 and DNA 3 coding sequences of 14 BBTV isolates showed distinct phylogenetic histories based on Kishino-Hasegawa and Shimodaira-Hasegawa tests. The results of principal component analysis of amino acid and codon usages indicate that DNA 1 and 3 have a codon bias different from that of all other genes of nanoviruses, including all currently known additional Rep genes of BBTV, which suggests a possible ancient genome reassortment event between distinctive nanoviruses. 相似文献

9.

Intrachromosomal rearrangements in avian genome evolution: evidence for regions prone to breakpoints

Skinner BM Griffin DK 《Heredity》2012,108(1):37-41

It is generally believed that the organization of avian genomes remains highly conserved in evolution as chromosome number is constant and comparative chromosome painting demonstrated there to be very few interchromosomal rearrangements. The recent sequencing of the zebra finch (Taeniopygia guttata) genome allowed an assessment of the number of intrachromosomal rearrangements between it and the chicken (Gallus gallus) genome, revealing a surprisingly high number of intrachromosomal rearrangements. With the publication of the turkey (Meleagris gallopavo) genome it has become possible to describe intrachromosomal rearrangements between these three important avian species, gain insight into the direction of evolutionary change and assess whether breakpoint regions are reused in birds. To this end, we aligned entire chromosomes between chicken, turkey and zebra finch, identifying syntenic blocks of at least 250 kb. Potential optimal pathways of rearrangements between each of the three genomes were determined, as was a potential Galliform ancestral organization. From this, our data suggest that around one-third of chromosomal breakpoint regions may recur during avian evolution, with 10% of breakpoints apparently recurring in different lineages. This agrees with our previous hypothesis that mechanisms of genome evolution are driven by hotspots of non-allelic homologous recombination. 相似文献

10.

Structure and evolution of human sub-telomeric regions

Vergnaud G 《Journal de la Société de Biologie》1999,193(1):35-40

Recent progress in the field of human genome analysis has led to the development of new concepts in the definition of subtelomeric domains. Analysis of DNA sequences from human and yeast chromosome ends have shown that short stretches of degenerate TTAGGG are found at a distance from the telomeric repeats. These stretches define a boundary between two structurally different regions. The distal domain is characterised by numerous, short segments of interrupted homology to many other human telomeric regions and to a number of ESTs. The proximal domain shows much longer uninterrupted homology to a few chromosome ends. This domain evolved quickly within primates at least, as demonstrated by the detailed study of locus DNF92 which spread very recently in humans from 17 qter to at least ten other chromosome ends. At the different sites, presence-absence polymorphisms are observed within humans. The region remained single locus at the paralogous site in higher primates. Conversely, a human and orangutan single locus telomeric domain occupies multiple chromosome ends in chimpanzee. Balanced translocation is the likely mechanism through which the spreading occurred. Some members of the olfactory receptor gene family show a similar behaviour: multiple telomeric locations, and presence-absence polymorphism. Strikingly, the set of chromosome ends occupied by the two regions is identical, except for the two ancestral sites. Moreover, the relative frequency of detection of the region at the different sites indicates some kind of competition between the two regions. Consequently, these two regions represent major new tools to investigate recent human genome evolution and human genome diversity in different populations. 相似文献

11.

Rapid evolution and complex structural organization in genomic regions harboring multiple prolamin genes in the polyploid wheat genome 总被引：5，自引：0，他引：5

Gao S Gu YQ Wu J Coleman-Derr D Huo N Crossman C Jia J Zuo Q Ren Z Anderson OD Kong X 《Plant molecular biology》2007,65(1-2):189-203

Genes encoding wheat prolamins belong to complicated multi-gene families in the wheat genome. To understand the structural complexity of storage protein loci, we sequenced and analyzed orthologous regions containing both gliadin and LMW-glutenin genes from the A and B genomes of a tetraploid wheat species, Triticum turgidum ssp. durum. Despite their physical proximity to one another, the gliadin genes and LMW-glutenin genes are organized quite differently. The gliadin genes are found to be more clustered than the LMW-glutenin genes which are separated from each other by much larger distances. The separation of the LMW-glutenin genes is the result of both the insertion of large blocks of repetitive DNA owing to the rapid amplification of retrotransposons and the presence of genetic loci interspersed between them. Sequence comparisons of the orthologous regions reveal that gene movement could be one of the major factors contributing to the violation of microcolinearity between the homoeologous A and B genomes in wheat. The rapid sequence rearrangements and differential insertion of repetitive DNA has caused the gene islands to be not conserved in compared regions. In addition, we demonstrated that the i-type LMW-glutenin originated from a deletion of 33-bps in the 5′ coding region of the m-type gene. Our results show that multiple rounds of segmental duplication of prolamin genes have driven the amplification of the ω-gliadin genes in the region; such segmental duplication could greatly increase the repetitive DNA content in the genome depending on the amount of repetitive DNA present in the original duplicate region. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users. Shuangcheng Gao and Yong Qiang Gu contributed equally to the work. 相似文献

12.

Genome-wide mapping of matrix attachment regions in Drosophila melanogaster

Rashmi U Pathak Arumugam Srinivasan Rakesh K Mishra 《BMC genomics》2014,15(1)

相似文献

13.

The evolution of retrotransposon regulatory regions and its consequences on the Drosophila melanogaster and Homo sapiens host genomes

Marie Fablet Rita Rebollo Christian Bimont Cristina Vieira 《Gene》2007,390(1-2):84

It has now been established that transposable elements (TEs) make up a variable, but significant proportion of the genomes of all organisms, from Bacteria to Vertebrates. However, in addition to their quantitative importance, there is increasing evidence that TEs also play a functional role within the genome. In particular, TE regulatory regions can be viewed as a large pool of potential promoter sequences for host genes. Studying the evolution of regulatory region of TEs in different genomic contexts is therefore a fundamental aspect of understanding how a genome works. In this paper, we first briefly describe what is currently known about the regulation of TE copy number and activity in genomes, and then focus on TE regulatory regions and their evolution. We restrict ourselves to retrotransposons, which are the most abundant class of eukaryotic TEs, and analyze their evolution and the subsequent consequences for host genomes. Particular attention is paid to much-studied representatives of the Vertebrates and Invertebrates, Homo sapiens and Drosophila melanogaster, respectively, for which high quality sequenced genomes are available. 相似文献

14.

Integrating convolution and self-attention improves language model of human genome for interpreting non-coding regions at base-resolution

Meng Yang Lichao Huang Haiping Huang Hui Tang Nan Zhang Huanming Yang Jihong Wu Feng Mu 《Nucleic acids research》2022,50(14):e81

Interpretation of non-coding genome remains an unsolved challenge in human genetics due to impracticality of exhaustively annotating biochemically active elements in all conditions. Deep learning based computational approaches emerge recently to help interpret non-coding regions. Here, we present LOGO (Language of Genome), a self-attention based contextualized pre-trained language model containing only two self-attention layers with 1 million parameters as a substantially light architecture that applies self-supervision techniques to learn bidirectional representations of the unlabelled human reference genome. LOGO is then fine-tuned for sequence labelling task, and further extended to variant prioritization task via a special input encoding scheme of alternative alleles followed by adding a convolutional module. Experiments show that LOGO achieves 15% absolute improvement for promoter identification and up to 4.5% absolute improvement for enhancer-promoter interaction prediction. LOGO exhibits state-of-the-art multi-task predictive power on thousands of chromatin features with only 3% parameterization benchmarking against the fully supervised model, DeepSEA and 1% parameterization against a recent BERT-based DNA language model. For allelic-effect prediction, locality introduced by one dimensional convolution shows improved sensitivity and specificity for prioritizing non-coding variants associated with human diseases. In addition, we apply LOGO to interpret type 2 diabetes (T2D) GWAS signals and infer underlying regulatory mechanisms. We make a conceptual analogy between natural language and human genome and demonstrate LOGO is an accurate, fast, scalable, and robust framework to interpret non-coding regions for global sequence labeling as well as for variant prioritization at base-resolution. 相似文献

15.

Pattern and rate of indel evolution inferred from whole chloroplast intergenic regions in sugarcane, maize and rice.

Kyoko Yamane Kentaro Yano Taihachi Kawahara 《DNA research》2006,13(5):197-204

Microstructural changes such as insertions and deletions (=indels) are a major driving force in the evolution of non-coding DNA sequences. To better understand the mechanisms by which indel mutations arise, as well as the molecular evolution of non-coding regions, the number and pattern of indels and nucleotide substitutions were compared in the whole chloroplast genomes. Comparisons were made for a total of over 38 kb non-coding DNA sequences from 126 intergenic regions in two data sets representing species with different divergence times: sugarcane and maize and Oryza sativa var. indica and japonica. The main findings of this study are: (i) Approximately half of all indels are single nucleotide indels. This observation agrees with previous studies in various organisms. (ii) The distribution and number of indels was different between two data sets, and different patterns were observed for tandem repeat and non-repeat indels. (iii) Distribution pattern of tandem repeat indels showed statistically significant bias towards A/T-rich. (iv) The rate of indel mutation was estimated to be approximately 0.8 +/- 0.04 x 10(-9) per site per year, which was similar to previous estimates in other organisms. (v) The frequencies of nucleotide substitutions and indels were significantly lower in inverted repeat (IR). 相似文献

16.

Terminal regions of mammal chromosomes: Plasticity and role in evolution

N. S. Zhdanova N. B. Rubtsov Yu. M. Minina 《Russian Journal of Genetics》2007,43(7):721-732

The review considers data on the composition, organization, and functional significance of terminal regions in mammalian chromosomes, including telomeres and subtelomeric regions. Because of specific structure, features of DNA replication, and characteristic localization in somatic and meiotic cells, these regions are hot spots for many events associated with genome functioning in mammals. Instability of these regions is of special interest. Evidence suggesting that instability of chromosomal regions containing telomeric DNA is a factor of chromosome evolution is discussed. The association of size and structure of telomeric regions with replicative aging and cell immortalization is considered. The review deals in detail with classical and alternative mechanisms of telomere size control, the significance of changes in telomeric region length in ontogeny, oncotransformation, and evolution. The issues related to telomere destabilization and the role of this process in chromosome rearrangement formation and chromosome evolution are discussed. The origin of telomere repeats in interstitial chromosome sites, including regions of evolutionary fusions-fissions is given special consideration. The possible role of ribosomal repeats and mechanisms similar to ALT (alternative lengthening of telomeres) in telomere reorganization in some taxa are discussed. 相似文献

17.

Rapid genome evolution revealed by comparative sequence analysis of orthologous regions from four triticeae genomes 总被引：23，自引：0，他引：23

Gu YQ Coleman-Derr D Kong X Anderson OD 《Plant physiology》2004,135(1):459-470

Bread wheat (Triticum aestivum) is an allohexaploid species, consisting of three subgenomes (A, B, and D). To study the molecular evolution of these closely related genomes, we compared the sequence of a 307-kb physical contig covering the high molecular weight (HMW)-glutenin locus from the A genome of durum wheat (Triticum turgidum, AABB) with the orthologous regions from the B genome of the same wheat and the D genome of the diploid wheat Aegilops tauschii (Anderson et al., 2003; Kong et al., 2004). Although gene colinearity appears to be retained, four out of six genes including the two paralogous HMW-glutenin genes are disrupted in the orthologous region of the A genome. Mechanisms involved in gene disruption in the A genome include retroelement insertions, sequence deletions, and mutations causing in-frame stop codons in the coding sequences. Comparative sequence analysis also revealed that sequences in the colinear intergenic regions of these different genomes were generally not conserved. The rapid genome evolution in these regions is attributable mainly to the large number of retrotransposon insertions that occurred after the divergence of the three wheat genomes. Our comparative studies indicate that the B genome diverged prior to the separation of the A and D genomes. Furthermore, sequence comparison of two distinct types of allelic variations at the HMW-glutenin loci in the A genomes of different hexaploid wheat cultivars with the A genome locus of durum wheat indicates that hexaploid wheat may have more than one tetraploid ancestor. 相似文献

18.

Molecular evolution of poxviruses

I. V. Babkin S. N. Shchelkunov 《Russian Journal of Genetics》2008,44(8):895-908

Previous restriction fragment length polymorphism analysis divided variola virus (VARV) strains into two subtypes, one of which included West African and South American isolates. This allowed a dating to be introduced for the first time in estimation of the VARV evolution rate. The results were used to analyze the molecular evolution of the total family Poxviridae. Comparisons of the known nucleotide sequences were performed for the extended conserved central genome region in 42 orthopoxvirus strains and for the eight genes of multisubunit RNA polymerase in 65 viruses belonging to various genera of the family Poxviridae. Using the Bayesian dating method, the mutation accumulation rate of poxviruses was estimated at (1.7–8.8) × 10^?6 nucleotide substitutions per site per year. Computations showed that the modern poxvirus genera started diverging from an ancestral virus more than 200 thousand years ago and that an ancestor of the genus Orthopoxvirus emerged 131 ± 45 thousand years ago. The other genera of mammalian poxviruses with a low GC content diverged approximately 110–90 thousand years ago. The independent evolution of VARV started 3.4 ± 0.8 thousand years ago. It was shown with the example of VARV and the monkeypox virus (MPXV) that divergent evolution of these orthopoxviruses started and the West African subtypes of VARV and MPXV were formed as geographical conditions changed to allow isolation of West African animals from other African regions. 相似文献

19.

Statistical method for predicting protein coding regions in nucleic acid sequences 总被引：1，自引：0，他引：1

Fichant G.; Gautier C. 《Bioinformatics (Oxford, England)》1987,3(4):287-295

Protein coding regions of a genome fragment can be mathematicallypredicted by studying variations in the statistical propertiesor by searching the signals characteristic of the junctionsbetween the coding and non-coding regions. We propose here anew statistical method using correspondence analysis. This methoddoes not use any reference codon set but takes into accountthe codon usage homogeneity along the studied genome fragment.Comparison with previously published methods especially the‘codon usage method’ of Staden has been made, andtwo examples are presented here. Applications to analysis ofprokaryotic operon and eukaryotic split genes are also discussed.Use of the method has also shown two structures not previouslydescribed: i) in the human prt gene, a strong triplet structureexists in a non-coding region; ii) in the human tp-a codon usageis not uniform between the different exons Received on September 25, 1986 相似文献

20.

Sorghum chlorotic spot virus binds to potyvirus cylindrical inclusions in tobacco leaf cells

《Journal of Ultrastructure and Molecular Structure Research》1989,102(1):47-52

Nicotiana benthamiana can be doubly infected with either potato virus Y or tobacco etch virus and sorghum chlorotic spot virus (SCSV). Immunogold labeling showed that cylindrical inclusions of either potyvirus bind virions of the unrelated rod-shaped furovirus SCSV. Not all cells in doubly infected N. benthamiana plants contained both viruses. In cells infected by the potyviruses but not by SCSV, cylindrical inclusions did not label with the antiserum to SCSV. Numbers of cells infected with SCSV did not increase in doubly infected plants compared to those in plants infected with SCSV alone. Systemic infection of N. benthamiana by either potyvirus was not prevented by SCSV infections. This provides further evidence that unrelated rod-shaped viruses can bind to potyvirus cylindrical inclusion bodies, and that this phenomenon is not limited to graminaceous hosts. 相似文献