期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Finding keywords for intergenic and gene regions for human genome

Qiao YH Liu JL Zhang CG Zeng Y 《Nucleosides, nucleotides & nucleic acids》2005,24(3):191-198

The analysis of functionally related sequences for conserved patterns is important for further research of different functional regions. This paper presents an analysis of genes and intergenic sequences from the point of view of linguistics analysis, where gene and intergenic regions are regarded as two different subjects written in the four-letter alphabet [A, C, G, T] and high-frequency simple sequences are taken as keywords. A measurement alpha[l(tau)] was introduced to describe the relative repeat ratio of simple sequences. Cutoff values were found for keywords selection. After eliminating "noise," 87 short sequences were selected as keywords for intergenic regions and 76 for gene regions. 相似文献

2.

Correlation between the size of the intergenic regulatory region, the status of cytosine methylation of rRNA genes and nucleolar expression in wheat 总被引：4，自引：0，他引：4

Ravinder Sardana Michael O'Dell Richard Flavell 《Molecular & general genetics : MGG》1993,236(2-3):155-162

Summary A large number of wheat rRNA genes are methylated at all the CCGG sites that are present in the intergenic regions. A smaller number of rRNA genes are not methylated at one or more CCGG sites. A subset of genes was found unmethylated at a specific CCGG site just downstream of the array of 135 by A repeats in the intergenic region. In all the genotypes studied, the rDNA loci with larger intergenic regions between their genes also possess a greater number of rRNA genes that are unmethylated at one or more CCGG sites in the intergenic regions than do the loci with shorter intergenic regions. In four genotypes (for which data were available), rDNA loci with longer intergenic regions had larger secondary constrictions on metaphase chromosomes, a measure of relative locus activity, than the loci with shorter intergenic regions. The results have been integrated into a model for the control of rDNA expression based on correlations between cytosine methylation patterns and the number of upstream 135 by repeats in intergenic regions. According to this model the 135 by repeats play a part in the control of gene activity by binding a protein(s) that is in limiting supply, thereby predisposing the neighbouring gene to become active preferentially. 相似文献

3.

NOVEL AND RAPIDLY DIVERGING INTERGENIC SEQUENCES BETWEEN TANDEM REPEATS OF THE LUCIFERASE GENES IN SEVEN DINOFLAGELLATE SPECIES1

Liyun Liu J. Woodland Hastings 《Journal of phycology》2006,42(1):96-103

相似文献

4.

Realistic artificial DNA sequences as negative controls for computational genomics

Juan Caballero Arian F. A. Smit Leroy Hood Gustavo Glusman 《Nucleic acids research》2014,42(12):e99

A common practice in computational genomic analysis is to use a set of ‘background’ sequences as negative controls for evaluating the false-positive rates of prediction tools, such as gene identification programs and algorithms for detection of cis-regulatory elements. Such ‘background’ sequences are generally taken from regions of the genome presumed to be intergenic, or generated synthetically by ‘shuffling’ real sequences. This last method can lead to underestimation of false-positive rates. We developed a new method for generating artificial sequences that are modeled after real intergenic sequences in terms of composition, complexity and interspersed repeat content. These artificial sequences can serve as an inexhaustible source of high-quality negative controls. We used artificial sequences to evaluate the false-positive rates of a set of programs for detecting interspersed repeats, ab initio prediction of coding genes, transcribed regions and non-coding genes. We found that RepeatMasker is more accurate than PClouds, Augustus has the lowest false-positive rate of the coding gene prediction programs tested, and Infernal has a low false-positive rate for non-coding gene detection. A web service, source code and the models for human and many other species are freely available at http://repeatmasker.org/garlic/. 相似文献

5.

酵母基因上游序列中潜在的转录正调控位点分析 总被引：3，自引：0，他引：3

王秀荷张静《生物化学与生物物理进展》2005,32(10):953-958

前期研究表明,高效转录酵母基因内含子在序列长度、寡核苷酸使用、以及位置分布等方面都有着区别于低转录内含子的特征 . 进一步观察发现：上游基因间区域的序列长度与基因转录频率也有与内含子序列相同的现象,转录频率高的上游基因间序列一般都比转录频率低的长 . 对高效转录和低效转录上游基因间序列的寡核苷酸使用频率进行统计比较分析,抽提出高转录基因上游区可能的转录正调控元件 . 与酵母的所有非编码序列比较,这些可能的正调控元件基本上也是过表达的 (over-represented) ,其中多数和实验所得的一些位点特征相吻合 . 这些元件富含 G 、 C ,这与内含子中可能的正调控元件在碱基组成上有一定的互补性 . 从这些特征看,高效转录基因上游的序列结构确实有利于基因的转录 . 相似文献

6.

Target genes of microsatellite sequences in head and neck squamous cell carcinoma: Mononucleotide repeats are not detected

Y Wang X Liu Y Li 《Gene》2012,506(1):195-201

Microsatellite instability (MSI) is detected in a wide variety of tumors. It is thought that mismatch repair gene mutation or inactivation is the major cause of MSI. Microsatellite sequences are predominantly distributed in intergenic or intronic DNA. However, MSI is found in the exonic sequences of some genes, causing their inactivation. In this report, we searched GenBank for candidate genes containing potential MSI sequences in exonic regions. Twenty seven target genes were selected for MSI analysis. Instability was found in 70% of these genes (14/20) with head and neck squamous cell carcinoma (HNSCC). Interestingly, no instability was detected in mononucleotide repeats in genes or in intergenic sequences. We conclude that instability of mononucleotide repeats is a rare event in HNSCC. High MSI phenotype in young HNSCC patients is limited to noncoding regions only. MSI percentage in HNSCC tumor is closely related to the repeat type, repeat location and patient's age. 相似文献

7.

Nucleotide sequence and transcription of a human tRNA gene cluster with four genes 总被引：1，自引：0，他引：1

Y N Chang I L Pirtle R M Pirtle 《Gene》1986,48(1):165-174

相似文献

8.

Tubulin genes of the African trypanosome Trypanosoma brucei rhodesiense:nucleotide sequence of a 3.7-kb fragment containing genes for alpha and beta tubulins 总被引：19，自引：0，他引：19

B E Kimmel S Samson J Wu R Hirschberg L R Yarbrough 《Gene》1985,35(3):237-248

相似文献

9.

Nucleotide sequence of the rat gamma-crystallin gene region and comparison with an orthologous human region 总被引：5，自引：0，他引：5

J T den Dunnen J W van Neck F P Cremers N H Lubsen J G Schoenmakers 《Gene》1989,78(2):201-213

The sequences of a 51-kb region containing the cluster of five rat gamma-crystallin-coding genes (CRYG) and of a 7-kb region surrounding the sixth rat CRYG gene were determined. Approximately 78% of the total sequence represents intergenic DNA. We also sequenced 22 kb of DNA from the human CRYG gene cluster. All CRYG genes are associated with CpG-rich regions. The sequence similarity between the human and rat gene regions drops sharply (to 65%) in intronic and 3'-flanking regions but decreases only gradually in the 5'-flanking region. Highly conserved regions (greater than 80%) are found as far upstream as 1.5 kb. Overall intergenic distances are conserved. The human region contains much more repetitive DNA (24% vs. 10%) but less simple-sequence (sps) DNA (0.7% vs. 4%) than the rat region. Almost all repeats and spsDNA elements are located in the intergenic region. The location of repetitive and spsDNA differs between the orthologous regions and these elements were probably inserted after the evolutionary separation of rat and man. The Alu repeats in man and the B3 repeats in the rat are close copies of their respective consensus sequences and bordered by virtually perfect repeats. In contrast, the B1 and B2 repeats in the rat have diverged considerably from the consensus sequence and the surrounding direct repeats are usually imperfect. Thus the dispersion of the B1 and B2 repeats in the rat probably preceded that of the B3 repeats. Within the rat genomic region the spacing of Z-DNA elements is surprisingly regular, they are located about 12 kb apart. A search for putative matrix-associated regions suggests that the rat CRYG gene cluster is organized into two chromosomal domains. 相似文献

10.

Nucleotide sequence analysis of soybean small heat shock protein genes belonging to two different multigene families 总被引：10，自引：0，他引：10

E Raschke G Baumann F Sch?ffl 《Journal of molecular biology》1988,199(4):549-557

相似文献

11.

Short ultraconserved promoter regions delineate a class of preferentially expressed alternatively spliced transcripts

Christian Rdelsperger Sebastian Khler Marcel H. Schulz Thomas Manke Sebastian Bauer Peter N. Robinson 《Genomics》2009,94(5):308-316

相似文献

12.

Identification of multiple sites suitable for insertion of foreign genes in herpes simplex virus genomes

Tomomi Morimoto Jun Arii Hiroomi Akashi Yasushi Kawaguchi 《Microbiology and immunology》2009,53(3):155-161

相似文献

13.

Conserved sequences and transcription of the hsp70 gene family in Trypanosoma brucei. 总被引：32，自引：9，他引：23

下载免费PDF全文

D J Glass R I Polvere L H Van der Ploeg 《Molecular and cellular biology》1986,6(12):4657-4666

相似文献

14.

Complete chloroplast genome sequences of Solanum bulbocastanum, Solanum lycopersicum and comparative analyses with other Solanaceae genomes

Daniell H Lee SB Grevich J Saski C Quesada-Vargas T Guda C Tomkins J Jansen RK 《TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik》2006,112(8):1503-1518

Despite the agricultural importance of both potato and tomato, very little is known about their chloroplast genomes. Analysis of the complete sequences of tomato, potato, tobacco, and Atropa chloroplast genomes reveals significant insertions and deletions within certain coding regions or regulatory sequences (e.g., deletion of repeated sequences within 16S rRNA, ycf2 or ribosomal binding sites in ycf2). RNA, photosynthesis, and atp synthase genes are the least divergent and the most divergent genes are clpP, cemA, ccsA, and matK. Repeat analyses identified 33–45 direct and inverted repeats ≥30 bp with a sequence identity of at least 90%; all but five of the repeats shared by all four Solanaceae genomes are located in the same genes or intergenic regions, suggesting a functional role. A comprehensive genome-wide analysis of all coding sequences and intergenic spacer regions was done for the first time in chloroplast genomes. Only four spacer regions are fully conserved (100% sequence identity) among all genomes; deletions or insertions within some intergenic spacer regions result in less than 25% sequence identity, underscoring the importance of choosing appropriate intergenic spacers for plastid transformation and providing valuable new information for phylogenetic utility of the chloroplast intergenic spacer regions. Comparison of coding sequences with expressed sequence tags showed considerable amount of variation, resulting in amino acid changes; none of the C-to-U conversions observed in potato and tomato were conserved in tobacco and Atropa. It is possible that there has been a loss of conserved editing sites in potato and tomato.Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users. 相似文献

15.

Apparent gene conversion between beta-tubulin genes yields multiple regulatory pathways for a single beta-tubulin polypeptide isotype. 总被引：12，自引：7，他引：5

下载免费PDF全文

K F Sullivan J T Lau D W Cleveland 《Molecular and cellular biology》1985,5(9):2454-2465

相似文献

16.

Identification of gene expression elements in Histomonas meleagridis using splinkerette PCR, a variation of ligated adaptor PCR

Lynn EC Beckstead RB 《The Journal of parasitology》2012,98(1):135-141

相似文献

17.

Does the high gene density in the sponge NK homeobox gene cluster reflect limited regulatory capacity?

Fahey B Larroux C Woodcroft BJ Degnan BM 《The Biological bulletin》2008,214(3):205-217

相似文献

18.

Expansion and divergence of the GH locus between spider monkey and chimpanzee

Revol De Mendoza A Esquivel Escobedo D Martínez Dávila I Saldaña H 《Gene》2004,336(2):185-193

Growth hormone (GH) has been previously described as showing distinct evolutionary stories between primates and other mammals. A burst of changes and successive amplification events took place in the primate lineage giving rise to a multigene family in the three Anthropoidea lineages. Polymerase chain reaction (PCR) was used to obtain the genes and the intergenic regions comprising the GH loci of the spider monkey (Ateles geoffroyi), a New-World primate, and of the chimpanzee (Pan troglodytes), an ape. The intergenic sequences of both species were screened by hybridization to detect copies of the Alu family, which have been implicated in the formation of the human GH locus. The GH locus of the spider monkey contains at least six GH-related genes, four of them were cloned. Likewise, five short intergenic sequences of approximately 3 kb were amplified and cloned. On the other hand, in the chimpanzee four new placental lactogen (PL) genes as well as four intergenic regions were amplified. Consequently, in this ape, six genes (two GHs, previously obtained, and four PLs) are clustered, separated by intergenic sequences of different lengths (two short ones of about 5 kb, and at least two long ones between 9 and 13 kb). The presence of Alu sequences within the intergenic regions of both GH loci corroborates the current hypothesis that they acted as a driving force for the locus expansion. GH sequence comparisons reveal that several gene-conversion events might have occurred during the formation of this genome region, which has undergone independent evolution in the three Anthropoidea branches. To establish the GH's evolutionary history may prove to be a difficult task due to these gene-conversion events. 相似文献

19.

The beta globin gene cluster of the prosimian primate Galago crassicaudatus: nucleotide sequence determination of the 41-kb cluster and comparative sequence analyses.

D A Tagle M J Stanhope D R Siemieniak P Benson M Goodman J L Slightom 《Genomics》1992,13(3):741-760

The nucleotide sequence of the beta globin gene cluster of the prosimian Galago crassicaudatus has been determined. A total sequence spanning 41,101 bp contains and links together previously published sequences of the five galago beta-like globin genes (5'-epsilon-gamma-psi eta-delta-beta-3'). A computer-aided search for middle interspersed repetitive sequences identified 10 LINE (L1) elements, including a 5' truncated repeat that is orthologous to the full-length L1 element found in the human epsilon-gamma intergenic region. SINE elements that were identified included one Alu type I repeat, four Alu type II repeats, and two methionine tRNA-derived Monomer (type III) elements. Alu type II and Monomer sequences are unique to the galago genome. Structural analyses of the cluster sequence reveals that it is relatively A+T rich (about 62%) and regions with high G+C content are associated primarily with globin coding regions. Comparative analyses with the beta globin cluster sequences of human, rabbit, and mouse reveal extensive sequence homologies in their genic regions, but only human, galago, and rabbit sequences share extensive intergenic sequence homologies. Divergence analyses of aligned intergenic and flanking sequences from orthologous human, galago, and rabbit sequences show a gradation in the rate of nucleotide sequence evolution along the cluster where sequences 5' of the epsilon globin gene region show the least sequence divergence and sequences just 5' of the beta globin gene region show the greatest sequence divergence. 相似文献

20.

Identification of the transcriptional promoters in the proximal regions of human microRNA genes

Long YS Deng GF Sun XS Yi YH Su T Zhao QH Liao WP 《Molecular biology reports》2011,38(6):4153-4157

相似文献