期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Distribution and characterization of simple sequence repeats in Gossypium raimondii genome

Changsong Zou Cairui Lu Youping Zhang Guoli Song 《Bioinformation》2012,8(17):801-806

Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton. 相似文献

2.

Coevolution between simple sequence repeats (SSRs) and virus genome size

X Zhao Y Tian R Yang H Feng Q Ouyang Y Tian Z Tan M Li Y Niu J Jiang G Shen R Yu 《BMC genomics》2012,13(1):435

ABSTRACT: BACKGROUND: Relationship between the level of repetitiveness in genomic sequence and genome size has been investigated by making use of complete prokaryotic and eukaryotic genomes, but relevant studies have been rarely made in virus genomes. RESULTS: In this study, a total of 257 viruses were examined, which cover 90% of genera. The results showed that simple sequence repeats (SSRs) is strongly, positively and significantly correlated with genome size. Certain repeat class is distributed in a certain range of genome sequence length. Mono-, di- and tri- repeats are widely distributed in all virus genomes, tetra- SSRs as a common component consist in genomes which more than 100 kb in size; in the range of genome < 100 kb, genomes containing penta- and hexa- SSRs are not more than 50%. Principal components analysis (PCA) indicated that dinucleotide repeat affects the differences of SSRs most strongly among virus genomes. Results showed that SSRs tend to accumulate in larger virus genomes; and the longer genome sequence, the longer repeat units. CONCLUSIONS: We conducted this research standing on the height of the whole virus. We concluded that genome size is an important factor in affecting the occurrence of SSRs; hosts are also responsible for the variances of SSRs content to a certain degree. 相似文献

3.

De novo assembly and characterization of the complete chloroplast genome of radish (Raphanus sativus L.)

Young-Min Jeong Won-Hyung Chung Jeong-Hwan Mun Namshin Kim Hee-Ju Yu 《Gene》2014

Radish (Raphanus sativus L.) is an edible root vegetable crop that is cultivated worldwide and whose genome has been sequenced. Here we report the complete nucleotide sequence of the radish cultivar WK10039 chloroplast (cp) genome, along with a de novo assembly strategy using whole genome shotgun sequence reads obtained by next generation sequencing. The radish cp genome is 153,368 bp in length and has a typical quadripartite structure, composed of a pair of inverted repeat regions (26,217 bp each), a large single copy region (83,170 bp), and a small single copy region (17,764 bp). The radish cp genome contains 87 predicted protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence analysis revealed the presence of 91 simple sequence repeats (SSRs) in the radish cp genome. 相似文献

4.

SSRscanner: a program for reporting distribution and exact location of simple sequence repeats 总被引：1，自引：0，他引：1

下载免费PDF全文

Anwar T Khan AU 《Bioinformation》2006,1(3):89-91

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence.

Availability 相似文献

5.

Analysis of simple sequence repeats (SSRs)dynamics in fungus Fusarium graminearum 总被引：1，自引：0，他引：1

Singh R Sheoran S Sharma P Chatrath R 《Bioinformation》2011,5(10):402-404

The abundance and inherent potential for variations in simple sequence repeats (SSRs) or microsatellites resulted in valuable source for genetic markers in eukaryotes. We describe the organization and abundance of SSRs in fungus Fusarium graminearum (causative agent for Fusarium head blight or head scab of wheat). We identified 1705 SSRs of various nucleotide repeat motifs in the sequence database of F. graminearum. It is observed that mononucleotide repeats (62%) were most abundant followed by di- (20%) and trinucleotide repeats (14%). It is noted that tetra-, penta- and hexanucleotide repeats accounted for only 4% of SSRs. The estimated frequency of Class I SSRs (perfect repeats ≥20 nucleotides) was one SSR per 124.5 kb, whereas the frequency of Class II (perfect repeats >10 nucleotides and ≫20 nucleotides) was one SSR per 25.6 kb. The dynamics of SSRs will be a powerful tool for taxonomic, phylogenetic, genome mapping and population genetic studies as SSR based markers show high levels of allelic variation, codominant inheritance and ease of analysis. 相似文献

6.

Incidence,complexity and diversity of simple sequence repeats across potexvirus genomes

Chaudhary Mashhood Alam Avadhesh Kumar Singh Choudhary Sharfuddin Safdar Ali 《Gene》2014

An in-silico analysis of simple sequence repeats (SSRs) in genomes of 32 species of potexviruses was performed wherein a total of 691 SSRs and 33 cSSRs were observed. Though SSRs were present in all the studied genomes their incident frequency ranged from 11 to 30 per genome. Further, 10 potexvirus genomes possessed no cSSRs when extracted at a dMAX of 10 and wherein present, the highest frequency was 3. SSR and cSSR incidence, relative density and relative abundance were non-significantly correlated with genome size and GC content suggesting an ongoing evolutionary and adaptive phase of the virus species. SSRs present primarily ranged from mono- to tri-nucleotide repeat motifs with a greatly skewed distribution across the coding and non-coding regions. Present work is an effort for the undergoing compilation and analysis of incidence, distribution and variation of the viral repeat sequences to understand their evolutionary and functional relevance. 相似文献

7.

Map and analysis of microsatellites in the genome of <Emphasis Type="Italic">Populus</Emphasis>: The first sequenced perennial plant

下载免费PDF全文

Li ShuXian Yin TongMing 《中国科学C辑(英文版)》2007,50(5):690-699

We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated that SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites. 相似文献

8.

Simple sequence repeats in proteins and their significance for network evolution

Hancock JM Simon M 《Gene》2005,345(1):113-118

相似文献

9.

Characterization of genome-wide simple sequence repeats and application in interspecific genetic map integration in kiwifruit

Chunyan Liu Qiong Zhang Xiaohong Yao Caihong Zhong Chunlin Yan Hongwen Huang 《Tree Genetics & Genomes》2016,12(2):21

Simple sequence repeats (SSRs) have been widely used in the construction of linkage maps, quantitative trait loci (QTLs) mapping, and marker-assisted selection (MAS). The availability of the sequenced Actinidia chinensis (kiwifruit) genome allows for the inexpensive and efficient development of microsatellite markers. In this study, a total of 49,067 SSRs were identified and characterized in the genome sequences of kiwifruit. Dinucleotide repeats are the most abundant SSRs, with the AG/TC motif accounting for 44.2 % of all SSRs in the genome. Fifty-five newly derived SSRs, together with 46 previously available SSRs, were integrated into linkage maps of an interspecific kiwifruit population. In addition, eight sex-linked SSR markers (including one previously published SSR) were mapped in the sex-related region on the LG25, suggesting that recombination is partially suppressed to maintain dioecy in kiwifruit. The SSRs developed from this study are a valuable resource for kiwifruit genetics and will contribute to the use of MAS in early sex determination of dioecious plant breeding. 相似文献

10.

Map and analysis of microsatellites in the genome of Populus: The first sequenced perennial plant

LI ShuXian & YIN TongMing 《中国科学：生命科学英文版》2007,50(5):690-699

Environmental Sciences Division, Oak Ridge National Laboratory, TN, USA We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated tr at SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites. 相似文献

11.

In-silico analysis of simple and imperfect microsatellites in diverse tobamovirus genomes

Chaudhary Mashhood Alam Avadhesh Kumar Singh Choudhary Sharfuddin Safdar Ali 《Gene》2013

An in-silico analysis of simple sequence repeats (SSRs) in 30 species of tobamoviruses was done. SSRs (mono to hexa) were present with variant frequency across species. Compound microsatellites, primarily of variant motifs accounted for up to 11.43% of the SSRs. Motif duplications were observed for A, T, AT, and ACA repeats. (AG)–(TC) was the most prevalent SSR-couple. SSRs were differentially localized in the coding region with ~ 54% on the 128 kDa protein while 20.37% was exclusive to 186 kDa protein. Characterization of such variations is important for elucidating the origin, sequence variations, and structure of these widely used, but incompletely understood sequences. 相似文献

12.

Mapping and analysis of simple sequence repeats in the Arabidopsis thaliana genome

Tamanna A Khan AU 《Bioinformation》2005,1(2):64-68

Simple sequence repeats (SSRs) are becoming standard DNA markers for plant genome analysis and are being used as markers in marker assisted breeding. And hence because of its great significance we have initiated this study to analyze complete genome of Arabidopsis thaliana for the prevalence of mono-, di-, tri-, tetra-, penta- and hexa- mer repeats in the coding and non-coding regions of the chromosome and to map their exact position on the sequence. We have developed a program that can search a repeat of any length, its exact position on the chromosome and also its frequency of occurrence in the genome. Analysis of the results reveal that maximum number of repeats were found in chromosome 1 followed by chromosome 2 and 4 whereas, chromosome 3 and 5 contain relatively less number of these repeats. Among the SSRs, hexamers and dimers were more predominant in the chromosomes. Overall data showed that Chromosome 5 has minimum number of repeats. The abundance or rarity of various simple repeats in different chromosomes is not explained by nucleotide composition of sequence or potential repeated motifs to form alternative DNA structures. This suggests that in addition to nucleotide composition of repeat motifs, characteristic DNA replication / repair / recombination machinery might play an important role in genesis of repeats. The positional information is given at www.geocities.com/amubioinfo/ARD. This positional information can help Arabidopsis researchers to identify new polymorphisms in chromosomal regions of interest based on the SSRs that map in the area. 相似文献

13.

Simple sequence repeats in organellar genomes of rice: frequency and distribution in genic and intergenic regions

Rajendrakumar P Biswal AK Balachandran SM Srinivasarao K Sundaram RM 《Bioinformatics (Oxford, England)》2007,23(1):1-4

MOTIVATION: Simple sequence repeats (SSRs) are abundant across genomes. However, the significance of SSRs in organellar genomes of rice has not been completely understood. The availability of organellar genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. RESULTS: We have analyzed SSRs in mitochondrial and chloroplast genomes of rice. We identified 2528 SSRs in the mitochondrial genome and average 870 SSRs in the chloroplast genomes. About 8.7% of the mitochondrial and 27.5% of the chloroplast SSRs were observed in the genic region. Dinucleotides were the most abundant repeats in genic and intergenic regions of the mitochondrial genome while mononucleotides were predominant in the chloroplast genomes. The rps and nad gene clusters of mitochondria had the maximum repeats, while the rpo and ndh gene clusters of chloroplast had the maximum repeats. We identified SSRs in both organellar genomes and validated in different cultivars and species. 相似文献

14.

Characterization of the chloroplast genome sequence of oil palm (Elaeis guineensis Jacq.)

P. Uthaipaisanwong J. Chanprasert J.R. Shearman D. Sangsrakru T. Yoocha N. Jomchai C. Jantasuriyarat S. Tragoonrung S. Tangphatsornruang 《Gene》2012

Oil palm (Elaeis guineensis Jacq.) is an economically important crop, which is grown for oil production. To better understand the molecular basis of oil palm chloroplasts, we characterized the complete chloroplast (cp) genome sequence obtained from 454 pyrosequencing. The oil palm cp genome is 156,973 bp in length consisting of a large single-copy region of?85,192 bp flanked on each side by inverted repeats of 27,071 bp with a small single-copy region of 17,639 bp joining the?repeats. The genome contains 112 unique genes: 79 protein-coding genes, 4 ribosomal RNA genes and 29 tRNA genes. By aligning the cp?genome sequence with oil palm cDNA sequences, we observed 18 non-silent and 10 silent RNA editing events among 19 cp protein-coding genes. Creation of an initiation codon by RNA editing in rpl2 has been reported in several monocots and was also found in the oil palm cp genome. Fifty common chloroplast protein-coding genes from 33 plant taxa were used to construct ML and MP?phylogenetic trees. Their topologies are similar and strongly support for the position of E. guineensis as the sister of closely related species Phoenix dactylifera in Arecaceae (palm families) of monocot subtrees. 相似文献

15.

棉属四倍体AD1与二倍体A2、D5基因组的同源SSR分析

孙高飞何守朴潘兆娥杜雄明《遗传》2015,37(2):192-203

SSRs(Simple sequence repeats)是一类广泛存在于动植物基因组的DNA短串联重复序列,是重要的基因组分子标记。比较不同基因组同源SSR的差异,有利于了解相近物种间的进化过程。文章使用雷蒙德氏棉基因组(D₅)、亚洲棉基因组(A₂)全基因组序列和陆地棉(AD₁)的限制性酶切基因组测序数据,进行全基因组SSR扫描,比较了A组和D组的SSR分布情况,通过识别3个基因组之间的同源SSR,比较它们之间同源SSR重复序列的差异。结果发现,A组和D组同源SSR的分布规律非常相似,但A组与AD组的同源SSR保守性比D组与AD组同源SSR的保守性强。与AD组同源SSR相比,A组中重复序列长度增长的SSR数量约为长度缩短的SSR数量的5倍,在D组中这一比值约为3倍。可以推测,四倍体AD组在与A组、D组的平行进化过程中,由于基因组融合,导致SSR的重复序列长度变化速率与二倍体A、D组有差异,同时这种差异可能导致了AD组SSR重复序列长度在进化过程中与二倍体相比有变短的趋势。文章首次对3个棉花基因组的同源SSR进行了系统地比较,发现了同源SSR在棉属四倍体基因组和二倍体基因组中的显著差异,为进一步揭示棉属基因组的进化规律提供了基础。相似文献

16.

Simple sequence repeats as advantageous mutators in evolution 总被引：3，自引：0，他引：3

Kashi Y King DG 《Trends in genetics : TIG》2006,22(5):253-259

相似文献

17.

Genome-wide analysis of simple sequence repeats in the model medicinal mushroom Ganoderma lucidum

Jun Qian Haibin XuJingyuan Song Jiang XuYingjie Zhu Shilin Chen 《Gene》2013

Simple sequence repeats (SSRs) or microsatellites are one of the most popular sources of genetic markers and play a significant role in gene function and genome organization. We identified SSRs in the genome of Ganoderma lucidum and analyzed their frequency and distribution in different genomic regions. We also compared the SSRs in G. lucidum with six other Agaricomycetes genomes: Coprinopsis cinerea, Laccaria bicolor, Phanerochaete chrysosporium, Postia placenta, Schizophyllum commune and Serpula lacrymans. Based on our search criteria, the total number of SSRs found ranged from 1206 to 6104 and covered from 0.04% to 0.15% of the fungal genomes. The SSR abundance was not correlated with the genome size, and mono- to tri-nucleotide repeats outnumbered other SSR categories in all of the species examined. In G. lucidum, a repertoire of 2674 SSRs was detected, with mono-nucleotides being the most abundant. SSRs were found in all genomic regions and were more abundant in non-coding regions than coding regions. The highest SSR relative abundance was found in introns (108 SSRs/Mb), followed by intergenic regions (84 SSRs/Mb). A total of 684 SSRs were found in the protein-coding sequences (CDSs) of 588 gene models, with 81.4% of them being tri- or hexa-nucleotides. After scanning for InterPro domains, 280 of these genes were successfully annotated, and 215 of them could be assigned to Gene Ontology (GO) terms. SSRs were also identified in 28 bioactive compound synthesis-related gene models, including one 3-hydroxy-3-methylglutaryl-CoA reductase (HMGR), three polysaccharide biosynthesis genes and 24 cytochrome P450 monooxygenases (CYPs). Primers were designed for the identified SSR loci, providing the basis for the future development of SSR markers of this medicinal fungus. 相似文献

18.

Genome-Wide Comparative Analyses of Microsatellites in Papaya

Jianping Wang Cuixia Chen Jong-Kuk Na Qingyi Yu Shaobin Hou Robert E. Paull Paul H. Moore Maqsudul Alam Ray Ming 《Tropical plant biology》2008,1(3-4):278-292

Microsatellites, or simple sequence repeats (SSRs), are highly polymorphic and universally distributed in eukaryotes. SSRs have been used extensively as sequence tagged markers in genetic studies. Recently, the functional and evolutionary importance of SSRs has received considerable attention. Here we report the mining and characterization of the SSRs in papaya genome. We analyzed SSRs from 277.4 Mb of whole genome shotgun (WGS) sequences, 51.2 Mb bacterial artificial chromosome (BAC) end sequences (BES), and 13.4 Mb expressed sequence tag (EST) sequences. The papaya SSR density was one SSR per 0.7 kb of DNA sequence in the WGS, which was higher than that in BES and EST sequences. SSR abundance was dramatically reduced as the repeat length increased. According to SSR motif length, dinucleotide repeats were the most common motif in class I, whereas hexanucleotides were the most copious in class II SSRs. The tri- and hexanucleotide repeats of both classes were greater in EST sequences compared to genomic sequences. In class I SSR, AT and AAT were the most frequent motifs in BES and WGS sequences. By contrast, AG and AAG were the most abundant in EST sequences. For SSR marker development, 9,860 primer pairs were surveyed for amplification and polymorphism. Successful amplification and polymorphic rates were 66.6% and 17.6%, respectively. The highest polymorphic rates were achieved by AT, AG, and ATG motifs. The genome wide analysis of microsatellites revealed their frequency and distribution in papaya genome, which varies among plant genomes. This complete set of SSRs markers throughout the genome will assist diverse genetic studies in papaya and related species. 相似文献

19.

High GC content of simple sequence repeats in Herpes simplex virus type 1 genome

Ouyang Q Zhao X Feng H Tian Y Li D Li M Tan Z 《Gene》2012,499(1):37-40

The presence, locations and composition of simple sequence repeats (SSRs) in Herpes simplex virus type 1 (HSV-1) genome were extracted and analyzed by using the software Imperfect Microsatellite Extractor (IMEx). There were 663 mon-, 502 di-, 184 tri-, 20 tetra-, 4 penta- and 4 hexanucleotide SSRs that were observed in different distribution between coding and noncoding regions in the HSV-1 genome. G/C, GC/CG, and (GGC)(n) were predominant in mononucleotide, dinucletide, trinucleotide repeats respectively. Indeed, the results showed that GC content in simple sequence repeats was notably higher than that in entire HSV-1 genome. Our data might be helpful for studying the pathogenesis, genome structure and evolution of HSV-1. 相似文献

20.

Simple sequence repeats in different genome sequences of Shigella and comparison with high GC and AT-rich genomes.

Ashraf Hosseini Suvidya H Ranade Indira Ghosh Pramod Khandekar 《DNA sequence》2008,19(3):167-176

Simple sequence repeats (SSRs) are omnipresent in prokaryotes and eukaryotes, and are found anywhere in the genome in both protein encoding and noncoding regions. In present study the whole genome sequences of seven chromosomes (Shigella flexneri 2a str301 and 2457T, Shigella sonnei, Escherichia coli k12, Mycobacterium tuberculosis, Mycobacterium leprae and Staphylococcus saprophyticus) have downloaded from the GenBank database for identifying abundance, distribution and composition of SSRs and also to determine difference between the tandem repeats in real genome and randomness genome (using sequence shuffling tool) of the organisms included in this study. The data obtained in the present study show that: (i) tandem repeats are widely distributed throughout the genomes; (ii) SSRs are differentially distributed among coding and noncoding regions in investigated Shigella genomes; (iii) total frequency of SSRs in noncoding regions are higher than coding regions; (iv) in all investigated chromosomes ratio of Trinucleotide SSRs in real genomes are much higher than randomness genomes and Di nucleotide SSRs are lower; (v) Ratio of total and mononucleotide SSRs in real genome is higher than randomness genomes in E. coli K12, S. flexneri str 301 and S. saprophyticus, while it is lower in S. flexneri str 2457T, S.sonnei and M. tuberculosis and it is approximately same in M. leprae; (vi) frequency of codon repetitions are vary considerably depending on the type of encoded amino acids. 相似文献