首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
柑橘EST-SSR分子标记分析   总被引:25,自引:0,他引:25  
江东  钟广炎  洪棋斌 《遗传学报》2006,33(4):345-353
对来源于甜橙(Citrus sinensis Osbeck)、枳壳(Poncirus trifoliata Raf.)和其他柑橘非冗余EST数据库的38124条单-基因(Unigene)序列进行了简单重复序列SSRs(Simple Sequence Repeat)搜索,所分析的柑橘非冗余核酸序列总长23.29Mb,从中获得了8218条SSR,其中包括单碱基重复4913条(59.8%),2碱基重复1419条(17.3%),3碱基重复1709条(20.8%),4碱基重复114条(1.39%),5碱基重复23条(0.28%),6碱基重复40条(0.49%)。大约每2.8kb长度的单-基因序列中即存在1个SSR,即平均4.6个单-基因中存在1个SSR。随碱基重复单元(motif)的不同,SSR的最大长度在40-105之间,全部重复序列的平均长度为20.9bp。各种SSR(1-,2-,3-,4-,5-,6-核苷酸重复)的发生频率在甜橙和枳壳间非常接近。其中单碱基重复序列是最丰富的重复单元,其次为3碱基重复。在所得的SSR的重复单元中,富含A碱基的重复单元的分布占据优势地位,出现的频率与密度均较高,而富含CG碱基的重复单元出现频率和密度较低。用25对EST-SSR引物对6个柑橘品种的多样性进行了PCR检测,结果表明,所有25对引物在6个柑橘品种间均扩增到多样性条带,证实通过柑橘EST数据库的发掘能够高效地筛选到基因水平的SSR标记。  相似文献   

2.
Chen M  Tan Z  Zeng G 《Bioinformation》2011,6(4):171-172
Simple sequence repeats (SSRs) are ubiquitous short tandem repeats, which are associated with various regulatory mechanisms and have been found in viral genomes. Herein, we develop MfSAT (Multi-functional SSRs Analytical Tool), a new powerful tool which can fast identify SSRs in multiple short viral genomes and then automatically calculate the numbers and proportions of various SSR types (mono-, di-, tri-, tetra-, penta- and hexanucleotide repeats). Furthermore, it also can detect codon repeats and report the corresponding amino acid.  相似文献   

3.
A genome-wide sequence search was conducted to identify simple sequence repeat (SSR) loci in phylloxera, Daktulosphaira vitifoliae, a major grape pest throughout the world. Collectively, 1524 SSR loci containing mono-, di-, tri-, tetra-, penta-, and hexanucleotide motifs were identified. Among them, trinucleotide repeats were the most abundant in the phylloxera genome (34.4%), followed by hexanucleotide (20.4%) and dinucleotide (19.6%) repeats. Mono-, tetra- and pentanucleotide repeats were found at a frequency of 1.3, 11.2 and 12.9%, respectively. The abundance and inherent variations in SSRs provide valuable information for developing molecular markers. The high levels of allelic variation and codominant features of SSRs make this marker system a useful tool for genotyping, diversity assessment and population genetic studies of reproductive characteristics of phylloxera in agricultural and natural populations.  相似文献   

4.
Simple sequence repeats (SSRs) have been widely used in maize genetics and breeding, because they are co-dominant, easy to score, and highly abundant. In this study, we used whole-genome sequences from 16 maize inbreds and 1 wild relative to determine SSR abundance and to develop a set of high-density polymorphic SSR markers. A total of 264 658 SSRs were identified across the 17 genomes, with an average of 135 693 SSRs per genome. Marker density was one SSR every of 15.48 kb. (C/G)n, (AT)n, (CAG/CTG)n, and (AAAT/ATTT)n were the most frequent motifs for mono, di-, tri-, and tetra-nucleotide SSRs, respectively. SSRs were most abundant in intergenic region and least frequent in untranslated regions, as revealed by comparing SSR distributions of three representative resequenced genomes. Comparing SSR sequences and e-polymerase chain reaction analysis among the 17 tested genomes created a new database, including 111 887 SSRs, that could be develop as polymorphic markers in silico. Among these markers, 58.00, 26.09, 7.20, 3.00, 3.93, and 1.78% of them had mono, di-, tri-, tetra-, penta-, and hexa-nucleotide motifs, respectively. Polymorphic information content for 35 573 polymorphic SSRs out of 111 887 loci varied from 0.05 to 0.83, with an average of 0.31 in the 17 tested genomes. Experimental validation of polymorphic SSR markers showed that over 70% of the primer pairs could generate the target bands with length polymorphism, and these markers would be very powerful when they are used for genetic populations derived from various types of maize germplasms that were sampled for this study.  相似文献   

5.
In the present study, 3217 UniGene sequences of Neurospora crassa downloaded from the National Center for Biotechnology Information (NCBI) were mined for the identification of microsatellites or simple sequence repeats (SSRs). A total of 287 SSRs detected gives density of 1SSR/14.6 kb of 4187.86 kb sequences mined suggests that only 250 (7.8%) of sequences contained SSRs. Depending on the repeat units, the length of SSRs ranged from 14 to 17 bp for mono-, 14 to 48 bp for di-, 18 to 90 bp for tri-, 24 to 48 bp for tetra-, 30 for penta- and 42 to 48 bp for hexa-nucleotide repeats. Tri-nucleotide repeats were the most frequent repeat type (88.8%) followed by di-nucleotide repeats (5.9%). An attempt was also made with the help of bioinformatics approach to find out primer pairs for identified SSRs and primers were found only for 239 sequences. But, this part needs experimental validation. Annotation of SSRs containing sequences was also carried out.  相似文献   

6.
Microsatellites or simple sequence repeats (SSRs) are distributed across both prokaryotic and eukaryotic genomes and have been widely used for genetic studies and molecular marker-assisted breeding in crops. Though an ordered draft sequence of hexaploid bread wheat have been announced, the researches about systemic analysis of SSRs for wheat still have not been reported so far. In the present study, we identified 364,347 SSRs from among 10,603,760 sequences of the Chinese spring wheat (CSW) genome, which were present at a density of 36.68 SSR/Mb. In total, we detected 488 types of motifs ranging from di- to hexanucleotides, among which dinucleotide repeats dominated, accounting for approximately 42.52% of the genome. The density of tri- to hexanucleotide repeats was 24.97%, 4.62%, 3.25% and 24.65%, respectively. AG/CT, AAG/CTT, AGAT/ATCT, AAAAG/CTTTT and AAAATT/AATTTT were the most frequent repeats among di- to hexanucleotide repeats. Among the 21 chromosomes of CSW, the density of repeats was highest on chromosome 2D and lowest on chromosome 3A. The proportions of di-, tri-, tetra-, penta- and hexanucleotide repeats on each chromosome, and even on the whole genome, were almost identical. In addition, 295,267 SSR markers were successfully developed from the 21 chromosomes of CSW, which cover the entire genome at a density of 29.73 per Mb. All of the SSR markers were validated by reverse electronic-Polymerase Chain Reaction (re-PCR); 70,564 (23.9%) were found to be monomorphic and 224,703 (76.1%) were found to be polymorphic. A total of 45 monomorphic markers were selected randomly for validation purposes; 24 (53.3%) amplified one locus, 8 (17.8%) amplified multiple identical loci, and 13 (28.9%) did not amplify any fragments from the genomic DNA of CSW. Then a dendrogram was generated based on the 24 monomorphic SSR markers among 20 wheat cultivars and three species of its diploid ancestors showing that monomorphic SSR markers represented a promising source to increase the number of genetic markers available for the wheat genome. The results of this study will be useful for investigating the genetic diversity and evolution among wheat and related species. At the same time, the results will facilitate comparative genomic studies and marker-assisted breeding (MAS) in plants.  相似文献   

7.
The abundance and inherent potential for variations in simple sequence repeats (SSRs) or microsatellites resulted in valuable source for genetic markers in eukaryotes. We describe the organization and abundance of SSRs in fungus Fusarium graminearum (causative agent for Fusarium head blight or head scab of wheat). We identified 1705 SSRs of various nucleotide repeat motifs in the sequence database of F. graminearum. It is observed that mononucleotide repeats (62%) were most abundant followed by di- (20%) and trinucleotide repeats (14%). It is noted that tetra-, penta- and hexanucleotide repeats accounted for only 4% of SSRs. The estimated frequency of Class I SSRs (perfect repeats ≥20 nucleotides) was one SSR per 124.5 kb, whereas the frequency of Class II (perfect repeats >10 nucleotides and ≫20 nucleotides) was one SSR per 25.6 kb. The dynamics of SSRs will be a powerful tool for taxonomic, phylogenetic, genome mapping and population genetic studies as SSR based markers show high levels of allelic variation, codominant inheritance and ease of analysis.  相似文献   

8.
Microsatellites (simple sequence repeats, SSRs) are important genetic markers in tree breeding and conservation. Here we utilized high-throughput 454 sequencing technology to mine microsatellites from masson pine (MP) genomic DNA. First, we analyzed the characteristics of SSRs in all nonredundant MP reads (genome survey sequences, GSSs) and compared them with loblolly pine (LP) GSSs and BACs (bacterial artificial chromosome clone sequences), and three other nonconiferous species GSSs. Second, a set of MP GSS–SSR primer pairs were designed. There were extremely low overall GSS–SSR densities (28 SSR/Mb) in MP when compared with LP (48 SSR/Mb) and the other species. AT, AAT, AAAT, and AAAAAT were the richest motifs in di-, tri-, tetra-, and hexanucleotides, respectively. Two hundred forty GSS–SSR primer pairs were designed in total, and 20 novel polymorphic markers were identified using three populations (two natural and one clonal seed orchard) as evaluating samples. These markers should be useful for future MP population genetics studies.  相似文献   

9.
叶城沙蜥Phrynocephalus axillaris是我国特有的一种小型爬行动物,广泛分布于新疆塔里木盆地、吐鲁番-哈密盆地和甘肃敦煌盆地。本研究利用Roche 454 GS FLX高通量测序技术进行叶城沙蜥微卫星位点筛选,获得了91 190条高质量序列。用Krait搜索微卫星位点,共得到1~6个碱基重复类型的完美型微卫星序列29 890个。不同类型微卫星中,单碱基重复类型数目最多,有14 630个,占总数的48. 95%,其次是二碱基,约占28. 60%,四碱基、三碱基、五碱基和六碱基分别占10. 73%、10. 48%、0. 92%和0. 32%。二碱基微卫星中AC重复类型数量最多,三碱基、四碱基、五碱基和六碱基中分别是ATC、AAAT、AAAAT和AATCCC。叶城沙蜥完美型微卫星中数量最多的11种重复拷贝类型分别为C、A、AC、AG、AAAT、ATC、AT、AAT、ATAG、AGG和AAC。本研究深化了对叶城沙蜥基因组的了解,并为以后开发和筛选大量高质量微卫星标记提供了数据支持,也为利用微卫星标记研究叶城沙蜥种群遗传结构和谱系地理模式奠定了基础。  相似文献   

10.
Sweet orange (Citrus sinensis) is one of the major cultivated and most-consumed citrus species. With the goal of enhancing the genomic resources in citrus, we surveyed, developed and characterized microsatellite markers in the ≈347 Mb sequence assembly of the sweet orange genome. A total of 50,846 SSRs were identified with a frequency of 146.4 SSRs/Mbp. Dinucleotide repeats are the most frequent repeat class and the highest density of SSRs was found in chromosome 4. SSRs are non-randomly distributed in the genome and most of the SSRs (62.02%) are located in the intergenic regions. We found that AT-rich SSRs are more frequent than GC-rich SSRs. A total number of 21,248 SSR primers were successfully developed, which represents 89 SSR markers per Mb of the genome. A subset of 950 developed SSR primer pairs were synthesized and tested by wet lab experiments on a set of 16 citrus accessions. In total we identified 534 (56.21%) polymorphic SSR markers that will be useful in citrus improvement. The number of amplified alleles ranges from 2 to 12 with an average of 4 alleles per marker and an average PIC value of 0.75. The newly developed sweet orange primer sequences, their in silico PCR products, exact position in the genome assembly and putative function are made publicly available. We present the largest number of SSR markers ever developed for a citrus species. Almost two thirds of the markers are transferable to 16 citrus relatives and may be used for constructing a high density linkage map. In addition, they are valuable for marker-assisted selection studies, population structure analyses and comparative genomic studies of C. sinensis with other citrus related species. Altogether, these markers provide a significant contribution to the citrus research community.  相似文献   

11.
We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated that SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites.  相似文献   

12.
Environmental Sciences Division, Oak Ridge National Laboratory, TN, USA We mapped and analyzed the microsatellites throughout 284295605 base pairs of the unambiguously assembled sequence scaffolds along 19 chromosomes of the haploid poplar genome. Totally, we found 150985 SSRs with repeat unit lengths between 2 and 5 bp. The established microsatellite physical map demonstrated tr at SSRs were distributed relatively evenly across the genome of Populus. On average, These SSRs occurred every 1883 bp within the poplar genome and the SSR densities in intergenic regions, introns, exons and UTRs were 85.4%, 10.7%, 2.7% and 1.2%, respectively. We took di-, tri-, tetra-and pentamers as the four classes of repeat units and found that the density of each class of SSRs decreased with the repeat unit lengths except for the tetranucleotide repeats. It was noteworthy that the length diversification of microsatellite sequences was negatively correlated with their repeat unit length and the SSRs with shorter repeat units gained repeats faster than the SSRs with longer repeat units. We also found that the GC content of poplar sequence significantly correlated with densities of SSRs with uneven repeat unit lengths (tri-and penta-), but had no significant correlation with densities of SSRs with even repeat unit lengths (di-and tetra-). In poplar genome, there were evidences that the occurrence of different microsatellites was under selection and the GC content in SSR sequences was found to significantly relate to the functional importance of microsatellites.  相似文献   

13.
To identify EST-SSR molecular markers, 41,986 cattle UniGene sequences from NCBI were mined for analyzing SSRs. A total of 1,831 SSRs were identified from 1,666 ESTs, which represented an average density of 19.88 kb per SSR. The frequency of EST-SSRs was 4.0%. The dinucleotide repeat motif was the most abundant SSR, accounting for 54%, followed by 22%, 13%, 7% and 4%, respec-tively, for tri-, hexa-, penta- and tetra-nucleotide repeats. Depending upon the length of the repeat unit, the length of microsatellites varied from 14 to 86 bp. Among the di- and tri-nucleotide repeats, AC/TG (57%) and AGC (12%) were the most abundant type. Annotation of EST-SSRs was also carried out. Three hundred primer pairs were randomly designed using Prime Premier 5.0 program and Oligo 5.0 for further experimental validation.  相似文献   

14.
Simple sequence repeat (SSR) markers are widely used in many plant and animal genomes due to their abundance, hypervariability, and suitability for high-throughput analysis. Development of SSR markers using molecular methods is time consuming, laborious, and expensive. Use of computational approaches to mine ever-increasing sequences such as expressed sequence tags (ESTs) in public databases permits rapid and economical discovery of SSRs. Most of such efforts to date focused on mining SSRs from monocotyledonous ESTs. In this study, we have computationally mined and examined the abundance of SSRs in more than 1.54 million ESTs belonging to 55 dicotyledonous species. The frequency of ESTs containing SSRs among species ranged from 2.65% to 16.82%. Dinucleotide repeats were found to be the most abundant followed by tri- or mono-nucleotide repeats. The motifs A/T, AG/GA/CT/TC, and AAG/AGA/GAA/CTT/TTC/TCT were the predominant mono-, di-, and tri-nucleotide SSRs, respectively. Most of the mononucleotide SSRs contained 15-25 repeats, whereas the majority of the di- and tri-nucleotide SSRs contained 5-10 repeats. The comprehensive SSR survey data presented here demonstrates the potential of in silico mining of ESTs for rapid development of SSR markers for genetic analysis and applications in dicotyledonous crops.  相似文献   

15.
A sequence search of swine expressed sequence tags (EST) data in GenBank identified over 100 sequence files which contained a microsatellite repeat or simple sequence repeat (SSR). Most of these repeat motifs were dinucleotide (CA/GT) repeats; however, a number of tri-, tetra-, penta- and hexa-nucleotide repeats were also detected. An initial assessment of six dinucleotide and 14 higher-order repeat markers indicated that only dinucleotide markers yielded a sufficient number of informative markers (100% vs. 14% for dinucleotide and higher order repeats, respectively). Primers were designed for an additional 50 di- and one tri-nucleotide SSRs. Overall, 42 markers were polymorphic in the US Meat Animal Research Center (MARC) reference population, 17 markers were uninformative and 12 primer pairs failed to satisfactorily amplify genomic DNA. A comparison of di-nucleotide repeat vs. markers with repeat motifs of three to six bases demonstrated that 72% of dinucleotide markers were informative relative to only 7% of other repeat motifs. The difference was the result of a much higher percentage of monomorphic markers in the three to six base repeat motif markers than in the dinucleotide markers (64% vs. 14%). Either higher order repeat motifs are less polymorphic in the porcine genome or our selection criteria for repeat length of more than 17 contiguous bases was too low. The mapped microsatellite markers add to the porcine genetic map and provide valuable links between the porcine and human genome.  相似文献   

16.
Ouyang Q  Zhao X  Feng H  Tian Y  Li D  Li M  Tan Z 《Gene》2012,499(1):37-40
The presence, locations and composition of simple sequence repeats (SSRs) in Herpes simplex virus type 1 (HSV-1) genome were extracted and analyzed by using the software Imperfect Microsatellite Extractor (IMEx). There were 663 mon-, 502 di-, 184 tri-, 20 tetra-, 4 penta- and 4 hexanucleotide SSRs that were observed in different distribution between coding and noncoding regions in the HSV-1 genome. G/C, GC/CG, and (GGC)(n) were predominant in mononucleotide, dinucletide, trinucleotide repeats respectively. Indeed, the results showed that GC content in simple sequence repeats was notably higher than that in entire HSV-1 genome. Our data might be helpful for studying the pathogenesis, genome structure and evolution of HSV-1.  相似文献   

17.
Development of genomic resources in any crop is the pre-requisite for the construction of linkage map and implementation of molecular breeding strategies to develop superior cultivars. Large number of molecular markers are required to enrich the scanty information available in horsegram (Macrotyloma uniflorum).We employed the next-generation Illumina sequencing platform to develop a large number of microsatellite markers in this species. Of the total 23,305 potential SSRs motifs, 5755 primers were designed. Of these, 1425, 1310, 856, 1276, and 888 were of di-, tri-, tetra-, penta-, and hexa-nucleotide repeats respectively. Thirty polymorphic SSR primers and 24 morphological traits were used in 360 horsegram accessions to detect the genetic diversity and population structure. Thirty primers amplified 170 polymorphic alleles with an average of 5.6 alleles per primer having size 80 to 380 bp. The polymorphism information content (PIC) ranged from 0.15 to 0.76 with an average of 0.50, suggesting that SSR markers used in the study were polymorphic and suitable for characterization of horsegram germplasm. Dendrogram-based on Jaccard’s similarity coefficient and neighbor-joining tree grouped the horsegram accessions into two major clusters. Similarly, STRUCTURE analysis assigned genotypes into two gene pools namely Himalayan origin and Southern India. Diversity analysis based on 24 agro-morphological traits also suggested the presence of high level of diversity among the accessions.  相似文献   

18.
Microsatellites or simple sequence repeats (SSRs) are among the genetic markers most widely utilized in research. This includes applications in numerous fields such as genetic conservation, paternity testing, and molecular breeding. Though ordered draft genome assemblies of camels have been announced, including for the Arabian camel, systemic analysis of camel SSRs is still limited. The identification and development of informative and robust molecular SSR markers are essential for marker assisted breeding programs and paternity testing. Here we searched and compared perfect SSRs with 1–6 bp nucleotide motifs to characterize microsatellites for draft genome sequences of the Camelidae. We analyzed and compared the occurrence, relative abundance, relative density, and guanine-cytosine (GC) content in four taxonomically different camelid species: Camelus dromedarius, C. bactrianus, C. ferus, and Vicugna pacos. A total of 546762, 544494, 547974, and 437815 SSRs were mined, respectively. Mononucleotide SSRs were the most frequent in the four genomes, followed in descending order by di-, tetra-, tri-, penta-, and hexanucleotide SSRs. GC content was highest in dinucleotide SSRs and lowest in mononucleotide SSRs. Our results provide further evidence that SSRs are more abundant in noncoding regions than in coding regions. Similar distributions of microsatellites were found in all four species, which indicates that the pattern of microsatellites is conserved in family Camelidae.  相似文献   

19.
We screened for simple sequence repeats (SSRs) found in ESTs derived from an EST-database development project ('Marine Genomics Europe' Network of Excellence). Different motifs of di-, tri-, tetra-, penta- and hexanucleotide SSRs were evaluated for variation in length and position in the expressed sequences, relative abundance and distribution in gilthead sea bream (Sparus aurata). We found 899 ESTs that harbor 997 SSRs (4.94%). On average, one SSR was found per 2.95 kb of EST sequence and the dinucleotide SSRs are the most abundant accounting for 47.6% of the total number. EST-SSRs were used as template for primer design. 664 primer pairs could be successfully identified and a subset of 206 pairs of primers was synthesized, PCR-tested and visualized on ethidium bromide stained agarose gels. The main objective was to further assess the potential of EST-SSRs as informative markers and investigate their cross-species amplification in sixteen teleost fish species: seven sparid species and nine other species from different families. Approximately 78% of the primer pairs gave PCR products of expected size in gilthead sea bream, and as expected, the rate of successful amplification of sea bream EST-SSRs was higher in sparids, lower in other perciforms and even lower in species of the Clupeiform and Gadiform orders. We finally determined the polymorphism and the heterozygosity of 63 markers in a wild gilthead sea bream population; fifty-eight loci were found to be polymorphic with the expected heterozygosity and the number of alleles ranging from 0.089 to 0.946 and from 2 to 27, respectively. These tools and markers are expected to enhance the available genetic linkage map in gilthead sea bream, to assist comparative mapping and genome analyses for this species and further with other model fish species and finally to help advance genetic analysis for cultivated and wild populations and accelerate breeding programs.  相似文献   

20.
烟草EST-SSR位点分析   总被引:10,自引:0,他引:10  
利用MISA软件对烟草EST公共数据库中的简单重复序列(SSRs)进行了分析。结果表明,在133523条EST序列中,共获得81757条SSR序列,SSRs之间的距离约为0.92 kb。其中,六碱基重复丰度最大,占60.3%,而单碱基、三碱基、四碱基、二碱基和五碱基重复丰度分别为20.0%、11.0%、4.2%、2.8%和1.7%。在单碱基、二碱基、三碱基和四碱基重复模体中,丰度最大的分别是A/T、AG、AAG和AAAT,而CG在编码区内丰度很低。用CAP3软件进行冗余分析表明,在这6种类型的重复模体中,冗余与非冗余的烟草EST之间没有显著差异。在得到的SSR序列中随机选择10个序列设计引物,在7个烟草品种中进行PCR扩增。结果表明,10对引物全部扩增出PCR产物,其中8对引物扩增出预期片段。用这8组扩增出预期片段的PCR产物进行变性PAGE凝胶电泳检测,结果表明,其中有4对引物(EB4、EB5、EB6和EB8)扩增出多态性条带。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号