首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
We studied microsatellite frequency and distribution in 21.76-Mb random genomic sequences, 0.67-Mb BAC sequences from the Z chromosome, and 6.3-Mb EST sequences of Bombyx mori. We mined microsatellites of >/=15 bases of mononucleotide repeats and >/=5 repeat units of other classes of repeats. We estimated that microsatellites account for 0.31% of the genome of B. mori. Microsatellite tracts of A, AT, and ATT were the most abundant whereas their number drastically decreased as the length of the repeat motif increased. In general, tri- and hexanucleotide repeats were overrepresented in the transcribed sequences except TAA, GTA, and TGA, which were in excess in genomic sequences. The Z chromosome sequences contained shorter repeat types than the rest of the chromosomes in addition to a higher abundance of AT-rich repeats. Our results showed that base composition of the flanking sequence has an influence on the origin and evolution of microsatellites. Transitions/transversions were high in microsatellites of ESTs, whereas the genomic sequence had an equal number of substitutions and indels. The average heterozygosity value for 23 polymorphic microsatellite loci surveyed in 13 diverse silkmoth strains having 2-14 alleles was 0.54. Only 36 (18.2%) of 198 microsatellite loci were polymorphic between the two divergent silkworm populations and 10 (5%) loci revealed null alleles. The microsatellite map generated using these polymorphic markers resulted in 8 linkage groups. B. mori microsatellite loci were the most conserved in its immediate ancestor, B. mandarina, followed by the wild saturniid silkmoth, Antheraea assama.  相似文献   

2.
Exploiting dinucleotide microsatellites conserved among mammalian species   总被引:3,自引:0,他引:3  
Dinucleotide microsatellites are useful for gene mapping projects. Depending upon definition of conservation, published estimates of dinucleotide microsatellite conservation levels vary dramatically (30% to 100%). This study focused on well-characterized genes that contain microsatellites in the human genome. The objective was to examine the feasibility of developing microsatellite markers within genes on the basis of the assumption of microsatellite conservation across distantly related species. Eight genes (Gamma-actin, carcinoembryonic antigen, apolipoprotein A-II, cardiac beta myosin heavy chain, laminin B2 chain, MHC class I CD8 alpha chain, c-reactive protein, and retinoblastoma susceptibility protein) containing large dinucleotide repeat units (N ≥ 15), complete genomic structure information, and homologous gene sequences in a second species were selected. Heterologous primers were designed from conserved exon sequences flanking a microsatellite motif. PCR products from bovine and porcine genomic DNA were tested for the presence of microsatellite sequences by Southern blot hybridization with biotin-labeled (CA)12 oligonucleotides. Fragments containing microsatellites were cloned and sequenced. Homology was verified by sequence comparisons between human and corresponding bovine or porcine fragments. Four of sixteen (25%) cross-amplified PCR products contained dinucleotide repetitive sequences with repeat unit lengths of 5 to 23. Two dinucleotide repetitive sequences showed microsatellite length polymorphism, and an additional sequence displayed single-strand conformational polymorphism. Results from this study suggest that exploitation of conserved microsatellite sequences is a useful approach for developing specific genetic markers for comparative mapping purposes. Received: 7 July 1995 / Accepted: 28 September 1995  相似文献   

3.
A bovine genomic phagemid library was constructed with randomly sheared DNA. Enrichment of this single-stranded DNA library with CA or GT primers resulted in 45% positive clones. The 14% of positive clones with (CA · GT)>12, and not containing flanking repetitive elements, were sequenced, and the efficiency of marker production was compared with random M13 bacteriophage libraries. Primer sequences and genotyping information are presented for 390 informative bovine microsatellite markers. The genomic frequency for 11 tri- and tetranucleotide repeats was estimated by hybridization to a lambda genomic library. Only GCT, GGT, and GGAT were estimated to have a frequency of >100 per genome. Enrichment of the phagemid library for these repeats failed to provide a viable source of microsatellite markers in the bovine. Comparison of map interval lengths between 100 markers from the enriched library prepared from randomly sheared DNA and M13 bacteriophage libraries prepared from Mbo1 restriction digests suggested no bias in skeletal genomic coverage based on source of small insert DNA. In conclusion, enrichment of the bovine phagemid library provides a sufficient source of microsatellites so that small repeat lengths and flanking repetitive sequences common in the bovine can be eliminated, resulting in a high percentage of informative markers.The nucleotide sequence data reported in this paper have been submitted to GenBank and have been assigned the accession numbers U25689 and U25690.  相似文献   

4.
通过对桉树属(Eucalyptus)的10000条EST序列进行分析,在其中的1499条序列上共发现1775个微卫星重复序列。含有微卫星的EST序列约占序列总数的15%。此外,还发现桉树EST序列所含微卫星长度的变异速率与重复单元长度呈负相关;微卫星的丰度与重复单元长度也呈负相关(三碱基重复微卫星除外)。在桉树EST序列中,重复单元长度为三碱基的微卫星最为丰富。三碱基重复单元微卫星的过度富集可能是由于遗传密码选择所致。在微卫星的丰度及长度变异方面,桉树EST序列与杨树(Populus trichocarpa)基因组注释的转录序列随重复单元长度的变化呈现出相同的规律,但桉树EST序列中微卫星频率及三碱基重复微卫星的含量显著偏低,推测含微卫星的基因表达丰度极有可能低于不含微卫星的基因。通过对发现的所有微卫星位点进行引物设计,并对设计的引物进行PCR检测,结果表明所设计的引物具有极高的扩增成功率。  相似文献   

5.
桉树EST序列中微卫星含量及相关特征   总被引:6,自引:0,他引:6  
通过对桉树属(Eucalyptus)的10 000条EST序列进行分析, 在其中的1 499条序列上共发现1 775个微卫星重复序列。含有微卫星的EST序列约占序列总数的15%。此外, 还发现桉树EST序列所含微卫星长度的变异速率与重复单元长度呈负相关; 微卫星的丰度与重复单元长度也呈负相关(三碱基重复微卫星除外)。在桉树EST序列中, 重复单元长度为三碱基的微卫星最为丰富。三碱基重复单元微卫星的过度富集可能是由于遗传密码选择所致。在微卫星的丰度及长度变异方面, 桉树EST序列与杨树(Populus trichocarpa)基因组注释的转录序列随重复单元长度的变化呈现出相同的规律, 但桉树EST序列中微卫星频率及三碱基重复微卫星的含量显著偏低, 推测含微卫星的基因表达丰度极有可能低于不含微卫星的基因。通过对发现的所有微卫星位点进行引物设计, 并对设计的引物进行PCR检测, 结果表明所设计的引物具有极高的扩增成功率。  相似文献   

6.
Efforts to construct a genetic linkage map of channel catfish have involved identification of random genomic microsatellite markers, as well as anchored Type I loci (expressed genes) from channel catfish. To identify Type I markers we constructed a directional cDNA library from brain tissue to obtain expressed catfish sequences that could be used for single nucleotide polymorphism (SNP) marker development. These cDNA sequences surprisingly contained a high proportion of microsatellites (about 14%) in noncoding regions of expressed sequence tags (ESTs), many of which were not associated with known sequences. To further identify cDNAs with microsatellites and reduce the number of sequencing reactions needed for marker development, we enriched this library for repeat sequences and sequenced clones from both directions. A total of 1644 clones from seven repeat-enriched captures (CA, GT, CT, GA, MTT, TAG, and TAC) were sequenced from both ends, and 795 nonredundant clones were assembled. Thirty-seven percent of the clones contained microsatellites in the trimmed sequence. After assembly in the TIGR Catfish Gene Index (CfGI), 154 contigs matched known vertebrate genes and 92 contigs contained microsatellites. When BLAST-matched orthologues were available for similarity alignments, 28% of these contigs contained repeats in the 5'-UTR, 72% contained repeats in the 3'-UTR, and 8% contained repeats at both ends. Using biotinylated repeat oligonucleotides coupled with streptavidin-coated magnetic beads, and rapid, single-pass hybridization, we were able to enrich our plasmid library greater than two-fold for repeat sequences and increase the ability to link these ESTs with known sequences greater than six-fold.  相似文献   

7.
Microsatellites, or simple sequence repeats (SSRs), have become the markers of choice for genetic studies with many crop species including wheat. Currently an international effort is underway to enrich the repertoire of available sequence tagged microsatellite site (STMS) markers in wheat. As a part of this effort, we have sequenced 43 clones obtained from a microsatellite-enriched wheat genomic library; 34 clones contained 41 different microsatellites. These microsatellites (mono-, di-, tri- nucleotide repeats) were classified as 19 simple perfect, 18 simple imperfect and 4 compound imperfect types. Dinucleotide repeats were the most abundant (70%). Primer pairs for only 16 microsatellites could be designed, since the flanking sequences of the others were either too short or were otherwise not suitable for designing the microsatellite specific primers. Microsatellite loci of the expected size and polymorphism were successfully amplified from 15 of these 16 primer pairs using three wheat varieties. 14 loci detected by 12 out of the 15 functional primer pairs were assigned to 11 specific chromosomes. An erratum to this article is available at .  相似文献   

8.
Efforts to construct a genetic linkage map of channel catfish have involved identification of random genomic microsatellite markers, as well as anchored Type I loci (expressed genes) from channel catfish. To identify Type I markers we constructed a directional cDNA library from brain tissue to obtain expressed catfish sequences that could be used for single nucleotide polymorphism (SNP) marker development. These cDNA sequences surprisingly contained a high proportion of microsatellites (about 14%) in noncoding regions of expressed sequence tags (ESTs), many of which were not associated with known sequences. To further identify cDNAs with microsatellites and reduce the number of sequencing reactions needed for marker development, we enriched this library for repeat sequences and sequenced clones from both directions. A total of 1644 clones from seven repeat-enriched captures (CA, GT, CT, GA, MTT, TAG, and TAC) were sequenced from both ends, and 795 nonredundant clones were assembled. Thirty-seven percent of the clones contained microsatellites in the trimmed sequence. After assembly in the TIGR Catfish Gene Index (CfGI), 154 contigs matched known vertebrate genes and 92 contigs contained microsatellites. When BLAST-matched orthologues were available for similarity alignments, 28% of these contigs contained repeats in the 5'-UTR, 72% contained repeats in the 3'-UTR, and 8% contained repeats at both ends. Using biotinylated repeat oligonucleotides coupled with streptavidin-coated magnetic beads, and rapid; single-pass hybridization, we were able to enrich our plasmid library greater than two-fold for repeat sequences and increase the ability to link these ESTs with known sequences greater than six-fold.  相似文献   

9.
Previously isolated tomato (Lycopersicon esculentum) microsatellite markers were mainly clustered in the centromeric heterochromatin and not located in euchromatic regions. To achieve a more-uniform distribution of microsatellite markers for genome mapping purposes, a set of tomato microsatellite markers containing dinucleotide simple sequence repeats were developed by screening genomic libraries enriched for single-copy sequences, and screening the tomato EST database. The tomato microsatellites isolated in these ways were characterized by combinations of different types of repeated motifs and they were polymorphic in a set of L. esculentum varieties detecting up to four alleles. A total of 20 markers were placed on the genetic map of tomato. Interestingly, all markers isolated from genomic libraries enriched for single-copy sequences by PstI-pre-digestion mapped into the centromeric regions. The majority of markers derived from EST sequences contained predominantly AT microsatellites and were located in euchromatic regions. Received: 22 December 2000 / Accepted: 4 May 2001  相似文献   

10.
11.
Microsatellites, a special class of repetitive DNA, have become one of the most popular genetic markers. The progress of various genome projects has made it possible to study the genomic distribution of microsatellites and to evaluate the potential influence of several parameters on their genesis. We report the distribution of dinucleotide microsatellites in the genome of Drosophila melanogaster. When considering only microsatellites with five or more repeat units, the average length of dinucleotide repeats in D. melanogaster is 6.7 repeats. We tested a wide range of parameters which could potentially influence microsatellite density, and we did not detect a significant influence of recombination rate, number of exons, or total length of coding sequence. In concordance with the neutral expectation for the origin of microsatellites, a significant positive correlation between AT content and (AT/TA)n microsatellite density was detected. While this pattern may indicate that microsatellite genesis is a random process, we also found evidence for a nonrandom distribution of microsatellites. Average microsatellite density was higher on the X chromosome, but extreme heterogeneity was observed between different genomic regions. Such a clumping of microsatellites was also evident on a more local scale, as 38.9% of the contiguous sequences analyzed showed a deviation from a random distribution of microsatellites.  相似文献   

12.
A sequence search of swine expressed sequence tags (EST) data in GenBank identified over 100 sequence files which contained a microsatellite repeat or simple sequence repeat (SSR). Most of these repeat motifs were dinucleotide (CA/GT) repeats; however, a number of tri-, tetra-, penta- and hexa-nucleotide repeats were also detected. An initial assessment of six dinucleotide and 14 higher-order repeat markers indicated that only dinucleotide markers yielded a sufficient number of informative markers (100% vs. 14% for dinucleotide and higher order repeats, respectively). Primers were designed for an additional 50 di- and one tri-nucleotide SSRs. Overall, 42 markers were polymorphic in the US Meat Animal Research Center (MARC) reference population, 17 markers were uninformative and 12 primer pairs failed to satisfactorily amplify genomic DNA. A comparison of di-nucleotide repeat vs. markers with repeat motifs of three to six bases demonstrated that 72% of dinucleotide markers were informative relative to only 7% of other repeat motifs. The difference was the result of a much higher percentage of monomorphic markers in the three to six base repeat motif markers than in the dinucleotide markers (64% vs. 14%). Either higher order repeat motifs are less polymorphic in the porcine genome or our selection criteria for repeat length of more than 17 contiguous bases was too low. The mapped microsatellite markers add to the porcine genetic map and provide valuable links between the porcine and human genome.  相似文献   

13.
Oil camellia trees are important woody plants for the production of high-quality cooking oil. On the contrary to their economic importance, their genetic and genomic resources are very limited, which greatly hamper the genetic studies on oil camellia trees. Microsatellites or simple sequence repeats (SSRs) have great value in many aspects of genetic analyses due to their high polymorphism and codominant inheritance. In this study, we report the large-scale development and characterization of SSR markers derived from genomic sequences of Camellia chekiangoleosa by high-throughput pyrosequencing technology. A total of 1,091,393 genomic shotgun reads were generated using Roche 454 FLX sequencer, the average read length was 319 bp, and the total sequence throughput was 347.9 Mb. These sequences were assembled into 35,315 contigs with total length of 14.8 Mb and the N50 contig size of 770 bp. By analyzing with microsatellite (MISA), a total of 5,844 perfect microsatellites were detected from the assembled sequences. Among them, tetranucleotide repeats were found to be the most frequent microsatellites in the genome of C. chekiangoleosa, and all the dominant repeat motifs for different types of SSRs were detected to be rich in A/T. Experimental analysis with 900 SSR primer pairs revealed that 66 % of them succeeded in PCR amplification. Further investigation with 345 SSR primer pairs showed that a relatively high percentage of primers amplified polymorphic loci (31.9 %). Experimental data also revealed that, overall, long microsatellite repeats (>20 bp) were more variable than the short ones (<20 bp) in the genome of oil camellia tree.  相似文献   

14.
Cattle microsatellite clones (136) were isolated from cosmid (10) and plasmid (126) libraries and sequenced. The dinucleotide repeats were studied in each of these sequences and compared with dinucleotide repeats found in other vertebrate species where information was available. The distribution in cattle was similar to that described for other mammals, such as rat, mouse, pig, or human. A major difference resides in the number of sequences present in the bovine genome, which seemed at best one-third as large as in other species. Oligonucleotide primers (117 pairs) were synthesized, and a PCR product of expected size was obtained for 88 microsatellite sequences (75%). Synteny or chromosome assignment was searched for each locus with PCR amplification on a panel of 36 hamster/bovine somatic cell hybrids. Of our bovine microsatellites, eighty-six could be assigned to synteny groups of chromosomes. In addition, 10 other microsatellites—HEL 5, 6, 9, 11, 12, 13 (Kaukinen and Varvio 1993), HEL 4, 7, 14, 15—as well as the microsatellite found in the -casein gene (Fries et al. 1990) were mapped on the hybrids. Microsatellite polymorphism was checked on at leat 30 unrelated animals of different breeds. Almost all the autosomal and X Chr microsatellites displayed polymorphism, with the number of alleles varying between two and 44. We assume that these microsatellites could be very helpful in the construction of a primary public linkage map of the bovine genome, with an aim of finding markers for Economic Trait Loci (ETL) in cattle.  相似文献   

15.
During a search of polymorphic microsatellites for bovine genome mapping, we found that microsatellites often occur as tails of artiodactyl C-A retroposon elements. In this element, C (85bp) is a tRNA derivative, while A (117bp) is of unknown origin. The A element also occurs as dimer element with a connecting 27bp linker sequence comprising hexanucleotide CACTTT repeats. In 10 clones (45% of those selected deliberately for dinucleotide repeats), the microsatellite motif is associated with the C-A retroposon. In 50% of 44 database artiodactyl C-A sequences, the element also has a microsatellite tail. The microsatellite is usually a simple (CA)n repeat, but in some cases it is an apparent derivative of the linker sequence CACTTT. All but one of 33 database dimer elements have trinucleotide repeat tails (AGC)n, n = 1-9. Microsatellites, retroposons, and their truncated versions (C and/or A) often occur as clusters. We derived the consensus sequence (202bp) of the C-A element, and designed four primers for inter-SINE amplification with the aim of finding SINEmorph polymorphisms. The method is potentially powerful for rapidly producing polymorphic markers for artiodactyl genome mapping.  相似文献   

16.
In this study, an in silico approach was developed to identify homologies existing between livestock microsatellite flanking sequences and GenBank nucleotide sequences. Initially, 1955 bovine, 1570 porcine and 1121 chicken microsatellites were downloaded and the flanking sequences were compared with the nr and dbEST databases of GenBank. A total of 74 bovine, 44 porcine and 37 chicken microsatellite flanking sequences passed our criteria and had at least one significant match to human genomic sequence, genes/expressed sequence tags (ESTs) or both. GenBank annotation and BLAT searches of the UCSC human genome assembly revealed that 38 bovine, 13 porcine and 17 chicken microsatellite flanking sequences were highly similar to known human genes. Map locations were available for 67 bovine, 44 porcine and 21 chicken microsatellite flanking sequences, providing useful links in the comparative maps of humans and livestock. In support of our approach, 112 alignments with both microsatellite and match mapping information were located in the expected chromosomal regions based on previously reported syntenic relationships. The development of this in silico mapping approach has significantly increased the number of genes and EST sequences anchored to the bovine, porcine and chicken genome maps and the number of links in various human-livestock comparative maps.  相似文献   

17.
In the last decade microsatellites have become one of the most useful genetic markers used in a large number of organisms due to their abundance and high level of polymorphism. Microsatellites have been used for individual identification, paternity tests, forensic studies and population genetics. Data on microsatellite abundance comes preferentially from microsatellite enriched libraries and DNA sequence databases. We have conducted a search in GenBank of more than 16,000 Schistosoma mansoni ESTs and 42,000 BAC sequences. In addition, we obtained 300 sequences from CA and AT microsatellite enriched genomic libraries. The sequences were searched for simple repeats using the RepeatMasker software. Of 16,022 ESTs, we detected 481 (3%) sequences that contained 622 microsatellites (434 perfect, 164 imperfect and 24 compounds). Of the 481 ESTs, 194 were grouped in 63 clusters containing 2 to 15 ESTs per cluster. Polymorphisms were observed in 16 clusters. The 287 remaining ESTs were orphan sequences. Of the 42,017 BAC end sequences, 1,598 (3.8%) contained microsatellites (2,335 perfect, 287 imperfect and 79 compounds). The 1,598 BAC end sequences 80 were grouped into 17 clusters containing 3 to 17 BAC end sequences per cluster. Microsatellites were present in 67 out of 300 sequences from microsatellite enriched libraries (55 perfect, 38 imperfect and 15 compounds). From all of the observed loci 55 were selected for having the longest perfect repeats and flanking regions that allowed the design of primers for PCR amplification. Additionally we describe two new polymorphic microsatellite loci.  相似文献   

18.
Microsatellites, as the tracts of repetitive DNA, are an essential constituent of the plant genome that holds important evolutionary significance, and have been extensively used to develop molecular makers for genetic analysis. To understand the microsatellite dynamics of quinoa genome and its relatives, in this study we performed a genome‐wide analysis of microsatellites in five Amaranthaceae species using available genome sequences. The results demonstrated that the microsatellites of the five Amaranthaceae species were characterised by relatively high proportions of mono‐, di‐ and trinucleotide repeats with A/T rich motifs, implying conservative organisation and composition of microsatellites in this family. Furthermore, a significant negative correlation between microsatellite frequencies and GC contents (r = ?.87) were observed. In total, 533,961 (89.57%) and 542,601 (89.86%) microsatellite loci could be used to develop simple sequence repeat (SSR) molecular markers, of which 7,178 were found to be polymorphic between the two sequenced quinoa cultivars, QQ74 and Real Blanca, through in silico PCR analysis. Finally, 15 SSR markers were randomly selected to validate their polymorphism across 12 quinoa accessions by wet‐lab PCR amplification. The newly developed genome‐wide SSR markers provide a useful resource for population genetics, gene mapping and molecular breeding studies in quinoa and beyond.  相似文献   

19.
The growing number of rice microsatellite markers warrants a comprehensive comparison of allelic variability between the markers developed using different methods, with various sequence repeat motifs, and from coding and non-coding portions of the genome. We have performed such a comparison over a set of 323 microsatellite markers; 194 were derived from genomic library screening and 129 were derived from the analysis of rice-expressed sequence tags (ESTs) available in public DNA databases. We have evaluated the frequency of polymorphism between parental pairs of six inter- subspecific crosses and one inter-specific cross widely used for mapping in rice. Microsatellites derived from genomic libraries detected a higher level of polymorphism than those derived from ESTs contained in the GenBank database (83.8% versus 54.0%). Similarly, the other measures of genetic variability [the number of alleles per locus, polymorphism information content (PIC), and allele size ranges] were all higher in genomic library-derived microsatellites than in their EST-database counterparts. The highest overall degree of genetic diversity was seen in GA-containing microsatellites of genomic library origin, while the most conserved markers contained CCG- or CAG-trinucleotide motifs and were developed from GenBank sequences. Preferential location of specific motifs in coding versus non-coding regions of known genes was related to observed levels of microsatellite diversity. A strong positive correlation was observed between the maximum length of a microsatellite motif and the standard deviation of the molecular-weight of amplified fragments. The reliability of molecular weight standard deviation (SDmw) as an indicator of genetic variability of microsatellite loci is discussed. Received: 5 May 1999 / Accepted: 16 August 1999  相似文献   

20.
Studies on microsatellite distribution and divergence in related genomes contribute towards understanding of genome evolution in eukaryotes. Despite the availability of whole genome sequences of four rice genomes, occurrence and significance of microsatellites in the rice genome has remained a relatively unexplored area of research. We have aligned genomes of two rice subspecies i.e. indica and japonica to understand the trends of microsatellite conservation and divergence in the rice genome. Nearly 62% of the indica microsatellites were also found in the japonica genome. Occurrence of microsatellites showed a negative association with that of retrotransposons. Microsatellites repeat unit length and sequence showed direct influence on the microsatellite locus length. Further, microsatellite allele length was also influenced by the sequence characteristics of the neighbouring regions. CCG repeats were most conserved microsatellite sequences across the different syntenic regions in the two rice genomes and often showed association with CpG islands. Our study suggested that microsatellite distribution is not only governed by a balance between replication slippage and point mutations as proposed earlier, but also by the microsatellite motif sequence and characteristics of microsatellite neighbouring regions in the genome. Thus, this study is likely to prove an important reference for understanding the process of microsatellite evolution and dynamics in the two rice subspecies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号