首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 281 毫秒
1.
Zhikong scallop (Chlamys farreri Jones et Preston, 1904) is one of the most commercially important bivalves in China, but research on its genome is underdeveloped. In this study, we constructed the first Zhikong scallop fosmid library, and analyzed the fosmid end sequences to provide a preliminary assessment of the genome. The library consists of 133,851 clones with an average insert size of about 40 kb, amounting to 4.3 genome equivalents. Fosmid stability assays indicate that Zhikong scallop DNA was stable during propagation in the fosmid system. Library screening with two genes and seven microsatellite markers yielded between two and eight positive clones, and none of those tested was absent from the library. End-sequencing of 480 individual clones generated 828 sequences after trimming, with an average sequence length of 624 bp. BLASTN searches of the nr and EST databases of GenBank and BLASTX searches of the nr database resulted in 213 (25.72%) and 44 (5.31%) significant hits (E < e−5), respectively. Repetitive sequences analysis resulted in 375 repeats, accounting for 15.84% of total length, which were composed of interspersed repetitive sequences, tandem repeats, and low-complexity sequences. The fosmid library, in conjunction with the fosmid end sequences, will serve as a useful resource for physical mapping and positional cloning, and provide a better understanding of the Zhikong scallop genome.  相似文献   

2.
Huan Gao  Jie Kong 《DNA sequence》2005,16(6):426-436
Through two-time sequencing randomly in Fenneropenaeus chinensis, 2,597,000 bp cumulative length random genomic sequences about occupying 1.23 per thousand of the entire genome are obtained, in which the length of the first time sequencing is 884,000 bp, by cutting the genome DNA with Sau3AI enzyme, and the second is 1,713,000 bp by breaking the genome DNA with the physical method, ultrasonic. Using tandem repeat finder (TRF) soft to analyze the sequences, 4,588 tandem repeats are found, in which the number of microsatellites (1-6 bp) is 3,888, and 700 for minisatellites ( >or= 7 bp). The cumulative length of repeats is 305,555 bp, accounting for 11.72% of total cumulative sequence length, in which the cumulative length of microsatellites is 232,979 bp, accounting for 8.97% of total sequence length, and greater than those of other organisms, such as human and mosquito, etc. The dinucleotide repeat type is dominant in which the dominant repeat class is AT. The second abundant repeat type is trinucleotide, of which the dominant repeat class is AAT. Interestingly, of all of repeat types, the repeat numbers and repeat classes of primer number repeat types, such as pentanucleotide, heptanucleotide, elevennucleotide, etc. are less than those of repeat types beside them. The phenomena may involve the genesis and the evolution of microsatellites and minisatellites.  相似文献   

3.
Survey of human and rat microsatellites   总被引:44,自引:0,他引:44  
Length variations in simple sequence tandem repeats (microsatellite DNA polymorphisms) are finding increasing usage in mammalian genetics. Although every variety of short tandem repeat that has been tested has been shown to exhibit length polymorphisms, little information on the relative abundance of the different repeat motifs has been collected. In this report, summaries of GenBank searches for all possible human and rat microsatellites ranging from mononucleotide to tetranucleotide repeats are presented. In humans, the five most abundant microsatellites with total lengths for the runs of repeats of greater than or equal to 20 nucleotides contained repeat sequences of A, AC, AAAN, AAN, and AG, in order of decreasing abundance, where N is C, G, or T. These five groups comprised about 76% of all microsatellites. Many other human simple sequence repeats were found at low frequency. In the 745 kb of human genomic DNA surveyed, one microsatellite of greater than or equal to 20 nucleotides in length was found, on average, every 6 kb. Only 12% of the human microsatellites had total lengths greater than or equal to 40 nucleotides. Roughly 80% of the A, AAN, and AAAN microsatellites and 50% of the AT microsatellites, but few of the other human microsatellites, were found to be associated with interspersed, repetitive Alu elements. In rats, the five most abundant microsatellites contained AC, AG, A, AAAN, and AAGG sequences, respectively. Rat microsatellites were generally longer than human microsatellites, with 43% of the rat sequences greater than or equal to 40 nucleotides.  相似文献   

4.
We studied the structure, organization and relationship of repetitive DNA sequences in the genome of the scallop, Pecten maximus, a bivalve that is important both commercially and in marine ecology. Recombinant DNA libraries were constructed after partial digestion of genomic DNA from scallop with PstI and ApaI restriction enzymes. Clones containing repetitive DNA were selected by hybridisation to labelled DNA from scallop, oyster and mussel; colonies showing strong hybridisation only to scallop were selected for analysis and sequencing. Six non-homologous tandemly repeated sequences were identified in the sequences, and Southern hybridisation with all repeat families to genomic DNA digests showed characteristic ladders of hybridised bands. Three families had monomer lengths around 40 bp while three had repeats characteristic of the length wrapping around one (170 bp), or two (326 bp) nucleosomes. In situ hybridisation to interphase nuclei showed each family had characteristic numbers of clusters indicating contrasting arrangements. Two of the repeats had unusual repetitions of bases within their sequence, which may relate to the nature of microsatellites reported in bivalves. The study of these rapidly evolving sequences is valuable to understand an important source of genomic diversity, has the potential to provide useful markers for population studies and gives a route to identify mechanisms of DNA sequence evolution.  相似文献   

5.
微卫星(Microsatellite)是一类由2-6个核苷酸经多次单位串联组成的高度变异重复DNA序列(Schlotterer and Tautz,1992)。它具有按照孟德尔方式分离、突变快、多态信息含量丰富、呈共显性遗传等特点,其核心序列在同一物种中具有保守性,因此,可以根据微卫星的侧翼序列设计合适的引  相似文献   

6.
德国小蠊全基因组中微卫星分布规律   总被引:3,自引:0,他引:3  
【目的】分析德国小蠊 Blattella germanica 全基因组中微卫星的数量和分布规律,并对外显子中含有微卫星的基因进行功能注释。【方法】使用微卫星搜索软件查找德国小蠊基因组中微卫星的数量、重复次数以及所有微卫星的位置信息,编写Python脚本对微卫星进行定位,并通过Blast2Go和KASS程序对外显子中含有微卫星的基因进行功能注释。【结果】共找到1~6碱基重复类型的微卫星序列604 386个,总长度15 301 255 bp,约占全基因组序列(约2.04 Gb)的0.75%,分布频率为1/3.37 kb,微卫星序列的长度主要在12~60个碱基长度范围内。不同类型的微卫星中,三碱基(226 876)重复类型微卫星数量最多,占微卫星总数的37.54%;四碱基(150 355)重复类型次之,占微卫星总数的24.88%;其余依次是单碱基(141 167)、二碱基(60 877)、五碱基(21 570)和六碱基(3 541)重复类型,分别占微卫星总数的23.36%, 10.07%, 3.57%和0.59%。出现最多的重复拷贝类别有:ATT, AAT, A, T, AAAT, ATTT和AT,共411 789个微卫星,占微卫星总数的68.13%,这7种类别的微卫星数量均大于30 000个。共有2 372个微卫星在外显子上,它们分别位于1 481个基因上。GO功能注释结果表明,其中434条归类于细胞组分(cellular component),402条归类于分子功能(molecular function),660条归类于生物学过程(biological process)。KEGG通路分析结果表明,与新陈代谢相关的基因最多(380个),其次是与机体系统相关的(276个),与遗传信息进程相关的基因最少(92个)。【结论】本研究为进一步系统深入分析德国小蠊微卫星功能及微卫星分子标记筛选打下了基础。  相似文献   

7.
Oil camellia trees are important woody plants for the production of high-quality cooking oil. On the contrary to their economic importance, their genetic and genomic resources are very limited, which greatly hamper the genetic studies on oil camellia trees. Microsatellites or simple sequence repeats (SSRs) have great value in many aspects of genetic analyses due to their high polymorphism and codominant inheritance. In this study, we report the large-scale development and characterization of SSR markers derived from genomic sequences of Camellia chekiangoleosa by high-throughput pyrosequencing technology. A total of 1,091,393 genomic shotgun reads were generated using Roche 454 FLX sequencer, the average read length was 319 bp, and the total sequence throughput was 347.9 Mb. These sequences were assembled into 35,315 contigs with total length of 14.8 Mb and the N50 contig size of 770 bp. By analyzing with microsatellite (MISA), a total of 5,844 perfect microsatellites were detected from the assembled sequences. Among them, tetranucleotide repeats were found to be the most frequent microsatellites in the genome of C. chekiangoleosa, and all the dominant repeat motifs for different types of SSRs were detected to be rich in A/T. Experimental analysis with 900 SSR primer pairs revealed that 66 % of them succeeded in PCR amplification. Further investigation with 345 SSR primer pairs showed that a relatively high percentage of primers amplified polymorphic loci (31.9 %). Experimental data also revealed that, overall, long microsatellite repeats (>20 bp) were more variable than the short ones (<20 bp) in the genome of oil camellia tree.  相似文献   

8.
鳄龟科和平胸龟科线粒体控制区序列分析和结构比较   总被引:3,自引:0,他引:3  
颜亮  张雁  汪宁  张莉  聂刘旺 《动物学研究》2008,29(2):127-133
本文参照龟类近缘种的线粒体DNA(mitochondrial DNA,mtDNA)控制区(control region,CR)及邻接序列,设计了二对特异引物,采用PCR和测序技术,获得了大鳄龟(Macroclemys temminckii)、小鳄龟(Chelydra serpentina)和平胸龟(Platysternon megacephalum)mtDNA CR区序列,其长度分别为1062bp、1124bp和1119bp;A T的含量分别为68.93%、69.34%和69.44%。序列分析显示,三种龟CR区3'末端均存在丰富的微卫星序列,其中大鳄龟和小鳄龟各有一段2bp的TA序列分别重复20和15次;小鳄龟另有一段5bp的TATAT序列重复13次;平胸龟则是一段10bp的AGTATGTTAT序列重复4次和一段17bp的GTTGTTATATAACATAT序列重复13次。本文还结合GenBank中已发表的其他6种龟鳖类动物的控制区序列,探讨了龟鳖类动物微卫星序列的类型及分布,结果表明:9种龟鳖类动物都存在丰富的微卫星序列,且微卫星所在位置及序列存在很大差异。  相似文献   

9.
Microsatellite polymorphisms are invaluable for mapping vertebrate genomes. In order to estimate the occurrence of microsatellites in the rabbit genome and to assess their feasibility as markers in rabbit genetics, a survey on the presence of all types of mononucleotide, dinucleotide, trinucleotide and tetranucleotide repeats, with a length of about 20 bp or more, was conducted by searching the published rabbit DNA sequences in the EMBL nucleotide database (version 32). A total of 181 rabbit microsatellites could be extracted from the present database. The estimated frequency of microsatellites in the rabbit genome was one microsatellite for every 2–3 kb of DNA. Dinucleotide repeats constituted the prevailing class of microsatellites, followed by trinucleotide, mononucleotide and tetranucleotide repeats, respectively. The average length of the microsatellites, as found in the database, was 26, 23, 23 and 22 bp for mono-, di-, tri- and tetranucleotide repeats, respectively. The most common repeat motif was AG, followed by A, AC, AGG and CCG. This group comprised about 70% of all extracted rabbit microsatellites. About 61% of the microsatellites were found in non-coding regions of genes, whereas 15% resided in (protein) coding regions. A significant fraction of rabbit microsatellites (about 22%) was found within interspersed repetitive DNA sequences.  相似文献   

10.
The Pacific oyster (Crassostrea gigas) is globally distributed and is one of the most commercially and ecologically important marine organisms. However, little is known about the genome of this species. In this study, a C. gigas fosmid library was constructed that contains 459,936 clones with an average insert size of approximately 40 kb, representing 22.34-fold haploid genome equivalents. End sequencing generated 90,240 fosmid end sequences (FESs) with an average length of 384.27 base pairs (bp), covering approximately 2.58% of the Pacific oyster genome. The FESs were subsequently assembled and annotated, resulting in 6332 sequences with predicted open reading frames≥300 and 1,189,100 bp repeats. Furthermore, a total of 3200 microsatellite repeats were identified, and dinucleotide repeats were found to occur most abundantly, with AG and AAT being the most abundant repeat class of dinucleotides and trinucleotides. We also found that the repeat number was generally negatively proportional to the repeat element length. Microsatellites composition between the transcribed sequences and genomic sequences was shown to be different. Point mutations of microsatellite were non-random and underwent strong selection stress. Overall, a comprehensive sequence resource for the Pacific oyster was created, including annotated transposable elements, tandem repeats, protein coding sequences and microsatellites. These initial findings will serve as resources for further in-depth studies of physical mapping, gene discovery, microsatellite marker developing and evolution studies.  相似文献   

11.
12.
MOTIVATION: Microsatellites, also known as simple sequence repeats, are the tandem repeats of nucleotide motifs of the size 1-6 bp found in every genome known so far. Their importance in genomes is well known. Microsatellites are associated with various disease genes, have been used as molecular markers in linkage analysis and DNA fingerprinting studies, and also seem to play an important role in the genome evolution. Therefore, it is of importance to study distribution, enrichment and polymorphism of microsatellites in the genomes of interest. For this, the prerequisite is the availability of a computational tool for extraction of microsatellites (perfect as well as imperfect) and their related information from whole genome sequences. Examination of available tools revealed certain lacunae in them and prompted us to develop a new tool. RESULTS: In order to efficiently screen genome sequences for microsatellites (perfect as well as imperfect), we developed a new tool called IMEx (Imperfect Microsatellite Extractor). IMEx uses simple string-matching algorithm with sliding window approach to screen DNA sequences for microsatellites and reports the motif, copy number, genomic location, nearby genes, mutational events and many other features useful for in-depth studies. IMEx is more sensitive, efficient and useful than the available widely used tools. IMEx is available in the form of a stand-alone program as well as in the form of a web-server. AVAILABILITY: A World Wide Web server and the stand-alone program are available for free access at http://203.197.254.154/IMEX/ or http://www.cdfd.org.in/imex.  相似文献   

13.
香蕉EST-SSRs标记的开发与应用   总被引:10,自引:0,他引:10  
王静毅  陈业渊  刘伟良  武耀廷 《遗传》2008,30(7):933-940
从NCBI搜索的2 282条香蕉EST中, 发掘出含有SSR的EST序列110条, 共有122个SSR位点, 检出率为5.3%。SSR位点可分为37种重复单元, 平均长度为20 bp, 其中二、三核苷酸重复单元的SSR占主导地位, 分别占总SSR的33.1%和47.6%。GA和GAA是二、三核苷酸中的优势重复类型, 分别占二、三核苷酸重复类型的75.7%和36.0%; 其他重复类型所占比例均不足10%, 而四核苷酸重复类型最少, 为4.0%。设计的63对EST-SSRs引物中, 有41对EST-SSRs引物对巴西蕉基因组DNA能扩增出产物, 占总引物数的65.1%。应用进一步筛选出的重复性好、多态性高的19对引物对49个香蕉品种(系)进行PCR扩增。每对引物扩增的多态性带数目为4~12个, 平均7.58个; 引物多态信息量变化范围为0.3572~0.8744, 平均0.7324。在相似系数为0.63的水平可将49个品种聚为2个类群:一类为含B基因组香蕉品种; 另一类为不含B基因组的香蕉品种, 表明EST-SSR引物可以应用于香蕉品种资源分类的研究。  相似文献   

14.
Trace sequences from the 2X alpaca genome sequencing effort were examined to identify simple sequence repeats (microsatellites) for genetic studies. A total of 6,685 repeat-containing sequences were downloaded from GenBank, processed, and assembled into contigs representing an estimated 4,278 distinct sequences. This sequence set contained 2,290 sequences of length > 100 nucleotides that contained microsatellites of length > or = 14 dinucleotide or 10 trinucleotide repeats with purity equal to 100%. An additional 13 sequences contained a GC microsatellite of length > or = 12 repeats (purity = 100%) were also obtained. Primer pairs for amplification of 1,516 putative loci are presented. Amplification of genomic DNA from alpaca and llama by PCR was demonstrated for 14 primer sets including one from each of the microsatellite repeat types. Comparative chromosomal location for the alpaca markers was predicted in the bovine genome by BLAT searches against assembly 4.0 of the bovine whole genome sequence. A total of 634 markers (41.8%) returned BLAT hits with score > 100 and Identity > 85%, with the majority assignable to unique locations. We show that microsatellites are abundant and easily identified within the alpaca genome sequence. These markers will provide a valuable resource for further genetic studies of the alpaca and related species.  相似文献   

15.
In the present study, 3217 UniGene sequences of Neurospora crassa downloaded from the National Center for Biotechnology Information (NCBI) were mined for the identification of microsatellites or simple sequence repeats (SSRs). A total of 287 SSRs detected gives density of 1SSR/14.6 kb of 4187.86 kb sequences mined suggests that only 250 (7.8%) of sequences contained SSRs. Depending on the repeat units, the length of SSRs ranged from 14 to 17 bp for mono-, 14 to 48 bp for di-, 18 to 90 bp for tri-, 24 to 48 bp for tetra-, 30 for penta- and 42 to 48 bp for hexa-nucleotide repeats. Tri-nucleotide repeats were the most frequent repeat type (88.8%) followed by di-nucleotide repeats (5.9%). An attempt was also made with the help of bioinformatics approach to find out primer pairs for identified SSRs and primers were found only for 239 sequences. But, this part needs experimental validation. Annotation of SSRs containing sequences was also carried out.  相似文献   

16.
Ames D  Murphy N  Helentjaris T  Sun N  Chandler V 《Genetics》2008,179(3):1693-1704
Using the compiled human genome sequence, we systematically cataloged all tandem repeats with periods between 20 and 2000 bp and defined two subsets whose consensus sequences were found at either single-locus tandem repeats (slTRs) or multilocus tandem repeats (mlTRs). Parameters compiled for these subsets provide insights into mechanisms underlying the creation and evolution of tandem repeats. Both subsets of tandem repeats are nonrandomly distributed in the genome, being found at higher frequency at many but not all chromosome ends and internal clusters of mlTRs were also observed. Despite the integral role of recombination in the biology of tandem repeats, recombination hotspots colocalized only with shorter microsatellites and not the longer repeats examined here. An increased frequency of slTRs was observed near imprinted genes, consistent with a functional role, while both slTRs and mlTRs were found more frequently near genes implicated in triplet expansion diseases, suggesting a general instability of these regions. Using our collated parameters, we identified 2230 slTRs as candidates for highly informative molecular markers.  相似文献   

17.
表达序列标签(EST)是发掘Ⅰ型微卫星标记的重要资源。研究运用生物信息学方法,从草鱼头肾组织3027条EST序列中搜索到322个微卫星位点,占整个EST数据库的10.6%。其中,二核苷酸重复位点151个(46.9%),三核苷酸重复位点137个(42.5%),四、五、六核苷酸重复位点较少;在二核苷酸重复位点中,AC/GT重复位点最为丰富,占二核苷酸重复位点总数的50.3%,AG/CT重复次之,占二核苷酸重复位点总数的40.4%,AT和GC重复较少。10个微卫星位点的多态性检测结果显示,4个位点在草鱼测试群体中呈多态性,多态性位点的平均多态信息含量(PIC)和平均遗传杂合度(H)分别为0.5236和0.5441,其中,2个多态性位点的PIC值大于0.5,呈现高度多态性特征。Ⅰ型微卫星标记将为草鱼遗传连锁图谱构建和QTL分析提供有效的基因分子标记。    相似文献   

18.
A BamHI DNA fragment of 301 bp corresponding to the main repeating unit of 5S rRNA was isolated from barley genomic DNA. The primary nucleotide sequence of this fragment was determined and a high level of homology was found between coding sequences of 5S rRNA genes of barley, wheat and rye. At the same time, spacer's nucleotide sequences of different species of cereals were changed dramatically. At least two types of 5S rRNA tandem repeats of 301 and 450 bp were found in barley genome. Polymorphism for restriction fragment length in 5S rRNA repeats allowed to discriminate between all barley varieties used in this work.  相似文献   

19.
Mining functional microsatellites in legume unigenes   总被引:1,自引:0,他引:1  
Highly polymorphic and transferable microsatellites (SSRs) are important for comparative genomics, genome analysis and phylogenetic studies. Development of novel species-specific microsatellite markers remains a costly and labor-intensive project. Therefore, interest has been shifted from genomic to genic markers owing to their high inter-species transferability as they are developed from conserved coding regions of the genome. This study concentrates on comparative analysis of genic microsatellites in nine important legume (Arachis hypogaea, Cajanus cajan, Cicer arietinum, Glycine max, Lotus japonicus, Medicago truncatula, Phaseolus vulgaris, Pisum sativum and Vigna unguiculata) and two model plant species (Oryza sativa and Arabidopsis thaliana). Screening of a total of 228090 putative unique sequences spanning 219610522 bp using a microsatellite search tool, MISA, identified 12.18% of the unigenes containing 36248 microsatellite motifs excluding mononucleotide repeats. Frequency of legume unigene-derived SSRs was one SSR in every 6.0 kb of analyzed sequences. The trinucleotide repeats were predominant in all the unigenes with the exception of C. cajan, which showed prevalence of dinucleotide repeats over trinucleotide repeats. Dinucleotide repeats along with trinucleotides counted for more than 90% of the total microsatellites. Among dinucleotide and trinucleotide repeats, AG and AAG motifs, respectively, were the most frequent. Microsatellite positive chickpea unigenes were assigned Gene Ontology (GO) terms to identify the possible role of unigenes in various molecular and biological functions. These unigene based microsatellite markers will prove valuable for recording allelic variance across germplasm collections, gene tagging and searching for putative candidate genes.  相似文献   

20.
中国明对虾基因组小卫星重复序列分析   总被引:4,自引:0,他引:4  
高焕  孔杰 《动物学报》2005,51(1):101-107
通过对中国明对虾基因组随机DNA片断的测序 ,我们获得了总长度约 6 4 10 0 0个碱基的基因组DNA序列 ,从中共找到 172 0个重复序列。其中 ,小卫星序列的数目为 398个 ,占重复序列总数目的 2 3 14 %。这些小卫星序列的重复单位长度为 7- 16 5个碱基 ,集中分布于 7- 2 1个碱基范围内 ,其中以重复单位长度为 12个碱基的重复序列数目最多 ,为 5 8个 ,占小卫星重复序列总数目的 14 5 7%。不同拷贝数目所对应的重复序列的数目情况为 :拷贝数目为 2的重复单位所组成的重复序列数目最多 ,为 137个 ;其次是拷贝数目为 3的重复序列 ,为12 2个 ,且随着拷贝数目的增加 ,由其所组成的重复序列的数目呈递减的趋势。其中一部分序列见GeneBank数据库 ,登录号为AY6 990 72 -AY6 990 76。 398个重复序列分别由 398种重复单位所组成 ,因而小卫星重复序列的类型很多 ,我们初步分成三类 :两种碱基组成类别、三种碱基组成类别和四种碱基组成类别 ,并进一步根据各个重复序列中所含有的碱基种类的数量从大到小排列这些碱基而分成若干小类。从这些分类中可以看出 ,中国明对虾基因组中的小卫星整体上是富含A T的重复序列 ,并具有一定的“等级制度” ,揭示了其与微卫星重复序列之间的关系 ,即一部分小卫星重复序列可能起源于微卫星  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号