首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Human BAC ends quality assessment and sequence analyses   总被引:8,自引:0,他引:8  
Zhao S  Malek J  Mahairas G  Fu L  Nierman W  Venter JC  Adams MD 《Genomics》2000,63(3):321-332
End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering approximately 4.7% of the genome. Over 60% of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have highlighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research.  相似文献   

2.
Brachypodium is well suited as a model system for temperate grasses because of its compact genome and a range of biological features. In an effort to develop resources for genome research in this emerging model species, we constructed 2 bacterial artificial chromosome (BAC) libraries from an inbred diploid Brachypodium distachyon line, Bd21, using restriction enzymes HindIII and BamHI. A total of 73,728 clones (36,864 per BAC library) were picked and arrayed in 192,384-well plates. The average insert size for the BamHI and HindIII libraries is estimated to be 100 and 105 kb, respectively, and inserts of chloroplast origin account for 4.4% and 2.4%, respectively. The libraries individually represent 9.4- and 9.9-fold haploid genome equivalents with combined 19.3-fold genome coverage, based on a genome size of 355 Mb reported for the diploid Brachypodium, implying a 99.99% probability that any given specific sequence will be present in each library. Hybridization of the libraries with 8 starch biosynthesis genes was used to empirically evaluate this theoretical genome coverage; the frequency at which these genes were present in the library clones gave an estimated coverage of 11.6- and 19.6-fold genome equivalents. To obtain a first view of the sequence composition of the Brachypodium genome, 2185 BAC end sequences (BES) representing 1.3 Mb of random genomic sequence were compared with the NCBI GenBank database and the GIRI repeat database. Using a cutoff expectation value of E<10-10, only 3.3% of the BESs showed similarity to repetitive sequences in the existing database, whereas 40.0% had matches to the sequences in the EST database, suggesting that a considerable portion of the Brachypodium genome is likely transcribed. When the BESs were compared with individual EST databases, more matches hit wheat than maize, although their EST collections are of a similar size, further supporting the close relationship between Brachypodium and the Triticeae. Moreover, 122 BESs have significant matches to wheat ESTs mapped to individual chromosome bin positions. These BACs represent colinear regions containing the mapped wheat ESTs and would be useful in identifying additional markers for specific wheat chromosome regions.  相似文献   

3.
White clover (Trifolium repens L.) is a forage legume widely used in combination with grass in pastures because of its ability to fix nitrogen. We have constructed a bacterial artificial chromosome (BAC) library of an advanced breeding line of white clover. The library contains 37 248 clones with an average insert size of approximately 85 kb, representing an approximate 3-fold coverage of the white clover genome based on an estimated genome size of 960 Mb. The BAC library was pooled and screened by polymerase chain reaction (PCR) amplification using both white clover microsatellites and PCR-based markers derived from Medicago truncatula, resulting in an average of 6 hits per marker; this supports the estimated 3-fold genome coverage in this allotetraploid species. PCR-based screening of 766 clones with a multiplex set of chloroplast primers showed that only 0.5% of BAC clones contained chloroplast-derived inserts. The library was further evaluated by sequencing both ends of 724 of the clover BACs. These were analysed with respect to their sequence content and their homology to the contents of a range of plant gene, expressed sequence tag, and repeat element databases. Forty-three microsatellites were discovered in the BAC-end sequences (BESs) and investigated as potential genetic markers in white clover. The BESs were also compared with the partially sequenced genome of the model legume M. truncatula with the specific intention of identifying putative comparative-tile BACs, which represent potential regions of microsynteny between the 2 species; 14 such BACs were discovered. The results suggest that a large-scale BAC-end sequencing strategy has the potential to anchor a significant proportion of the genome of white clover onto the gene-space sequence of M. truncatula.  相似文献   

4.
Availability of the human genome sequence and high similarity between humans and pigs at the molecular level provides an opportunity to use a comparative mapping approach to piggy-BAC the human genome. In order to advance the pig genome sequencing initiative, sequence similarity between large-scale porcine BAC-end sequences (BESs) and human genome sequence was used to construct a comparatively-anchored porcine physical map that is a first step towards sequencing the pig genome. A total of 50,300 porcine BAC clones were end-sequenced, yielding 76,906 BESs after trimming with an average read length of 538 bp. To anchor the porcine BACs on the human genome, these BESs were subjected to BLAST analysis using the human draft sequence, revealing 31.5% significant hits (E < e?5). Both genic and non-genic regions of homology contributed to the alignments between the human and porcine genomes. Porcine BESs with unique homology matches within the human genome provided a source of markers spaced approximately 70 to 300 kb along each human chromosome. In order to evaluate the utility of piggy-BACing human genome sequences, and confirm predictions of orthology, 193 evenly spaced BESs with similarity to HSA3 and HSA21 were selected and then utilized for developing a high-resolution (1.22 Mb) comparative radiation hybrid map of SSC13 that represents a fusion of HSA3 and HSA21. Resulting RH mapping of SSC13 covers 99% and 97% of HSA3 and HSA21, respectively. Seven evolutionary conserved blocks were identified including six on HSA3 and a single syntenic block corresponding to HSA21. The strategy of piggy-BACing the human genome described in this study demonstrates that through a directed, targeted comparative genomics approach construction of a high-resolution anchored physical map of the pig genome can be achieved. This map supports the selection of BACs to construct a minimal tiling path for genome sequencing and targeted gap filling. Moreover, this approach is highly relevant to other genome sequencing projects.  相似文献   

5.
Characterizing the walnut genome through analyses of BAC end sequences   总被引:1,自引:0,他引:1  
Persian walnut (Juglans regia L.) is an economically important tree for its nut crop and timber. To gain insight into the structure and evolution of the walnut genome, we constructed two bacterial artificial chromosome (BAC) libraries, containing a total of 129,024 clones, from in vitro-grown shoots of J. regia cv. Chandler using the HindIII and MboI cloning sites. A total of 48,218 high-quality BAC end sequences (BESs) were generated, with an accumulated sequence length of 31.2?Mb, representing approximately 5.1% of the walnut genome. Analysis of repeat DNA content in BESs revealed that approximately 15.42% of the genome consists of known repetitive DNA, while walnut-unique repetitive DNA identified in this study constitutes 13.5% of the genome. Among the walnut-unique repetitive DNA, Julia SINE and JrTRIM elements represent the first identified walnut short interspersed element (SINE) and terminal-repeat retrotransposon in miniature (TRIM) element, respectively; both types of elements are abundant in the genome. As in other species, these SINEs and TRIM elements could be exploited for developing repeat DNA-based molecular markers in walnut. Simple sequence repeats (SSR) from BESs were analyzed and found to be more abundant in BESs than in expressed sequence tags. The density of SSR in the walnut genome analyzed was also slightly higher than that in poplar and papaya. Sequence analysis of BESs indicated that approximately 11.5% of the walnut genome represents a coding sequence. This study is an initial characterization of the walnut genome and provides the largest genomic resource currently available; as such, it will be a valuable tool in studies aimed at genetically improving walnut.  相似文献   

6.
Physical map of chickpea was developed for the reference chickpea genotype (ICC 4958) using bacterial artificial chromosome (BAC) libraries targeting 71,094 clones (~12× coverage). High information content fingerprinting (HICF) of these clones gave high-quality fingerprinting data for 67,483 clones, and 1,174 contigs comprising 46,112 clones and 3,256 singletons were defined. In brief, 574 Mb genome size was assembled in 1,174 contigs with an average of 0.49 Mb per contig and 3,256 singletons represent 407 Mb genome. The physical map was linked with two genetic maps with the help of 245 BAC-end sequence (BES)-derived simple sequence repeat (SSR) markers. This allowed locating some of the BACs in the vicinity of some important quantitative trait loci (QTLs) for drought tolerance and reistance to Fusarium wilt and Ascochyta blight. In addition, fingerprinted contig (FPC) assembly was also integrated with the draft genome sequence of chickpea. As a result, ~965 BACs including 163 minimum tilling path (MTP) clones could be mapped on eight pseudo-molecules of chickpea forming 491 hypothetical contigs representing 54,013,992 bp (~54 Mb) of the draft genome. Comprehensive analysis of markers in abiotic and biotic stress tolerance QTL regions led to identification of 654, 306 and 23 genes in drought tolerance “QTL-hotspot” region, Ascochyta blight resistance QTL region and Fusarium wilt resistance QTL region, respectively. Integrated physical, genetic and genome map should provide a foundation for cloning and isolation of QTLs/genes for molecular dissection of traits as well as markers for molecular breeding for chickpea improvement.  相似文献   

7.
Spartina species play an important ecological role on salt marshes. Spartina maritima is an Old-World species distributed along the European and North-African Atlantic coasts. This hexaploid species (2n = 6x = 60, 2C = 3,700 Mb) hybridized with different Spartina species introduced from the American coasts, which resulted in the formation of new invasive hybrids and allopolyploids. Thus, S. maritima raises evolutionary and ecological interests. However, genomic information is dramatically lacking in this genus. In an effort to develop genomic resources, we analysed 40,641 high-quality bacterial artificial chromosome-end sequences (BESs), representing 26.7 Mb of the S. maritima genome. BESs were searched for sequence homology against known databases. A fraction of 16.91 % of the BESs represents known repeats including a majority of long terminal repeat (LTR) retrotransposons (13.67 %). Non-LTR retrotransposons represent 0.75 %, DNA transposons 0.99 %, whereas small RNA, simple repeats and low-complexity sequences account for 1.38 % of the analysed BESs. In addition, 4,285 simple sequence repeats were detected. Using the coding sequence database of Sorghum bicolor, 6,809 BESs found homology accounting for 17.1 % of all BESs. Comparative genomics with related genera reveals that the microsynteny is better conserved with S. bicolor compared to other sequenced Poaceae, where 37.6 % of the paired matching BESs are correctly orientated on the chromosomes. We did not observe large macrosyntenic rearrangements using the mapping strategy employed. However, some regions appeared to have experienced rearrangements when comparing Spartina to Sorghum and to Oryza. This work represents the first overview of S. maritima genome regarding the respective coding and repetitive components. The syntenic relationships with other grass genomes examined here help clarifying evolution in Poaceae, S. maritima being a part of the poorly-known Chloridoideae sub-family.  相似文献   

8.
Coffee is one of the world’s most important agricultural commodities. Coffee belongs to the Rubiaceae family in the euasterid I clade of dicotyledonous plants, to which the Solanaceae family also belongs. Two bacterial artificial chromosome (BAC) libraries of a homozygous doubled haploid plant of Coffea canephora were constructed using two enzymes, HindIII and BstYI. A total of 134,827 high quality BAC-end sequences (BESs) were generated from the 73,728 clones of the two libraries, and 131,412 BESs were conserved for further analysis after elimination of chloroplast and mitochondrial sequences. This corresponded to almost 13 % of the estimated size of the C. canephora genome. 6.7 % of BESs contained simple sequence repeats, the most abundant (47.8 %) being mononucleotide motifs. These sequences allow the development of numerous useful marker sites. Potential transposable elements (TEs) represented 11.9 % of the full length BESs. A difference was observed between the BstYI and HindIII libraries (14.9 vs. 8.8 %). Analysis of BESs against known coding sequences of TEs indicated that 11.9 % of the genome corresponded to known repeat sequences, like for other flowering plants. The number of genes in the coffee genome was estimated at 41,973 which is probably overestimated. Comparative genome mapping revealed that microsynteny was higher between coffee and grapevine than between coffee and tomato or Arabidopsis. BESs constitute valuable resources for the first genome wide survey of coffee and provide new insights into the composition and evolution of the coffee genome.  相似文献   

9.
The initial strategy of the Corynebacterium glutamicum genome project was to sequence overlapping inserts of an ordered cosmid library. High-density colony grids of approximately 28 genome equivalents were used for the identification of overlapping clones by Southern hybridization. Altogether 18 contiguous genomic segments comprising 95 overlapping cosmids were assembled. Systematic shotgun sequencing of the assembled cosmid set revealed that only 2.84 Mb (86.6%) of the C. glutamicum genome were represented by the cosmid library. To obtain a complete genome coverage, a bacterial artificial chromosome (BAC) library of the C. glutamicum chromosome was constructed in pBeloBAC11 and used for genome mapping. The BAC library consists of 3168 BACs and represents a theoretical 63-fold coverage of the C. glutamicum genome (3.28 Mb). Southern screening of 2304 BAC clones with PCR-amplified chromosomal markers and subsequent insert terminal sequencing allowed the identification of 119 BACs covering the entire chromosome of C. glutamicum. The minimal set representing a 100% genome coverage contains 44 unique BAC clones with an average overlap of 22 kb. A total of 21 BACs represented linking clones between previously sequenced cosmid contigs and provided a valuable tool for completing the genome sequence of C. glutamicum.  相似文献   

10.
Availability of the human genome sequence and high similarity between humans and pigs at the molecular level provides an opportunity to use a comparative mapping approach to piggy-BAC the human genome. In order to advance the pig genome sequencing initiative, sequence similarity between large-scale porcine BAC-end sequences (BESs) and human genome sequence was used to construct a comparatively-anchored porcine physical map that is a first step towards sequencing the pig genome. A total of 50,300 porcine BAC clones were end-sequenced, yielding 76,906 BESs after trimming with an average read length of 538 bp. To anchor the porcine BACs on the human genome, these BESs were subjected to BLAST analysis using the human draft sequence, revealing 31.5% significant hits (E < e(-5)). Both genic and non-genic regions of homology contributed to the alignments between the human and porcine genomes. Porcine BESs with unique homology matches within the human genome provided a source of markers spaced approximately 70 to 300 kb along each human chromosome. In order to evaluate the utility of piggy-BACing human genome sequences, and confirm predictions of orthology, 193 evenly spaced BESs with similarity to HSA3 and HSA21 were selected and then utilized for developing a high-resolution (1.22 Mb) comparative radiation hybrid map of SSC13 that represents a fusion of HSA3 and HSA21. Resulting RH mapping of SSC13 covers 99% and 97% of HSA3 and HSA21, respectively. Seven evolutionary conserved blocks were identified including six on HSA3 and a single syntenic block corresponding to HSA21. The strategy of piggy-BACing the human genome described in this study demonstrates that through a directed, targeted comparative genomics approach construction of a high-resolution anchored physical map of the pig genome can be achieved. This map supports the selection of BACs to construct a minimal tiling path for genome sequencing and targeted gap filling. Moreover, this approach is highly relevant to other genome sequencing projects.  相似文献   

11.
Pigeonpea (Cajanus cajan), an important food legume crop in the semi-arid regions of the world and the second most important pulse crop in India, has an average crop productivity of 780 kg/ha. The relatively low crop yields may be attributed to non-availability of improved cultivars, poor crop husbandry and exposure to a number of biotic and abiotic stresses in pigeonpea growing regions. Narrow genetic diversity in cultivated germplasm has further hampered the effective utilization of conventional breeding as well as development and utilization of genomic tools, resulting in pigeonpea being often referred to as an ‘orphan crop legume’. To enable genomics-assisted breeding in this crop, the pigeonpea genomics initiative (PGI) was initiated in late 2006 with funding from Indian Council of Agricultural Research under the umbrella of Indo-US agricultural knowledge initiative, which was further expanded with financial support from the US National Science Foundation’s Plant Genome Research Program and the Generation Challenge Program. As a result of the PGI, the last 3 years have witnessed significant progress in development of both genetic as well as genomic resources in this crop through effective collaborations and coordination of genomics activities across several institutes and countries. For instance, 25 mapping populations segregating for a number of biotic and abiotic stresses have been developed or are under development. An 11X-genome coverage bacterial artificial chromosome (BAC) library comprising of 69,120 clones have been developed of which 50,000 clones were end sequenced to generate 87,590 BAC-end sequences (BESs). About 10,000 expressed sequence tags (ESTs) from Sanger sequencing and ca. 2 million short ESTs by 454/FLX sequencing have been generated. A variety of molecular markers have been developed from BESs, microsatellite or simple sequence repeat (SSR)-enriched libraries and mining of ESTs and genomic amplicon sequencing. Of about 21,000 SSRs identified, 6,698 SSRs are under analysis along with 670 orthologous genes using a GoldenGate SNP (single nucleotide polymorphism) genotyping platform, with large scale SNP discovery using Solexa, a next generation sequencing technology, is in progress. Similarly a diversity array technology array comprising of ca. 15,000 features has been developed. In addition, >600 unique nucleotide binding site (NBS) domain containing members of the NBS-leucine rich repeat disease resistance homologs were cloned in pigeonpea; 960 BACs containing these sequences were identified by filter hybridization, BES physical maps developed using high information content fingerprinting. To enrich the genomic resources further, sequenced soybean genome is being analyzed to establish the anchor points between pigeonpea and soybean genomes. In addition, Solexa sequencing is being used to explore the feasibility of generating whole genome sequence. In summary, the collaborative efforts of several research groups under the umbrella of PGI are making significant progress in improving molecular tools in pigeonpea and should significantly benefit pigeonpea genetics and breeding. As these efforts come to fruition, and expanded (depending on funding), pigeonpea would move from an ‘orphan legume crop’ to one where genomics-assisted breeding approaches for a sustainable crop improvement are routine.  相似文献   

12.
Sugarcane has become an increasingly important first-generation biofuel crop in tropical and subtropical regions. It has a large, complex, polyploid genome that has hindered the progress of genomic research and marker-assisted selection. Genetic mapping and ultimately genome sequence assembly require a large number of DNA markers. Simple sequence repeats (SSRs) are widely used in genetic mapping because of their abundance, high rates of polymorphism, and ease of use. The objectives of this study were to develop SSR markers for construction of a saturated genetic map and to characterize the frequency and distribution of SSRs in a polyploid genome. SSR markers were mined from expressed sequence tag (EST), reduced representation library genomic sequences, and bacterial artificial chromosome (BAC) sequences. A total of 5,675 SSR markers were surveyed in a segregating population. The overall successful amplification and polymorphic rates were 87.9 and 16.4%, respectively. The trinucleotide repeat motifs were most abundant, with tri- and hexanucleotide motifs being the most abundant for the ESTs. BAC and genomic SSRs were mostly AT-rich while the ESTs were relatively GC-rich due to codon bias. These markers were also aligned to the sorghum genome, resulting in 1,203 markers mapped in the sorghum genome. This set of SSRs conserved in sugarcane and sorghum would be the most informative for mapping quantitative trait loci in sugarcane and for comparative genomic analyses. This large collection of SSR markers is a valuable resource for sugarcane genomic research and crop improvement.  相似文献   

13.
A bacterial artificial chromosome library for sugarcane   总被引:10,自引:0,他引:10  
Modern cultivated sugarcane is a complex aneuploid polyploid with an estimated genome size of 3000 Mb. Although most traits in sugarcane show complex inheritance, a rust locus showing monogenic inheritance has been documented. In order to facilitate cloning of the rust locus, we have constructed a bacterial artificial chromosome (BAC) library for the cultivar R570. The library contains 103,296 clones providing 4.5 sugarcane genome equivalents. A random sampling of 240 clones indicated an average insert size of 130 kb allowing a 98% probability of recovering any specific sequence of interest. High-density filters were gridded robotically using a Genetix Q-BOT in a 4 × 4 double-spotted array on 22.5-cm2 filters. Each set of five filters provides a genome coverage of 4x with 18,432 clones represented per filter. Screening of the library with three different barley chloroplast gene probes indicated an exceptionally low chloroplast DNA content of less than 1%. To demonstrate the library’s potential for map-based cloning, single-copy RFLP sugarcane mapping probes anchored to nine different linkage groups and three different gene probes were used to screen the library. The number of positive hybridization signals resulting from each probe ranged from 8 to 60. After determining addresses of the signals, clones were evaluated for insert size and HindIII-fingerprinted. The fingerprints were then used to determine clone relationships and assemble contigs. For comparison with other monocot genomes, sugarcane RFLP probes were also used to screen a Sorghum bicolor BAC library and two rice BAC libraries. The rice and sorghum BAC clones were characterized for insert size and fingerprinted, and the results compared to sugarcane. The library was screened with a rust resistance RFLP marker and candidate BAC clones were subjected to RFLP fragment matching to identify those corresponding to the same genomic region as the rust gene. Received: 12 September 1998 / Accepted: 12 March 1999  相似文献   

14.
In an effort to increase the density of sequence-based markers for the horse genome we generated 9473 BAC end sequences (BESs) from the CHORI-241 BAC library with an average read length of 677 bp. BLASTN searches with the BESs revealed 4036 meaningful hits (E 相似文献   

15.
As part of a larger project to sequence the Populus genome and generate genomic resources for this emerging model tree, we constructed a physical map of the Populus genome, representing one of the few such maps of an undomesticated, highly heterozygous plant species. The physical map, consisting of 2802 contigs, was constructed from fingerprinted bacterial artificial chromosome (BAC) clones. The map represents approximately 9.4-fold coverage of the Populus genome, which has been estimated from the genome sequence assembly to be 485 ± 10 Mb in size. BAC ends were sequenced to assist long-range assembly of whole-genome shotgun sequence scaffolds and to anchor the physical map to the genome sequence. Simple sequence repeat-based markers were derived from the end sequences and used to initiate integration of the BAC and genetic maps. A total of 2411 physical map contigs, representing 97% of all clones assigned to contigs, were aligned to the sequence assembly (JGI Populus trichocarpa , version 1.0). These alignments represent a total coverage of 384 Mb (79%) of the entire poplar sequence assembly and 295 Mb (96%) of linkage group sequence assemblies. A striking result of the physical map contig alignments to the sequence assembly was the co-localization of multiple contigs across numerous regions of the 19 linkage groups. Targeted sequencing of BAC clones and genetic analysis in a small number of representative regions showed that these co-aligning contigs represent distinct haplotypes in the heterozygous individual sequenced, and revealed the nature of these haplotype sequence differences.  相似文献   

16.
Chinese hamster ovary (CHO) cells have frequently been used in biotechnology as a mammalian host cell platform for expressing genes of interest. Previously, we constructed a detailed physical chromosomal map of the CHO DG44 cell line by fluorescence in situ hybridization (FISH) imaging using 303 bacterial artificial chromosome (BAC) clones as hybridization probes (BAC-FISH). BAC-FISH results revealed that the two longest chromosomes were completely paired. However, other chromosomes featured partial deletions or rearrangements. In this study, we determined the end sequences of 303 BAC clones (BAC end sequences), which were used for BAC-FISH probes. Among 606 BAC-end sequences (BESs) (forward and reverse ends), 558 could be determined. We performed a comparison between all determined BESs and mouse genome sequences using NCBI BLAST. Among these 558 BESs, 465 showed high homology to mouse chromosomal sequences. We analyzed the locations of these BACs in chromosomes of the CHO DG44 cell line using a physical chromosomal map. From the obtained results, we investigated the regional similarities among CHO chromosomes (A–T) and mouse chromosomes (1–19 and sex) about 217 BESs (46.7% of 465 high homologous BESs). Twenty-three specific narrow regions in 13 chromosomes of the CHO DG44 cell line showed high homology to mouse chromosomes, but most of other regions did not show significant correlations with the mouse genome. These results contribute to accurate alignments of chromosomes of Chinese hamster and its genome sequence, analysis of chromosomal instability in CHO cells, and the development of target locations for gene and/or genome editing techniques.  相似文献   

17.
In this report we present the results of the analysis of approximately 2.7 Mb of genomic information for the American mink (Neovison vison) derived through BAC end sequencing. Our study, which encompasses approximately 1/1000th of the mink genome, suggests that simple sequence repeats (SSRs) are less common in the mink than in the human genome, whereas the average GC content of the mink genome is slightly higher than that of its human counterpart. The 2.7 Mb mink genomic dataset also contained 2,416 repeat elements (retroids and DNA transposons) occupying almost 31% of the sequence space. Among repeat elements, LINEs were over-represented and endogenous viruses (aka LTRs) under-represented in comparison to the human genome. Finally, we present a virtual map of the mink genome constructed with reference to the human and canine genome assemblies using a comparative genomics approach and incorporating over 200 mink BESs with unique hits to the human genome.  相似文献   

18.
A bacterial artificial chromosome (BAC) library containing a large genomlc DNA insert is an important tool for genome physical mapping, map-based cloning, and genome sequencing. To Isolate genes via a map-based cloning strategy and to perform physical mapping of the cotton genome, a high-quality BAC library containing large cotton DNA Inserts Is needed. We have developed a BAC library of the restoring line 0-613-2R for Isolating the fertility restorer (Rf1) gene and genomic research in cotton (Gossypium hirsutum L.). The BAC library contains 97 825 clones stored In 255 pieces of a 384-well mlcrotiter plate. Random samples of BACs digested with the Notl enzyme Indicated that the average Insert size Is approximately 130 kb, with a range of 80-275 kb, and 95.7% of the BAC clones in the library have an average insert size larger than 100 kb. Based on a cotton genome size of 2 250 Mb, library coverage is 5.7 × haploid genome equivalents. Four clones were selected randomly from the library to determine the stability of the BAC clones. There were no different fingerprints for 0 and 100 generations of each clone digested with Notl and Hlndiii enzymes. Thus, the atabiiity of a single BAC clone can be sustained at iesat for 100 generations. Eight simple sequence repeat (SSR) markers flanking the Rf; gene were chosen to screen the BAC library by pool using PCR method and 25 positive clones were identified with 3.1 positive clones per SSR marker.  相似文献   

19.
Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (WGS). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with McrBC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene‐enrichment strategy, we have compared assemblies using methyl‐filtered (MF) and unfiltered (UF) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (CDS) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (BACs), 97.2% of sugarcane expressed sequence tags (ESTs), 92.7% of sugarcane RNA‐seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single‐nucleotide polymorphisms (SNPs) in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and for improvement of sugarcane as a biofuel crop.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号