首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
MOTIVATION: Copy number profiling methods aim at assigning DNA copy numbers to chromosomal regions using measurements from microarray-based comparative genomic hybridizations. Among the proposed methods to this end, Hidden Markov Model (HMM)-based approaches seem promising since DNA copy number transitions are naturally captured in the model. Current discrete-index HMM-based approaches do not, however, take into account heterogeneous information regarding the genomic overlap between clones. Moreover, the majority of existing methods are restricted to chromosome-wise analysis. RESULTS: We introduce a novel Segmental Maximum A Posteriori approach, SMAP, for DNA copy number profiling. Our method is based on discrete-index Hidden Markov Modeling and incorporates genomic distance and overlap between clones. We exploit a priori information through user-controllable parameterization that enables the identification of copy number deviations of various lengths and amplitudes. The model parameters may be inferred at a genome-wide scale to avoid overfitting of model parameters often resulting from chromosome-wise model inference. We report superior performances of SMAP on synthetic data when compared with two recent methods. When applied on our new experimental data, SMAP readily recognizes already known genetic aberrations including both large-scale regions with aberrant DNA copy number and changes affecting only single features on the array. We highlight the differences between the prediction of SMAP and the compared methods and show that SMAP accurately determines copy number changes and benefits from overlap consideration.  相似文献   

2.
Libraries constructed in bacterial artificial chromosome (BAC) vectors have become the choice for clone sets in high throughput genomic sequencing projects primarily because of their high stability. BAC libraries have been proposed as a source for minimally over-lapping clones for sequencing large genomic regions, and the use of BAC end sequences (i.e. sequences adjoining the insert sites) has been proposed as a primary means for selecting minimally overlapping clones for sequencing large genomic regions. For this strategy to be effective, high throughput methods for BAC end sequencing of all the clones in deep coverage BAC libraries needed to be developed. Here we describe a low cost, efficient, 96 well procedure for BAC end sequencing. These methods allow us to generate BAC end sequences from human and Arabidoposis libraries with an average read length of >450 bases and with a single pass sequencing average accuracy of >98%. Application of BAC end sequences in genomic sequen-cing is discussed.  相似文献   

3.
Kim DW  Choi SH  Kim RN  Kim SH  Paik SG  Nam SH  Kim DW  Kim A  Kang A  Park HS 《Génome》2010,53(9):658-666
The sequencing and comparative genomic analysis of LMBR1 loci in mammals or other species, including human, would be very important in understanding evolutionary genetic changes underlying the evolution of limb development. In this regard, comparative genomic annotation of the false killer whale LMBR1 locus could shed new light on the evolution of limb development. We sequenced two false killer whale BAC clones, corresponding to 156 kb and 144 kb, respectively, harboring the tightly linked RNF32, LMBR1, and NOM1 genes. Our annotation of the false killer whale LMBR1 gene showed that it consists of 17 exons (1473 bp), in contrast to 18 exons (1596 bp) in human, and it displays 93.1% and 95.6% nucleotide and amino acid sequence similarity, respectively, compared with the human gene. In particular, we discovered that exon 10, deleted in the false killer whale LMBR1 gene, is present only in primates, and this fact strongly implies that exon 10 might be crucial in determining primate-specific limb development. ZRS and TFBS sequences have been well conserved across 11 species, suggesting that these regions could be involved in an important function of limb development and limb patterning. The neighboring gene RNF32 showed several lineage-conserved exons, such as exons 2 through 9 conserved in eutherian mammals, exons 3 through 9 conserved in mammals, and exons 5 through 9 conserved in vertebrates. The other neighboring gene, NOM1, had undergone a substitution (ATG→GTA) at the start codon, giving rise to a 36 bp shorter N-terminal sequence compared with the human sequence. Our comparative analysis of the false killer whale LMBR1 genomic locus provides important clues regarding the genetic regions that may play crucial roles in limb development and patterning.  相似文献   

4.
Sugar beet (Beta vulgaris) is an important crop plant that accounts for 30% of the world's sugar production annually. The genus Beta is a distant relative of currently sequenced taxa within the core eudicotyledons; the genomic characterization of sugar beet is essential to make its genome accessible to molecular dissection. Here, we present comprehensive genomic information in genetic and physical maps that cover all nine chromosomes. Based on this information we identified the proposed ancestral linkage groups of rosids and asterids within the sugar beet genome. We generated an extended genetic map that comprises 1127 single nucleotide polymorphism markers prepared from expressed sequence tags and bacterial artificial chromosome (BAC) end sequences. To construct a genome-wide physical map, we hybridized gene-derived oligomer probes against two BAC libraries with 9.5-fold cumulative coverage of the 758 Mbp genome. More than 2500 probes and clones were integrated both in genetic maps and the physical data. The final physical map encompasses 535 chromosomally anchored contigs that contains 8361 probes and 22 815 BAC clones. By using the gene order established with the physical map, we detected regions of synteny between sugar beet (order Caryophyllales) and rosid species that involves 1400-2700 genes in the sequenced genomes of Arabidopsis, poplar, grapevine, and cacao. The data suggest that Caryophyllales share the palaeohexaploid ancestor proposed for rosids and asterids. Taken together, we here provide extensive molecular resources for sugar beet and enable future high-resolution trait mapping, gene identification, and cross-referencing to regions sequenced in other plant species.  相似文献   

5.
Qianxing Mo  Faming Liang 《Biometrics》2010,66(4):1284-1294
Summary ChIP‐chip experiments are procedures that combine chromatin immunoprecipitation (ChIP) and DNA microarray (chip) technology to study a variety of biological problems, including protein–DNA interaction, histone modification, and DNA methylation. The most important feature of ChIP‐chip data is that the intensity measurements of probes are spatially correlated because the DNA fragments are hybridized to neighboring probes in the experiments. We propose a simple, but powerful Bayesian hierarchical approach to ChIP‐chip data through an Ising model with high‐order interactions. The proposed method naturally takes into account the intrinsic spatial structure of the data and can be used to analyze data from multiple platforms with different genomic resolutions. The model parameters are estimated using the Gibbs sampler. The proposed method is illustrated using two publicly available data sets from Affymetrix and Agilent platforms, and compared with three alternative Bayesian methods, namely, Bayesian hierarchical model, hierarchical gamma mixture model, and Tilemap hidden Markov model. The numerical results indicate that the proposed method performs as well as the other three methods for the data from Affymetrix tiling arrays, but significantly outperforms the other three methods for the data from Agilent promoter arrays. In addition, we find that the proposed method has better operating characteristics in terms of sensitivities and false discovery rates under various scenarios.  相似文献   

6.
Detection and characterization of chimeric yeast artificial-chromosome clones.   总被引:11,自引:0,他引:11  
Methods for the construction of yeast artificial-chromosome (YAC) clones have been designed to isolate single, large (100-1000 kb) segments of chromosomal DNA. It is apparent from early experience with this cloning system that the major artifact in YAC clones involves the formation of YACs that contain two or more unrelated pieces of DNA. Such "chimeric" YACs are not easily recognized, particularly in libraries constructed from the total DNA of an organism. In some libraries, they have been found to constitute a major fraction of the clones. Here we discuss some of our experiences with chimeric YACs, with particular emphasis on the approaches that we have employed to detect such aberrant clones. In addition, we describe the detailed characterization of one chimeric YAC isolated from a library prepared from total human DNA. The organization of this clone indicates that it formed by in vivo recombination, presumably in yeast, between two Alu sequences located on unrelated segments of human DNA.  相似文献   

7.
Aberrant DNA methylation imprints in aborted bovine clones   总被引:1,自引:0,他引:1  
Genomic imprinting plays a very important role during development and its abnormality may heavily undermine the developmental potential of bovine embryos. Because of limited resources of the cow genome, bovine genomic imprinting, both in normal development and in somatic cell nuclear transfer (SCNT) cloning, is not well documented. DNA methylation is thought to be a major factor for the establishment of genomic imprinting. In our study, we determined the methylation status of differential methylated regions (DMRs) of four imprinted genes in four spontaneously aborted SCNT-cloned fetuses (AF). Firstly, abnormal methylation imprints were observed in each individual to different extents. In particular, Peg3 and MAOA were either seriously demethylated or showed aberrant methylation patterns in four aborted clones we tested, but Xist and Peg10 exhibited relatively better maintained methylation status in AF1 and AF4. Secondly, two aborted fetuses, AF2 and AF3 exhibited severe aberrant methylation imprints of four imprinted genes. Finally, MAOA showed strong heterogeneous methylation patterns of its DMR in normal somatic adult tissue, but largely variable methylation levels and relatively homogeneous methylation patterns in aborted cloned fetuses. Our data indicate that the aborted cloned fetuses exhibited abnormal methylation imprints, to different extent, in aborted clones, which partially account for the higher abortion and developmental abnormalities during bovine cloning.  相似文献   

8.
MOTIVATION: The identification of DNA copy number changes provides insights that may advance our understanding of initiation and progression of cancer. Array-based comparative genomic hybridization (array-CGH) has emerged as a technique allowing high-throughput genome-wide scanning for chromosomal aberrations. A number of statistical methods have been proposed for the analysis of array-CGH data. In this article, we consider a fused quantile regression model based on three motivations: (1) quantile regression may provide a more comprehensive picture for the ratio profile of copy numbers than the standard mean regression approach; (2) for simplicity, most available methods assume uniform spacing between neighboring clones, while incorporating the information of physical locations of clones may be helpful and (3) most current methods have a set of tuning parameters that must be carefully tuned, which introduces complexity to the implementation. RESULTS: We formulate the detection of regions of gains and losses in a fused regularized quantile regression framework, incorporating physical locations of clones. We derive an efficient algorithm that computes the entire solution path for the resulting optimization problem, and we propose a simple estimate for the complexity of the fitted model, which leads to convenient selection of the tuning parameter. Three published array-CGH datasets are used to demonstrate our approach. AVAILABILITY: R code are available at http://www.stat.lsa.umich.edu/~jizhu/code/cgh/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

9.
10.
T Matsuoka  H Kato  K Hashimoto  Y Kurosawa 《Gene》1991,107(1):27-35
Long-range physical mapping with rare-cutting restriction enzymes (rare cutters) is an important step for structural analysis of complex genomes. Combination of two types of DNA clones bearing the rare-cutter sites, linking clones and jumping clones (Fig. 1a), facilitates the physical mapping [Poustka et al., Nature 325 (1987) 353-355]. A step followed by the physical mapping is the cloning of the large (rare-cutter-generated) restriction fragment of interest. For facilitating this step, we devised a method to directly clone a long restriction fragment without constructing the whole genomic DNA library using the jumping clone as starting material. The short DNA segments of a jumping clone, which are derived from the 5' and 3' terminal regions of the large restriction fragment, are inserted into the yeast artificial chromosome plasmid (pYAC) vector, and then converted into single strands with T7 gene 6-encoded 5'----3' exonuclease. The total genomic DNA digested with the restriction enzyme is also treated with the exonuclease to convert the terminal regions of the restriction fragments into single strands. In the resulting products, only the fragment corresponding to the jumping clone can form hybrids with the just-mentioned, single-stranded DNAs, which are connected to the pYAC, and only this fragment is cloned in yeast. We describe the protocol of this method with Escherichia coli DNA as a model experiment. Judging from the cloning efficiency, this method could be applied to cloning single-copy regions of the human genome, provided a jumping clone is available. The instability of inserts in the pYAC vector is also discussed.  相似文献   

11.
Characterization of the segmental duplication LCR7-20 in the human genome   总被引:1,自引:0,他引:1  
Liu X  Li X  Li M  Acimovic YJ  Li Z  Scherer SW  Estivill X  Tsui LC 《Genomics》2004,83(2):262-269
Our previous study described the amplification of a genomic sequence containing exon 9 of CFTR in the human genome. Here we report that this CFTR sequence is part of a large duplicated sequence unit, provisionally named LCR7-20. Through successive screening of two human chromosome 7-specific cosmid libraries to construct a cosmid contig, we assembled two sequenced BAC clones into a single contig containing a prototypic LCR7-20 unit. Subsequent searches of existing human genome sequences identified additional six copies of LCR7-20-like sequences with more than 90% sequence homology. Additional genomic clones containing LCR7-20-like sequences were then isolated from total genomic BAC and PAC libraries. Restriction fragment analysis and limited sequencing data indicated that there could be around 30 copies of LCR7-20-like sequences in the human genome and that the average region of homology could extend over 120 kb. As indicated by fluorescence in situ hybridization analysis, LCR7-20-like sequences are dispersed on different chromosomes, mainly in the centromeric and pericentromeric regions, and some may exist in tandem copies. Our study also indicates that many genomic regions containing LCR7-20's either have been misassembled or are missing in current versions of the human genome sequence.  相似文献   

12.
Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based on depth of coverage data generated by NGS technology. In this work, we have used a novel way to represent the read count data as a two dimensional geometrical point. A key aspect of detecting the regions with CNVs, is to devise a proper segmentation algorithm that will distinguish the genomic locations having a significant difference in read count data. We have designed a new segmentation approach in this context, using convex hull algorithm on the geometrical representation of read count data. To our knowledge, most algorithms have used a single distribution model of read count data, but here in our approach, we have considered the read count data to follow two different distribution models independently, which adds to the robustness of detection of CNVs. In addition, our algorithm calls CNVs based on the multiple sample analysis approach resulting in a low false discovery rate with high precision.  相似文献   

13.
具有同源重叠区的酵母人工染色体(YAC)可以利用酵母细胞减数分裂进行同源重组,从而构建更大的人工染色体基因组,这对生命科学基础研究和生物技术应用研究有着非常重要的意义。本实验以两个含人免疫球蛋白κ链基因簇片段的YAC克隆为材料,通过酵母改型、异型接合、二倍体发孢、单孢子筛选和分子生物学鉴定等技术和方法,利用酵母菌减数分裂同源重组机制,构建了一条包含人的免疫球蛋白κ轻链32个Vκ基因、5个Jκ基因、Cκ基因、Eκ基因和κde基因的YAC重组体,长度约400kb。同时,本实验利用溶壁酶消化法获取单孢子重组体,代替了传统的显微分孢操作。使得利用酵母人工染色体减数分裂同源重组的技术更加简便可行。  相似文献   

14.
Isolating high-priority segments of genomes greatly enhances the efficiency of next-generation sequencing (NGS) by allowing researchers to focus on their regions of interest. For the 2010–11 DNA Sequencing Research Group (DSRG) study, we compared outcomes from two leading companies, Agilent Technologies (Santa Clara, CA, USA) and Roche NimbleGen (Madison, WI, USA), which offer custom-targeted genomic enrichment methods. Both companies were provided with the same genomic sample and challenged to capture identical genomic locations for DNA NGS. The target region totaled 3.5 Mb and included 31 individual genes and a 2-Mb contiguous interval. Each company was asked to design its best assay, perform the capture in replicates, and return the captured material to the DSRG-participating laboratories. Sequencing was performed in two different laboratories on Genome Analyzer IIx systems (Illumina, San Diego, CA, USA). Sequencing data were analyzed for sensitivity, specificity, and coverage of the desired regions. The success of the enrichment was highly dependent on the design of the capture probes. Overall, coverage variability was higher for the Agilent samples. As variant discovery is the ultimate goal for a typical targeted sequencing project, we compared samples for their ability to sequence single-nucleotide polymorphisms (SNPs) as a test of the ability to capture both chromosomes from the sample. In the targeted regions, we detected 2546 SNPs with the NimbleGen samples and 2071 with Agilent''s. When limited to the regions that both companies included as baits, the number of SNPs was ∼1000 for each, with Agilent and NimbleGen finding a small number of unique SNPs not found by the other.  相似文献   

15.
16.
Advances in biotechnology have resulted in large-scale studies of DNA methylation. A differentially methylated region (DMR) is a genomic region with multiple adjacent CpG sites that exhibit different methylation statuses among multiple samples. Many so-called “supervised” methods have been established to identify DMRs between two or more comparison groups. Methods for the identification of DMRs without reference to phenotypic information are, however, less well studied. An alternative “unsupervised” approach was proposed, in which DMRs in studied samples were identified with consideration of nature dependence structure of methylation measurements between neighboring probes from tiling arrays. Through simulation study, we investigated effects of dependencies between neighboring probes on determining DMRs where a lot of spurious signals would be produced if the methylation data were analyzed independently of the probe. In contrast, our newly proposed method could successfully correct for this effect with a well-controlled false positive rate and a comparable sensitivity. By applying to two real datasets, we demonstrated that our method could provide a global picture of methylation variation in studied samples. R source codes to implement the proposed method were freely available at http://www.csjfann.ibms.sinica.edu.tw/eag/programlist/ICDMR/ICDMR.html.  相似文献   

17.
The immunoglobulin heavy chain isotype switch is mediated by a DNA rearrangement involving specific genomic segments referred to as switch regions. Switch regions are composed of tandemly repeated simple sequences. The role of the tandemly repeated structure of switch regions in the switch recombination process is not understood. We mapped eight recombination sites--six in the gamma 1 and two in the gamma 3 tandem arrays. In addition, we obtained molecular clones representing three of the six gamma 1 rearrangements, and determined the nucleotide sequences of the recombination sites in each. In general, the rearrangements are confined to the tandem repeat units, and are not clustered in a particular portion of either the gamma 3 or gamma 1 switch region. Nucleotide sequence analysis of one of the recombinant clones, gamma M35, reveals evidence for a successive switch event wherein a recombination between S mu and S gamma 3 was followed by recombination 57 bp downstream with S gamma 1. gamma 1 sequence data from the molecular clones we obtained, together with similar data from other investigators regarding the gamma 1, gamma 2b, and gamma 2a switch regions, reveals that recombinations tend to occur at homologous positions of the respective gamma-unit repeats, adjacent to the elements AGCT and GGGG found in each. This finding suggests that the cutting and religation step of the recombination process is mediated by a recombinase common to the four gamma-isotypes.  相似文献   

18.
The recently developed technique for cloning genomic DNA fragments of several hundred kilobases or more into yeast artificial chromosomes (YACs) makes it possible to isolate gene families while preserving their structural integrity. We have analyzed five independent yeast clones identified by PCR screening using oligonucleotides derived from the adult human beta-globin gene. Analysis of the five clones containing YACs by conventional and pulsed-field gel electrophoresis revealed that all of the clones include a YAC with sequences from the adult beta-globin gene as expected. One of the clones contains multiple, unstable YACs. Two other clones carry single YACs in which there are at least two unrelated human genomic inserts. The remaining two clones contain single YACs, 150 and 220 kb in size, that contain the entire beta-globin gene family and flanking regions in a single, structurally intact genomic fragment. These should prove useful in future studies of the regulation of expression of genes in the beta-globin gene cluster.  相似文献   

19.
? Transgenomics is the process of introducing genomic clones from a donor species into a recipient species and then screening the resultant transgenic lines for phenotypes of interest. This method might allow us to find genes involved in the evolution of phenotypic differences between species as well as genes that have the potential to contribute to reproductive isolation: potential speciation genes. ? More than 1100 20-kbp genomic clones from Leavenworthia alabamica were moved into Arabidopsis thaliana by transformation. After screening a single primary transformant for each line, clones associated with mutant phenotypes were tested for repeatability and co-segregation. ? We found 84 clones with possible phenotypic effects, of which eight were repeatedly associated with the same phenotype. One clone, 11_11B, co-segregated with a short fruit phenotype. Further study showed that 11_11B affects seed development, with as much as one-third of the seeds aborted in some fruit. ? Transgenomics is a viable strategy for discovering genes of evolutionary interest. We identify methods to reduce false positives and false negatives in the future. 11_11B can be viewed as a potential speciation gene, illustrating the value of transgenomics for studying the molecular basis of reproductive isolation.  相似文献   

20.
 A foxtail millet-rice comparative genetic map was constructed using mapped rice RFLP markers and wheat genomic and cDNA clones with known map position in rice. About 74% and 37% of the cDNA and genomic clones, respectively, were transferable to foxtail millet, confirming that conservation at the DNA level is greatest in genic regions. A high degree of conserved colinearity was observed between the two genomes. Five entire foxtail millet chromosomes appear to be colinear with five entire rice chromosomes. The remaining four foxtail millet linkage groups each show colinearity with segments of two rice chromosomes. The rearrangements of rice chromosomes 3 and 10 to form foxtail millet chromosome IX, and 7 and 9 to form chromosome II are very similar to those required to form maize chromosomes 1 and 7 and sorghum linkage groups C and B, indicating Setaria’s clear taxonomic position within the subfamily of the Panicoideae. Received: 18 December 1996 / Accepted: 4 August 1997  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号