首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The accurate prediction of higher eukaryotic gene structures and regulatory elements directly from genomic sequences is an important early step in the understanding of newly assembled contigs and finished genomes. As more new genomes are sequenced, comparative approaches are becoming increasingly practical and valuable for predicting genes and regulatory elements. We demonstrate the effectiveness of a comparative method called pattern filtering; it utilizes synteny between two or more genomic segments for the annotation of genomic sequences. Pattern filtering optimally detects the signatures of conserved functional elements despite the stochastic noise inherent in evolutionary processes, allowing more accurate annotation of gene models. We anticipate that pattern filtering will facilitate sequence annotation and the discovery of new functional elements by the genetics and genomics communities.  相似文献   

2.
Gene transfer and gene mapping in mammalian cells in culture   总被引:1,自引:0,他引:1  
The ability to transfer mammalian genes parasexually has opened new possibilities for gene mapping and fine structure mapping and offers great potential for contributing to several aspects of mammalian biology, including gene expression and genetic engineering. The DNA transferred has ranged from whole genomes to single genes and smaller segments of DNA. The transfer of whole genomes by cell fusion forms cell hybrids, which has promoted the extensive mapping of human and mouse genes. Transfer, by cell fusion, of rearranged chromosomes has contributed significantly to determining close linkage and the assignment of genes to specific chromosomal regions. Transfer of single chromosomes has been achieved utilizing microcells fused to recipient cells. Metaphase chromosomes have been isolated and used to transfer single-to-multigenic DNA segments. DNA-mediated gene transfer, simulating bacterial transformation, has achieved transfer of single-copy genes. By utilizing DNA cleaved with restriction endonucleases, gene transfer is being empolyed as a bioassay for the purification of genes. Gene mapping and the fate of transferred genes can be examined now at the molecular level using sequence-specific probles. Recently, single genes have been cloned into eucaryotic and procaryotic vectors for transfer into mammalian cells. Moreover, recombinant libraries in which entire mammalian genomes are represented collectively are a rich new source of transferable genes. Methodology for transferring mammalian genetic information and applications for mapping mammalian genes is presented and prospects for the future discussed.  相似文献   

3.
Summary The ability to transfer mammalian genes parasexually has opened new possibilities for gene mapping and fine structure mapping and offers great potential for contributing to several aspects of mammalian biology, including gene expression and genetic engineering. The DNA transferred has ranged from whole genomes to single genes and smaller segments of DNA. The transfer of whole genomes by cell fusion forms cell hybrids, which has promoted the extensive mapping of human and mouse genes. Transfer, by cell fusion, of rearranged chromosomes has contributed significantly to determining close linkage and the assignment of genes to specific chromosomal regions. Transfer of single chromosomes has been achieved utilizing microcells fused to recipient cells. Metaphase chromosomes have been isolated and used to transfer single-to-multigenic DNA segments. DNA-mediated gene transfer, simulating bacterial transformation, has achieved transfer of single-copy genes. By utilizing DNA cleaved with restriction endonucleases, gene transfer is being employed as a bioassay for the purification of genes. Gene mapping and the fate of transferred genes can be examined now at the molecular level using sequence-specific probes. Recently, single genes have been clones into eucaryotic and procaryotic vectors for transfer into mammalian cells. Moreover, recombinant libraries in which entire mammalian genomes are represented collectively are a rich new source of transferable genes. Methodology for transferring mammalian genetic information and applications for mapping mammalian genes is presented and prospects for the future discussed. Presented in the symposium on Gene Transfer, Differentiation and Neoplasia in Plant and Animal Cells at the 30th Annual Meeting of the Tissue Culture Association, Seattle, Washington, June 10–14, 1979. This symposium was supported in part by Grant CA 26748 from the National Cancer Institute, DHEW, and Grant RD-67 from the American Cancer Society. Supported by NIH grants HD 05196 and GM 20454 and by MOD grants 1-485 and 1-692.  相似文献   

4.
Shi G  Peng MC  Jiang T 《PloS one》2011,6(6):e20892
The identification of orthologous genes shared by multiple genomes plays an important role in evolutionary studies and gene functional analyses. Based on a recently developed accurate tool, called MSOAR 2.0, for ortholog assignment between a pair of closely related genomes based on genome rearrangement, we present a new system MultiMSOAR 2.0, to identify ortholog groups among multiple genomes in this paper. In the system, we construct gene families for all the genomes using sequence similarity search and clustering, run MSOAR 2.0 for all pairs of genomes to obtain the pairwise orthology relationship, and partition each gene family into a set of disjoint sets of orthologous genes (called super ortholog groups or SOGs) such that each SOG contains at most one gene from each genome. For each such SOG, we label the leaves of the species tree using 1 or 0 to indicate if the SOG contains a gene from the corresponding species or not. The resulting tree is called a tree of ortholog groups (or TOGs). We then label the internal nodes of each TOG based on the parsimony principle and some biological constraints. Ortholog groups are finally identified from each fully labeled TOG. In comparison with a popular tool MultiParanoid on simulated data, MultiMSOAR 2.0 shows significantly higher prediction accuracy. It also outperforms MultiParanoid, the Roundup multi-ortholog repository and the Ensembl ortholog database in real data experiments using gene symbols as a validation tool. In addition to ortholog group identification, MultiMSOAR 2.0 also provides information about gene births, duplications and losses in evolution, which may be of independent biological interest. Our experiments on simulated data demonstrate that MultiMSOAR 2.0 is able to infer these evolutionary events much more accurately than a well-known software tool Notung. The software MultiMSOAR 2.0 is available to the public for free.  相似文献   

5.
CpG islands (CGIs) are CpG-rich regions compared to CpG-depleted bulk DNA of mammalian genomes and are generally regarded as the epigenetic regulatory regions in association with unmethylation, promoter activity and histone modifications. Accurate identification of CpG islands with epigenetic regulatory function in bulk genomes is of wide interest. Here, the common features of functional CGIs are identified using an average mutual information method to differentiate functional CGIs from the remaining CGIs. A new approach (CpG mutual information, CpG_MI) was further explored to identify functional CGIs based on the cumulative mutual information of physical distances between two neighboring CpGs. Compared to current approaches, CpG_MI achieved the highest prediction accuracy. This approach also identified new functional CGIs overlapping with gene promoter regions which were missed by other algorithms. Nearly all CGIs identified by CpG_MI overlapped with histone modification marks. CpG_MI could also be used to identify potential functional CGIs in other mammalian genomes, as the CpG dinucleotide contents and cumulative mutual information distributions are almost the same among six mammalian genomes in our analysis. It is a reliable quantitative tool for the identification of functional CGIs from bulk genomes and helps in understanding the relationships between genomic functional elements and epigenomic modifications.  相似文献   

6.
Since operons are unstable across Prokaryotes, it has been suggested that perhaps they re-combine in a conservative manner. Thus, genes belonging to a given operon in one genome might re-associate in other genomes revealing functional relationships among gene products. We developed a system to build networks of functional relationships of gene products based on their organization into operons in any available genome. The operon predictions are based on inter-genic distances. Our system can use different kinds of thresholds to accept a functional relationship, either related to the prediction of operons, or to the number of non-redundant genomes that support the associations. We also work by shells, meaning that we decide on the number of linking iterations to allow for the complementation of related gene sets. The method shows high reliability benchmarked against knowledge-bases of functional interactions. We also illustrate the use of Nebulon in finding new members of regulons, and of other functional groups of genes. Operon rearrangements produce thousands of high-quality new interactions per prokaryotic genome, and thousands of confirmations per genome to other predictions, making it another important tool for the inference of functional interactions from genomic context.  相似文献   

7.
Regulatory DNA elements, short genomic segments that regulate gene expression, have been implicated in developmental disorders and human disease. Despite this clinical urgency, only a small fraction of the regulatory DNA repertoire has been confirmed through reporter gene assays. The overall success rate of functional validation of candidate regulatory elements is low. Moreover, the number and diversity of datasets from which putative regulatory elements can be identified is large and rapidly increasing. We generated a flexible and user-friendly tool to integrate the information from different types of genomic datasets, e.g. ATAC-seq, ChIP-seq, conservation, aiming to increase the ease and success rate of functional prediction. To this end, we developed the EMERGE program that merges all datasets that the user considers informative and uses a logistic regression framework, based on validated functional elements, to set optimal weights to these datasets. ROC curve analysis shows that a combination of datasets leads to improved prediction of tissue-specific enhancers in human, mouse and Drosophila genomes. Functional assays based on this prediction can be expected to have substantially higher success rates. The resulting integrated signal for prediction of functional elements can be plotted in a build-in genome browser or exported for further analysis.  相似文献   

8.
Gene fusion and fission events are key mechanisms in the evolution of gene architecture, whose effects are visible in protein architecture when they occur in coding sequences. Until now, the detection of fusion and fission events has been performed at the level of protein sequences with a post facto removal of supernumerary links due to paralogy, and often did not include looking for events defined only in single genomes. We propose a method for the detection of these events, defined on groups of paralogs to compensate for the gene redundancy of eukaryotic genomes, and apply it to the proteomes of 12 fungal species. We collected an inventory of 1,680 elementary fusion and fission events. In half the cases, both composite and element genes are found in the same species. Per-species counts of events correlate with the species genome size, suggesting a random mechanism of occurrence. Some biological functions of the genes involved in fusion and fission events are slightly over- or under-represented. As already noted in previous studies, the genes involved in an event tend to belong to the same functional category. We inferred the position of each event in the evolution tree of the 12 fungal species. The event localization counts for all the segments of the tree provide a metric that depicts the “recombinational” phylogeny among fungi. A possible interpretation of this metric as distance in adaptation space is proposed.  相似文献   

9.
10.
Despite several decades of investigation, the organization of angiosperm genomes remained largely unknown until very recently. Data describing the sequence composition of large segments of genomes, covering hundreds of kilobases of contiguous sequence, have only become available in the past two years. Recent results indicate commonalities in the characteristics of many plant genomes, including in the structure of chromosomal components like telomeres and centromeres, and in the order and content of genes. Major differences between angiosperms have been associated mainly with repetitive DNAs, both gene families and mobile elements. Intriguing new studies have begun to characterize the dynamic three-dimensional structures of chromosomes and chromatin, and the relationship between genome structure and co-ordinated gene function.  相似文献   

11.
Gene arrangement into operons varies between bacterial species. Genes in a given system can be on one operon in some organisms and on several operons in other organisms. Existing theories explain why genes that work together should be on the same operon, since this allows for advantageous lateral gene transfer and accurate stoichiometry. But what causes the frequent separation into multiple operons of co-regulated genes that act together in a pathway? Here we suggest that separation is due to benefits made possible by differential regulation of each operon. We present a simple mathematical model for the optimal distribution of genes into operons based on a balance of the cost of operons and the benefit of regulation that provides 'just-when-needed' temporal order. The analysis predicts that genes are arranged such that genes on the same operon do not skip functional steps in the pathway. This prediction is supported by genomic data from 137 bacterial genomes. Our work suggests that gene arrangement is not only the result of random historical drift, genome re-arrangement and gene transfer, but has elements that are solutions of an evolutionary optimization problem. Thus gene functional order may be inferred by analyzing the operon structure across different genomes.  相似文献   

12.
The structural and functional analysis of mammalian genomes would benefit from the ability to isolate from multiple DNA samples any targeted chromosomal segment that is the size of an average human gene. A cloning technique that is based on transformation-associated recombination (TAR) in the yeast Saccharomyces cerevisiae satisfies this need. It is a unique tool to selectively recover chromosome segments that are up to 250 kb in length from complex genomes. In addition, TAR cloning can be used to characterize gene function and genome variation, including polymorphic structural rearrangements, mutations and the evolution of gene families, and for long-range haplotyping.  相似文献   

13.
An in silico comparative genomics approach was used to identify putative orthologs to genetically mapped genes from the mosquito, Aedes aegypti, in the Drosophila melanogaster and Anopheles gambiae genome databases. Comparative chromosome positions of 73 D. melanogaster orthologs indicated significant deviations from a random distribution across each of the five A. aegypti chromosomal regions, suggesting that some ancestral chromosome elements have been conserved. However, the two genomes also reflect extensive reshuffling within and between chromosomal regions. Comparative chromosome positions of A. gambiae orthologs indicate unequivocally that A. aegypti chromosome regions share extensive homology to the five A. gambiae chromosome arms. Whole-arm or near-whole-arm homology was contradicted with only two genes among the 75 A. aegypti genes for which orthologs to A. gambiae were identified. The two genomes contain large conserved chromosome segments that generally correspond to break/fusion events and a reciprocal translocation with extensive paracentric inversions evident within. Only very tightly linked genes are likely to retain conserved linear orders within chromosome segments. The D. melanogaster and A. gambiae genome databases therefore offer limited potential for comparative positional gene determinations among even closely related dipterans, indicating the necessity for additional genome sequencing projects with other dipteran species.  相似文献   

14.
15.
Mammalian bicistronic mRNA is a recently discovered mammalian gene structure. Several reported cases of mammalian bicistronic mRNA indicated that genes of this structure play roles in some important biological processes. However, a genome-wide computational identification of bicistronic mRNA in mammalian genome, such as human genome, is still lacking. Here we used a comparative genomics approach to identify the frequency of human bicistronic mRNA. We then validated the result by using a new support vector machine (SVM) model. We identified 43 human bicistronic mRNAs in 30 distinct genes. Our literature analysis shows that our method recovered 100 % (6/6) of the previously known bicistronic mRNAs which had been experimentally confirmed by other groups. Our graph theory-based analysis and GO analysis indicated that human bicistronic mRNAs are prone to produce different yet closely functionally related proteins. In addition, we also described and analyzed three different mechanisms of ORF fusion. Our method of identifying bicistronic mRNAs in human genome provides a model for the computational identification of characteristic gene structures in mammalian genomes. We anticipate that our data will facilitate further molecular characterization and functional study of human bicistronic mRNA.  相似文献   

16.
17.
Sandy P  Ventura A  Jacks T 《BioTechniques》2005,39(2):215-224
Silencing of gene expression by RNA interference (RNAi) has become a powerful tool for the functional annotation of the Caenorhabditis elegans and Drosophila melanogaster genomes. Recent advances in the design and delivery of targeting molecules now permit efficient and highly specific gene silencing in mammalian systems as well. RNAi offers a simple, fast, and cost-effective alternative to existing gene targeting technologies both in cell-based and in vivo settings. Synthetic small interfering RNA (siRNA) and retroviral short hairpin RNA (shRNA) libraries targeting thousands of human and mouse genes are publicly available for high-throughput genetic screens, and knockdown animals can be rapidly generated by lentivirus-mediated transgenesis. RNAi also holds great promise as a novel therapeutic approach. This review provides insight into the current gene silencing techniques in mammalian systems.  相似文献   

18.
Conserved synteny––the sharing of at least one orthologous gene by a pair of chromosomes from two species––can, in the strictest sense, be viewed as sequence conservation between chromosomes of two related species, irrespective of whether coding or non-coding sequence is examined. The recent sequencing of multiple vertebrate genomes indicates that certain chromosomal segments of considerable size are conserved in gene order as well as underlying non-coding sequence across all vertebrates. Some of these segments lost genes or non-coding sequence and/or underwent breakage only in teleost genomes, presumably because evolutionary pressure acting on these regions to remain intact were relaxed after an additional round of whole genome duplication. Random reporter insertions into zebrafish chromosomes combined with computational genome-wide analysis indicate that large chromosomal areas of multiple genes contain long-range regulatory elements, which act on their target genes from several gene distances away. In addition, computational breakpoint analyses suggest that recurrent evolutionary breaks are found in “fragile regions” or “hotspots”, outside of the conserved blocks of synteny. These findings cannot be accommodated by the random breakage model and suggest that this view of genome and chromosomal evolution requires substantial reassessment.  相似文献   

19.
Many genes are involved in mammalian cell apoptosis pathway. These apoptosis genes often contain characteristic functional domains, and can be classified into at least 15 functional groups, according to previous reports. Using an integrated bioinformatics platform for motif or domain search from three public mammalian proteomes (International Protein Index database for human, mouse, and rat), we systematically cataloged all of the proteins involved in mammalian apoptosis pathway. By localizing those proteins onto the genomes, we obtained a gene locus centric apoptosis gene catalog for human, mouse and rat.Further phylogenetic analysis showed that most of the apoptosis related gene loci are conserved among these three mammals. Interestingly, about one-third of apoptosis gene loci form gene clusters on mammal chromosomes, and exist in the three species, which indicated that mammalian apoptosis gene orders are also conserved. In addition, some tandem duplicated gene loci were revealed by comparing gene loci clusters in the three species. All data produced in this work were stored in a relational database and may be viewed at http://pcas.cbi.pku.edu.cn/database/apd.php.  相似文献   

20.
During evolution genes can produce more complex proteins by gene fusion or less complex proteins by gene fission. Considering proteins from 131 completely sequenced genomes from all three kingdoms of life, we identified 2869 groups of multi-domain proteins as a single protein in certain organisms and as two or more smaller proteins with equivalent domain architectures in other organisms. We found that fusion events are approximately four times more common than fission events, and we established that, in most cases, any particular fusion or fission event only occurred once during the course of evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号