首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A genomic pattern of new gene origination is often dependent on a genomic method that can efficiently identify a statistically adequate number of recently originated genes. The heterochromatic regions have often been viewed as genomic deserts with low coding potential and thus a low flux of new genes. However, increasing reports revealed unexpected roles of heterochromatic regions in the evolution of genes and genomes. We identified recently retroposed genes that originated in heterochromatic regions in Drosophila, by developing microarray-based comparative genomic hybridization (CGH) with multiple species. This new gene family, named Ifc-2h, originated in the common ancestor of the clade of D. simulans, D. mauritiana, and D. sechellia. The sequence features and phylogenetic distribution indicated that Ifc-2h resulted from the retroposition from its parental gene, Infertile crescent (Ifc), and integrated into heterochromatic region of common ancester of the three sibling species 2 million years ago. Expression analysis revealed that Ifc-2h had developed a new expression pattern by recruiting a putative regulatory element from its target sequence. The distribution of indel variation in Ifc-2h of D. simulans and D. mauritiana revealed a significant sequence constraint, suggesting that the Ifc-2h gene may be functional. These analyses cast fresh insight into the evolution of heterochromatin and the origin of its coding regions. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Martin Kreitman]  相似文献   

2.
AGO proteins are universal effectors of eukaryotic small RNA-directed regulatory pathways. In this study, we used a comparative genomics approach to explore the AGO sub-family in the teleost clade. We identified five Ago homologues in teleost genomes, one more than encoded in other vertebrate clades. The additional teleost homologue was preserved most likely due to the differential retention of regulatory elements following the fish-specific genome duplication event that occurred approximately 350 million years ago. Analysis of all five Ago genomic loci in teleosts revealed that orthologues contain specific, conserved sequence elements in non-coding regions indicating that the teleost Ago paralogues are differentially regulated. This was supported by qRT-PCR analysis that showed differential expression of the zebrafish homologues across development and between adult tissues indicating stage and tissue-specific function of individual AGO proteins. Multiple sequence alignments showed not only that all teleost homologues possess critical residues for AGO function, but also that teleost homologues contain multiple orthologue-specific features, indicative of structural diversification. Notably, these are retained throughout the vertebrate lineage arguing these may be important for orthologue-specific functions.  相似文献   

3.
Microsatellites physically linked to expressed sequence tags (EST-SSRs) are an important resource for linkage mapping and comparative genomics, and data mining in publicly available EST databases is a common strategy for EST-SSR discovery. At present, many species lack species-specific EST sequence data needed for the efficient characterization of EST-SSRs. This paper describes the discovery and development of EST-SSRs for red drum (Sciaenops ocellatus), an estuarine-dependent sciaenid species of economic importance in the USA and elsewhere, using a phylogenetically informed, comparative genomics approach to primer design. The approach entailed comparing existing genomic resources from species closely allied phylogenetically to red drum, with resources from more distantly related outgroup species. By taking into account the degree to which flanking regions are conserved across taxa, the efficiency of PCR primer design was increased greatly. The amplification success rate for primers designed for red drum was 100?% when using EST libraries from confamilial species and 92?% when using an EST library from a species in the same suborder. The primers developed also amplified EST-SSRs in a wide range of perciform fishes, suggesting potential use in comparative genomics. This study demonstrates that EST-SSRs can be efficiently developed for an organism when limited species-specific data are available by exploiting genomic resources from well-studied species, even those at extended phylogenetic distances.  相似文献   

4.
Genes that show complex tissue-specific and temporal control by regulatory elements located outside their promoters present a considerable challenge to identify the sequences involved. The rapid accumulation of genomic sequence information for a number of species has enabled a comparative phylogenetic approach to find important regulatory elements. For some genes, which show a similar pattern of expression in humans and rodents, genomic sequence information for these two species may be sufficient. Others, such as the cystic fibrosis transmembrane conductance regulator (CFTR) gene, show significant divergence in expression patterns between mouse and human, necessitating phylogenetic approaches involving additional species. The ovine CFTR gene has a temporal and spatial expression pattern that is very similar to that of human CFTR. Comparative genomic sequence analysis of ovine and human CFTR identified high levels of homology between the core elements in several potential regulatory elements defined as DNase I hypersensitive sites in human CFTR. These data provide a case for the power of an artiodactyl genome to contribute to the understanding of human genetic disease.  相似文献   

5.
The accurate prediction of higher eukaryotic gene structures and regulatory elements directly from genomic sequences is an important early step in the understanding of newly assembled contigs and finished genomes. As more new genomes are sequenced, comparative approaches are becoming increasingly practical and valuable for predicting genes and regulatory elements. We demonstrate the effectiveness of a comparative method called pattern filtering; it utilizes synteny between two or more genomic segments for the annotation of genomic sequences. Pattern filtering optimally detects the signatures of conserved functional elements despite the stochastic noise inherent in evolutionary processes, allowing more accurate annotation of gene models. We anticipate that pattern filtering will facilitate sequence annotation and the discovery of new functional elements by the genetics and genomics communities.  相似文献   

6.
7.
The Human Genome Project stimulated the development of efficient strategies and relevant hardware for complete genome sequencing. The comparative genomic approach extends the possibilities of using the sequencing data to identify new genes or conserved regulatory regions by means of nucleotide sequence alignment of the particular regions of the mouse and human genomes, or to trace the evolutionary events resulting in the genome structure of modern mammals. The review focuses on the use of new molecular cytogenetic methods along with computer-aided analysis of the genomes in vertebrates. Several factors hindering data analysis are considered. The currently available information on gene evolution rate inferred from comparative genomic data is presented. The origin and evolution of the genomes of several species are discussed.  相似文献   

8.
9.
The Human Genome Project stimulated the development of efficient strategies and relevant hardware for complete genome sequencing. The comparative genomic approach extends the possibilities of using the sequencing data to identify new genes or conserved regulatory regions by means of nucleotide sequence alignment of the particular regions of the mouse and human genomes, or to trace the evolutionary events resulting in the genome structure of modern mammals. The review focuses on the use of new molecular cytogenetic methods along with computer-aided analysis of the genomes in vertebrates. Several factors hindering data analysis are considered. The currently available information on gene evolution rate inferred from comparative genomic data is presented. The origin and evolution of the genomes of several species are discussed.  相似文献   

10.
11.
12.
13.
Prakash A  Tompa M 《Nature biotechnology》2005,23(10):1249-1256
We have analyzed issues of reliability in studies in which comparative genomic approaches have been applied to the discovery of regulatory elements at a genome-wide level in vertebrates. We point out some potential problems with such studies, including difficulties in accurately identifying orthologous promoter regions. Many of these subtle analytical problems have become apparent only when studying the more complex vertebrate genomes. By determining motif reliability, we compared existing tools when applied to the discovery of vertebrate regulatory elements. We then used a statistical clustering method to produce a computational catalog of high quality putative regulatory elements from vertebrates, some of which are widely conserved among vertebrates and many of which are novel regulatory elements. The results provide a glimpse into the wealth of information that comparative genomics can yield and suggest the need for further improvement of genome-wide comparative computational techniques.  相似文献   

14.
Comparative genomics of multiple related species is a powerful methodology for the discovery of functional genomic elements, and its power should increase with the number of species compared. Here, we use 12 Drosophila genomes to study the power of comparative genomics metrics to distinguish between protein-coding and non-coding regions. First, we study the relative power of different comparative metrics and their relationship to single-species metrics. We find that even relatively simple multi-species metrics robustly outperform advanced single-species metrics, especially for shorter exons (≤240 nt), which are common in animal genomes. Moreover, the two capture largely independent features of protein-coding genes, with different sensitivity/specificity trade-offs, such that their combinations lead to even greater discriminatory power. In addition, we study how discovery power scales with the number and phylogenetic distance of the genomes compared. We find that species at a broad range of distances are comparably effective informants for pairwise comparative gene identification, but that these are surpassed by multi-species comparisons at similar evolutionary divergence. In particular, while pairwise discovery power plateaued at larger distances and never outperformed the most advanced single-species metrics, multi-species comparisons continued to benefit even from the most distant species with no apparent saturation. Last, we find that genes in functional categories typically considered fast-evolving can nonetheless be recovered at very high rates using comparative methods. Our results have implications for comparative genomics analyses in any species, including the human.  相似文献   

15.
The ability to align pairs of very large molecular sequences is essential for a range of comparative genomic studies. However, given the complexity of genomic sequences, it has been difficult to devise a systematic method that can align - even within the same species - pairs of large sequences. Most existing approaches typically attempt to align nucleotide sequences while ignoring valuable features contained within them, eg they filter out low-complexity regions and retroelements before aligning the sequences. However, features are then added post-alignment for visualisation and analysis purposes. We argue that repetitive elements and other features (such as genes, exons and regulatory elements) should be part of the alignment process. A hierarchical approach that aligns the biologically relevant features before aligning the detailed nucleotide sequences has a number of interesting characteristics: (1) features define 'alignment anchor points' that can guide meaningful nucleotide alignment; (2) features can be weighted; (3) a hierarchical approach would identify only meaningful regions to be aligned; (4) nucleotide sequences can be described as sequences of features and non-features, providing a natural mechanism to divide the sequences for processing; and (5) computational speed is significantly faster than other approaches. In this paper, we describe and discuss a feature-based approach to aligning large genome sequences. We refer to this as 'feature-based sequence alignment'.  相似文献   

16.
Combining high‐throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short‐read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long‐read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom‐made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage‐specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long‐read capture as a versatile approach for comparative genomic studies by generation of a cross‐species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes.  相似文献   

17.
Members of the family Vibrionaceae are generally found in marine and brackish environments, playing important roles in nutrient cycling. The Rumoiensis clade is an unconventional group in the genus Vibrio, currently comprising six species from different origins including two species isolated from non-marine environments. In this study, we performed comparative genome analysis of all six species in the clade using their complete genome sequences. We found that two non-marine species, Vibrio casei and Vibrio gangliei, lacked the genes responsible for algal polysaccharide degradation, while a number of glycoside hydrolase genes were enriched in these two species. Expansion of insertion sequences was observed in V. casei and Vibrio rumoiensis, which suggests ongoing genomic changes associated with niche adaptations. The genes responsible for the metabolism of glucosylglycerate, a compound known to play a role as compatible solutes under nitrogen limitation, were conserved across the clade. These characteristics, along with genes encoding species-specific functions, may reflect the habit expansion which has led to the current distribution of Rumoiensis clade species. Genome analysis of all species in a single clade give us valuable insights into the genomic background of the Rumoiensis clade species and emphasize the genomic diversity and versatility of Vibrionaceae.  相似文献   

18.
The explosion in genomic sequence available in public databases has resulted in an unprecedented opportunity for computational whole genome analyses. A number of promising comparative-based approaches have been developed for gene finding, regulatory element discovery and other purposes, and it is clear that these tools will play a fundamental role in analysing the enormous amount of new data that is currently being generated. The synthesis of computationally intensive comparative computational approaches with the requirement for whole genome analysis represents both an unprecedented challenge and opportunity for computational scientists. We focus on a few of these challenges, using by way of example the problems of alignment, gene finding and regulatory element discovery, and discuss the issues that have arisen in attempts to solve these problems in the context of whole genome analysis pipelines.  相似文献   

19.
The evolution of noncoding DNA: how much junk, how much func?   总被引:1,自引:0,他引:1  
Comparative sequence analysis on a genomic scale has opened the door for the systematic analysis of cis-acting regulatory DNA. It is now possible to begin to answer basic questions such as, how much meaningful noncoding sequence is in the genome? How strong is natural selection on functional noncoding sequences in different species? Two recent articles have capitalized on the comparative genomic approach in an attempt to answer these questions with surprising results.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号