首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Classification and nomenclature of all human homeobox genes   总被引:2,自引:0,他引:2  

Background

The homeobox genes are a large and diverse group of genes, many of which play important roles in the embryonic development of animals. Increasingly, homeobox genes are being compared between genomes in an attempt to understand the evolution of animal development. Despite their importance, the full diversity of human homeobox genes has not previously been described.

Results

We have identified all homeobox genes and pseudogenes in the euchromatic regions of the human genome, finding many unannotated, incorrectly annotated, unnamed, misnamed or misclassified genes and pseudogenes. We describe 300 human homeobox loci, which we divide into 235 probable functional genes and 65 probable pseudogenes. These totals include 3 genes with partial homeoboxes and 13 pseudogenes that lack homeoboxes but are clearly derived from homeobox genes. These figures exclude the repetitive DUX1 to DUX5 homeobox sequences of which we identified 35 probable pseudogenes, with many more expected in heterochromatic regions. Nomenclature is established for approximately 40 formerly unnamed loci, reflecting their evolutionary relationships to other loci in human and other species, and nomenclature revisions are proposed for around 30 other loci. We use a classification that recognizes 11 homeobox gene 'classes' subdivided into 102 homeobox gene 'families'.

Conclusion

We have conducted a comprehensive survey of homeobox genes and pseudogenes in the human genome, described many new loci, and revised the classification and nomenclature of homeobox genes. The classification scheme may be widely applicable to homeobox genes in other animal genomes and will facilitate comparative genomics of this important gene superclass.  相似文献   

2.
3.
Aquaporins, members of major intrinsic proteins (MIPs), transport water across cellular membranes and play vital roles in all organisms. Adversities such as drought, salinity, or chilling affect water uptake and transport, and numerous plant MIPs are reported to be differentially regulated under such stresses. However, MIP genes have been not yet been characterized in wheat, the largest cereal crop. We have identified 24 PIP and 11 TIP aquaporin genes from wheat by gene isolation and database searches. They vary extensively in lengths, numbers, and sequences of exons and introns, and sequences and cellular locations of predicted proteins, but the intron positions (if present) are characteristic. The putative PIP proteins show a high degree of conservation of signature sequences or residues for membrane integration, water transport, and regulation. The TIPs are more diverse, some with potential for water transport and others with various selectivity filters including a new combination. Most genes appear to be expressed as expressed sequence tags, while two are likely pseudogenes. Many of the genes are highly identical to rice but some are unique, and many correspond to genes that show differential expression under salinity and/or drought. The results provide extensive information for functional studies and developing markers for stress tolerance. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

4.
Comparative analysis of processed pseudogenes in the mouse and human genomes   总被引:16,自引:0,他引:16  
Pseudogenes are important resources in evolutionary and comparative genomics because they provide molecular records of the ancient genes that existed in the genome millions of years ago. We have systematically identified approximately 5000 processed pseudogenes in the mouse genome, and estimated that approximately 60% are lineage specific, created after the mouse and human diverged. In both mouse and human genomes, similar types of genes give rise to many processed pseudogenes. These tend to be housekeeping genes, which are highly expressed in the germ line. Ribosomal-protein genes, in particular, form the largest sub-group. The processed pseudogenes in the mouse occur with a distinctly different chromosomal distribution than LINEs or SINEs - preferentially in GC-poor regions. Finally, the age distribution of mouse-processed pseudogenes closely resembles that of LINEs, in contrast to human, where the age distribution closely follows Alus (SINEs).  相似文献   

5.
6.
MOTIVATION: Mammalian genomes contain many 'genomic fossils' i.e. pseudogenes. These are disabled copies of functional genes that have been retained in the genome by gene duplication or retrotransposition events. Pseudogenes are important resources in understanding the evolutionary history of genes and genomes. RESULTS: We have developed a homology-based computational pipeline ('PseudoPipe') that can search a mammalian genome and identify pseudogene sequences in a comprehensive and consistent manner. The key steps in the pipeline involve using BLAST to rapidly cross-reference potential "parent" proteins against the intergenic regions of the genome and then processing the resulting "raw hits" -- i.e. eliminating redundant ones, clustering together neighbors, and associating and aligning clusters with a unique parent. Finally, pseudogenes are classified based on a combination of criteria including homology, intron-exon structure, and existence of stop codons and frameshifts.  相似文献   

7.
8.

Background

Hair represents an evolutionary innovation that appeared early on mammalian evolutionary history, and presumably contributed significantly to the rapid radiation of the group. An interesting event in hair evolution has been its secondary loss in some mammalian groups, such as cetaceans, whose hairless phenotype appears to be an adaptive response to better meet the environmental conditions. To determine whether different repertoire of keratin genes among mammals can potentially explain the phenotypic hair features of different lineages, we characterized the type I and II clusters of alpha keratins from eight mammalian species, including the hairless dolphin and minke whale representing the order Cetacea.

Results

We combined the available genomic information with phylogenetic analysis to conduct a comprehensive analysis of the evolutionary patterns of keratin gene clusters. We found that both type I and II gene clusters are fairly conserved among the terrestrial mammals included in this study, with lineage specific gene duplication and gene loss. Nevertheless, there is also evidence for an increased rate of pseudogenization in the cetacean lineage when compared to their terrestrial relatives, especially among the hair type keratins.

Conclusions

Here we present a comprehensive characterization of alpha-keratin genes among mammals and elucidate the mechanisms involved in the evolution of this gene family. We identified lineage-specific gene duplications and gene loss among the Laurasiatherian and Euarchontoglires species included in the study. Interestingly, cetaceans present an increased loss of hair-type keratin genes when compared to other terrestrial mammals. As suggested by the ‘less-is-more’ hypothesis, we do not rule out the possibility that the gene loss of hair-type keratin genes in these species might be associated to the hairless phenotype and could have been adaptive in response to new selective pressures imposed by the colonization of a new habitat. Our study provides support for the idea that pseudogenes are not simply ‘genomic fossils’ but instead have adaptive roles during the evolutionary process.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-869) contains supplementary material, which is available to authorized users.  相似文献   

9.
10.
Determining the influence of complex, molecular-system dynamics on the evolution of proteins is hindered by the significant challenge of quantifying the control exerted by the proteins on system output. We have employed a combination of systems biology and molecular evolution analyses in a first attempt to unravel this relationship. We employed a comprehensive mathematical model of mammalian phototransduction to predict the degree of influence that each protein in the system exerts on the high-level dynamic behaviour. We found that the genes encoding the most dynamically sensitive proteins exhibit relatively relaxed evolutionary constraint. We also investigated the evolutionary and epistatic influences of the many nonlinear interactions between proteins in the system and found several pairs to have coevolved, including those whose interactions are purely dynamical with respect to system output. This evidence points to a key role played by nonlinear system dynamics in influencing patterns of molecular evolution.  相似文献   

11.
Based on searches for disabled homologs to known proteins, we have identified a large population of pseudogenes in four sequenced eukaryotic genomes—the worm, yeast, fly and human (chromosomes 21 and 22 only). Each of our nearly 2500 pseudogenes is characterized by one or more disablements mid-domain, such as premature stops and frameshifts. Here, we perform a comprehensive survey of the amino acid and nucleotide composition of these pseudogenes in comparison to that of functional genes and intergenic DNA. We show that pseudogenes invariably have an amino acid composition intermediate between genes and translated intergenic DNA. Although the degree of intermediacy varies among the four organisms, in all cases, it is most evident for amino acid types that differ most in occurrence between genes and intergenic regions. The same intermediacy also applies to codon frequencies, especially in the worm and human. Moreover, the intermediate composition of pseudogenes applies even though the composition of the genes in the four organisms is markedly different, showing a strong correlation with the overall A/T content of the genomic sequence. Pseudogenes can be divided into ‘ancient’ and ‘modern’ subsets, based on the level of sequence identity with their closest matching homolog (within the same genome). Modern pseudogenes usually have a much closer sequence composition to genes than ancient pseudogenes. Collectively, our results indicate that the composition of pseudogenes that are under no selective constraints progressively drifts from that of coding DNA towards non-coding DNA. Therefore, we propose that the degree to which pseudogenes approach a random sequence composition may be useful in dating different sets of pseudogenes, as well as to assess the rate at which intergenic DNA accumulates mutations. Our compositional analyses with the interactive viewer are available over the web at http://genecensus.org/pseudogene.  相似文献   

12.
13.
The identification and classification of genes and pseudogenes in duplicated regions still constitutes a challenge for standard automated genome annotation procedures. Using an integrated homology and orthology analysis independent of current gene annotation, we have identified 9,484 and 9,017 gene duplicates in human and mouse, respectively. On the basis of the integrity of their coding regions, we have classified them into functional and inactive duplicates, allowing us to define the first consistent and comprehensive collection of 1,811 human and 1,581 mouse unprocessed pseudogenes. Furthermore, of the total of 14,172 human and mouse duplicates predicted to be functional genes, as many as 420 are not included in current reference gene databases and therefore correspond to likely novel mammalian genes. Some of these correspond to partial duplicates with less than half of the length of the original source genes, yet they are conserved and syntenic among different mammalian lineages. The genes and unprocessed pseudogenes obtained here will enable further studies on the mechanisms involved in gene duplication as well as of the fate of duplicated genes.  相似文献   

14.

Background

By reshuffling genomes, structural genomic reorganizations provide genetic variation on which natural selection can work. Understanding the mechanisms underlying this process has been a long-standing question in evolutionary biology. In this context, our purpose in this study is to characterize the genomic regions involved in structural rearrangements between human and macaque genomes and determine their influence on meiotic recombination as a way to explore the adaptive role of genome shuffling in mammalian evolution.

Results

We first constructed a highly refined map of the structural rearrangements and evolutionary breakpoint regions in the human and rhesus macaque genomes based on orthologous genes and whole-genome sequence alignments. Using two different algorithms, we refined the genomic position of known rearrangements previously reported by cytogenetic approaches and described new putative micro-rearrangements (inversions and indels) in both genomes. A detailed analysis of the rhesus macaque genome showed that evolutionary breakpoints are in gene-rich regions, being enriched in GO terms related to immune system. We also identified defense-response genes within a chromosome inversion fixed in the macaque lineage, underlying the relevance of structural genomic changes in evolutionary and/or adaptation processes. Moreover, by combining in silico and experimental approaches, we studied the recombination pattern of specific chromosomes that have suffered rearrangements between human and macaque lineages.

Conclusions

Our data suggest that adaptive alleles – in this case, genes involved in the immune response – might have been favored by genome rearrangements in the macaque lineage.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-530) contains supplementary material, which is available to authorized users.  相似文献   

15.
Like many plants, Populus has an evolutionary history in which several, both recent and more ancient, genome duplication events have occurred and, therefore, constitutes an excellent model system for studying the functional evolution of genes. In the present study, we have focused on the properties of genes with tissue-specific differential expression patterns in poplar. We identified the genes by analyzing digital expression profiles derived by mapping 90,000+ expressed sequence tags (ESTs) from 18 sources to the predicted genes of Populus. Our sequence analysis suggests that tissue-specific differentially expressed genes have less diverged paralogs than average, indicating that gene duplication events is an important event in the pathway leading to this type of expression pattern. The functional analysis showed that genes coding for proteins involved in processes of functional importance for the specific tissue(s) in which they are expressed and genes coding for regulatory or responsive proteins are most common among the differentially expressed genes, demonstrating that the expression differentiation process is under strong selective pressure. Thus, our data supports a model where gene duplication followed by gene specialization or expansion of the regulatory and responsive networks leads to tissue-specific differential expression patterns. We have also searched for clustering of genes with similar expression pattern into gene-expression neighborhoods within the Populus genome. However, we could not detect any major clustering among the analyzed genes with highly specific expression patterns. Electronic supplementary material Supplementary material is available in the online version of this article at and is accessible for authorized users.  相似文献   

16.
17.
18.
Studies of neutrally evolving sequences suggest that differences in eukaryotic genome sizes result from different rates of DNA loss. However, very few pseudogenes have been identified in microbial species, and the processes whereby genes and genomes deteriorate in bacteria remain largely unresolved. The typhus-causing agent, Rickettsia prowazekii, is exceptional in that as much as 24% of its 1.1-Mb genome consists of noncoding DNA and pseudogenes. To test the hypothesis that the noncoding DNA in the R. prowazekii genome represents degraded remnants of ancestral genes, we systematically examined all of the identified pseudogenes and their flanking sequences in three additional Rickettsia species. Consistent with the hypothesis, we observe sequence similarities between genes and pseudogenes in one species and intergenic DNA in another species. We show that the frequencies and average sizes of deletions are larger than insertions in neutrally evolving pseudogene sequences. Our results suggest that inactivated genetic material in the Rickettsia genomes deteriorates spontaneously due to a mutation bias for deletions and that the noncoding sequences represent DNA in the final stages of this degenerative process.  相似文献   

19.
20.
Homma K  Fukuchi S  Kawabata T  Ota M  Nishikawa K 《Gene》2002,294(1-2):25-33
Pseudogenes are open reading frames (ORFs) encoding dysfunctional proteins with high homology to known protein-coding genes. Although pseudogenes were reported to exist in the genomes of many eukaryotes and bacteria, no systematic search for pseudogenes in the Escherichia coli genome has been carried out. Genome comparisons of E. coli strains K-12 and O157 revealed that many protein-coding sequences have prematurely terminated orthologs encoding unstable proteins. To systematically screen for pseudogenes, we selected ORFs generated by premature termination of the orthologous protein-coding genes and subsequently excluded those possibly arising from sequence errors. Lastly we eliminated those with close homologs in this and other species, as these shortened ORFs may actually have functions. The process produced 95 and 101 pseudogene candidates in K-12 and O157, respectively. The assigned three-dimensional structures suggest that most of the encoded proteins cannot fold properly and thus are dysfunctional, indicating that they are probably pseudogenes. Therefore, the existence of a significant number of probable pseudogenes in E. coli is predicted, awaiting experimental verification. Most of them were found to be genes with paralogs or horizontally transferred genes or both. We suggest that pseudogenes constitute a small fraction of the genomes of free-living bacteria in general, reflecting the faster elimination than production of pseudogenes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号