首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Hahn MW  Han MV  Han SG 《PLoS genetics》2007,3(11):e197
Comparison of whole genomes has revealed large and frequent changes in the size of gene families. These changes occur because of high rates of both gene gain (via duplication) and loss (via deletion or pseudogenization), as well as the evolution of entirely new genes. Here we use the genomes of 12 fully sequenced Drosophila species to study the gain and loss of genes at unprecedented resolution. We find large numbers of both gains and losses, with over 40% of all gene families differing in size among the Drosophila. Approximately 17 genes are estimated to be duplicated and fixed in a genome every million years, a rate on par with that previously found in both yeast and mammals. We find many instances of extreme expansions or contractions in the size of gene families, including the expansion of several sex- and spermatogenesis-related families in D. melanogaster that also evolve under positive selection at the nucleotide level. Newly evolved gene families in our dataset are associated with a class of testes-expressed genes known to have evolved de novo in a number of cases. Gene family comparisons also allow us to identify a number of annotated D. melanogaster genes that are unlikely to encode functional proteins, as well as to identify dozens of previously unannotated D. melanogaster genes with conserved homologs in the other Drosophila. Taken together, our results demonstrate that the apparent stasis in total gene number among species has masked rapid turnover in individual gene gain and loss. It is likely that this genomic revolving door has played a large role in shaping the morphological, physiological, and metabolic differences among species.  相似文献   

3.
A large portion of the annotated genes in Drosophila melanogaster show sex-biased expression, indicating that sex and reproduction-related genes (SRR genes) represent an appreciable component of the genome. Previous studies, in which subsets of genes were compared among few Drosophila species, have found that SRR genes exhibit unusual evolutionary patterns. Here, we have used the newly released genome sequences from 12 Drosophila species, coupled to a larger set of SRR genes, to comprehensively test the generality of these patterns. Among 2505 SRR genes examined, including ESTs with biased expression in reproductive tissues and genes characterized as involved in gametogenesis, we find that a relatively high proportion of SRR genes have experienced accelerated divergence throughout the genus Drosophila. Several testis-specific genes, male seminal fluid proteins (SFPs), and spermatogenesis genes show lineage-specific bursts of accelerated evolution and positive selection. SFP genes also show evidence of lineage-specific gene loss and/or gain. These results bring us closer to understanding the details of the evolutionary dynamics of SRR genes with respect to species divergence.  相似文献   

4.
Sea urchin actin gene subtypes. Gene number, linkage and evolution   总被引:12,自引:0,他引:12  
The actin gene family of the sea urchin Strongylocentrotus purpuratus was analyzed by the genome blot method, using subcloned probes specific to the 3' terminal non-translated actin gene sequence, intervening sequence and coding region probes. We define an actin gene subtype as that gene or set of genes displaying homology with a given 3' terminal sequence probe, when hybridized at 55 degrees C, 0.75 M-Na+. By determining the often polymorphic restriction fragment band pattern displayed in genome blots by each probe, all, or almost all of the actin genes in this species could be classified. Our evidence shows that the S. purpuratus genome probably contains seven to eight actin genes, and these can be assigned to four subtypes. Studies of the expression of the genes (Shott et al., 1983) show that the actin genes of three of these subtypes code for cytoskeletal actins (Cy), while the fourth gives rise to a muscle-specific actin (M). We denote the array of S. purpuratus actin genes indicated by our data as follows. There is a single CyI actin gene, two or possibly three CyII genes (CyIIa, CyIIb, and possibly CyIIc), three CyIII actin genes (CyIIIa, CyIIIb, CyIIIc), and a single M actin gene. Comparative studies were carried out on the actin gene families of five other sea urchin species. At least the CyIIa and CyIIb genes are also linked in the Strongylocentrotus franciscanus genome, and this species also has a CyI gene, an M actin gene and at least two CyIII actin genes. It is not clear whether it also possesses a CyIIc actin gene, or a CyIIIc actin gene. The genome of a more closely related congener, Strongylocentrotus dr?bachiensis, includes 3' terminal sequences suggesting the presence of a CyIIc gene. In S. franciscanus and S. dr?bachiensis the first intron of the CyI gene has remained homologous with intron sequences of both the CyIIa and CyIIb genes, indicating a common origin of these three linked cytoskeletal actin genes. Of the four S. purpuratus 3' terminal subtype probe sequences only the CyI 3' terminal sequence has been conserved sufficiently during evolution to permit detection outside of the genus Strongylocentrotus. An unexpected observation was that a sequence found only in the 3' untranslated region of the CyII actin gene in the DNA of S. dr?bachiensis and S. purpuratus is represented as a large family of interspersed repeat sequences in the genome of S. franciscanus.  相似文献   

5.
The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a reference genome. Many species lack a reference genome, but are still important genetic models or are significant species in agricultural production or natural ecosystems. For these species, it is possible to annotate SNPs through comparison with cDNA, or data from well‐annotated genes in public repositories. We present SNPMeta, a tool which gathers information about SNPs by comparison with sequences present in GenBank databases. SNPMeta is able to annotate SNPs from contextual sequence in SNP assay designs, and SNPs discovered through genotyping by sequencing (GBS) approaches. However, SNPs discovered through GBS occur throughout the genome, rather than only in gene space, and therefore do not annotate at high rates. SNPMeta can therefore be used to annotate SNPs in nonmodel species or species that lack a reference genome. Annotations generated by SNPMeta are highly concordant with annotations that would be obtained from a reference genome.  相似文献   

6.
The number of introns varies considerably among different organisms. This can be explained by the differences in the rates of intron gain and loss. Two factors that are likely to influence these rates are selection for or against introns and the mutation rate that generates the novel intron or the intronless copy. Although it has been speculated that stronger selection for a compact genome might result in a higher rate of intron loss and a lower rate of intron gain, clear evidence is lacking, and the role of selection in determining these rates has not been established. Here, we studied the gain and loss of introns in the two closely related species Arabidopsis thaliana and A. lyrata as it was recently shown that A. thaliana has been undergoing a faster genome reduction driven by selection. We found that A. thaliana has lost six times more introns than A. lyrata since the divergence of the two species but gained very few introns. We suggest that stronger selection for genome reduction probably resulted in the much higher intron loss rate in A. thaliana, although further analysis is required as we could not find evidence that the loss rate increased in A. thaliana as opposed to having decreased in A. lyrata compared with the rate in the common ancestor. We also examined the pattern of the intron gains and losses to better understand the mechanisms by which they occur. Microsimilarity was detected between the splice sites of several gained and lost introns, suggesting that nonhomologous end joining repair of double-strand breaks might be a common pathway not only for intron gain but also for intron loss.  相似文献   

7.
Though spliceosomal introns are a major structural component of most eukaryotic genes and intron density varies by more than three orders of magnitude among eukaryotes [1-3], the origins of introns are poorly understood, and only a few cases of unambiguous intron gain are known [4-8]. We utilized population genomic comparisons of three closely related fungi to identify crucial transitory phases of intron gain and loss. We found 74 intron positions showing intraspecific presence-absence polymorphisms (PAPs) for the entire intron. Population genetic analyses identified intron PAPs at different stages of fixation and showed that intron gain or loss was very recent. We found direct support for extensive intron transposition among unrelated genes. A substantial proportion of highly similar introns in the genome either were recently gained or showed a transient phase of intron PAP. We also identified an intron transfer among paralogous genes that created a new intron. Intron loss was due mainly to homologous recombination involving reverse-transcribed mRNA. The large number of intron positions in transient phases of either intron gain or loss shows that intron evolution is much faster than previously thought and provides an excellent model to study molecular mechanisms of intron gain.  相似文献   

8.
9.
《Genomics》2021,113(2):717-726
High quality genome is of great significance for the mining of biological information resources of species. Up to now, the genomic information of several important economic flatfishes has been well explained. All these fishes are eyes on left side-type, and no high-quality genome of eyes on right side-type species has been reported. In this study, we applied a combined strategy involving stLFR and Hi-C technologies to generate sequencing data for constructing the chromosomal genome of Verasper variegates, which belongs to Pleuronectidae with characteristic of eyes on right side. The size of genome of V. variegatus is 556 Mb. More than 97.2% of BUSCO genes were detected, and N50 lengths of the contigs and scaffolds reached 79.8 Kb and 23.8 Mb, respectively, demonstrating the outstanding completeness and sequence continuity of the genome. A total of 22,199 protein-coding genes were predicted in the assembled genome, and more than 95% of those genes could be functionally annotated. Meanwhile, the genomic collinearity, gene family and phylogenetic analyses of similar species in Pleuronectiformes were also investigated and portrayed for metamorphosis and benthic adaptation. Sex related genes mapping has also been achieved at the chromosome level. This study is the first chromosomal level genome of a Pleuronectidae fish (V. variegatus). The chromosomal genome assembly constructed in this work will not only be valuable for conservation and aquaculture studies of the V. variegatus but will also be of general interest in the phylogenetic and taxonomic studies of Pleuronectiformes.  相似文献   

10.
Stable integration of foreign DNA into the frog genome has been the purpose of several studies aimed at generating transgenic animals or producing mutations of endogenous genes. Inserting DNA into a host genome can be achieved in a number of ways. In Xenopus, different strategies have been developed which exhibit specific molecular and technical features. Although several of these technologies were also applied in various model organizms, the attributes of each method have rarely been experimentally compared. Investigators are thus confronted with a difficult choice to discriminate which method would be best suited for their applications. To gain better understanding, a transgenesis workshop was organized by the X-omics consortium. Three procedures were assessed side-by-side, and the results obtained are used to illustrate this review. In addition, a number of reagents and tools have been set up for the purpose of gene expression and functional gene analyses. This not only improves the status of Xenopus as a powerful model for developmental studies, but also renders it suitable for sophisticated genetic approaches. Twenty years after the first reported transgenic Xenopus, we review the state of the art of transgenic research, focusing on the new perspectives in performing genetic studies in this species.  相似文献   

11.
Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or ‘accessory’ genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae.  相似文献   

12.
The diversification of lineages within Pseudomonas syringae has involved a number of adaptive shifts from herbaceous hosts onto various species of tree, resulting in the emergence of highly destructive diseases such as bacterial canker of kiwi and bleeding canker of horse chestnut. This diversification has involved a high level of gene gain and loss, and these processes are likely to play major roles in the adaptation of individual lineages onto their host plants. In order to better understand the evolution of P. syringae onto woody plants, we have generated de novo genome sequences for 26 strains from the P. syringae species complex that are pathogenic on a range of woody species, and have looked for statistically significant associations between gene presence and host type (i.e. woody or herbaceous) across a phylogeny of 64 strains. We have found evidence for a common set of genes associated with strains that are able to colonize woody plants, suggesting that divergent lineages have acquired similarities in genome composition that may form the genetic basis of their adaptation to woody hosts. We also describe in detail the gain, loss and rearrangement of specific loci that may be functionally important in facilitating this adaptive shift. Overall, our analyses allow for a greater understanding of how gene gain and loss may contribute to adaptation in P. syringae.  相似文献   

13.
Potato is the third most important global food crop and the most widely grown noncereal crop. As a species highly amenable to cell culture, it has a long history of biotechnology applications for crop improvement. This review begins with a historical perspective on potato improvement using biotechnology encompassing pathogen elimination, wide hybridization, ploidy manipulation and applications of cell culture. We describe the past developments and new approaches for gene transfer to potato. Transformation is highly effective for adding single genes to existing elite potato clones with no, or minimal, disturbances to their genetic background and represents the only effective way to produce isogenic lines of specific genotypes/cultivars. This is virtually impossible via traditional breeding as, due to the high heterozygosity in the tetraploid potato genome, the genetic integrity of potato clones is lost upon sexual reproduction as a result of allele segregation. These genetic attributes have also provided challenges for the development of genetic maps and applications of molecular markers and genomics in potato breeding. Various molecular approaches used to characterize loci, (candidate) genes and alleles in potato, and associating phenotype with genotype are also described. The recent determination of the potato genome sequence has presented new opportunities for genomewide assays to provide tools for gene discovery and enabling the development of robustly unique marker haplotypes spanning QTL regions. The latter will be useful in introgression breeding and whole‐genome approaches such as genomic selection to improve the efficiency of selecting elite clones and enhancing genetic gain over time.  相似文献   

14.
With the availability of an increasing number of whole genome sequences in chordates, exhaustive comparisons of multigene families become feasible. Relationships of orthology/paralogy can not only be inferred from sequence similarity but also by comparing synteny conservation on chromosomes. More accurate scenarios for gene and expression domain gain or loss can now be proposed. Here, we take benefit from the recent release of the medaka (Oryzias latipes) genome to analyse the orthology relationships and expression patterns of the three different sub-families of the pitx homeobox genes belonging to the paired class. They are involved in a wide variety of developmental processes and have pleiotropic expression patterns, especially in the case of the pitx2 sub-family. The emerging picture is a strong conservation of expression domains, suggesting that most functions have been present in the common ancestor of actinopterygians and sarcopterygians. Almost all pitx genes are expressed in anterior placodes in all species studied so far, including medaka. It has previously been shown that in mammals, pitx1 and 2 are expressed in the pituitary. Interestingly we demonstrate here that only pitx3 is expressed in medaka pituitary. It will be interesting to analyze what are the corresponding changes in the regulatory elements of pitx genes.  相似文献   

15.
A J Lohan  K H Wolfe 《Genetics》1998,150(1):425-433
The plastid genome of the nonphotosynthetic parasitic plant Epifagus virginiana contains only 17 of the 30 tRNA genes normally found in angiosperm plastid DNA. Although this is insufficient for translation, the genome is functional, so import of cytosolic tRNAs into plastids has been suggested. This raises the question of whether the tRNA genes that remain in E. virginiana plastid DNA are active or have just fortuitously escaped deletion. We report the sequences of 20 plastid tRNA loci from Orobanche minor, which shares a nonphotosynthetic ancestor with E. virginiana. The two species have 9 intact tRNA genes in common, the others being defunct in one or both species. The intron-containing trnLUAA gene is absent from E. virginiana, but it is intact, transcribed, and spliced in O. minor. The shared intact genes are better conserved than intergenic sequences, which indicates that these genes are being maintained by natural selection and, therefore, must be functional. For the most part, the tRNA species conserved in nonphotosynthetic plastids are also those that have never been found to be imported in plant mitochondria, which suggests that the same rules may govern tRNA import in the two organelles. A small photosynthesis gene, psbI, is still intact in O. minor, and computer simulations show that some small nonessential genes have an appreciable chance of escaping deletion.  相似文献   

16.
The European rabbit (Oryctolagus cuniculus) is relevant in a large spectrum of fields: it is a livestock, a pet, a biomedical model and a biotechnology tool, a wild resource and a pest. The sequencing of the rabbit genome has opened new perspectives to study this lagomorph at the genome level. We herein investigated for the first time the O. cuniculus genome by array comparative genome hybridization (aCGH) and established a first copy number variation (CNV) genome map in this species comprising 155 copy number variation regions (CNVRs; 95 gains, 59 losses, 1 with both gain and loss) covering ~0.3% of the OryCun2.0 version. About 50% of the 155 CNVRs identified spanned 139 different protein coding genes, 110 genes of which were annotated or partially annotated (including Major Histocompatibility Complex genes) with 277 different gene ontology terms. Many rabbit CNVRs might have a functional relevance that should be further investigated.  相似文献   

17.
The occurrence of bacteria with a reduced genome, such as that found in Mycoplasmas, raises the question as to which genes should be enough to guarantee the genomic stability indispensable for the maintenance of life. The aim of this work was to compare nine Mycoplasma genomes in regard to DNA repair genes. An in silico analysis was done using six Mycoplasma species, whose genomes are accessible at GenBank, and M. synoviae, and two strains of M. hyopneumoniae, whose genomes were recently sequenced by The Brazilian National Genome Project Consortium and Southern Genome Investigation Program (Brazil) respectively. Considering this reduced genome model, our comparative analysis suggests that the DNA integrity necessary for life can be primarily maintained by nucleotide excision repair (NER), which is the only complete repair pathway. Furthermore, some enzymes involved with base excision repair (BER) and recombination are also present and can complement the NER activity. The absence of RecR and RecO-like ORFs was observed only in M. genitalium and M. pneumoniae, which can be involved with the conservation of gene order observed between these two species. We also obtained phylogenetic evidence for the recent acquisition of the ogt gene in M. pulmonis and M. penetrans by a lateral transference event. In general, the presence or nonexistence of repair genes is shared by all species analyzed, suggesting that the loss of the majority of repair genes was an ancestral event, which occurred before the divergence of the Mycoplasma species.  相似文献   

18.
Previously we showed that the mitochondrial deoxyribonucleic acid (DNA) from Paramecium aurelia consists of a linear genome and that replication of this genome is initiated at one terminus and proceeds unidirectionally to the other terminus. Analyses of mitochondria from four closely related species (1, 4, 5, and 7) indicated that the species 1, 5, and 7 DNAs are essentially completely homologous but that the species 4 mitochondrial DNA is only 40 to 50% homologous with that from species 1. The major regions of homology are those containing the genes for ribosomal ribonucleic acid (RNA). To understand the replication and organization of the linear mitochondrial genome better, we compared species 1 (Paramecium primaurelia) and 4 (Paramecium tetraaurelia) DNAs with regard to restriction fragment mapping and homology between initiation regions; we also identified the sites of the genes for ribosomal RNA. In general, the structures of the species 1 and 4 mitochondrial genomes were quite similar. Each ribosomal RNA gene was present in one copy per genome, with the large ribosomal RNA gene located near the terminal region of replication and the small ribosomal RNA gene located more centrally. These two genes were separated by about 10 kilobases in the species 1 genome and by about 12 kilobases in the species 4 genome. In contrast to our previous findings, by using nonstringent hybridization conditions we detected homology between the species 1 and 4 DNA fragments containing the initiation regions. We constructed recombinant DNA clones for many fragments, especially those containing the initiation region and the ribosomal RNA genes. We also constructed restriction enzyme maps for six enzymes for both P. primaurelia and P. tetraaurelia.  相似文献   

19.
The microbial pan-genome   总被引:1,自引:0,他引:1  
A decade after the beginning of the genomic era, the question of how genomics can describe a bacterial species has not been fully addressed. Experimental data have shown that in some species new genes are discovered even after sequencing the genomes of several strains. Mathematical modeling predicts that new genes will be discovered even after sequencing hundreds of genomes per species. Therefore, a bacterial species can be described by its pan-genome, which is composed of a "core genome" containing genes present in all strains, and a "dispensable genome" containing genes present in two or more strains and genes unique to single strains. Given that the number of unique genes is vast, the pan-genome of a bacterial species might be orders of magnitude larger than any single genome.  相似文献   

20.
Since the completion of the genome project of the nematode C. elegans in 1998, functional genomic approaches have been applied to elucidate the gene and protein networks in this model organism. The recent completion of the whole genome of C. briggsae, a close sister species of C. elegans, now makes it possible to employ the comparative genomic approaches for identifying regulatory mechanisms that are conserved in these species and to make more precise annotation of the predicted genes. RNA interference (RNAi) screenings in C. elegans have been performed to screen the whole genome for the genes whose mutations give rise to specific phenotypes of interest. RNAi screens can also be used to identify genes that act genetically together with a gene of interest. Microarray experiments have been very useful in identifying genes that exhibit co-regulated expression profiles in given genetic or environmental conditions. Proteomic approaches also can be applied to the nematode, just as in other species whose genomes are known. With all these functional genomic tools, genetics will still remain an important tool for gene function studies in the post genome era. New breakthroughs in C. elegans biology, such as establishing a feasible gene knockout method, immortalized cell lines, or identifying viruses that can be used as vectors for introducing exogenous gene constructs into the worms, will augment the usage of this small organism for genome-wide biology.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号