首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 218 毫秒
1.
Eight recombinant phage clones containing cytoplasmic actin-like gene sequences have been isolated from a human genomic library for structural characterization. Kpn I family repeat sequences flank six of these actin genes isolated, and Alu family repeats are scattered throughout the DNA inserts of all eight phage clones. Three of these genes are γ actin-like, and the other five are β actin-like. The complete nucleotide sequence analysis of one β and one γ actin-like genes and their flanking regions demonstrates that they both are processed pseudogenes. Using unique DNA sequences flanking these two pseudogenes as hybridization probes for human-mouse somatic cell hybrid DNAs, we have mapped the two actin pseudogenes on human chromosomes 8 and 3, respectively. We have also determined the DNA sequence of a human Y chromosome-linked, processed actin pseudogene. The different values of sequence divergence of these processed pseudogenes and their functional counterparts allow us to estimate the time of generation of the pseudogenes. The results suggest that the cDNA insertion events generating the human cytoplasmic actin-like pseudogenes have occurred at significantly different times during the evolution of primates, after their separation from other mammalian species.  相似文献   

2.
3.
Eleven daughters of NANOG   总被引:6,自引:0,他引:6  
Booth HA  Holland PW 《Genomics》2004,84(2):229-238
Nanog is a recently discovered ANTP class homeobox gene. Mouse Nanog is expressed in the inner cell mass and in embryonic stem cells and has roles in self-renewal and maintenance of pluripotency. Here we describe the location, genomic organization, and relative ages of all human NANOG pseudogenes, comprising ten processed pseudogenes and one tandem duplicate. These are compared to the original, intact human NANOG gene. Eleven is an unusually high number of pseudogenes for a homeobox gene and must reflect expression in the human germ line. A pseudogene orthologous to NANOGP4 was found in chimpanzee and an expressed pseudogene in macaque. Examining pseudogenes of differing ages gives insight into pseudogene decay, which involves an excess of deletion mutations over insertions. The mouse genome has two processed pseudogenes, which are not clear orthologues of the primate pseudogenes.  相似文献   

4.
Three pseudogenes for the nuclear-encoded subunit VIb of cytochrome c oxidase (COX) were isolated by screening a human genomic library with cloned human cDNA coding for COX subunit VIb. The nucleotide sequences of the pseudogenes, designated psi COX6b-1, psi COX6b-2 and psi COX6b-3, were determined. Pseudogene psi COX6b-1 bears all the hallmarks of a processed pseudogene and diverged from the parental gene after the divergence of man and cow. Alu repetitive elements were integrated into the structural sequences of the other two pseudogenes. Comparison with the human and bovine cDNA sequences encoding COX subunit VIb suggests that psi COX6b-2 and psi COX6b-3 were formed earlier in evolution than psi COX6b-1. Genomic Southern analysis indicated that a few more pseudogenes for COX subunit VIb are likely to be present in the human genome. Identical nt differences with respect to the human cDNA sequence in the pseudogenes provide some clues on the evolution of the ancestral gene coding for COX subunit VIb.  相似文献   

5.
6.
Reconstruction and analysis of human alu genes   总被引:39,自引:0,他引:39  
  相似文献   

7.
T Shimada  M J Chen  A W Nienhuis 《Gene》1984,31(1-3):1-8
A dihydrofolate reductase (DHFR) pseudogene, hDHFR-psi 3 has been isolated from a human genomic DNA fragment library. Sequence analysis of this gene revealed a lack of introns and the presence of a tract of nine adenines, 90 bp downstream from the end of the coding sequence. These features suggest that hDHFR-psi 3 was derived from a processed RNA molecule that has been converted into DNA and inserted into a chromosome, analogous to the origin of three intronless human DHFR genes previously described. An interesting feature of hDHFR-psi 3 is the presence of a member of the Alu moderately repetitive DNA sequence family within the DHFR coding region. This Alu element is flanked by a 16 bp directly repeated DNA segment derived from DHFR coding sequences. The Alu element apparently has been inserted into the intronless DHFR pseudogene and thus, there have been two insertions at a single chromosomal locus. The hDHFR-psi 3 contains only the 3' half of the DHFR coding sequence. Immediately upstream from the directly repeated sequence before the Alu element is an adenine-rich tract. The DNA farther upstream is moderately repetitive and is related to neither DHFR nor Alu DNA sequence. Therefore, it seems possible that a third insertion has occurred at the same site further disrupting the hDHFR coding sequences.  相似文献   

8.
9.
人类基因组上的假基因   总被引:5,自引:0,他引:5  
周光金  余龙  赵寿元 《生命科学》2004,16(4):210-214,230
假基因是基因组上与编码基因序列非常相似的非功能性基因组DNA拷贝,一般情况都不被转录,且没有明确生理意义。假基因根据其来源可分为复制假基因和已加工假基因。迄今为止,明确鉴定的人类假基因多为已加工假基因,有8000个之多。在Swiss-Prot/TrEMBL收录的编码蛋白质的将近25500个基因序列中,约10%在基因组中有一个或多个近全长已加工假基因。其余的功能基因都没有已加工假基因。核糖体蛋白基因具有最多数量的已加工假基因,约有l700个(占已加工假基因数的22%),少数基因,如cyclophilinA、肌动蛋白(actin)、角蛋白(keratin)、GAPDH、细胞色素C(cytochromec)和nucleophosmin等则有很多份已加工假基因。总体上讲,假基因在人类染色体上的分布与染色体长度成比例,但已加工假基因在GC含量为41%~46%的染色体区域密度最高。已加工假基因的拷贝数和功能基因在生殖器官中的表达高度一致,说明许多假基因发生在胚胎阶段,另外也和基因中GC含量和基因大小密切相关。假基因的准确鉴定对基因组进化、分子医学研究和医学应用具有重要意义。  相似文献   

10.
Two human genomic clones containing the lactate dehydrogenase-B processed pseudogene were isolated from two patients deficient in lactate dehydrogenase-B isozyme. The sequences of 3,287 nucleotides, including the pseudogenes and its flanking regions, from both clones were found to be identical except for three differences in the pseudogenes. The sequences of 1,286 nucleotides from these two pseudogenes exhibited 93% homology with the cDNA sequence of the lactate dehydrogenase-B functional gene, and the pseudogene contained 75/76 base substitutions, 11/12 single-base deletions, and 5 single-base insertions. This pseudogene was mapped to the x-chromosome by dot-blot analysis using a probe for the pseudogene or its 5' flanking sequence.  相似文献   

11.
The genomic sequences within the alpha-block (approximately 288-310 kb) of the human and chimpanzee MHC class I region contains ten MHC class I genes and three MIC gene fragments grouped together within alternating duplicated genomic segments or duplicons. In this study, the chimpanzee and human genomic sequences were analyzed in order to determine whether the remnants of the ERVK9 and other retrotransposon sequences are useful genomic markers for reconstructing the evolutionary history of the duplicated MHC gene families within the alpha-block. A variety of genes, pseudogenes, autologous DNA transposons and retrotransposons such as Alu and ERVK9 were used to categorize the ten duplicons into four distinct structural groups. The phylogenetic relationship of the ten duplicons was examined by using the neighbour joining method to analyze transposon sequence topologies of selected Alu members, LTR16B and Charlie9. On the basis of these structural groups and the phylogeny of the duplicated transposon sequences, a duplication model was reconstructed involving four multipartite tandem duplication steps to explain the organization and evolution of the ten duplicons within the alpha-block of the chimpanzee and human. The phylogenetic analysis and inferred duplication history suggests that the Patr/HLA-F was the first MHC class I gene to have been fixed and not required as a precursor for further duplication within the alpha-block of the ancestral species.  相似文献   

12.
Two human apolipoprotein C-I genes, one of which is believed to be a pseudogene, are located within the lipoprotein gene cluster on chromosome 19. Alignments were made between the apoC-I and the pseudoC-I' genes using a computer sequence editor. Particular Alu sequences may be found in one gene or in both: the proposal is that common Alu sequences (found in both genes) were present before the duplication of the C-I gene, whereas single Alu sequences (present in only one gene) were transposed afterward. Alu sequences of the C-I genes were also classified into Alu families. Common sequences belong to older families of Alu genes, whereas single sequences belong to younger families. Marked change in the apolipoprotein C-I gene began during early radiation of primate lineages. Retropositions of older Alu sequences occurred throughout the Paleocene and the Eocene periods. The numbering of uncommon substitutions in the six common Alu sequences gives a good estimate of the duplication time for the C-I gene (39 +/- 6 million years) at the end of the Eocene. After that, the other Alu sequences were transposed into each gene and further substitutions occurred to give the present form of the C-I genes in humans.  相似文献   

13.
Processed genes are created by retroposition from messenger RNA of expressed genes. The estimated amount of processed copies of genes in the human genome is 10,000-14,000. Some of these might be pseudogenes with the expected pattern for nonfunctional sequences, but some others might be an important source of new genes. We have studied the evolution of a Phosphoglycerate mutase processed gene (PGAM3) described in humans and believed to be a pseudogene. We sequenced PGAM3 in chimpanzee and macaque and obtained polymorphism data for human coding region. We found evidence that PGAM3 likely produces a functional protein, as an example of addressing functionality for human processed pseudogenes. First, the open reading frame was intact despite many deletions that occurred in the 3' untranslated region. Second, it appears that the gene is expressed. Finally, interspecies and intraspecies variation for PGAM3 was not consistent with the neutral model proposed for pseudogenes, suggesting that a new functional primate gene has originated. Amino acid divergence was significantly higher than synonymous divergence in PGAM3 lineage, supporting positive selection acting in this gene. This role of selection was further supported by the excess of rare alleles in a population genetic analysis. PGAM3 is located in a region of very low recombination; therefore, it is conceivable that the rapid fixation events in this newly arising gene may have contributed to a selective sweep of variation in the region.  相似文献   

14.

Background  

The NANOG gene is expressed in mammalian embryonic stem cells where it maintains cellular pluripotency. An unusually large family of pseudogenes arose from it with one unprocessed and ten processed pseudogenes in the human genome. This article compares the NANOG gene and its pseudogenes in the human and chimpanzee genomes and derives an evolutionary history of this pseudogene family.  相似文献   

15.
Three loci, designated U2/4, U2/6 and U2/7, which contain sequences related to human U2 RNA, have been studied. The U2/6 locus contains a tandem array of bona fide U2 genes. U2/4 and U2/7, in contrast, contain pseudogenes whose sequences deviate significantly from that of mammalian U2 RNA. The two pseudogenes appear to have been created by different mechanisms. The sequences that flank the pseudogene in the U2/4 locus lack homology to the corresponding sequences in functional human U2 genes, except for 10 base-pairs immediately following the 3′ end. The conserved 3′-flanking segment is homologous to those nucleotides that are present in a U2 RNA precursor. No direct repeats flank the pseudogene in the U2/4 locus. The observations thus suggest that a complementary DNA copy of the U2 RNA precursor was inserted into a blunt-ended chromosomal break to generate the U2/4 locus.The U2/7 locus, in contrast, revealed flanking sequence homology when compared to functional U2 genes, both on the 5′ and 3′ sides of the pseudogene. The homology was interrupted on both sides by repetitive sequences belonging to the Alu family. On the 5′ side the homology continues beyond the Alu repeats whereas on the 3′ side it ends precisely at the Alu repeat. This Alu repeat is inserted in a region where a homocopolymeric region of alternating C and T residues is located in functional U2 loci. The observed organization of the U2/7 locus suggests that a previously functional U2 locus was invaded by Alu repeats and subsequently accumulated base substitutions to become a pseudogene.  相似文献   

16.
We have determined the sequence of 2400 base pairs upstream from the human pseudo alpha globin (psi alpha) gene, and for comparison, 1100 base pairs of DNA within and upstream from the chimpanzee psi alpha gene. The region upstream from the promoter of the psi alpha gene shows no significant homology to the intergenic regions of the adult alpha 2 and alpha 1 globin genes. The chimpanzee gene has a coding defect in common with the human psi alpha gene, showing that the product of this gene, if any, was inactivated before the divergence of human and chimpanzee. However the chimpanzee gene contains a normal ATG initiation codon in contrast to the human gene which has GTG as the initiation codon. The psi alpha genes of both human and chimpanzee are flanked by the same Alu family member. The structure and position of this repeat have not been altered since the divergence of human and chimpanzee, and it is at least as well conserved as its immediate flanking sequence. Comparing human and chimpanzee, the 300 bp Alu repeat has accumulated only two base substitutions and one length mutation; the adjacent 300 bp flanking region has accumulated five base substitutions and twelve length mutations.  相似文献   

17.
According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human–mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human–mouse species split, and also a larger group of primate-specific ones found from human–chimpanzee searches. Two processed sequences are notable, their conservation since the human–mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios.  相似文献   

18.
《Gene》1997,184(2):273-278
Genes for the snRNP proteins U1-70K, U1-A, Sm-B′/B, Sm-D1 and Sm-E have been isolated from various metazoan species. The genes for Sm-D1 and Sm-E, which were isolated from a murine and human source respectively, appear to belong to a multigene family. It has been suggested that also for the mammalian U1-C protein such a multigene family exists. With the human U1-C cDNA as a probe, two genes containing sequences homologous to the probe sequence were isolated from a mouse genomic library. Simultaneously, a murine U1-C cDNA was isolated from a mouse cDNA library. This 0.74 kb cDNA contains an open reading frame (ORF) of 477 bp encoding a polypeptide of 159 amino acids (aa) which differs at only one position (position 65) from the human U1-C protein. One of the isolated U1-C genes contains an ORF as well and shares 92% nucleotide sequence identity with the mouse U1-C cDNA. The features of this gene, in particular the absence of introns, the acquisition of a 3′ poly(A) tail and flanking direct repeats, indicate that it represents a processed pseudogene. At the predicted aa sequence level, substitutions of conserved residues at functionally important positions are observed, strongly suggesting that expression of this gene would not lead to a functional polypeptide. The second U1-C gene appeared to be a pseudogene as well because it is also intronless and contains a frameshift mutation compared to the ORF in the mouse U1-C cDNA. The characterization of these two pseudogenes points to the existence of a U1-C multigene family in mice. Furthermore, comparison of aa sequences of the murine, human and Xenopus U1-C shows that the protein is highly conserved through evolution. Since the Xenopus U1-C differs from the two mammalian counterparts solely at a number of positions in the C-terminal region, it can be concluded that aa changes are less well tolerated in the N-terminal region of U1-C than in the rest of the protein.  相似文献   

19.
20.
The presence of at least ten mouse LDH-A pseudogenes was demonstrated in the genomic blot analysis, and four different processed pseudogenes have thus far been isolated and characterized. In this report, the nucleotide sequences to two different mouse lactate dehydrogenase-A processed pseudogenes, M11 and M14, were determined and compared with the protein-coding sequences of the mouse and rat LDH-A functional genes. In the pseudogene M11, the sequence of 64 nucleotides from codon no. 257 to 278 was tandemly duplicated. In the pseudogene M14, the sequence of 22 nucleotides from codon no. 68 to 75 was replaced by an inserted repetitive sequence of 242 nucleotides homologous to a mouse truncated R element. The pattern of nucleotide substitutions accumulated in mouse LDH-A pseudogenes M11 and M14, as well as that of pseudogene M10 identified previously, was analyzed, and the substitution frequencies of the C or G at the CG dinucleotide were found to be high.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号