首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
While genome-era technologies focused on complete genome sequencing in various organisms, post-genome technologies aim at the understanding of the mechanisms of genetic information processing and elucidation of within-species variation. Single nucleotide polymorphisms (SNPs) are the most common source of genome variation in the human population. Nonsynonymous SNPs that occur in coding gene regions and result in amino acid substitutions are of particular interest. It is thought that such SNPs are responsible for phenotypic variation, quantitative traits, and the etiology of common diseases. PolyPhen is a computational tool for the prediction of putatively functional nonsynonymous SNPs by combining information of various types. The application areas of PolyPhen and similar methods include the genetics of complex diseases and congenital defects, the identification of functional mutations in model organisms, and evolutionary genetics.  相似文献   

2.
Here we report a large, extensively characterized set of single-nucleotide polymorphisms (SNPs) covering the human genome. We determined the allele frequencies of 55,018 SNPs in African Americans, Asians (Japanese-Chinese), and European Americans as part of The SNP Consortium's Allele Frequency Project. A subset of 8333 SNPs was also characterized in Koreans. Because these SNPs were ascertained in the same way, the data set is particularly useful for modeling. Our results document that much genetic variation is shared among populations. For autosomes, some 44% of these SNPs have a minor allele frequency > or =10% in each population, and the average allele frequency differences between populations with different continental origins are less than 19%. However, the several percentage point allele frequency differences among the closely related Korean, Japanese, and Chinese populations suggest caution in using mixtures of well-established populations for case-control genetic studies of complex traits. We estimate that approximately 7% of these SNPs are private SNPs with minor allele frequencies <1%. A useful set of characterized SNPs with large allele frequency differences between populations (>60%) can be used for admixture studies. High-density maps of high-quality, characterized SNPs produced by this project are freely available.  相似文献   

3.
Regional and time-resolved mutation patterns of the human genome   总被引:3,自引:0,他引:3  
MOTIVATION: Substantial regional variations of substitutional processes have recently been reported from human/mouse comparisons. However, several features including the C + G dependence and the CpG-based transition effect remain obscure. RESULTS: Utilizing the vast amount of transposable elements in the human genome, we performed detailed analysis of the substitutional and insertion/deletion patterns along the human lineage in a regional and time-resolved fashion. We observed a drastic increase in the CpG-based transition frequency at about the time of the mammalian radiation. We also observed clear regional biases of substitution patterns, most notably a bias to enrich the C+G content toward the telomeres. AVAILABILITY: The programs used are available upon request from the authors.  相似文献   

4.
Despite the current good level of annotation, the Drosophila genome still holds surprises. A recent study has added perhaps 2,000 genes to the predicted total, and raises a number of questions about how genome annotation data should be stored and presented.  相似文献   

5.
Repetitive elements may comprise over two-thirds of the human genome   总被引:1,自引:0,他引:1  
Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo "clouds"). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%-69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (~25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed "element-specific" P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ~100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed.  相似文献   

6.
Although previous studies have failed to detect an association between microsatellite polymorphism and broadscale recombination rates in the human genome, there are several possible reasons why such a relationship could exist. For instance, there might be a direct link if recombination is mutagenic to microsatellite sequences or if polymorphic microsatellites act as recombination signals. Alternatively, recombination could exert an indirect effect by uncoupling of natural selection at linked loci, promoting polymorphism. As recombination is concentrated in narrow hotspot regions in the human genome, we investigated the relationship between microsatellite polymorphism and recombination hot spots. By using data from a common allele frequency database, we found several polymorphism estimates to be similar for hot spots and the genomic average. However, this is likely explained by an ascertainment bias because markers with high polymorphism information content are usually selected for genotyping in human populations and pedigrees. In contrast, by using an unbiased set of shotgun sequence data, we found an excess of microsatellite polymorphism in recombination hot spots of 14%. However, when other genomic variables are taken into account in a generalized model and using wavelet analysis, the effect is no longer detectable and the only firm predictor of microsatellite polymorphism is the incidence of SNPs and indels. One possible neutral explanation to these observations is that there is a common denominator affecting the local rate of mutation in unique as well as in repetitive DNA, for example, base composition.  相似文献   

7.
In the human genome, short tandem repetitive (STR) DNA sequences often show restriction fragment length polymorphisms (RFLPs) due to variation in the number of copies of the repeat unit. For a subset of these sequences known as minisatellites or variable number tandem repeat loci (VNTR), it has been proposed that a homologous "core" sequence of 10-12 nucleotides is involved in the mechanism(s) generating the polymorphism. In our present study we have prepared oligonucleotide probes complementary to one or two repeat units of several VNTR loci. Under stringent hybridization and wash conditions these probes hybridize locus specifically thus allowing the evaluation of the intrinsic polymorphism of individual loci. Our results indicate that not all of the loci having STR DNA sequences are polymorphic despite the fact that they share the "core" sequence. This suggests that more than the DNA sequence of the locus is involved in the mechanism(s) generating the polymorphism.  相似文献   

8.
Human endogenous retroviruses (HERVs), which constitute a significant part of the human genome, might have a serious impact on primate evolution. Over a hundred insertions of HERV-K(HML-2) family members distinguish the human genome from other primate genomes. However, only three cases of insertion polymorphisms have been reported so far, all for endogenous HERV-K proviruses. This suggests that some retroviral integrations occurred rather recently in human genome evolution. In this report, we describe a very rare case of true insertion polymorphism of a solitary HERV-K LTR in the human genome. Distribution of the LTR-containing allele was tested in 5 Africans and 83 individuals from three Russian populations. The allele frequency appeared to be relatively high in populations of both European and Asian origin. The detected polymorphic LTR could be a useful molecular genetic marker of the corresponding genomic region.  相似文献   

9.
There is a mounting evidence for the correlation between the gene expression pattern and sequence divergence. However, little is known about the relationship between the gene expression pattern and polymorphism. We compiled the gene expression, polymorphism, and divergence data from the public databases of the human genome. The ratios of nonsynonymous (A) to synonymous (S) substitutions in polymorphism and divergence in the human genome were strongly influenced by the expression pattern and breadth of genes and showed strong correlations. Among the tissues we analyzed, the brain-expressed genes have the smallest and the liver-expressed genes have the largest proportion of amino acid changes both in polymorphism and divergence. The analysis implies that negative selection is the primary factor affecting expression-dependent gene evolution and the prevalent but nonuniform distribution of slightly deleterious mutations in the genome. Although the genes under relaxed negative selection evolved faster than the other genes, these genes are even more liable to slightly deleterious mutations in the population. On the other hand, nonneutral mutations in the highly conservative genes, such as brain-expressed and housekeeping genes, are largely deleterious and eliminated before they enter the population.  相似文献   

10.
11.
12.
13.
14.
15.
The coccidium Cryptosporidium parvum is an obligate intracellular parasite of the phylum Apicomplexa. It infects the gastrointestinal tract of humans and livestock, and represents the third major cause of diarrhoeal disease worldwide. Scarcely considered for decades due to its apparently non-pathogenic nature, C. parvum has been studied very actively over the last 15 years, after its medical relevance as a dangerous opportunistic parasite and widespread water contaminant was fully recognised. Despite the lack of an efficient in vitro culture system and appropriate animal models, significant advances have been made in this relatively short period of time towards understanding C. parvum biology, immunology, genetics and epidemiology. Until recently, very little was known about the genome of C. parvum, with even basic issues, such as the number and size of chromosomes, being the object of a certain controversy. With the advent of pulsed field gradient electrophoresis and the introduction of molecular biology techniques, the overall structure and fine organisation of the genome of C. parvum have started to be disclosed. Organised into eight chromosomes distributed in a very narrow range of molecular masses, the genome of C. parvum is one of the smallest so far described among unicellular eukaryotic organisms. Although fewer than 30 C. parvum genes have been cloned so far, information about the overall structure of the parasite genome has increased exponentially over the last 2 years. From the first karyotypic analyses to the recent development of physical maps for individual chromosomes, this review will try to describe the state-of-the-art of our knowledge on the nuclear genome of C. parvum and will discuss the available experimental evidence concerning the presence of extra-chromosomal elements.  相似文献   

16.
The multilocus RAPD analysis of intergeneric, inter-and intraspecific nuclear genome polymorphism was used for the first time to assess intergeneric, interspecific, and intraspecific polymorphism in Lemnaceae growing on the territory of Russia. The origin of the chosen accessions overlapped with the natural range of duckweeds in Russia. Seventy-five Lemnaceae accessions representing eight species (L. minor, L. gibba, L. turionifera, L. japonica, L. trisulca, L. aequinoctialis, S. polyrhiza, and L. punctata) from three genera (Lemna, Spirodela, and Landoltia), were analyzed. The highest variability levels were revealed in L. minor accessions (0.03–0.20). Species L. trisulca and S. polyrhiza were characterized by values of genetic distance 0.01–0.18 and 0.03–0.16, respectively. The lowest polymorphism levels were detected for L. turionifera (0.01–0.11). The dendrogram based on RAPD data showed that L. aequinoctialis was the most genetically distant species of the genus Lemna. Accessions of species L. turionifera and L. japonica, as well as L. minor and L. gibba, did not form separate species-specific subclusters; rather, they fell into clusters with L. japonica/L. turionifera and L. minor/L. gibba. Accessions of the genera Spirodela and Landoltia formed two separate clusters combined into one group.  相似文献   

17.
The multilocus RAPD analysis of intergeneric, inter- and intraspecific nuclear genome polymorphism was used for the first time to assess intergeneric, interspecific, and intraspecific polymorphism in Lemnaceae growing on the territory of Russia. The origin of the chosen accessions overlapped with the natural range of duckweeds in Russia. Seventy-five Lemnaceae accessions representing eight species (L. minor, L. gibba, L. turionifera, L. japonica. L. trisulca, L. aequinoctialis, S. polyrhiza, and L. punctata) from three genera (Lemna, Spirodela, and Landoltia), were analyzed. The highest variability levels were revealed in L. minor accessions (0.03-0.20). Species L. trisulca and S. polyrhiza were characterized by values of genetic distance 0.01-0.18 and 0.03-0.16, respectively. The lowest polymorphism levels were detected for L. turionifera (0.01-0.11). The dendrogram based on RAPD data showed that L. aequinoctialis was the most genetically distant species of the genus Lemna. Accessions of species L. turionifera and L. japonica, as well as L. minor and L. gibba, did not form separate species-specific subclusters; rather, they fell into clusters with L. japonica/L. turionifera and L. minor/L. gibba. Accessions of the genera Spirodela and Landoltia formed two separate clusters combined into one group.  相似文献   

18.
The prospect of using linkage disequilibrium (LD) for fine-scale mapping in humans has attracted considerable attention, and, during the validation of a set of single-nucleotide polymorphisms (SNPs) for linkage analysis, a set of data for 4,833 SNPs in 538 clusters was produced that provides a rich picture of local attributes of LD across the genome. LD estimates may be biased depending on the means by which SNPs are first identified, and a particular problem of ascertainment bias arises when SNPs identified in small heterogeneous panels are subsequently typed in larger population samples. Understanding and correcting ascertainment bias is essential for a useful quantitative assessment of the landscape of LD across the human genome. Heterogeneity in the population recombination rate, rho=4Nr, along the genome reflects how variable the density of markers will have to be for optimal coverage. We find that ascertainment-corrected rho varies along the genome by more than two orders of magnitude, implying great differences in the recombinational history of different portions of our genome. The distribution of rho is unimodal, and we show that this is compatible with a wide range of mixtures of hotspots in a background of variable recombination rate. Although rho is significantly correlated across the three population samples, some regions of the genome exhibit population-specific spikes or troughs in rho that are too large to be explained by sampling. This result is consistent with differences in the genealogical depth of local genomic regions, a finding that has direct bearing on the design and utility of LD mapping and on the National Institutes of Health HapMap project.  相似文献   

19.
Mononucleotide repeats (MNRs) are abundant in eukaryotic genomes and exhibit a high degree of length variability due to insertion and deletion events. However, the relationship between these repeats and mutation rates in surrounding sequences has not been systematically investigated. We have analyzed the frequency of single nucleotide polymorphisms (SNPs) at positions close to and within MNRs in the human genome. Overall, we find a 2- to 4-fold increase in the SNP frequency at positions immediately adjacent to the boundaries of MNRs, relative to that at more distant bases. This relationship exhibits a strong asymmetry between 3' and 5' ends of repeat tracts and is dependent upon the repeat motif, length and orientation of surrounding repeats. Our analysis suggests that the incorporation or exclusion of bases adjacent to the boundary of the repeat through substitutions, in which these nucleotides mutate towards or away from the base present within the repeat, respectively, may be another mechanism by which MNRs expand and contract in the human genome.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号