首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
The presence of trinucleotide microsatellites within genes is a well-known cause for a number of genetic diseases. However, the precise distribution of dinucleotide microsatellites within genes is less well documented. Here we report 15 unique cDNAs containing dinucleotide repeats from the channel catfish Ictalurus punctatus. Gene identities of nine of the 15 cDNAs were determined, of which three encode structural genes, and six encode regulatory proteins. Five cDNAs harbored dinucleotide repeats in the 5' untranslated region (5'-NTR), nine in the 3'-NTR, and one in the coding region. The presence of these transcribed dinucleotide repeats and their potential expansion in size within coding regions could lead to disruption of the original protein and/or formation of new genes by frame shift. The low number of dinucleotide repeats within coding regions suggests that they were strongly selected against. All the transcribed microsatellite loci examined were polymorphic making them useful for gene mapping in catfish.  相似文献   

3.
《DNA research》2008,15(6):333-346
A large collection of full-length cDNAs is essential for the correct annotation of genomic sequences and for the functional analysis of genes and their products. We obtained a total of 39 936 soybean cDNA clones (GMFL01 and GMFL02 clone sets) in a full-length-enriched cDNA library which was constructed from soybean plants that were grown under various developmental and environmental conditions. Sequencing from 5′ and 3′ ends of the clones generated 68 661 expressed sequence tags (ESTs). The EST sequences were clustered into 22 674 scaffolds involving 2580 full-length sequences. In addition, we sequenced 4712 full-length cDNAs. After removing overlaps, we obtained 6570 new full-length sequences of soybean cDNAs so far. Our data indicated that 87.7% of the soybean cDNA clones contain complete coding sequences in addition to 5′- and 3′-untranslated regions. All of the obtained data confirmed that our collection of soybean full-length cDNAs covers a wide variety of genes. Comparative analysis between the derived sequences from soybean and Arabidopsis, rice or other legumes data revealed that some specific genes were involved in our collection and a large part of them could be annotated to unknown functions. A large set of soybean full-length cDNA clones reported in this study will serve as a useful resource for gene discovery from soybean and will also aid a precise annotation of the soybean genome.Key words: EST, full-length cDNA, functional annotation, legume, soybean  相似文献   

4.
Syngenta claims ownership of rice - but will give data away   总被引:1,自引:0,他引:1       下载免费PDF全文
  相似文献   

5.
6.
The density and distribution of single-nucleotide polymorphisms (SNPs) across the genome has important implications for linkage disequilibrium mapping and association studies, and the level of simple-sequence microsatellite polymorphisms has important implications for the use of oligonucleotide hybridization methods to genotype SNPs. To assess the density of these types of polymorphisms in P. falciparum, we sampled introns and noncoding DNA upstream and downstream of coding regions among a variety of geographically diverse parasites. Across 36,229 base pairs of noncoding sequence representing 41 genetic loci, a total of 307 polymorphisms including 248 polymorphic microsatellites and 39 SNPs were identified. We found a significant excess of microsatellite polymorphisms having a repeat unit length of one or two, compared to those with longer repeat lengths, as well as a nonrandom distribution of SNP polymorphisms. Almost half of the SNPs localized to only three of the 41 genetic loci sampled. Furthermore, we find significant differences in the frequency of polymorphisms across the two chromosomes (2 and 3) examined most extensively, with an excess of SNPs and a surplus of polymorphic microsatellites on chromosome 3 as compared to chromosome 2 (P=0.0001). Furthermore, at some individual genetic loci we also find a nonrandom distribution of polymorphisms between coding and flanking noncoding sequences, where completely monomorphic regions may flank highly polymorphic genes. These data, combined with our previous findings of nonrandom distribution of SNPs across chromosome 2, suggest that the Plasmodium falciparum genome may be a mosaic with regard to genetic diversity, containing chromosomal regions that are highly polymorphic interspersed with regions that are much less polymorphic.  相似文献   

7.
Ubiquitin coding sequences were isolated from a human genomic library and two cDNA libraries. One human ubiquitin gene consists of 2055 nucleotides and codes for a polyprotein consisting of 685 amino acid residues. The polyprotein contains nine direct repeats of the ubiquitin amino acid sequence and the last ubiquitin sequence is extended with an additional valyl residue at the C-terminal end. No spacer sequences separate the ubiquitin repeats and the coding regions are not interrupted by intervening sequences. This particular gene is transcribed since cDNAs corresponding to the genomic sequence have been isolated. At least two more types of ubiquitin genes are encoded in the human genome, one coding for an ubiquitin monomer while another presumably codes for three or four direct repeats of the ubiquitin sequence. Human DNA contains many copies of the ubiquitin sequence. Ubiquitin is therefore encoded in the human genome as a multigene family.  相似文献   

8.
The polyploid nature of wheat is a key characteristic of the plant. Full-length complementary DNAs (cDNAs) provide essential information that can be used to annotate the genes and provide a functional analysis of these genes and their products. We constructed a full-length cDNA library derived from young spikelets of common wheat, and obtained 24056 expressed sequence tags (ESTs) from both ends of the cDNA clones. These ESTs were grouped into 3605 contigs using the phrap method, representing expressed loci from each of the three genomes. Using BLAST, 3605 contigs were grouped into 1902 gene clusters, showing that loci of the three genomes are not always expressed. A homology search of these gene clusters against a wheat EST database (15964 gene clusters) and a rice full-length cDNA database (21447 gene clusters) revealed that a quarter of the wheat full-length cDNAs were novel. A protein database of Arabidopsis was used to examine the functional classification of these gene clusters. The GC-content in the 5 -UTR region of wheat cDNAs was compared to that of rice. Forty-three genes (3.5% of wheat cDNAs homologous to those of rice) possessed distinct GC-content in the 5 -UTR region, suggesting different breeding behaviors of wheat and rice.  相似文献   

9.

Background

Cynomolgus macaques (Macaca fascicularis) are a valuable resource for linkage studies of genetic disorders, but their microsatellite markers are not sufficient. In genetic studies, a prerequisite for mapping genes is development of a genome-wide set of microsatellite markers in target organisms. A whole genome sequence and its annotation also facilitate identification of markers for causative mutations. The aim of this study is to establish hundreds of microsatellite markers and to develop an integrative cynomolgus macaque genome database with a variety of datasets including marker and gene information that will be useful for further genetic analyses in this species.

Results

We investigated the level of polymorphisms in cynomolgus monkeys for 671 microsatellite markers that are covered by our established Bacterial Artificial Chromosome (BAC) clones. Four hundred and ninety-nine (74.4%) of the markers were found to be polymorphic using standard PCR analysis. The average number of alleles and average expected heterozygosity at these polymorphic loci in ten cynomolgus macaques were 8.20 and 0.75, respectively.

Conclusion

BAC clones and novel microsatellite markers were assigned to the rhesus genome sequence and linked with our cynomolgus macaque cDNA database (QFbase). Our novel microsatellite marker set and genomic database will be valuable integrative resources in analyzing genetic disorders in cynomolgus macaques.  相似文献   

10.
11.
We have accumulated information of the coding sequences of uncharacterized human genes, which are known as KIAA genes, and the number of these genes exceeds 2000 at present. As an extension of this sequencing project, we recently have begun to accumulate mouse KIAA-homologous cDNAs, because it would be useful to prepare a set of human and mouse homologous cDNA pairs for further functional analysis of the KIAA genes. We herein present the entire sequences of 400 mouse KIAA cDNA clones and 4 novel cDNA clones which were incidentally identified during this project. Most of clones entirely sequenced in this study were selected by computer-assisted analysis of terminal sequences of the cDNAs. The average size of the 404 cDNA sequences reached 5.3 kb and that of the deduced amino acid sequences from these cDNAs was 868 amino acid residues. The results of sequence analyses of these clones showed that single mouse KIAA cDNAs bridged two different human KIAA cDNAs in some cases, which indicated that these two human KIAA cDNAs were derived from single genes although they had been supposed to originate from different genes. Furthermore, we successfully mapped all the mouse KIAA cDNAs along the genome using a recently published mouse genome draft sequence.  相似文献   

12.
Aptamer-dependent full-length cDNA synthesis by overlap extension PCR   总被引:5,自引:0,他引:5  
Mitani Y  Nakayama T  Harbers M  Hayashizaki Y 《BioTechniques》2004,37(1):124, 126, 128-124, 126, 129
  相似文献   

13.
To accumulate information on the coding sequences (CDSs) of unidentified genes, we have conducted a sequencing project of human long cDNA clones. Both the end sequences of approximately 10,000 cDNA clones from two size-fractionated human spleen cDNA libraries (average sizes of 4.5 kb and 5.6 kb) were determined by single-pass sequencing to select cDNAs with unidentified sequences. We herein present the entire sequences of 81 cDNA clones, most of which were selected by two approaches based on their protein-coding potentialities in silico: Fifty-eight cDNA clones were selected as those having protein-coding potentialities at the 5'-end of single-pass sequences by applying the GeneMark analysis; and 20 cDNA clones were selected as those expected to encode proteins larger than 100 amino acid residues by analysis of the human genome sequences flanked by both the end sequences of cDNAs using the GENSCAN gene prediction program. In addition to these newly identified cDNAs, three cDNA clones were isolated by colony hybridization experiments using probes corresponding to known gene sequences since these cDNAs are likely to contain considerable amounts of new information regarding the genes already annotated. The sequence data indicated that the average sizes of the inserts and corresponding CDSs of cDNA clones analyzed here were 5.0 kb and 2.0 kb (670 amino acid residues), respectively. From the results of homology and motif searches against the public databases, functional categories of the 29 predicted gene products could be assigned; 86% of these predicted gene products (25 gene products) were classified into proteins relating to cell signaling/communication, nucleic acid management, and cell structure/motility.  相似文献   

14.
15.
We have accumulated information on protein-coding sequences of uncharacterized human genes, which are known as KIAA genes, through cDNA sequencing. For comprehensive functional analysis of the KIAA genes, it is necessary to prepare a set of cDNA clones which direct the synthesis of functional KIAA gene products. However, since the KIAA cDNAs were derived from long mRNAs (> 4 kb), it was not expected that all of them were full-length. Thus, as the first step toward preparing these clones, we evaluated the integrity of protein-coding sequences of KIAA cDNA clones through comparison with homologous protein entries in the public database. As a result, 1141 KIAA cDNAs had at least one homologous entry in the database, and 619 of them (54%) were found to be truncated at the 5' and/or 3' ends. In this study, 290 KIAA cDNA clones were tailored to be full-length or have considerably longer sequences than the original clones by isolating additional cDNA clones and/or connected parts of additional cDNAs or PCR products of the missing portion to the original cDNA clone. Consequently, 265, 8, and 17 predicted CDSs of KIAA cDNA clones were increased in the amino-, carboxy-, and both terminal sequences, respectively. In addition, 40 cDNA clones were modified to remove spurious interruption of protein-coding sequences. The total length of the resultant extensions at amino- and carboxy-terminals of KIAA gene products reached 97,000 and 7,216 amino acid residues, respectively, and various protein domains were found in these extended portions.  相似文献   

16.
J Y Tso  X H Sun  T H Kao  K S Reece    R Wu 《Nucleic acids research》1985,13(7):2485-2502
Full length cDNAs encoding the glycolytic enzyme glyceraldehyde-3-phosphate dehydrogenase (GAPDH) from rat and man have been isolated and sequenced. Many GAPDH gene-related sequences have been found in both genomes based on genomic blot hybridization analysis. Only one functional gene product is known. Results from genomic library screenings suggest that there are 300-400 copies of these sequences in the rat genome and approximately 100 in the human genome. Some of these related sequences have been shown to be processed pseudogenes. We have isolated several rat cDNA clones corresponding to these pseudogenes indicating that some pseudogenes are transcribed. Rat and human cDNAs are 89% homologous in the coding region, and 76% homologous in the first 100 base pairs of the 3'-noncoding region. Comparison of these two cDNA sequences with those of the chicken, Drosophila and yeast genes allows the analysis of the evolution of the GAPDH genes in detail.  相似文献   

17.
In order to increase the number of markers on the horse cytogenetic map and expand the integration with the linkage map, an equine BAC library was screened for genes and for microsatellites. Eighty-nine intra-exon primers were designed from consensus gene sequences in documented species. After PCR screening, 38 clones containing identified genes were isolated and FISH mapped. These data allowed us to refine the available Zoo-FISH results, to define ten new conserved cytogenetic segments and expand two others, thus leading to the identification of a total of 26 conserved segments between horse and human. Interestingly, a new homeology segment was detected between ECA6p and HSA2q. Screening BAC clones for dinucleotide repeats led to the isolation of 33 microsatellites. Ten of the clones each contained at least a polymorphic microsatellite and one specific gene. The success of the approach in the production of integrative anchor loci and their potential use in localization and analysis of traits of interest by the candidate gene and positional cloning approach, are discussed.  相似文献   

18.
19.
Seven cDNA fragments containing polymorphic (AAT)n trinucleotide repeats were isolated from a human brain cDNA library and mapped by linkage to specific loci. These repeats may serve as gene markers or as candidates for diseases caused by expansion mutation.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号