首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
《Journal of phycology》2001,37(Z3):21-21
Grossman, A. R.1, Davies, J.2, Shrager, J.1, Chang, C.-W1, McDermott, J.3 & Zhang, Z.1 1Department of Plant Biology, The Carnegie Institution of Washington, 260 Panama Street, Stanford, CA 94305 USA; 2Exelixis Pharmaceuticals, Inc. 170 Harbor Way, So. San Francisco, CA 94083-051 USA; 3Department of Botany, Iowa State University, Ames, IA 50011-1020 USA The Chlamydomonas genome project involves 1) sequencing of cDNAs isolated from cells exposed to various environmental conditions, 2) construction of a high density DNA microarray, 3) construction of genomic contigs that nucleate around specific physical and genetic markers, 4) generation of a complete chloroplast genome sequence and analyses of chloroplast gene expression, and 5) placement of the genomic information on the network in a user friendly format. The aspects of the project emphasized by our group at Stanford involves the generation of normalized cDNA libraries, sequencing of the cDNAs to generate both contigs and unigene families and the use of this information to construct a high density cDNA microarray. I will discuss the techniques involved in securing cDNA sequence information and the ways in which that information is assembled and analyzed. I will also discuss the use of the different cDNA libraries to identifying differentially expressed genes (by in silico subtractions) and strategies for constructing high-density cDNA microarrays. Finally, if completed, the first global expression studies, and the use of the microarrays to elucidate global expression in mutant strains will be discussed.  相似文献   

2.
We have accumulated information of the coding sequences of uncharacterized human genes, which are known as KIAA genes, and the number of these genes exceeds 2000 at present. As an extension of this sequencing project, we recently have begun to accumulate mouse KIAA-homologous cDNAs, because it would be useful to prepare a set of human and mouse homologous cDNA pairs for further functional analysis of the KIAA genes. We herein present the entire sequences of 400 mouse KIAA cDNA clones and 4 novel cDNA clones which were incidentally identified during this project. Most of clones entirely sequenced in this study were selected by computer-assisted analysis of terminal sequences of the cDNAs. The average size of the 404 cDNA sequences reached 5.3 kb and that of the deduced amino acid sequences from these cDNAs was 868 amino acid residues. The results of sequence analyses of these clones showed that single mouse KIAA cDNAs bridged two different human KIAA cDNAs in some cases, which indicated that these two human KIAA cDNAs were derived from single genes although they had been supposed to originate from different genes. Furthermore, we successfully mapped all the mouse KIAA cDNAs along the genome using a recently published mouse genome draft sequence.  相似文献   

3.
To accumulate information on the coding sequences (CDSs) of unidentified genes, we have conducted a sequencing project of human long cDNA clones. Both the end sequences of approximately 10,000 cDNA clones from two size-fractionated human spleen cDNA libraries (average sizes of 4.5 kb and 5.6 kb) were determined by single-pass sequencing to select cDNAs with unidentified sequences. We herein present the entire sequences of 81 cDNA clones, most of which were selected by two approaches based on their protein-coding potentialities in silico: Fifty-eight cDNA clones were selected as those having protein-coding potentialities at the 5'-end of single-pass sequences by applying the GeneMark analysis; and 20 cDNA clones were selected as those expected to encode proteins larger than 100 amino acid residues by analysis of the human genome sequences flanked by both the end sequences of cDNAs using the GENSCAN gene prediction program. In addition to these newly identified cDNAs, three cDNA clones were isolated by colony hybridization experiments using probes corresponding to known gene sequences since these cDNAs are likely to contain considerable amounts of new information regarding the genes already annotated. The sequence data indicated that the average sizes of the inserts and corresponding CDSs of cDNA clones analyzed here were 5.0 kb and 2.0 kb (670 amino acid residues), respectively. From the results of homology and motif searches against the public databases, functional categories of the 29 predicted gene products could be assigned; 86% of these predicted gene products (25 gene products) were classified into proteins relating to cell signaling/communication, nucleic acid management, and cell structure/motility.  相似文献   

4.
The common marmoset is a new world monkey, which has become a valuable experimental animal for biomedical research. This study developed cDNA libraries for the common marmoset from five different tissues. A total of 290 426 high-quality EST sequences were obtained, where 251 587 sequences (86.5%) had homology (1E−100) with the Refseqs of six different primate species, including human and marmoset. In parallel, 270 673 sequences (93.2%) were aligned to the human genome. When 247 090 sequences were assembled into 17 232 contigs, most of the sequences (218 857 or 15 089 contigs) were located in exonic regions, indicating that these genes are expressed in human and marmoset. The other 5578 sequences (or 808 contigs) mapping to the human genome were not located in exonic regions, suggesting that they are not expressed in human. Furthermore, a different set of 118 potential coding sequences were not similar to any Refseqs in any species, and, thus, may represent unknown genes. The cDNA libraries developed in this study are available through RIKEN Bio Resource Center. A Web server for the marmoset cDNAs is available at http://marmoset.nig.ac.jp/index.html, where each marmoset EST sequence has been annotated by reference to the human genome. These new libraries will be a useful genetic resource to facilitate research in the common marmoset.  相似文献   

5.
6.
7.
A compilation of soybean ESTs: generation and analysis.   总被引:18,自引:0,他引:18  
Whole-genome sequencing is fundamental to understanding the genetic composition of an organism. Given the size and complexity of the soybean genome, an alternative approach is targeted random-gene sequencing, which provides an immediate and productive method of gene discovery. In this study, more than 120000 soybean expressed sequence tags (ESTs) generated from more than 50 cDNA libraries were evaluated. These ESTs coalesced into 16928 contigs and 17336 singletons. On average, each contig was composed of 6 ESTs and spanned 788 bases. The average sequence length submitted to dbEST was 414 bases. Using only those libraries generating more than 800 ESTs each and only those contigs with 10 or more ESTs each, correlated patterns of gene expression among libraries and genes were discerned. Two-dimensional qualitative representations of contig and library similarities were generated based on expression profiles. Genes with similar expression patterns and, potentially, similar functions were identified. These studies provide a rich source of publicly available gene sequences as well as valuable insight into the structure, function, and evolution of a model crop legume genome.  相似文献   

8.
9.
10.
11.
Reference cDNA library facilities available from European sources   总被引:1,自引:0,他引:1  
cDNA libraries are the cornerstone of efforts to identify the relatively small regions of genomes that are responsible for biological effects. Gene hunter seeking candidate genes, via a variety of approaches, ultimately focus on the cloning, sequencing, and expression of cDNAs. Assistance is now available to researchers in the form of genome programs, whose initial goals include assembly of a complete collection of expressed sequences derived from the genome of interest. The concept of reference sets of cDNA libraries is that the aims of genome programs are served most effectively by different laboratories working on a common set of high-quality arrayed cDNA libraries, using different experimental approaches, thereby reducing unnecessary duplication of effort, and maximizing the amount of information that one set of resources can provide.  相似文献   

12.
13.
A chemiluminescent approach for sequential DNA hybridizations to high-density filter arrays of cDNAs, using a biotin-based random priming method followed by a streptavidin/alkaline phosphatase/CDP-Star detection protocol, is presented. The method has been applied to the Brugia malayi genome project, wherein cDNA libraries, cosmid and bacterial artificial chromosome (BAC) libraries have been gridded at high density onto nylon filters for subsequent analysis by hybridization. Individual probes and pools of rRNA probes, ribosomal protein probes and expressed sequence tag probes show correct specificity and high signal-to-noise ratios even after ten rounds of hybridization, detection, stripping of the probes from the membranes and rehybridization with additional probe sets. This approach provides a subtraction method that leads to a reduction in redundant DNA sequencing, thus increasing the rate of novel gene discovery. The method is also applicable for detecting target sequences, which are present in one or only a few copies per cell; it has proven useful for physical mapping of BAC and cosmid high-density filter arrays, wherein multiple probes have been hybridized at one time (multiplexed) and subsequently "deplexed" into individual components for specific probe localizations.  相似文献   

14.
The fathead minnow Pimephales promelas serves as a model organism for assessing the effects of environmental contaminants on early life stage growth and development. Yet, the utilization of genomic tools has been hindered by the lack of genome sequence and genomic information known from this model species. Utilizing published cDNA library sequences, the authors used sequence similarity to compare 4105 cDNAs isolated from fathead minnow fry (<14 days old) with over 250 000 adult cDNA sequences derived from whole body and various tissue types. The objectives of the computational subtraction were to (1) assess the extent of sequence similarity between developing and adult cDNA libraries and (2) predict which cDNA clones are expressed only in developing organisms. The results of the computational predictions were assessed through the construction of a development‐specific DNA microarray targeting all 4105 sequences in the fry cDNA library as well as 56 known mRNAs in P. promelas. Gene expression was determined by comparing total RNA isolated from fry with total RNA isolated from adult samples (whole animal, kidney, liver, brain, ovary and testes). The results showed that 1381 of the targeted fry cDNA sequences (34%) displayed expression across all sample comparisons, and of these, only 166 genes were found to harbour fry‐specific expression (i.e. no expression in adult samples). Of note, 69% of the genes computationally predicted to be fry specific were found across all experimental results; yet, only 27% of the computationally predicted fry‐specific sequences were experimentally confirmed to be fry specific. An important result was the identification of many novel mRNA sequences specific to the developing minnow, which lack homology with any other known sequence. In addition, the study results included tissue‐specific expression in adult samples. These results demonstrate the capabilities and limitations of inter‐library sequence comparisons as a predictor of gene activity in non‐sequenced organisms and tissues, as well as DNA microarray gene expression studies in non‐sequenced organisms.  相似文献   

15.
Birds have played a central role in many biological disciplines, particularly ecology, evolution, and behavior. The chicken, as a model vertebrate, also represents an important experimental system for developmental biologists, immunologists, cell biologists, and geneticists. However, genomic resources for the chicken have lagged behind those for other model organisms, with only 1845 nonredundant full-length chicken cDNA sequences currently deposited in the EMBL databank. We describe a large-scale expressed-sequence-tag (EST) project aimed at gene discovery in chickens (http://www.chick.umist.ac.uk). In total, 339,314 ESTs have been sequenced from 64 cDNA libraries generated from 21 different embryonic and adult tissues. These were clustered and assembled into 85,486 contiguous sequences (contigs). We find that a minimum of 38% of the contigs have orthologs in other organisms and define an upper limit of 13,000 new chicken genes. The remaining contigs may include novel avian specific or rapidly evolving genes. Comparison of the contigs with known chicken genes and orthologs indicates that 30% include cDNAs that contain the start codon and 20% of the contigs represent full-length cDNA sequences. Using this dataset, we estimate that chickens have approximately 35,000 genes in total, suggesting that this number may be a characteristic feature of vertebrates.  相似文献   

16.
Discovery of single nucleotide polymorphisms (SNPs) requires analysis of redundant sequences such as those available in large public databases. The ability to detect SNPs, especially those of low frequency, is dependent on the depth and scale of the discovery effort. Large numbers of SNPs have been identified by mining large-scale EST surveys and whole genome sequencing projects. These surveys however are subject to ascertainment bias and the inherent errors in large-scale single pass sequencing efforts. For example, the number of steps involved in the construction and sequencing of cDNA libraries make ESTs highly error prone, resulting in an increased frequency of nonvalid SNPs obtained in these surveys. Sequences of mtDNA genes are often incorporated into cDNA libraries as an artifact of the library construction process and are typically either subtracted from cDNA libraries or are considered superfluous when evaluating the information content of EST datasets. Sequences of mtDNA genes provide a unique resource for the analysis of SNP parameters in EST projects. This study uses sequences from four turkey muscle cDNA libraries to demonstrate how mtDNA sequences gleaned from collections of ESTs can be used to estimate SNP parameters and thus help predict the validity of SNPs.  相似文献   

17.
Discovery of single nucleotide polymorphisms (SNPs) requires analysis of redundant sequences such as those available in large public databases. The ability to detect SNPs, especially those of low frequency, is dependent on the depth and scale of the discovery effort. Large numbers of SNPs have been identified by mining large-scale EST surveys and whole genome sequencing projects. These surveys however are subject to ascertainment bias and the inherent errors in large-scale single pass sequencing efforts. For example, the number of steps involved in the construction and sequencing of cDNA libraries make ESTs highly error prone, resulting in an increased frequency of nonvalid SNPs obtained in these surveys. Sequences of mtDNA genes are often incorporated into cDNA libraries as an artifact of the library construction process and are typically either subtracted from cDNA libraries or are considered superfluous when evaluating the information content of EST datasets. Sequences of mtDNA genes provide a unique resource for the analysis of SNP parameters in EST projects. This study uses sequences from four turkey muscle cDNA libraries to demonstrate how mtDNA sequences gleaned from collections of ESTs can be used to estimate SNP parameters and thus help predict the validity of SNPs.  相似文献   

18.
The partial amino acid sequences of 121 rice proteins separated by two-dimensional gel electrophoresis (2D-PAGE), were determined for a protein sequence data file. In the Rice Genome Research Program (RGP), more than 20,000 cDNA clones randomly selected from rice cDNA libraries have been sequenced to construct a cDNA catalog. Complimentary DNAs encoding about 30% of proteins in the protein sequence data file could be identified in the catalog by computer search. It was deduced that 20,000–40,000 genes are present in the rice genome. Only half of about 20,000 cDNAs sequenced in the RGP, corresponding to 1/4–1/2 of genes present in the entire rice genome, should have unique sequences after considering gene redundancy. This is consistent with the fact that the cDNAs encoding about 30% of the sequenced proteins could be identified in the catalog. If the size of the cDNA catalog is enlarged further, cDNAs encoding all proteins separated by 2D-PAGE could be easily identified from the catalog by using the protein sequence data.  相似文献   

19.
Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina) and Ion Torrent (Life Technology) sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare). Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels) between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis.  相似文献   

20.
The Filarial Genome Project (FGP) was initiated in 1994 under the auspices of the World Health Organisation. Brugia malayi was chosen as the model organism due to the availability of all life cycle stages for the construction of cDNA libraries. To date, over 20000 cDNA clones have been partially sequenced and submitted to the EST database (dbEST). These ESTs define approximately 7000 new Brugia genes. Analysis of the EST dataset provides useful information on the expression pattern of the most abundantly expressed Brugia genes. Some highly expressed genes have been identified that are expressed in all stages of the parasite's life cycle, while other highly expressed genes appear to be stage-specific. To elucidate the structure of the Brugia genome and to provide a basis for comparison to the Caenorhabditis elegans genome, the FGP is also constructing a physical map of the Brugia chromosomes and is sequencing genomic BAC clones. In addition to the nuclear genome, B. malayi possesses two other genomes: the mitochondrial genome and the genome of a bacterial endosymbiont. Eighty percent of the mitochondrial genome of B. malayi has been sequenced and is being compared to mitochondrial sequences of other nematodes. The bacterial endosymbiont genome found in B. malayi is closely related to the Wolbachia group of rickettsia-like bacteria that infects many insect species. A set of overlapping BAC clones is being assembled to cover the entire bacterial genome. Currently, half of the bacterial genome has been assembled into four contigs. A consortium has been established to sequence the entire genome of the Brugia endosymbiont. The sequence and mapping data provided by the FGP is being utilised by the nematode research community to develop a better understanding of the biology of filarial parasites and to identify new vaccine candidates and drug targets to aid the elimination of human filariasis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号