首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Loss of heterozygosity for a locus on human chromosome 11q22-23 is observed at high frequency in non-small cell lung carcinoma (NSCLC). Introduction of a 1.1 Mb fragmented yeast artificial chromosome (YAC) mapping to this region completely suppresses the tumorigenic properties of a human NSCLC cell line, A549. Smaller fragmented YACs give partial but not complete suppression. To further localize the gene(s) responsible for this partial suppression, a bacterial artificial chromosome (BAC) and P1-based artificial chromosome (PAC) contig was constructed, completely spanning the candidate region. End sequence generated in the construction of the BAC/PAC contig identified a previously unmapped EST and served to order genomic sequence contigs from the publicly available Celera Genomics (CG) and Human Genome Project (HGP) efforts. Comparison showed that CG provided larger contigs, while HGP provided more coverage. Neither CG nor HGP provided complete sequence coverage, alone or in combination. The sequence was used to map 110 ESTs and to predict new genes, including two GenScan gene predictions that overlapped ESTs and were shown to be differentially expressed in tumorigenic and suppressed A549 cell lines.  相似文献   

2.
Expressed sequence tags (ESTs) provide researchers with a quick and inexpensive route for discovering new genes, data on gene expression and regulation, and also provide genic markers that help in constructing genome maps. Cacao is an important perennial crop of humid tropics. Cacao EST sequences, as available in the public domain, were downloaded and made into contigs. Microsatellites were located in these ESTs and contigs using five softwares (MISA, TRA, TROLL, SSRIT and SSR primer). MISA gave maximum coverage of SSRs in cacao ESTs and contigs, although TRA was able to detect higher order (>5-mer) repeats. The frequency of SSRs was one per 26.9 kb in the known set of ESTs. One-third of the repeats in EST-contigs were found to be trimeric. A few rare repeats like 21-mer repeat were also located. A/T repeats were most abundant among the mononucleotide repeats and the AG/GA/TC/CT type was the most frequent among dimerics. Flanking primers were designed using Primer3 program and verified experimentally for PCR amplification. The results of the study are made available freely online database (). Seven primer pairs amplified genomic DNA isolated from leaves were used to screen a representative set of 12 accessions of cacao.  相似文献   

3.
Expressed sequence tags (ESTs) are partial cDNA sequences read from both ends of random expressed gene fragments used for discovering new genes. DNA libraries from four different developmental stages of Schistosoma mansoni used in this study generated 141 ESTs representing about 2.5% of S. mansoni sequences in dbEST. Sequencing was done by the dideoxy chain termination method. The sequences were submitted to GenBank for homology searching in nonredundant databases using Basic Local Alignment Search Tool for DNA (BLASTN) alignment and for protein (BLASTX) alignment at the National Center for Biotechnology Information (NCBI). Among submitted ESTs, 29 were derived from lambdagt11 sporocyst library, 70 from lambdaZap adult worm library, 31 from lambdaZap cercarial library, and 11 from lambdaZap female B worm library. Homology search revealed that eight (5.6%) ESTs shared homology to previously identified S.mansoni genes in dbEST, 15 (10.6%) are homologous to known genes in other organisms, 116 (81.7%) showed no significant sequence homology in the databases, and the remaining sequences (2.1%) showed low homologies to rRNA or mitochondrial DNA sequences. Thus, among the 141 ESTs studied, 116 sequences are derived from noval, uncharactarized S. mansoni genes. Those 116 ESTs are important for identification of coding regions in the sequences, helping in mapping of schistosome genome, and identifying genes of immunological and pharmacological significance.  相似文献   

4.
5.
6.
A bacterial artificial chromosome (BAC) library has been established from genomic DNA isolated from the trematode parasite of human, Schistosoma mansoni. This library consists of more than 21,000 recombinant clones carrying inserts in the pBeloBAC11 vector. The mean insert size was 100 kb, representing an approximate 7.95-fold genome coverage. Library screening with eight chromosome-specific or single-copy gene probes yielded between 1 and 9 positive clones, and none of those tested was absent from the library. End sequences were obtained for 93 randomly selected clones, and 37 showed sequence identity to S. mansoni sequences (ESTs, genes, or repetitive sequences). A preliminary analysis by fluorescence in situ hybridization localized 8 clones on schistosome chromosomes 1 (2 clones), 2, 3, 5, Z, and W (3 clones). This library provides a new resource for the physical mapping and sequencing of the genome of this important human pathogen.  相似文献   

7.
8.
9.
10.
Birds have played a central role in many biological disciplines, particularly ecology, evolution, and behavior. The chicken, as a model vertebrate, also represents an important experimental system for developmental biologists, immunologists, cell biologists, and geneticists. However, genomic resources for the chicken have lagged behind those for other model organisms, with only 1845 nonredundant full-length chicken cDNA sequences currently deposited in the EMBL databank. We describe a large-scale expressed-sequence-tag (EST) project aimed at gene discovery in chickens (http://www.chick.umist.ac.uk). In total, 339,314 ESTs have been sequenced from 64 cDNA libraries generated from 21 different embryonic and adult tissues. These were clustered and assembled into 85,486 contiguous sequences (contigs). We find that a minimum of 38% of the contigs have orthologs in other organisms and define an upper limit of 13,000 new chicken genes. The remaining contigs may include novel avian specific or rapidly evolving genes. Comparison of the contigs with known chicken genes and orthologs indicates that 30% include cDNAs that contain the start codon and 20% of the contigs represent full-length cDNA sequences. Using this dataset, we estimate that chickens have approximately 35,000 genes in total, suggesting that this number may be a characteristic feature of vertebrates.  相似文献   

11.
RNA-Seq is a powerful tool for the annotation of genomes, in particular for the identification of isoforms and UTRs. Nevertheless, several software tools exist and no standard strategy to obtain a reliable annotation is yet established. We tested different combinations of the most commonly used reference-based alignment tools (TopHat, GSNAP) in combination with two frequently used reference-based assemblers (Cufflinks, Scripture) and evaluated the potential of RNA-Seq to improve the annotation of Drosophila pseudoobscura. While GSNAP maps a higher proportion of reads, TopHat resulted in a more accurate annotation when used in combination with Cufflinks. Scripture had the lowest sensitivity. Interestingly, after subsampling to the same coverage for GSNAP and TopHat, we find that both mappers have similar performance, implying that the advantage of TopHat is mainly an artifact of the lower coverage. Overall, we observed a low concordance among the different approaches tested both at junction and isoform levels. Using data from both sexes of two adult strains of D. pseudoobscura we detected alternative splicing for about 30% of the FlyBase multiple-exon genes. Moreover, we extended the boundaries for 6523 genes (about 40%). We annotated 669 new genes, 45% of them with splicing evidence. Most of the new genes are located on unassembled contigs, reflecting their incomplete annotation. Finally, we identified 99 additional new genes that are not represented in the current genome contigs of D. pseudoobscura, probably due to location in genomic regions that are difficult to assemble (e.g. heterochromatic regions).  相似文献   

12.
For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.  相似文献   

13.
Expressed Sequence Tag (EST) analysis has pioneered genome-wide gene discovery and expression profiling. In order to establish a gene expression index in the rice cultivar indica, we sequenced and analyzed 86,136 ESTs from nine rice cDNA libraries from the super hybrid cultivar LYP9 and its parental cultivars. We assembled these ESTs into 13,232 contigs and leave 8,976 singletons. Overall, 7,497 sequences were found similar to the existing sequences in GenBank and 14,711 are novel. These sequences are classified by molecular function, biological process and pathways according to the Gene Ontology. We compared our sequenced ESTs with the publicly available 95,000 ESTs from japonica, and found little sequence variation, despite the large difference between genome sequences. We then assembled the combined 173,000 rice ESTs for further analysis. Using the pooled ESTs, we compared gene expression in metabolism pathway between rice and Arabidopsis according to KEGG. We further profiled gene expression pattern  相似文献   

14.
15.
16.
Efforts to construct a genetic linkage map of channel catfish have involved identification of random genomic microsatellite markers, as well as anchored Type I loci (expressed genes) from channel catfish. To identify Type I markers we constructed a directional cDNA library from brain tissue to obtain expressed catfish sequences that could be used for single nucleotide polymorphism (SNP) marker development. These cDNA sequences surprisingly contained a high proportion of microsatellites (about 14%) in noncoding regions of expressed sequence tags (ESTs), many of which were not associated with known sequences. To further identify cDNAs with microsatellites and reduce the number of sequencing reactions needed for marker development, we enriched this library for repeat sequences and sequenced clones from both directions. A total of 1644 clones from seven repeat-enriched captures (CA, GT, CT, GA, MTT, TAG, and TAC) were sequenced from both ends, and 795 nonredundant clones were assembled. Thirty-seven percent of the clones contained microsatellites in the trimmed sequence. After assembly in the TIGR Catfish Gene Index (CfGI), 154 contigs matched known vertebrate genes and 92 contigs contained microsatellites. When BLAST-matched orthologues were available for similarity alignments, 28% of these contigs contained repeats in the 5'-UTR, 72% contained repeats in the 3'-UTR, and 8% contained repeats at both ends. Using biotinylated repeat oligonucleotides coupled with streptavidin-coated magnetic beads, and rapid, single-pass hybridization, we were able to enrich our plasmid library greater than two-fold for repeat sequences and increase the ability to link these ESTs with known sequences greater than six-fold.  相似文献   

17.
Root-knot nematodes are obligatory sedentary endoparasites that require a plant host to complete their life cycle. To understand the functions of Meloidogyne incognita nematode genes transcribed from eggs and second-stage juveniles (J2), we have constructed a normalized full-length M. incognita cDNA library and analyzed the ESTs using Pendant-Pro Sequence Analysis Suite. The 5,832 M. incognita ESTs formed 3,263 clusters and 2,241 singletons. The sequences ranged from 51 to 1,740 base pairs, and their average size was 699 base pairs. The protein length of M. incognita ESTs ranged from 150 to 299 amino acids. Forty contigs of predicted proteins that were grouped by BLASTP identity values had significant homology to the genes expressed in their organelle structures (cuticle, epidermis, extracellular matrix and muscle). Using the gomerger method of contigs, we could functionally assign GO terms to 2,147 (53.4%) of 4,024 contigs. Following the E.C. numbers method using UniProt database hits, we could functionally classify E.C. numbers to 288 (7.2%) of 4024 contigs. Also, the taxonomy was classified to 2,329 (57.9%) of 4,024 contigs. We could predict transmembrane regions of 4,024 clusters using the TMpred algorithm. Of the 4,024 contigs with transmembrane regions, 1,457 (36.2%) were assigned more than one domain, and 2,567 (63.8%) could not be assigned a transmembrane domain. The M. incognita ESTs will provide a foundation for developing novel target genes for parasite control and contribute to accelerating the research of biologically-related species.  相似文献   

18.
Efforts to construct a genetic linkage map of channel catfish have involved identification of random genomic microsatellite markers, as well as anchored Type I loci (expressed genes) from channel catfish. To identify Type I markers we constructed a directional cDNA library from brain tissue to obtain expressed catfish sequences that could be used for single nucleotide polymorphism (SNP) marker development. These cDNA sequences surprisingly contained a high proportion of microsatellites (about 14%) in noncoding regions of expressed sequence tags (ESTs), many of which were not associated with known sequences. To further identify cDNAs with microsatellites and reduce the number of sequencing reactions needed for marker development, we enriched this library for repeat sequences and sequenced clones from both directions. A total of 1644 clones from seven repeat-enriched captures (CA, GT, CT, GA, MTT, TAG, and TAC) were sequenced from both ends, and 795 nonredundant clones were assembled. Thirty-seven percent of the clones contained microsatellites in the trimmed sequence. After assembly in the TIGR Catfish Gene Index (CfGI), 154 contigs matched known vertebrate genes and 92 contigs contained microsatellites. When BLAST-matched orthologues were available for similarity alignments, 28% of these contigs contained repeats in the 5'-UTR, 72% contained repeats in the 3'-UTR, and 8% contained repeats at both ends. Using biotinylated repeat oligonucleotides coupled with streptavidin-coated magnetic beads, and rapid; single-pass hybridization, we were able to enrich our plasmid library greater than two-fold for repeat sequences and increase the ability to link these ESTs with known sequences greater than six-fold.  相似文献   

19.
A unigene set of 1411 contigs was constructed from 2629 redundant maize expressed sequence tags (ESTs) mapped on the maizeDB genetic map. Rice orthologous sequences were identified by blast alignment against the rice genomic sequence. A total of 1046 (74%) maize contigs were associated with their corresponding homologues in the rice genome and 656 (47%) defined as potential orthologous relationships. One hundred and seventeen (8%) maize EST contigs mapped to two distinct loci on the maize genetic map, reflecting the tetraploid nature of the maize genome. Among 492 mono-locus contigs, 344 (484 redundant ESTs) identify collinear blocks between maize chromosomes 2 and 4 and a single rice chromosome, defining six new collinear regions. Fine-scale analysis of collinearity between rice chromosomes 1 and 5 with maize chromosomes 3, 6 and 8 shows the presence of internal rearrangements within collinear regions. Mapping of maize contigs to two distinct loci on the rice sequence identifies five new duplication events in rice. Detailed analysis of a duplication between rice chromosomes 1 and 5 shows that 11% of the annotated genes from the chromosome 1 locus are found duplicated on the chromosome 5 paralogous counterpart, indicating a high degree of re-organisations. The implications of these findings for map-based cloning in collinear regions are discussed.  相似文献   

20.
A fine physical map of the CACNA1A gene region on 19p13.1-p13.2 chromosome   总被引:5,自引:0,他引:5  
The P/Q-type Ca(2+) channel alpha(1A) subunit gene (CACNA1A) was cloned on the short arm of chromosome 19 between the markers D19S221 and D19S179 and found to be responsible for Episodic Ataxia type 2, Familial Hemiplegic Migraine and Spinocerebellar Ataxia type 6. This region was physically mapped by 11 cosmid contigs spanning about 1. 4Mb, corresponding to less than 70% of the whole region. The cosmid contig used to characterize the CACNA1A gene accounted only for the coding region of the gene lacking, therefore, the promoter and possible regulation regions. The present study improves the physical map around and within the CACNA1A by giving a complete cosmid or BAC contig coverage of the D19S221-D19S179 interval. A number of new STSs, whether polymorphic or not, were characterized and physically mapped within this region. Four ESTs were also assigned to cosmids belonging to specific contigs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号