首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.  相似文献   

4.
5.
6.
7.
8.
Selenoproteins are proteins containing an uncommon amino acid selenocysteine (Sec). Sec is inserted by a specific translational machinery that recognizes a stem-loop structure, the SECIS element, at the 3′ UTR of selenoprotein genes and recodes a UGA codon within the coding sequence. As UGA is normally a translational stop signal, selenoproteins are generally misannotated and designated tools have to be developed for this class of proteins. Here, we present two new computational methods for selenoprotein identification and analysis, which we provide publicly through the web servers at http://gladyshevlab.org/SelenoproteinPredictionServer or http://seblastian.crg.es. SECISearch3 replaces its predecessor SECISearch as a tool for prediction of eukaryotic SECIS elements. Seblastian is a new method for selenoprotein gene detection that uses SECISearch3 and then predicts selenoprotein sequences encoded upstream of SECIS elements. Seblastian is able to both identify known selenoproteins and predict new selenoproteins. By applying these tools to diverse eukaryotic genomes, we provide a ranked list of newly predicted selenoproteins together with their annotated cysteine-containing homologues. An analysis of a representative candidate belonging to the AhpC family shows how the use of Sec in this protein evolved in bacterial and eukaryotic lineages.  相似文献   

9.
Tomato Genomic Resources Database (TGRD) allows interactive browsing of tomato genes, micro RNAs, simple sequence repeats (SSRs), important quantitative trait loci and Tomato-EXPEN 2000 genetic map altogether or separately along twelve chromosomes of tomato in a single window. The database is created using sequence of the cultivar Heinz 1706. High quality single nucleotide polymorphic (SNP) sites between the genes of Heinz 1706 and the wild tomato S. pimpinellifolium LA1589 are also included. Genes are classified into different families. 5′-upstream sequences (5′-US) of all the genes and their tissue-specific expression profiles are provided. Sequences of the microRNA loci and their putative target genes are catalogued. Genes and 5′-US show presence of SSRs and SNPs. SSRs located in the genomic, genic and 5′-US can be analysed separately for the presence of any particular motif. Primer sequences for all the SSRs and flanking sequences for all the genic SNPs have been provided. TGRD is a user-friendly web-accessible relational database and uses CMAP viewer for graphical scanning of all the features. Integration and graphical presentation of important genomic information will facilitate better and easier use of tomato genome. TGRD can be accessed as an open source repository at http://59.163.192.91/tomato2/.  相似文献   

10.
11.
We determined 36 310 bovine expressed sequence tag (EST) sequences using 10 different cDNA libraries. For massive EST sequencing, we devised a new system with two major features. First, we constructed cDNA libraries in which the poly(A) tails were removed using nested deletion at the 3′-ends. This permitted high quality reading of sequences from the 3′-end of the cDNA, which is otherwise difficult to do. Second, we increased throughput by sequencing directly on templates generated by colony PCR. Using this system, we determined 600 cDNA sequences per day. The read-out length was >450 bases in >90% of the sequences. Furthermore, we established a data management system for analyses, storage and manipulation of the sequence data. Finally, 16 358 non-redundant ESTs were derived from ~6900 independent genes. These data will facilitate construction of a precise comparative map across mammalian species and isolate the functional genes that govern economic traits. This system is applicable to other organisms, including livestock, for which EST data are limited.  相似文献   

12.
13.

Background

Armolipid Plus (AP) is a nutraceutical that contains policosanol, fermented rice with red yeast, berberine, coenzyme Q10, folic acid, and astaxanthin. It has been shown to be effective in reducing plasma LDL cholesterol (LDLc) levels. In the multicenter randomized trial NCT01562080, there was large interindividual variability in the plasma LDLc response to AP supplementation. We hypothesized that the variability in LDLc response to AP supplementation may be linked to LDLR and PCSK9 polymorphisms.

Material and Methods

We sequenced the LDLR 3′ and 5′ untranslated regions (UTR) and the PCSK9 5′ UTR of 102 participants with moderate hypercholesterolemia in trial NCT01562080. In this trial, 50 individuals were treated with AP supplementation and the rest with placebo.

Results

Multiple linear regression analysis, using the response of LDLc levels to AP as the dependent variable, revealed that polymorphisms rs2149041 (c.-3383C>G) in the PCSK9 5′ UTR and rs14158 (c.*52G>A) in the LDLR 3′ UTR explained 14.1% and 6.4%, respectively, of the variability after adjusting for gender, age, and BMI of individuals. Combining polymorphisms rs2149041 and rs14158 explained 20.5% of this variability (p < 0.004).

Conclusions

Three polymorphisms in the 3′ UTR region of LDLR, c.*52G>A, c.*504G>A, and c.*773A>G, and two at the 5′ UTR region of PCSK9, c.−3383C>G and c.−2063A>G, were associated with response to AP. These results could explain the variability observed in the response to berberine among people with moderate hypercholesterolemia, and they may be useful in identifying patients who could potentially benefit from supplementation with AP.  相似文献   

14.

Background

Pathogenic bacteria infecting both animals as well as plants use various mechanisms to transport virulence factors across their cell membranes and channel these proteins into the infected host cell. The type III secretion system represents such a mechanism. Proteins transported via this pathway (“effector proteins”) have to be distinguished from all other proteins that are not exported from the bacterial cell. Although a special targeting signal at the N-terminal end of effector proteins has been proposed in literature its exact characteristics remain unknown.

Methodology/Principal Findings

In this study, we demonstrate that the signals encoded in the sequences of type III secretion system effectors can be consistently recognized and predicted by machine learning techniques. Known protein effectors were compiled from the literature and sequence databases, and served as training data for artificial neural networks and support vector machine classifiers. Common sequence features were most pronounced in the first 30 amino acids of the effector sequences. Classification accuracy yielded a cross-validated Matthews correlation of 0.63 and allowed for genome-wide prediction of potential type III secretion system effectors in 705 proteobacterial genomes (12% predicted candidates protein), their chromosomes (11%) and plasmids (13%), as well as 213 Firmicute genomes (7%).

Conclusions/Significance

We present a signal prediction method together with comprehensive survey of potential type III secretion system effectors extracted from 918 published bacterial genomes. Our study demonstrates that the analyzed signal features are common across a wide range of species, and provides a substantial basis for the identification of exported pathogenic proteins as targets for future therapeutic intervention. The prediction software is publicly accessible from our web server (www.modlab.org).  相似文献   

15.
Microarray-based enrichment of selected genomic loci is a powerful method for genome complexity reduction for next-generation sequencing. Since the vast majority of exons in vertebrate genomes are smaller than 150 nt, we explored the use of short fragment libraries (85–110 bp) to achieve higher enrichment specificity by reducing carryover and adverse effects of flanking intronic sequences. High enrichment specificity (60–75%) was obtained with a relative even base coverage. Up to 98% of the target-sequence was covered more than 20× at an average coverage depth of about 200×. To verify the accuracy of SNP/mutation detection, we evaluated 384 known non-reference SNPs in the targeted regions. At ∼200× average sequence coverage, we were able to survey 96.4% of 1.69 Mb of genomic sequence with only 4.2% false negative calls, mostly due to low coverage. Using the same settings, a total of 1197 novel candidate variants were detected. Verification experiments revealed only eight false positive calls, indicating an overall false positive rate of less than 1 per ∼200 000 bp. Taken together, short fragment libraries provide highly efficient and flexible enrichment of exonic targets and yield relatively even base coverage, which facilitates accurate SNP and mutation detection. Raw sequencing data, alignment files and called SNPs have been submitted into GEO database http://www.ncbi.nlm.nih.gov/geo/ with accession number GSE18542.  相似文献   

16.

Background

Recombinant human granulocyte-macrophage colony-stimulating factor (rhGM-CSF) is usually administered by injection, and its oral administration in a clinical setting has been not yet reported. Here we demonstrate the bioavailability of orally administered rhGM-CSF in healthy volunteers. The rhGM-CSF was expressed in Bombyx mori expression system (BmrhGM-CSF).

Methods and Findings

Using a single-dose, randomized, open-label, two-period crossover clinical trial design, 19 healthy volunteers were orally administered with BmrhGM-CSF (8 µg/kg) and subcutaneously injected with rhGM-CSF (3.75 µg/kg) respectively. Serum samples were drawn at 0.0h, 0.5h ,0.75h,1.0h,1.5h,2.0h ,3.0h,4.0h,5.0h,6.0h,8.0h,10.0h and 12.0h after administrations. The hGM-CSF serum concentrations were determined by ELISA. The AUC was calculated using the trapezoid method. The relative bioavailability of BmrhGM-CSF was determined according to the AUC ratio of both orally administered and subcutaneously injected rhGM-CSF. Three volunteers were randomly selected from 15 orally administrated subjects with ELISA detectable values. Their serum samples at the 0.0h, 1.0h, 2.0h, 3.0h and 4.0h after the administrations were analyzed by Q-Trap MS/MS TOF. The different peaks were revealed by the spectrogram profile comparison of the 1.0h, 2.0h, 3.0h and 4.0h samples with that of the 0.0h sample, and further analyzed using both Enhanced Product Ion (EPI) scanning and Peptide Mass Fingerprinting Analysis. The rhGM-CSF was detected in the serum samples from 15 of 19 volunteers administrated with BmrhGM-CSF. Its bioavailability was observed at an average of 1.0%, with the highest of 3.1%. The rhGM-CSF peptide sequences in the serum samples were detected by MS analysis, and their sizes ranging from 2,039 to 7,336 Da.

Conclusions

The results demonstrated that the oral administered BmrhGM-CSF was absorbed into the blood. This study provides an approach for an oral administration of rhGM-CSF protein in clinical settings.

Trial Registration

www.chictr.org ChiCTR-TRC-00000107  相似文献   

17.
18.
The effect of light and calcium depletion on in vivo protein phosphorylation was tested using dark-grown roots of Merit corn. Light caused rapid and specific promotion of phosphorylation of three polypeptides. Pretreatment of roots with ethylene glycol bis N,N,N′, N′ tetraacetic acid and A23187 prevented light-induced changes in protein phosphorylation. We postulate that these changes in protein phosphorylation are involved in the light-induced gravity response.  相似文献   

19.
Alpha-helical transmembrane proteins constitute roughly 30% of a typical genome and are involved in a wide variety of important biological processes including cell signalling, transport of membrane-impermeable molecules and cell recognition. Despite significant efforts to predict transmembrane protein topology, comparatively little attention has been directed toward developing a method to pack the helices together. Here, we present a novel approach to predict lipid exposure, residue contacts, helix-helix interactions and finally the optimal helical packing arrangement of transmembrane proteins. Using molecular dynamics data, we have trained and cross-validated a support vector machine (SVM) classifier to predict per residue lipid exposure with 69% accuracy. This information is combined with additional features to train a second SVM to predict residue contacts which are then used to determine helix-helix interaction with up to 65% accuracy under stringent cross-validation on a non-redundant test set. Our method is also able to discriminate native from decoy helical packing arrangements with up to 70% accuracy. Finally, we employ a force-directed algorithm to construct the optimal helical packing arrangement which demonstrates success for proteins containing up to 13 transmembrane helices. This software is freely available as source code from http://bioinf.cs.ucl.ac.uk/memsat/mempack/.  相似文献   

20.
We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an “encyclopedia” of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号