首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 16 毫秒
1.
正Dear Editor,An increasing number of single-nucleotide polymorphisms(SNP)have been identified and extensively used in genetics studies with advances in next-generation sequencing(NGS)and SNP arrays.However,due to frequent calling errors,the accuracy of SNPs of interest must usually be validated.In recent years,the single-nucleotide polymorphism genotyping by fluorescent competitive allele-specific polymerase chain reaction(Ni PTag,the commercial technology named KASP),a relatively new and powerful SNP genotyping  相似文献   

2.

Background

Set comparisons permeate a large number of data analysis workflows, in particular workflows in biological sciences. Venn diagrams are frequently employed for such analysis but current tools are limited.

Results

We have developed InteractiVenn, a more flexible tool for interacting with Venn diagrams including up to six sets. It offers a clean interface for Venn diagram construction and enables analysis of set unions while preserving the shape of the diagram. Set unions are useful to reveal differences and similarities among sets and may be guided in our tool by a tree or by a list of set unions. The tool also allows obtaining subsets’ elements, saving and loading sets for further analyses, and exporting the diagram in vector and image formats. InteractiVenn has been used to analyze two biological datasets, but it may serve set analysis in a broad range of domains.

Conclusions

InteractiVenn allows set unions in Venn diagrams to be explored thoroughly, by consequence extending the ability to analyze combinations of sets with additional observations, yielded by novel interactions between joined sets. InteractiVenn is freely available online at: www.interactivenn.net.  相似文献   

3.
SNP2CAPS: a SNP and INDEL analysis tool for CAPS marker development   总被引:7,自引:0,他引:7  
With the influx of various SNP genotyping assays in recent years, there has been a need for an assay that is robust, yet cost effective, and could be performed using standard gel-based procedures. In this context, CAPS markers have been shown to meet these criteria. However, converting SNPs to CAPS markers can be a difficult process if done manually. In order to address this problem, we describe a computer program, SNP2CAPS, that facilitates the computational conversion of SNP markers into CAPS markers. 413 multiple aligned sequences derived from barley ESTs were analysed for the presence of polymorphisms in 235 distinct restriction sites. 282 (90%) of 314 alignments that contain sequence variation due to SNPs and InDels revealed at least one polymorphic restriction site. After reducing the number of restriction enzymes from 235 to 10, 31% of the polymorphic sites could still be detected. In order to demonstrate the usefulness of this tool for marker development, we experimentally validated some of the results predicted by SNP2CAPS.  相似文献   

4.
Mass spectrometry for genotyping: an emerging tool for molecular medicine   总被引:4,自引:0,他引:4  
Recent technological innovations have made proteins and nucleic acids accessible to mass spectrometric analysis. As a result of their inherently high specificity, accuracy and throughput, there is considerable interest in developing mass spectrometric methods for genotype analysis in clinical diagnostic and research applications. This review outlines some of the most promising genotyping methods developed using electrospray and matrix-assisted laser-desorption-ionization mass spectrometry.  相似文献   

5.

Background

Genomic selection (GS) promises to improve accuracy in estimating breeding values and genetic gain for quantitative traits compared to traditional breeding methods. Its reliance on high-throughput genome-wide markers and statistical complexity, however, is a serious challenge in data management, analysis, and sharing. A bioinformatics infrastructure for data storage and access, and user-friendly web-based tool for analysis and sharing output is needed to make GS more practical for breeders.

Results

We have developed a web-based tool, called solGS, for predicting genomic estimated breeding values (GEBVs) of individuals, using a Ridge-Regression Best Linear Unbiased Predictor (RR-BLUP) model. It has an intuitive web-interface for selecting a training population for modeling and estimating genomic estimated breeding values of selection candidates. It estimates phenotypic correlation and heritability of traits and selection indices of individuals. Raw data is stored in a generic database schema, Chado Natural Diversity, co-developed by multiple database groups. Analysis output is graphically visualized and can be interactively explored online or downloaded in text format. An instance of its implementation can be accessed at the NEXTGEN Cassava breeding database, http://cassavabase.org/solgs.

Conclusions

solGS enables breeders to store raw data and estimate GEBVs of individuals online, in an intuitive and interactive workflow. It can be adapted to any breeding program.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0398-7) contains supplementary material, which is available to authorized users.  相似文献   

6.
TRFMA provides a Web environment for analyzing T-RFLP results based on molecular weights of the fragments, rather than the numbers of nucleotides, to increase accuracy. The 16S rRNA data are saved as an XML file containing around 650 sequences (light version) and a MySQL database containing around 50 000 sequences (full version), which are connected to Web server via PHP5 and manipulated on an Internet browser. AVAILABILITY: TRFMA is freely available at http://myamagu.dent.kyushu-u.ac.jp/bioinformatics/trfma/index.html and can be downloaded from the same site.  相似文献   

7.

Background

As Next-Generation Sequencing data becomes available, existing hardware environments do not provide sufficient storage space and computational power to store and process the data due to their enormous size. This is and will be a frequent problem that is encountered everyday by researchers who are working on genetic data. There are some options available for compressing and storing such data, such as general-purpose compression software, PBAT/PLINK binary format, etc. However, these currently available methods either do not offer sufficient compression rates, or require a great amount of CPU time for decompression and loading every time the data is accessed.

Results

Here, we propose a novel and simple algorithm for storing such sequencing data. We show that, the compression factor of the algorithm ranges from 16 to several hundreds, which potentially allows SNP data of hundreds of Gigabytes to be stored in hundreds of Megabytes. We provide a C++ implementation of the algorithm, which supports direct loading and parallel loading of the compressed format without requiring extra time for decompression. By applying the algorithm to simulated and real datasets, we show that the algorithm gives greater compression rate than the commonly used compression methods, and the data-loading process takes less time. Also, The C++ library provides direct-data-retrieving functions, which allows the compressed information to be easily accessed by other C++ programs.

Conclusions

The SpeedGene algorithm enables the storage and the analysis of next generation sequencing data in current hardware environment, making system upgrades unnecessary.  相似文献   

8.
Restriction enzyme-detectable polymorphisms have been used for assessing genetic differences and generating informative genetic markers. The most detailed fingerprinting analyses have been obtained using the AFLP (amplified fragment length polymorphism) technique, which accesses subsets of polymorphisms at one or two restriction sites. To combine increased discriminatory power with the stringency of polymerase chain reaction amplification, it would be beneficial to access additional independent restriction sites per analysis, and to amplify subsets of DNA restriction fragments with only one pair of oligonucleotide primers. We have now developed a unique approach that permits the simultaneous use of four or more endonucleases in combination with one pair of adapters/primers, and applied it to genotype 21 trypanosome populations to subspecific level. The approach takes advantage of the fact that some endonucleases create cohesive ends that are compatible with the overhang sites created by other endonucleases. We demonstrate the greater resolution of identifiable polymorphic fragments over the conventional ligation-mediated restriction analysis method, and discuss the value of the approach as a tool for fine genetic mapping of Trypanosoma brucei. Finally, we propose use of the method for fine characterisation and for identifying co-dominant genetic markers in a variety of other taxa. Edited by: W. HennigAn erratum to this article can be found at  相似文献   

9.
10.
We report here the release of a web-based tool (MDDNA) to study and model the fine structural details of DNA on the basis of data extracted from a set of molecular dynamics (MD) trajectories of DNA sequences involving all the unique tetranucleotides. The dynamic web interface can be employed to analyze the first neighbor sequence context effects on the 10 unique dinucleotide steps of DNA. Functionality is included to build all atom models of any user-defined sequence based on the MD results. The backend of this interface is a relational database storing the conformational details of DNA obtained in 39 different MD simulation trajectories comprising all the 136 unique tetranucleotide steps. Examples of the use of this data to predict DNA structures are included. Availability: http://humphry.chem.wesleyan.edu:8080/MDDNA. Supplementary information: Supplementary data including color figures are available at Bioinformatics online.  相似文献   

11.
NetAlign is a web-based tool designed to enable comparative analysis of protein interaction networks (PINs). NetAlign compares a query PIN with a target PIN by combining interaction topology and sequence similarity to identify conserved network substructures (CoNSs), which may derive from a common ancestor and disclose conserved topological organization of interactions in evolution. To exemplify the application of NetAlign, we perform two genome-scale comparisons with (1) the Escherichia coli PIN against the Helicobacter pylori PIN and (2) the Saccharomyces cerevisiae PIN against the Caenorrhabditis elegans PIN. Many of the identified CoNSs correspond to known complexes; therefore, cross-species PIN comparison provides a way for discovery of conserved modules. In addition, based on the species-to-species differences in CoNSs, we reformulate the problems of protein-protein interaction (PPI) prediction and species divergence from a network perspective. AVAILABILITY: http://www1.ustc.edu.cn/lab/pcrystal/NetAlign.  相似文献   

12.
BarleyExpress is a web-based microarray experiment data submission tool for BarleyBase, a public data resource of Affymetrix GeneChip data for plants. BarleyExpress uses the Plant Ontology vocabularies and enhances the MIAME guidelines to standardize the annotation of microarray gene expression experiments. In addition, BarleyExpress provides explicit support for factorial experiment design and template loading methods to ease the submission process for large experiments. AVAILABILITY: http://barleybase.org SUPPLEMENTARY INFORMATION: BarleyExpress Users Manual.  相似文献   

13.
Minuk L  Civetta A 《Génome》2011,54(4):341-347
The recent completion of genome sequencing of 12 species of Drosophila has provided a powerful resource for hypothesis testing, as well as the development of technical tools. Here we take advantage of genome sequence data from two closely related species of Drosophila, Drosophila simulans and Drosophila sechellia, to quickly identify candidate molecular markers for genotyping based on expected insertion or deletion (indel) differences between species. Out of 64 candidate molecular markers selected along the second and third chromosome of Drosophila, 51 molecular markers were validated using PCR and gel electrophoresis. We found that the 20% error rate was due to sequencing errors in the genome data, although we cannot rule out possible indel polymorphisms. The approach has the advantage of being affordable and quick, as it only requires the use of bioinformatics tools for predictions and a PCR and agarose gel based assay for validation. Moreover, the approach could be easily extended to a wide variety of taxa with the only limitation being the availability of complete or partial genome sequence data.  相似文献   

14.
Toxin/antitoxin (TA) systems, viewed as essential regulators of growth arrest and programmed cell death, are widespread among prokaryotes, but remain sparsely annotated. We present RASTA-Bacteria, an automated method allowing quick and reliable identification of TA loci in sequenced prokaryotic genomes, whether they are annotated open reading frames or not. The tool successfully confirmed all reported TA systems, and spotted new putative loci upon screening of sequenced genomes. RASTA-Bacteria is publicly available at .  相似文献   

15.
SMART: a web-based tool for the study of genetically mobile domains   总被引:61,自引:2,他引:59  
SMART (a Simple Modular Architecture Research Tool) allows the identification and annotation of genetically mobile domains and the analysis of domain architectures (http://SMART.embl-heidelberg.de ). More than 400 domain families found in signalling, extra-cellular and chromatin-associated proteins are detectable. These domains are extensively annotated with respect to phyletic distributions, functional class, tertiary structures and functionally important residues. Each domain found in a non-redundant protein database as well as search parameters and taxonomic information are stored in a relational database system. User interfaces to this database allow searches for proteins containing specific combinations of domains in defined taxa.  相似文献   

16.
17.

Background  

The functional annotation of proteins relies on published information concerning their close and remote homologues in sequence databases. Evidence for remote sequence similarity can be further strengthened by a similar biological background of the query sequence and identified database sequences. However, few tools exist so far, that provide a means to include functional information in sequence database searches.  相似文献   

18.
19.
SUMMARY: With the availability of whole genome sequence in many species, linkage analysis, positional cloning and microarray are gradually becoming powerful tools for investigating the links between phenotype and genotype or genes. However, in these methods, causative genes underlying a quantitative trait locus, or a disease, are usually located within a large genomic region or a large set of genes. Examining the function of every gene is very time consuming and needs to retrieve and integrate the information from multiple databases or genome resources. PGMapper is a software tool for automatically matching phenotype to genes from a defined genome region or a group of given genes by combining the mapping information from the Ensembl database and gene function information from the OMIM and PubMed databases. PGMapper is currently available for candidate gene search of human, mouse, rat, zebrafish and 12 other species. AVAILABILITY: Available online at http://www.genediscovery.org/pgmapper/index.jsp.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号