共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Cava Claudia Zoppis Italo Gariboldi Manuela Castiglioni Isabella Mauri Giancarlo Antoniotti Marco 《Journal of clinical bioinformatics》2014,4(1):1-13
Background
The human body plays host to a vast array of bacteria, found in oral cavities, skin, gastrointestinal tract and the vagina. Some bacteria are harmful while others are beneficial to the host. Despite the availability of many methods to identify bacteria, most of them are only applicable to specific and cultivable bacteria and are also tedious. Based on high throughput sequencing technology, this work derives 16S rRNA sequences of bacteria and analyzes probiotics and pathogens species.Results
We constructed a database that recorded the species of probiotics and pathogens from literature, along with a modified Smith-Waterman algorithm for assigning the taxonomy of the sequenced 16S rRNA sequences. We also constructed a bacteria disease risk model for seven diseases based on 98 samples. Applicability of the proposed platform is demonstrated by collecting the microbiome in human gut of 13 samples.Conclusions
The proposed platform provides a relatively easy means of identifying a certain amount of bacteria and their species (including uncultivable pathogens) for clinical microbiology applications. That is, detecting how probiotics and pathogens inhabit humans and how affect their health can significantly contribute to develop a diagnosis and treatment method. 相似文献3.
Use of molecular beacons to verify that the serine hydroxymethyltransferase pseudogene SHMT-ps1 is unique to the order Primates
下载免费PDF全文
![点击此处可从《Genome biology》网站下载免费的PDF全文](/ch/ext_images/free.gif)
Devor EJ 《Genome biology》2001,2(2):research0006.1-research00065
Background
The serine hydroxymethyltransferase processed pseudogene SHMT-ps1 has been suggested to be unique to the order Primates because of the failure to amplify this sequence by PCR from genomic DNAs of any non-primate mammal species. Here, 'molecular beacon' probes specific to SHMT-ps1 were used in an attempt to verify this suggestion.Results
In a search for SHMT-ps1-specific sequences using molecular beacons across a range of mammalian species, SHMT-ps1 was only found in primates. The molecular beacon assays also showed that SHMT-ps1 is present in both Old World and New World species but not among prosimians.Conclusions
These results suggest that SHMT-ps1 originated close to the origin of the Anthropoidea, some 40 to 50 million years ago. 相似文献4.
Background
Diverse plant and animal species have B chromosomes, also known as accessory, extra or supernumerary chromosomes. Despite being widely distributed among different taxa, the genomic nature and genetic behavior of B chromosomes are still poorly understood.Results
In this study we describe the occurrence of B chromosomes in the African cichlid fish Haplochromis obliquidens. One or two large B chromosome(s) occurring in 39.6% of the analyzed individuals (both male and female) were identified. To better characterize the karyotype and assess the nature of the B chromosomes, fluorescence in situ hybridization (FISH) was performed using probes for telomeric DNA repeats, 18S and 5S rRNA genes, SATA centromeric satellites, and bacterial artificial chromosomes (BACs) enriched in repeated DNA sequences. The B chromosomes are enriched in repeated DNAs, especially non-active 18S rRNA gene-like sequences.Conclusion
Our results suggest that the B chromosome could have originated from rDNA bearing subtelo/acrocentric A chromosomes through formation of an isochromosome, or by accumulation of repeated DNAs and rRNA gene-like sequences in a small proto-B chromosome derived from the A complement. 相似文献5.
Association between fatty acid compositions and genotypes of FABP4 and LXR-alpha in Japanese Black cattle 总被引:4,自引:0,他引:4
Shogo Hoashi Tomoko Hinenoya Atsuko Tanaka Hideki Ohsaki Shinji Sasazaki Masaaki Taniguchi Kenji Oyama Fumio Mukai Hideyuki Mannen 《BMC genetics》2008,9(1):1-7
Background
Single nucleotide polymorphisms (SNPs) and small insertions or deletions (indels) are the most common type of polymorphisms and are frequently used for molecular marker development. Such markers have become very popular for all kinds of genetic analysis, including haplotype reconstruction. Haplotypes can be reconstructed for whole chromosomes but also for specific genes, based on the SNPs present. Haplotypes in the latter context represent the different alleles of a gene. The computational approach to SNP mining is becoming increasingly popular because of the continuously increasing number of sequences deposited in databases, which allows a more accurate identification of SNPs. Several software packages have been developed for SNP mining from databases. From these, QualitySNP is the only tool that combines SNP detection with the reconstruction of alleles, which results in a lower number of false positive SNPs and also works much faster than other programs. We have build a web-based SNP discovery and allele detection tool (HaploSNPer) based on QualitySNP.Results
HaploSNPer is a flexible web-based tool for detecting SNPs and alleles in user-specified input sequences from both diploid and polyploid species. It includes BLAST for finding homologous sequences in public EST databases, CAP3 or PHRAP for aligning them, and QualitySNP for discovering reliable allelic sequences and SNPs. All possible and reliable alleles are detected by a mathematical algorithm using potential SNP information. Reliable SNPs are then identified based on the reconstructed alleles and on sequence redundancy.Conclusion
Thorough testing of HaploSNPer (and the underlying QualitySNP algorithm) has shown that EST information alone is sufficient for the identification of alleles and that reliable SNPs can be found efficiently. Furthermore, HaploSNPer supplies a user friendly interface for visualization of SNP and alleles. HaploSNPer is available from http://www.bioinformatics.nl/tools/haplosnper/. 相似文献6.
Hye-young Wang Sunghyun Kim Hyunjung Kim Jungho Kim Yeun Kim Soon-Deok Park Hyunwoo Jin Yeonim Choi Young Uh Hyeyoung Lee 《Annals of clinical microbiology and antimicrobials》2014,13(1):1-10
Background
Sepsis is one of the main causes of mortality and morbidity. The rapid detection of pathogens in blood of septic patients is essential for adequate antimicrobial therapy and better prognosis. This study aimed to accelerate the detection and discrimination of Gram-positive (GP) and Gram-negative (GN) bacteria and Candida species in blood culture samples by molecular methods.Methods
The Real-GP®, -GN®, and -CAN® real-time PCR kit (M&D, Wonju, Republic of Korea) assays use the TaqMan probes for detecting pan-GP, pan-GN, and pan-Candida species, respectively. The diagnostic performances of the real-time PCR kits were evaluated with 115 clinical isolates, 256 positive and 200 negative blood culture bottle samples, and the data were compared to results obtained from conventional blood culture.Results
Eighty-seven reference strains and 115 clinical isolates were correctly identified with specific probes corresponding to GP-bacteria, GN-bacteria and Candida, respectively. The overall sensitivity and specificity of the real-time PCR kit with blood culture samples were 99.6% and 89.5%, respectively.Conclusions
The Real-GP®, -GN®, and -CAN® real-time PCR kits could be useful tools for the rapid and accurate screening of bloodstream infections (BSIs). 相似文献7.
8.
Background
We previously developed the DBRF-MEGN (difference-based regulation finding-minimum equivalent gene network) method, which deduces the most parsimonious signed directed graphs (SDGs) consistent with expression profiles of single-gene deletion mutants. However, until the present study, we have not presented the details of the method's algorithm or a proof of the algorithm.Results
We describe in detail the algorithm of the DBRF-MEGN method and prove that the algorithm deduces all of the exact solutions of the most parsimonious SDGs consistent with expression profiles of gene deletion mutants.Conclusions
The DBRF-MEGN method provides all of the exact solutions of the most parsimonious SDGs consistent with expression profiles of gene deletion mutants. 相似文献9.
Background
Genomic islands play an important role in medical, methylation and biological studies. To explore the region, we propose a CpG islands prediction analysis platform for genome sequence exploration (CpGPAP).Results
CpGPAP is a web-based application that provides a user-friendly interface for predicting CpG islands in genome sequences or in user input sequences. The prediction algorithms supported in CpGPAP include complementary particle swarm optimization (CPSO), a complementary genetic algorithm (CGA) and other methods (CpGPlot, CpGProD and CpGIS) found in the literature. The CpGPAP platform is easy to use and has three main features (1) selection of the prediction algorithm; (2) graphic visualization of results; and (3) application of related tools and dataset downloads. These features allow the user to easily view CpG island results and download the relevant island data. CpGPAP is freely available at http://bio.kuas.edu.tw/CpGPAP/.Conclusions
The platform's supported algorithms (CPSO and CGA) provide a higher sensitivity and a higher correlation coefficient when compared to CpGPlot, CpGProD, CpGIS, and CpGcluster over an entire chromosome. 相似文献10.
Jaturon Harnsomburana Jason M Green Adrian S Barb Mary Schaeffer Leszek Vincent Chi-Ren Shyu 《BMC bioinformatics》2011,12(1):1-21
Background
Gene regulatory networks have an essential role in every process of life. In this regard, the amount of genome-wide time series data is becoming increasingly available, providing the opportunity to discover the time-delayed gene regulatory networks that govern the majority of these molecular processes.Results
This paper aims at reconstructing gene regulatory networks from multiple genome-wide microarray time series datasets. In this sense, a new model-free algorithm called GRNCOP2 (Gene Regulatory Network inference by Combinatorial OPtimization 2), which is a significant evolution of the GRNCOP algorithm, was developed using combinatorial optimization of gene profile classifiers. The method is capable of inferring potential time-delay relationships with any span of time between genes from various time series datasets given as input. The proposed algorithm was applied to time series data composed of twenty yeast genes that are highly relevant for the cell-cycle study, and the results were compared against several related approaches. The outcomes have shown that GRNCOP2 outperforms the contrasted methods in terms of the proposed metrics, and that the results are consistent with previous biological knowledge. Additionally, a genome-wide study on multiple publicly available time series data was performed. In this case, the experimentation has exhibited the soundness and scalability of the new method which inferred highly-related statistically-significant gene associations.Conclusions
A novel method for inferring time-delayed gene regulatory networks from genome-wide time series datasets is proposed in this paper. The method was carefully validated with several publicly available data sets. The results have demonstrated that the algorithm constitutes a usable model-free approach capable of predicting meaningful relationships between genes, revealing the time-trends of gene regulation. 相似文献11.
Background and Aims
The role and linkage of endophytic bacteria to resistance of peanut seeds to biotic stress is poorly understood. The aims of the present study were to survey the experimental (axenic) and control (conventional) peanut plants for the predominant endophytic bacteria, and to characterize isolates with activity against selected A. flavus strains.Methods
Young axenic plants were grown from presumably bacteria-free embryos in the lab, and then they were grown in a field. Endophytic bacterial species were identified by the analysis of DNA sequences of their 16S-ribosomal RNA gene. DNA extracted from soil was also analyzed for predominant bacteria.Results
Mature seeds from the experimental and control plants contained several species of nonpathogenic endophytic bacteria. Among the eight bacterial species isolated from seeds, and DNA sequences detected in soil, Bacillus thuringiensis was dominant. All B. amyloliquefaciens isolates, the second abundant species in seeds demonstrated activity against A. flavus. This effect was not observed with any other bacterial isolates. There was no significant difference in number and relative occurrence of the two major bacterial species between the experimental and conventionally grown control seeds.Conclusion
Endophytic bacterial colonization derives from local soil and not from the seed source, and the peanut plant accommodates only selected species of bacteria from diverse soil populations. Some bacterial isolates showed antibiosis against A. flavus. 相似文献12.
Background
The availability of sequences from whole genomes to reconstruct the tree of life has the potential to enable the development of phylogenomic hypotheses in ways that have not been before possible. A significant bottleneck in the analysis of genomic-scale views of the tree of life is the time required for manual curation of genomic data into multi-gene phylogenetic matrices.Results
To keep pace with the exponentially growing volume of molecular data in the genomic era, we have developed an automated technique, ASAP (Automated Simultaneous Analysis Phylogenetics), to assemble these multigene/multi species matrices and to evaluate the significance of individual genes within the context of a given phylogenetic hypothesis.Conclusion
Applications of ASAP may enable scientists to re-evaluate species relationships and to develop new phylogenomic hypotheses based on genome-scale data. 相似文献13.
14.
Background
With an increasing number of plant genome sequences, it has become important to develop a robust computational method for detecting plant promoters. Although a wide variety of programs are currently available, prediction accuracy of these still requires further improvement. The limitations of these methods can be addressed by selecting appropriate features for distinguishing promoters and non-promoters.Methods
In this study, we proposed two feature selection approaches based on hexamer sequences: the Frequency Distribution Analyzed Feature Selection Algorithm (FDAFSA) and the Random Triplet Pair Feature Selecting Genetic Algorithm (RTPFSGA). In FDAFSA, adjacent triplet-pairs (hexamer sequences) were selected based on the difference in the frequency of hexamers between promoters and non-promoters. In RTPFSGA, random triplet-pairs (RTPs) were selected by exploiting a genetic algorithm that distinguishes frequencies of non-adjacent triplet pairs between promoters and non-promoters. Then, a support vector machine (SVM), a nonlinear machine-learning algorithm, was used to classify promoters and non-promoters by combining these two feature selection approaches. We referred to this novel algorithm as PromoBot.Results
Promoter sequences were collected from the PlantProm database. Non-promoter sequences were collected from plant mRNA, rRNA, and tRNA of PlantGDB and plant miRNA of miRBase. Then, in order to validate the proposed algorithm, we applied a 5-fold cross validation test. Training data sets were used to select features based on FDAFSA and RTPFSGA, and these features were used to train the SVM. We achieved 89% sensitivity and 86% specificity.Conclusions
We compared our PromoBot algorithm to five other algorithms. It was found that the sensitivity and specificity of PromoBot performed well (or even better) with the algorithms tested. These results show that the two proposed feature selection methods based on hexamer frequencies and random triplet-pair could be successfully incorporated into a supervised machine learning method in promoter classification problem. As such, we expect that PromoBot can be used to help identify new plant promoters. Source codes and analysis results of this work could be provided upon request. 相似文献15.
Background
High-density oligonucleotide arrays have become a valuable tool for high-throughput gene expression profiling. Increasing the array information density and improving the analysis algorithms are two important computational research topics.Results
A new algorithm, Match-Only Integral Distribution (MOID), was developed to analyze high-density oligonucleotide arrays. Using known data from both spiking experiments and no-change experiments performed with Affymetrix GeneChip® arrays, MOID and the Affymetrix algorithm implemented in Microarray Suite 4.0 (MAS4) were compared. While MOID gave similar performance to MAS4 in the spiking experiments, better performance was observed in the no-change experiments. MOID also provides a set of alternative statistical analysis tools to MAS4. There are two main features that distinguish MOID from MAS4. First, MOID uses continuous P values for the likelihood of gene presence, while MAS4 resorts to discrete absolute calls. Secondly, MOID uses heuristic confidence intervals for both gene expression levels and fold change values, while MAS4 categorizes the significance of gene expression level changes into discrete fold change calls.Conclusions
The results show that by using MOID, Affymetrix GeneChip® arrays may need as little as ten probes per gene without compromising analysis accuracy. 相似文献16.
Background
DNA copy number alterations are one of the main characteristics of the cancer cell karyotype and can contribute to the complex phenotype of these cells. These alterations can lead to gains in cellular oncogenes as well as losses in tumor suppressor genes and can span small intervals as well as involve entire chromosomes. The ability to accurately detect these changes is central to understanding how they impact the biology of the cell.Results
We describe a novel algorithm called CARAT (Copy Number Analysis with Regression And Tree) that uses probe intensity information to infer copy number in an allele-specific manner from high density DNA oligonuceotide arrays designed to genotype over 100, 000 SNPs. Total and allele-specific copy number estimations using CARAT are independently evaluated for a subset of SNPs using quantitative PCR and allelic TaqMan reactions with several human breast cancer cell lines. The sensitivity and specificity of the algorithm are characterized using DNA samples containing differing numbers of X chromosomes as well as a test set of normal individuals. Results from the algorithm show a high degree of agreement with results from independent verification methods.Conclusion
Overall, CARAT automatically detects regions with copy number variations and assigns a significance score to each alteration as well as generating allele-specific output. When coupled with SNP genotype calls from the same array, CARAT provides additional detail into the structure of genome wide alterations that can contribute to allelic imbalance. 相似文献17.
Background
Drosophila mojavensishas been a model system for genetic studies of ecological adaptation and speciation. However, despite its use for over half a century, no linkage map has been produced for this species or its close relatives.Results
We have developed and mapped 90 microsatellites in D. mojavensis, and we present a detailed recombinational linkage map of 34 of these microsatellites. A slight excess of repetitive sequence was observed on the X-chromosome relative to the autosomes, and the linkage groups have a greater recombinational length than the homologous D. melanogaster chromosome arms. We also confirmed the conservation of Muller's elements in 23 sequences between D. melanogaster and D. mojavensis.Conclusions
The microsatellite primer sequences and localizations are presented here and made available to the public. This map will facilitate future quantitative trait locus mapping studies of phenotypes involved in adaptation or reproductive isolation using this species. 相似文献18.
Ricardo Gutiérrez Francisco Gómez Lucía Roa-Peña Eduardo Romero 《Diagnostic pathology》2011,6(1):1-14
Background
Expectation maximizing (EM) is one of the common approaches for image segmentation.Methods
an improvement of the EM algorithm is proposed and its effectiveness for MRI brain image segmentation is investigated. In order to improve EM performance, the proposed algorithms incorporates neighbourhood information into the clustering process. At first, average image is obtained as neighbourhood information and then it is incorporated in clustering process. Also, as an option, user-interaction is used to improve segmentation results. Simulated and real MR volumes are used to compare the efficiency of the proposed improvement with the existing neighbourhood based extension for EM and FCM.Results
the findings show that the proposed algorithm produces higher similarity index.Conclusions
experiments demonstrate the effectiveness of the proposed algorithm in compare to other existing algorithms on various noise levels. 相似文献19.