期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Characterization of Aqueous Dispersions and Gels Made of Sodium Caseinate and Basil Seed Gum: Phase Behavior,Rheology, and Microstructure

Sarabi-Aghdam Vahideh Hosseini-Parvar Seyed H. Motamedzadegan Ali Razi Saeed Mirarab Rashidinejad Ali 《Food biophysics》2020,15(4):495-508

The interactions between sodium caseinate (NaCas) and basil seed gum (BSG) in the presence of calcium chloride (CaCl₂) were investigated. The phase behavior of the mixed aqueous dispersions and their gels revealed a homogeneous mixture, obtained at the higher concentrations of both CaCl₂ and BSG. The Herschel-Bulkley model sufficiently fitted the flow behavior of the mixture solution data. Apparent viscosity increased significantly (p < 0.05) by increasing the concentration of BSG, where the addition of CaCl₂ had no significant effect on the viscosity of the samples (p > 0.05). Furthermore, there was an increase in thixotropy due to the higher concentrations of BSG and CaCl₂. Based on the frequency sweep test, at the low frequencies, a more gel-like behavior was observed in the case of the higher concentrations of either BSG or CaCl₂. The rheological and SEM data suggested that the stronger structure of NaCas-BSG gel in the presence of the higher concentrations of CaCl₂ was related to the induction of complex formation between the two biopolymers.

相似文献

2.

HIPPI: highly accurate protein family classification with ensembles of HMMs

Nguyen Nam-phuong Nute Michael Mirarab Siavash Warnow Tandy 《BMC genomics》2016,17(10):765-100

Background

Given a new biological sequence, detecting membership in a known family is a basic step in many bioinformatics analyses, with applications to protein structure and function prediction and metagenomic taxon identification and abundance profiling, among others. Yet family identification of sequences that are distantly related to sequences in public databases or that are fragmentary remains one of the more difficult analytical problems in bioinformatics.

Results

We present a new technique for family identification called HIPPI (Hierarchical Profile Hidden Markov Models for Protein family Identification). HIPPI uses a novel technique to represent a multiple sequence alignment for a given protein family or superfamily by an ensemble of profile hidden Markov models computed using HMMER. An evaluation of HIPPI on the Pfam database shows that HIPPI has better overall precision and recall than blastp, HMMER, and pipelines based on HHsearch, and maintains good accuracy even for fragmentary query sequences and for protein families with low average pairwise sequence identity, both conditions where other methods degrade in accuracy.

Conclusion

HIPPI provides accurate protein family identification and is robust to difficult model conditions. Our results, combined with observations from previous studies, show that ensembles of profile Hidden Markov models can better represent multiple sequence alignments than a single profile Hidden Markov model, and thus can improve downstream analyses for various bioinformatic tasks. Further research is needed to determine the best practices for building the ensemble of profile Hidden Markov models. HIPPI is available on GitHub at https://github.com/smirarab/sepp.

相似文献

3.

Anchoring quartet-based phylogenetic distances and applications to species tree reconstruction

Sayyari Erfan Mirarab Siavash 《BMC genomics》2016,17(10):783-113

Background

Inferring species trees from gene trees using the coalescent-based summary methods has been the subject of much attention, yet new scalable and accurate methods are needed.

Results

We introduce DISTIQUE, a new statistically consistent summary method for inferring species trees from gene trees under the coalescent model. We generalize our results to arbitrary phylogenetic inference problems; we show that two arbitrarily chosen leaves, called anchors, can be used to estimate relative distances between all other pairs of leaves by inferring relevant quartet trees. This results in a family of distance-based tree inference methods, with running times ranging between quadratic to quartic in the number of leaves.

Conclusions

We show in simulated studies that DISTIQUE has comparable accuracy to leading coalescent-based summary methods and reduced running times.

相似文献

4.

MRL and SuperFine+MRL: new supertree methods

Nguyen N Mirarab S Warnow T 《Algorithms for molecular biology : AMB》2012,7(1):3-13

Background

Supertree methods combine trees on subsets of the full taxon set together to produce a tree on the entire set of taxa. Of the many supertree methods, the most popular is MRP (Matrix Representation with Parsimony), a method that operates by first encoding the input set of source trees by a large matrix (the "MRP matrix") over {0,1, ?}, and then running maximum parsimony heuristics on the MRP matrix. Experimental studies evaluating MRP in comparison to other supertree methods have established that for large datasets, MRP generally produces trees of equal or greater accuracy than other methods, and can run on larger datasets. A recent development in supertree methods is SuperFine+MRP, a method that combines MRP with a divide-and-conquer approach, and produces more accurate trees in less time than MRP. In this paper we consider a new approach for supertree estimation, called MRL (Matrix Representation with Likelihood). MRL begins with the same MRP matrix, but then analyzes the MRP matrix using heuristics (such as RAxML) for 2-state Maximum Likelihood.

Results

We compared MRP and SuperFine+MRP with MRL and SuperFine+MRL on simulated and biological datasets. We examined the MRP and MRL scores of each method on a wide range of datasets, as well as the resulting topological accuracy of the trees. Our experimental results show that MRL, coupled with a very good ML heuristic such as RAxML, produced more accurate trees than MRP, and MRL scores were more strongly correlated with topological accuracy than MRP scores.

Conclusions

SuperFine+MRP, when based upon a good MP heuristic, such as TNT, produces among the best scores for both MRP and MRL, and is generally faster and more topologically accurate than other supertree methods we tested. 相似文献

5.

Corrigendum to: ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy

Chao Zhang Celine Scornavacca Erin K Molloy Siavash Mirarab 《Molecular biology and evolution》2021,38(10):4655

相似文献

6.

Log Transformation Improves Dating of Phylogenies

Uyen Mai Siavash Mirarab 《Molecular biology and evolution》2021,38(3):1151

Phylogenetic trees inferred from sequence data often have branch lengths measured in the expected number of substitutions and therefore, do not have divergence times estimated. These trees give an incomplete view of evolutionary histories since many applications of phylogenies require time trees. Many methods have been developed to convert the inferred branch lengths from substitution unit to time unit using calibration points, but none is universally accepted as they are challenged in both scalability and accuracy under complex models. Here, we introduce a new method that formulates dating as a nonconvex optimization problem where the variance of log-transformed rate multipliers is minimized across the tree. On simulated and real data, we show that our method, wLogDate, is often more accurate than alternatives and is more robust to various model assumptions. 相似文献

7.

Weighted Statistical Binning: Enabling Statistically Consistent Genome-Scale Phylogenetic Analyses

Md Shamsuzzoha Bayzid Siavash Mirarab Bastien Boussau Tandy Warnow 《PloS one》2015,10(6)

Because biological processes can result in different loci having different evolutionary histories, species tree estimation requires multiple loci from across multiple genomes. While many processes can result in discord between gene trees and species trees, incomplete lineage sorting (ILS), modeled by the multi-species coalescent, is considered to be a dominant cause for gene tree heterogeneity. Coalescent-based methods have been developed to estimate species trees, many of which operate by combining estimated gene trees, and so are called "summary methods". Because summary methods are generally fast (and much faster than more complicated coalescent-based methods that co-estimate gene trees and species trees), they have become very popular techniques for estimating species trees from multiple loci. However, recent studies have established that summary methods can have reduced accuracy in the presence of gene tree estimation error, and also that many biological datasets have substantial gene tree estimation error, so that summary methods may not be highly accurate in biologically realistic conditions. Mirarab et al. (Science 2014) presented the "statistical binning" technique to improve gene tree estimation in multi-locus analyses, and showed that it improved the accuracy of MP-EST, one of the most popular coalescent-based summary methods. Statistical binning, which uses a simple heuristic to evaluate "combinability" and then uses the larger sets of genes to re-calculate gene trees, has good empirical performance, but using statistical binning within a phylogenomic pipeline does not have the desirable property of being statistically consistent. We show that weighting the re-calculated gene trees by the bin sizes makes statistical binning statistically consistent under the multispecies coalescent, and maintains the good empirical performance. Thus, "weighted statistical binning" enables highly accurate genome-scale species tree estimation, and is also statistically consistent under the multi-species coalescent model. New data used in this study are available at DOI: http://dx.doi.org/10.6084/m9.figshare.1411146, and the software is available at https://github.com/smirarab/binning. 相似文献

8.

Interspecific Gene Flow Shaped the Evolution of the Genus Canis

Shyam Gopalakrishnan Mikkel-Holger S. Sinding Jazmín Ramos-Madrigal Jonas Niemann Jose A. Samaniego Castruita Filipe G. Vieira Christian Carøe Marc de Manuel Montero Lukas Kuderna Aitor Serres Víctor Manuel González-Basallote Yan-Hu Liu Guo-Dong Wang Tomas Marques-Bonet Siavash Mirarab Carlos Fernandes Philippe Gaubert Klaus-Peter Koepfli M. Thomas P. Gilbert 《Current biology : CB》2018,28(21):3441-3449.e5

相似文献

9.

FastSP: linear time calculation of alignment accuracy

Mirarab S Warnow T 《Bioinformatics (Oxford, England)》2011,27(23):3250-3258

相似文献

10.

Estimating repeat spectra and genome length from low-coverage genome skims with RESPECT

Shahab Sarmashghi Metin Balaban Eleonora Rachtman Behrouz Touri Siavash Mirarab Vineet Bafna 《PLoS computational biology》2021,17(11)

The cost of sequencing the genome is dropping at a much faster rate compared to assembling and finishing the genome. The use of lightly sampled genomes (genome-skims) could be transformative for genomic ecology, and results using k-mers have shown the advantage of this approach in identification and phylogenetic placement of eukaryotic species. Here, we revisit the basic question of estimating genomic parameters such as genome length, coverage, and repeat structure, focusing specifically on estimating the k-mer repeat spectrum. We show using a mix of theoretical and empirical analysis that there are fundamental limitations to estimating the k-mer spectra due to ill-conditioned systems, and that has implications for other genomic parameters. We get around this problem using a novel constrained optimization approach (Spline Linear Programming), where the constraints are learned empirically. On reads simulated at 1X coverage from 66 genomes, our method, REPeat SPECTra Estimation (RESPECT), had 2.2% error in length estimation compared to 27% error previously achieved. In shotgun sequenced read samples with contaminants, RESPECT length estimates had median error 4%, in contrast to other methods that had median error 80%. Together, the results suggest that low-pass genomic sequencing can yield reliable estimates of the length and repeat content of the genome. The RESPECT software will be publicly available at https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_shahab-2Dsarmashghi_RESPECT.git&d=DwIGAw&c=-35OiAkTchMrZOngvJPOeA&r=ZozViWvD1E8PorCkfwYKYQMVKFoEcqLFm4Tg49XnPcA&m=f-xS8GMHKckknkc7Xpp8FJYw_ltUwz5frOw1a5pJ81EpdTOK8xhbYmrN4ZxniM96&s=717o8hLR1JmHFpRPSWG6xdUQTikyUjicjkipjFsKG4w&e=. 相似文献