共查询到20条相似文献,搜索用时 0 毫秒
1.
It is well known that molecular data "saturates" with increasing sequence divergence (thereby losing phylogenetic information) and that in addition the accumulation of misleading information due to chance similarities or to systematic bias may accompany saturation as well. Exploratory data analysis methods that can quantify the extent of signal loss or convergence for a given data set are scarce. Such methods are needed because genomics delivers very long sequence alignments spanning substantial phylogenetic depth, where site saturation may be compounded by systematic biases or other alternative signals. Here we introduce the Treeness Triangle (TT) graph, in which signals detectable by Hadamard (spectral) analysis are summed into 3 categories--those supporting 1) external and 2) internal branches in the optimal tree, in addition to 3) the residuals (potential internal branches not present in the optimal tree). These 3 values are plotted in a standard ternary coordinate system. The approach is illustrated with simulated and real data sets, the latter from complete chloroplast genomes, where potential problems of paralogy or lateral gene acquisition can be excluded. The TT uncovers the divergence-dependent loss of phylogenetic signal as subsets of chloroplast genomes are investigated that span increasingly deeper evolutionary timescales. The rate of signal loss (or signal retention) varies with the gene and/or the method of analysis. 相似文献
2.
3.
4.
Avian higher-level phylogeny: well-supported clades and what we can learn from a phylogenetic analysis of 2954 morphological characters 总被引:1,自引:0,他引:1
G. Mayr 《Journal of Zoological Systematics and Evolutionary Research》2008,46(1):63-72
It has been shown that increased character sampling betters the accuracy of phylogenetic reconstructions in the case of molecular data. A recently published analysis of avian higher-level phylogenetics based on 2954 morphological characters now provides an empirical example to test whether this is also true in the case of morphological characters. Several clades are discussed which are supported by multiple analyses of mutually independent molecular data (sequences of nuclear genes on different chromosomes and mitochondrial genes) as well as morphological apomorphies, but did not result from parsimony analysis of the large morphological data set. Incorrect character scorings in that analysis notwithstanding, it is concluded that in the case of morphological data, increased character sampling does not necessarily better the accuracy of a phylogenetic reconstruction. Because morphological characters usually have a strongly varying complexity, many simple and homoplastic characters may overrule fewer ones of greater phylogenetic significance in large data sets, thus producing a low ratio of phylogenetic signal to 'noise' in the data. 相似文献
5.
We propose and compare two approaches for regression analysis of multilevel binary data when clusters are not necessarily nested: a GEE method that relies on a working independence assumption coupled with a three-step method for obtaining empirical standard errors, and a likelihood-based method implemented using Bayesian computational techniques. Implications of time-varying endogenous covariates are addressed. The methods are illustrated using data from the Breast Cancer Surveillance Consortium to estimate mammography accuracy from a repeatedly screened population. 相似文献
6.
7.
8.
A Markov chain Monte Carlo (MCMC) algorithm to sample an exchangeable covariance matrix, such as the one of the error terms (R0) in a multiple trait animal model with missing records under normal-inverted Wishart priors is presented. The algorithm (FCG) is based on a conjugate form of the inverted Wishart density that avoids sampling the missing error terms. Normal prior densities are assumed for the ''fixed'' effects and breeding values, whereas the covariance matrices are assumed to follow inverted Wishart distributions. The inverted Wishart prior for the environmental covariance matrix is a product density of all patterns of missing data. The resulting MCMC scheme eliminates the correlation between the sampled missing residuals and the sampled R0, which in turn has the effect of decreasing the total amount of samples needed to reach convergence. The use of the FCG algorithm in a multiple trait data set with an extreme pattern of missing records produced a dramatic reduction in the size of the autocorrelations among samples for all lags from 1 to 50, and this increased the effective sample size from 2.5 to 7 times and reduced the number of samples needed to attain convergence, when compared with the ''data augmentation'' algorithm. 相似文献
9.
K. Bremer A. Backlund B. Sennblad U. Swenson K. Andreasen M. Hjertson J. Lundberg M. Backlund B. Bremer 《Plant Systematics and Evolution》2001,229(3-4):137-169
A data matrix of 143 morphological and chemical characters for 142 genera of euasterids according to the APG system was compiled
and complemented with rbcL and ndhF sequences for most of the genera. The data were subjected to parsimony analysis and support was assessed by bootstrapping.
Strict consensus trees from analyses of morphology alone and morphology + rbcL + ndhF are presented. The morphological data recover several groups supported by molecular data but at the level of orders and
above relationships are only superficially in agreement with molecular studies. The analyses provide support for monophyly
of Gentianales, Aquifoliales, Apiales, Asterales, and Dipsacales. All data indicate that Adoxaceae are closely related to
Dipsacales and hence they should be included in that order. The trees were used to assess some possible morphological synapomorphies
for euasterids I and II and for the orders of the APG system. Euasterids I are generally characterised by opposite leaves,
entire leaf margins, hypogynous flowers, “early sympetaly” with a ring-shaped corolla primordium, fusion of stamen filaments
with the corolla tube, and capsular fruits. Euasterids II often have alternate leaves, serrate-dentate leaf margins, epigynous
flowers, “late sympetaly” with distinct petal primordia, free stamen filaments, and indehiscent fruits. It is unclear which
of these characters represent synapomorphies and symplesiomorphies for the two groups, respectively, and there are numerous
expections to be interpreted as reversals and parallelisms.
Received August 28, 2000 Accepted August 7, 2001 相似文献
10.
11.
Summary In estimation of the ROC curve, when the true disease status is subject to nonignorable missingness, the observed likelihood involves the missing mechanism given by a selection model. In this article, we proposed a likelihood‐based approach to estimate the ROC curve and the area under the ROC curve when the verification bias is nonignorable. We specified a parametric disease model in order to make the nonignorable selection model identifiable. With the estimated verification and disease probabilities, we constructed four types of empirical estimates of the ROC curve and its area based on imputation and reweighting methods. In practice, a reasonably large sample size is required to estimate the nonignorable selection model in our settings. Simulation studies showed that all four estimators of ROC area performed well, and imputation estimators were generally more efficient than the other estimators proposed. We applied the proposed method to a data set from research in Alzheimer's disease. 相似文献
12.
Chi Keung Cheng Man Kit Cheung Wenyan Nong Patrick Tik Wan Law Jing Qin Julia Mei-Lun Ling Kai Man Kam William Man Wai Cheung Hoi Shan Kwan 《BMC genomics》2015,16(1)
Background
Salmonella Typhimurium is frequently isolated from foodborne infection cases in Hong Kong, but the lack of genome sequences has hindered in-depth epidemiological and phylogenetic studies. In this study, we sought to reconstruct the phylogenetic relationship and investigate the distribution and mutation patterns of virulence determinants among local S. Typhimurium clinical isolates using their genome sequences.Results
We obtained genome sequences of 20 S. Typhimurium clinical isolates from a local hospital cluster using a 454 GS FLX Titanium sequencing platform. Phylogenetic analysis was performed based on single nucleotide polymorphism positions of the core genome against the reference strain LT2. Antimicrobial susceptibility was determined using minimal inhibitory concentration for five antimicrobial agents and analyses of virulence determinants were performed through referencing to various databases. Through phylogenetic analysis, we revealed two distinct clades of S. Typhimurium isolates and three outliers in Hong Kong, which differ remarkably in antimicrobial susceptibility and presentation and mutations of virulence determinants. The local isolates were not closely related to many of the previously sequenced S. Typhimurium isolates, except LT2. As the isolates in the two clades spanned over 10 years of isolation, they probably represent endemic strains. The outliers are possibly introduced from outside of Hong Kong. The close relatedness of members in one of the clades to LT2 and the Japanese stool isolate T000240 suggests the potential reemergence of LT2 progeny in regions nearby.Conclusions
Our study demonstrated the utility of next-generation sequencing coupled to traditional microbiological testing method in a retrospective epidemiological study involving multiple clinical isolates. The evolution of multidrug- and ciprofloxacin-resistant strains among the more virulent clade is also an increasing concern.Electronic supplementary material
The online version of this article (doi:10.1186/s12864-015-1900-y) contains supplementary material, which is available to authorized users. 相似文献13.
Yassin A Araripe LO Capy P Da Lage JL Klaczko LB Maisonhaute C Ogereau D David JR 《Molecular phylogenetics and evolution》2008,47(3):903-915
A molecular phylogeny for the drosophilid genus Zaprionus was inferred using a mitochondrial (CO-II) and a nuclear (Amyrel) gene using 22 available species. The combined molecular tree does not support the current classification, dubbed phylogenetic, based entirely upon a morphocline of forefemoral ornamentation. For species for which DNA was not available, phylogenetic positioning was only assigned using morphological characters. In order to avoid conflict between DNA and morphology in the combined analyses (supermatrix method), we developed a new method in which few morphological characters were sampled according to an a priori homoplasy assessment on the consensus molecular tree. At each internal node of the tree, a number of synapomorphies was determined, and species with no molecular sequences were grafted thereon. Analogously to tree vocabulary, we called our method 'morphological grafting'. New species groups and complexes were then defined in the light of our findings. Further, divergence times were estimated under a relaxed molecular clock, and historical biogeography was reconstructed under a maximum likelihood model. Zaprionus appears to be of recent origin in the Oriental region during the Late Miocene ( approximately 10 MYA), and colonization of Africa started shortly after ( approximately 7 MYA) via the maritime route of the Indian Ocean Islands. Most of the morphological and ecological diversification took place, later, in Western Africa during the Quaternary cyclic climatic changes. Furthermore, some species became recent invaders, with one, Zaprionus indianus, has successfully invaded South and North America during the last decade. 相似文献
14.
Das D Xu QS Lee JY Ankoudinova I Huang C Lou Y DeGiovanni A Kim R Kim SH 《Journal of structural biology》2007,158(3):494-502
Crystal structures of the bacterial multidrug transporter AcrB in R32 and C2 space groups showing both symmetric and asymmetric trimeric assemblies, respectively, supplemented with biochemical investigations, have provided most of the structural basis for a molecular level understanding of the protein structure and mechanisms for substrate uptake and translocation carried out by this 114-kDa inner membrane protein. They suggest that AcrB captures ligands primarily from the periplasm. Substrates can also enter the inner cavity of the transporter from the cytoplasm, but the exact mechanism of this remains undefined. Analysis of the amino acid sequences of AcrB and its homologs revealed the presence of conserved residues at the N-terminus including two phenylalanines which may be exposed to the cytoplasm. Any potential role that these conserved residues may play in function has not been addressed by existing biochemical or structural studies. Since phenylalanine residues elsewhere in the protein have been implicated in ligand binding, we explored the structure of this N-terminal region to investigate structural determinants near the cytoplasmic opening that may mediate drug uptake. Our structure of AcrB in R32 space group reveals an N-terminus loop, reducing the diameter of the central opening to approximately 15 A as opposed to the previously reported value of approximately 30 A for crystal structures in this space group with disordered N-terminus. Recent structures of the AcrB in C2 space group have revealed a helical conformation of this N-terminus but have not discussed its possible implications. We present the crystal structure of AcrB that reveals the structure of the N-terminus containing the conserved residues. We hope that the structural information provides a structural basis for others to design further biochemical investigation of the role of this portion of AcrB in mediating cytoplasmic ligand discrimination and uptake. 相似文献
15.
The complex of butterfly taxa close to Melitaea
didyma includes the traditionally recognized species Melitaea
didyma, Melitaea
didymoides and Melitaea
sutschana, the taxa that were recognized as species only relatively recently (Melitaea
latonigena, Melitaea
interrupta, Melitaea
chitralensis and Melitaea
mixta) as well as numerous described subspecies and forms with unclear taxonomic status. Here analysis of mitochondrial DNA barcodes is used to demonstrate that this complex is monophyletic group consisting of at least 12 major haplogroups strongly differentiated with respect to the gene COI. Six of these haplogroups are shown to correspond to six of the above-mentioned species (Melitaea
didymoides, Melitaea
sutschana, Melitaea
latonigena, Melitaea
interrupta, Melitaea
chitralensis and Melitaea
mixta). It is hypothesized that each of the remaining six haplogroups also represents a distinct species (Melitaea
mauretanica, Melitaea
occidentalis, Melitaea
didyma, Melitaea
neera, Melitaea
liliputana and Melitaea
turkestanica), since merging these haplogroups would result in a polyphyletic assemblage and the genetic distances between them are comparable with those found between the other six previously recognized species. 相似文献
16.
17.
Sean A. Locke Fabiana B. Drago Danimar López-Hernández Fred D. Chibwana Verónica Núñez Alex Van Dam María Fernanda Achinelly Pieter T.J. Johnson Jordana Costa Alves de Assis Alan Lane de Melo Hudson Alves Pinto 《International journal for parasitology》2021,51(8):667-683
When subjected to molecular study, species of digeneans believed to be cosmopolitan are usually found to consist of complexes of species with narrower distributions. We present molecular and morphological evidence of transcontinental distributions in two species of Apharyngostrigea Ciurea, 1924, based on samples from Africa and the Americas. Sequences of cytochrome c oxidase I and, in some samples, internal transcribed spacer, revealed Apharyngostrigea pipientis (Faust, 1918) in Tanzania (first known African record), Argentina, Brazil, USA and Canada. Sequences from A. pipientis also match previously published sequences identified as Apharyngostrigea cornu (Zeder, 1800) originating in Mexico. Hosts of A. pipientis surveyed include definitive hosts from the Afrotropic, Neotropic and Nearctic, as well as first and second intermediate hosts from the Americas, including the type host and type region. In addition, metacercariae of A. pipientis were obtained from experimentally infected Poecilia reticulata, the first known record of this parasite in a non-amphibian second intermediate host. Variation in cytochrome c oxidase I haplotypes in A. pipientis is consistent with a long established, wide-ranging species with moderate genetic structure among Nearctic, Neotropic and Afrotropic regions. We attribute this to natural dispersal by birds and find no evidence of anthropogenic introductions of exotic host species. Sequences of CO1 and ITS from adult Apharyngostrigea simplex (Johnston, 1904) from Egretta thula in Argentina matched published data from cercariae from Biomphalaria straminea from Brazil and metacercariae from Cnesterodon decemmaculatus in Argentina, consistent with previous morphological and life-cycle studies reporting this parasite—originally described in Australia—in South America. Analyses of the mitochondrial genome and rDNA operon from A. pipientis support prior phylogenies based on shorter markers showing the Strigeidae Railliet, 1919 to be polyphyletic. 相似文献
18.
Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae. Molecular dating analyses suggest that Ranunculaceae and Berberidaceae diverged between 90 and 84 mya, which is congruent with the fossil records and with recent estimates of the divergence time of these two taxa. 相似文献
19.
Graham Solomons JT Zimmerly EM Burns S Krishnamurthy N Swan MK Krings S Muirhead H Chirgwin J Davies C 《Journal of molecular biology》2004,342(3):847-860
Phosphoglucose isomerase (PGI) is an enzyme of glycolysis that interconverts glucose 6-phosphate (G6P) and fructose 6-phosphate (F6P) but, outside the cell, is a multifunctional cytokine. High-resolution crystal structures of the enzyme from mouse have been determined in native form and in complex with the inhibitor erythrose 4-phosphate, and with the substrate glucose 6-phosphate. In the substrate-bound structure, the glucose sugar is observed in both straight-chain and ring forms. This structure supports a specific role for Lys518 in enzyme-catalyzed ring opening and we present a "push-pull" mechanism in which His388 breaks the O5-C1 bond by donating a proton to the ring oxygen atom and, simultaneously, Lys518 abstracts a proton from the C1 hydroxyl group. The reverse occurs in ring closure. The transition from ring form to straight-chain substrate is achieved through rotation of the C3-C4 bond, which brings the C1-C2 region into close proximity to Glu357, the base catalyst for the isomerization step. The structure with G6P also explains the specificity of PGI for glucose 6-phosphate over mannose 6-isomerase (M6P). To isomerize M6P to F6P requires a rotation of its C2-C3 bond but in PGI this is sterically blocked by Gln511. 相似文献
20.
Estela R. álvarez-Martínez ángel Valverde Martha Helena Ramírez-Bahena Paula García-Fraile Carmen Tejedor Pedro F. Mateos Nery Santillana Doris Zú?iga Alvaro Peix Encarna Velázquez 《Archives of microbiology》2009,191(8):659-668
In this work, we analysed the core and symbiotic genes of rhizobial strains isolated from Vicia sativa in three soils from the Northwest of Spain, and compared them with other Vicia endosymbionts isolated in other geographical locations. The analysis of rrs, recA and atpD genes and 16S–23S rRNA intergenic spacer showed that the Spanish strains nodulating V. sativa are phylogenetically close to those isolated from V. sativa and V. faba in different European, American and Asian countries forming a group related to Rhizobium leguminosarum. The analysis of the nodC gene of strains nodulating V. sativa and V. faba in different continents showed they belong to a phylogenetically compact group indicating that these legumes are restrictive
hosts. The results of the nodC gene analysis allow the delineation of the biovar viciae showing a common phylogenetic origin of V. sativa and V. faba endosymbionts in several continents. Since these two legume species are indigenous from Europe, our results suggest a world
distribution of strains from R. leguminosarum together with the V. sativa and V. faba seeds and a close coevolution among chromosome, symbiotic genes and legume host in this Rhizobium–Vicia symbiosis. 相似文献