首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The successful application of genomic selection (GS) approaches is dependent on genetic makers derived from high-throughput and low-cost genotyping methods. Recent GS studies in trees have predominantly relied on SNP arrays as the source of genotyping, though this technology has a high entry cost. The recent development of alternative genotyping platforms, tailored to specific species and with low entry cost, has become possible due to advances in next-generation sequencing and genome complexity reduction methods such as sequence capture. However, the performance of these new platforms in GS models has not yet been evaluated, or compared to models developed from SNP arrays. Here, we evaluate the impact of these genotyping technologies on the development of GS prediction models for a Eucalyptus breeding population composed of 739 trees phenotyped for 13 wood quality and growth traits. Genotyping data obtained with both methods were compared for linkage disequilibrium, minor allele frequency, and missing data. Phenotypic prediction methods RR-BLUP and BayesB were employed, while predictive ability using cross validation was used to evaluate the performance of GS models derived from the different genotyping platforms. Differences in linkage disequilibrium patterns, minor allele frequency, missing data, and marker distribution were detected between sequence capture and SNP arrays. However, RR-BLUP and BayesB GS models resulted in similar predictive abilities. These results demonstrate that both genotyping methods are equivalent for genomic prediction of the traits evaluated. Sequence capture offers an alternative for species where SNP arrays are not available, or for when the initial development cost is too high.  相似文献   

2.
Parmelioid lichens comprise about 1500 species and have a worldwide distribution. Numerous species are widely distributed and well known, including important bioindicators for atmospheric pollution. The phylogeny and classification of parmelioid lichens has been a matter of debate for several decades. Previous studies using molecular data have helped to establish hypotheses of the phylogeny of certain clades within this group. In this study, we infer the phylogeny of major clades of parmelioid lichens using DNA sequence data from two nuclear loci and one mitochondrial locus from 145 specimens (117 species) that represent the morphological and chemical diversity in these taxa. Parmelioid lichens are not monophyletic; however, a core group is strongly supported as monophyletic, excluding Arctoparmelia and Melanelia s. str., and including Parmeliopsis and Parmelaria. Within this group, seven well-supported clades are found, but the relationships among them remain unresolved. Stochastic mapping on a MC/MCMC tree sampling was employed to infer the evolution of two morphological and two chemical traits believed to be important for the evolutionary success of these lichens, and have also been used as major characters for classification. The results suggest that these characters have been gained and lost multiple times during the diversification of parmelioid lichens.  相似文献   

3.
Parmeliaceae is the largest family of lichen-forming fungi with more than 2000 species and includes taxa with different growth forms. Morphology was widely employed to distinguish groups within this large, cosmopolitan family. In this study we test these morphology-based groupings using DNA sequence data from three nuclear and one mitochondrial marker from 120 taxa that include 59 genera and represent the morphological and chemical diversity in this lineage. Parmeliaceae is strongly supported as monophyletic and six well-supported main clades can be distinguished within the family. The relationships among them remain unresolved. The clades largely agree with the morphology-based groupings and only the placement of four of the genera studied is rejected by molecular data, while four other genera belong to clades previously unrecognised. The classification of these previously misplaced genera, however, has already been questioned by some authors based on morphological evidence. These results support morphological characters as important for the identification of monophyletic clades within Parmeliaceae.  相似文献   

4.
Conformation and dynamics of heparin and heparan sulfate   总被引:10,自引:0,他引:10  
Mulloy B  Forster MJ 《Glycobiology》2000,10(11):1147-1156
The glycosaminoglycans heparin and heparan sulfate contain similar structural units in varying proportions providing considerable diversity in sequence and biological function. Both compounds are alternating copolymers of glucosamine with both iduronate- and glucuronate-containing sequences bearing N-sulfate, N-acetyl, and O-sulfate substitution. Protein recognition of these structurally-diverse compounds depends upon substitution pattern, overall molecular shape, and on internal mobility. In this review particular attention is paid to the dynamic aspects of heparin/heparan sulfate conformation. The iduronate residue possesses an unusually flexible pyranose ring conformation. This extra source of internal mobility creates special problems in rationalization of experimental data for these compounds. We present herein the solution-state NMR parameters, fiber diffraction data, crystallographic data, and molecular modeling methods employed in the investigation of heparin and heparan sulfate. Heparin is a useful model compound for the sulfated, protein-binding regions of heparan sulfate. The literature contains a number of solution and solid-state studies of heparin oligo- and polysaccharides for both isolated heparin species and those bound to protein receptors. These studies indicate a diversity of iduronate ring conformations, but a limited range of glycosidic linkage geometries in the repeating disaccharides. In this sense, heparin exhibits a well-defined overall shape within which iduronate ring forms can freely interconvert. Recent work suggests that computational modeling could potentially identify heparin binding sites on protein surfaces.  相似文献   

5.
Recombination and selection at Brassica self-incompatibility loci   总被引:1,自引:0,他引:1  
Awadalla P  Charlesworth D 《Genetics》1999,152(1):413-425
In Brassica species, self-incompatibility is controlled genetically by haplotypes involving two known genes, SLG and SRK, and possibly an as yet unknown gene controlling pollen incompatibility types. Alleles at the incompatibility loci are maintained by frequency-dependent selection, and diversity at SLG and SRK appears to be very ancient, with high diversity at silent and replacement sites, particularly in certain "hypervariable" portions of the genes. It is important to test whether recombination occurs in these genes before inferences about function of different parts of the genes can be made from patterns of diversity within their sequences. In addition, it has been suggested that, to maintain the relationship between alleles within a given S-haplotype, recombination is suppressed in the S-locus region. The high diversity makes many population genetic measures of recombination inapplicable. We have analyzed linkage disequilibrium within the SLG gene of two Brassica species, using published coding sequences. The results suggest that intragenic recombination has occurred in the evolutionary history of these alleles. This is supported by patterns of synonymous nucleotide diversity within both the SLG and SRK genes, and between domains of the SRK gene. Finally, clusters of linkage disequilibrium within the SLG gene suggest that hypervariable regions are under balancing selection, and are not merely regions of relaxed selective constraint.  相似文献   

6.
Two Drosophila pseudoobscura genomic clones have sequence similarity to the Drosophila melanogaster amylase region that maps to the 53CD region on the D. melanogaster cytogenetic map. The two clones with similarity to amylase map to sections 73A and 78C of the D. pseudoobscura third chromosome cytogenetic map. The complete sequences of both the 73A and 78C regions were compared to the D. melanogaster genome to determine if the coding region for amylase is present in both regions and to determine the evolutionary mechanism responsible for the observed distribution of the amylase gene or genes. The D. pseudoobscura 73A and 78C linkage groups are conserved with the D. melanogaster 41E and 53CD regions, respectively. The amylase gene, however, has not maintained its conserved linkage between the two species. These data indicate that amylase has moved via a transposition event in the D. melanogaster or D. pseudoobscura lineage. The predicted genes within the 73A and 78C regions show patterns of molecular evolution in synonymous and nonsynonymous sites that are consistent with previous studies of these two species.  相似文献   

7.
Akashi H  Goel P  John A 《PloS one》2007,2(10):e1065
Reliable inference of ancestral sequences can be critical to identifying both patterns and causes of molecular evolution. Robustness of ancestral inference is often assumed among closely related species, but tests of this assumption have been limited. Here, we examine the performance of inference methods for data simulated under scenarios of codon bias evolution within the Drosophila melanogaster subgroup. Genome sequence data for multiple, closely related species within this subgroup make it an important system for studying molecular evolutionary genetics. The effects of asymmetric and lineage-specific substitution rates (i.e., varying levels of codon usage bias and departures from equilibrium) on the reliability of ancestral codon usage was investigated. Maximum parsimony inference, which has been widely employed in analyses of Drosophila codon bias evolution, was compared to an approach that attempts to account for uncertainty in ancestral inference by weighting ancestral reconstructions by their posterior probabilities. The latter approach employs maximum likelihood estimation of rate and base composition parameters. For equilibrium and most non-equilibrium scenarios that were investigated, the probabilistic method appears to generate reliable ancestral codon bias inferences for molecular evolutionary studies within the D. melanogaster subgroup. These reconstructions are more reliable than parsimony inference, especially when codon usage is strongly skewed. However, inference biases are considerable for both methods under particular departures from stationarity (i.e., when adaptive evolution is prevalent). Reliability of inference can be sensitive to branch lengths, asymmetry in substitution rates, and the locations and nature of lineage-specific processes within a gene tree. Inference reliability, even among closely related species, can be strongly affected by (potentially unknown) patterns of molecular evolution in lineages ancestral to those of interest.  相似文献   

8.
The genus Rickettsiella comprises intracellular bacterial pathogens of a wide range of arthropods that are currently classified in four recognized species and numerous further pathotypes. However, both the delineation of and the synonymization of pathotypes with species are highly problematic. In the sequel of a previous phylogenomic study at the supra-generic level, nine selected genes - the 16S and 23S rRNA genes and the protein-encoding genes dnaG, ftsY, gidA, ksgA, rpoB, rpsA, and sucB - were evaluated for their potential as markers for the generic and infra-generic taxonomic classification of Rickettsiella-like bacteria. A methodological approach combining phylogenetic reconstruction with likelihood-based significance testing was employed on the basis of sequence data from the species Rickettsiella grylli and Rickettsiella popilliae, pathotypes 'Rickettsiella melolonthae' and 'Rickettsiella tipulae'. This study provides the first multilocus sequence typing (MLST) data for the genus Rickettsiella and identifies two new genetic markers, gidA and sucB, for the infra-generic classification within this taxon. In particular, aforesaid genes were found more reliable and informative markers than the corresponding 16S rRNA-encoding sequences that failed to produce strictly significant infra-generic taxonomic assignments. However, gidA- and sucB-based phylogenies were consistent with the currently accepted view of species delineation and species-pathotype synonymization within the genus Rickettsiella.  相似文献   

9.
Oak gallwasps are cyclically parthenogenetic insects that induce a wide diversity of highly complex species- and generation-specific galls on oaks and other Fagaceae. Phylogenetic relationships within oak gallwasps remain to be established, while sexual and parthenogenetic generations of many species remain unpaired. Previous work on oak gallwasps has revealed substantial intra-specific variation, particularly between regions known to represent discrete Pleistocene glacial refuges. Here we use statistical phylogenetic inference methods on sequence data for a fragment of the mitochondrial cytochrome b gene to reconstruct the relationships among 62 oak gallwasp species. For 16 of these we also include 23 additional cytochrome b haplotype sequences from different Pleistocene refuge areas to test the effect of intra-specific variation on inter-specific phylogeny reconstruction. The reconstructed phylogenies show good intra-generic resolution and identify several conserved clades, but fail to reconstruct either very recent or very ancient divergences. Nine of the 16 species represented by multiple haplotypes are not monophyletic. The apparent discordance between the recovered gene tree and the current taxonomic classification can be explained through: (a) collapsing of some species currently known only from either a sexual or a parthenogenetic generation into a single cyclically parthenogenetic entity; (b) sorting of ancestral polymorphism in diverging lineages, and (c) horizontal transfer of haplotypes, perhaps due to hybridization within glacial refuges. Our conclusions emphasise the need for careful intra-specific sampling when reconstructing phylogenies for radiations of closely related species and imply that for certain taxonomic groups full phylogenetic resolution (using molecular markers) may not be attainable.  相似文献   

10.
Dinoflagellate taxonomy is based primarily on morphology and morphometric data that can be difficult to obtain. In contrast, molecular data can be rapidly and cost‐effectively acquired, which has led to a rapid accumulation of sequence data in GenBank. Currently there are no systematic criteria for utilizing taxonomically unassigned sequence data to identify putative species that could in turn serve as a basis for testable hypotheses concerning the taxonomy, diversity, distribution, and toxicity of these organisms. The goal of this research was to evaluate whether simple, uncorrected genetic distances (p) calculated using ITS1/5.8S/ITS2 (ITS region) rDNA sequences could be used to develop criteria for recognizing putative species before formal morphological evaluation and classification. The current analysis used sequences from 81 dinoflagellate species belonging to 14 genera. For this diverse assemblage of dinoflagellate species, the within‐species genetic distances between ITS region copies (p=0.000–0.021 substitutions per site) were consistently less than those observed between species (p=0.042–0.580). Our results indicate that a between‐species uncorrected genetic distance of p≥0.04 could be used to delineate most free‐living dinoflagellate species. Recently evolved species, however, may have ITS p values <0.04 and would require more extensive morphological and genetic analyses to resolve. For most species, the sequence of the dominant ITS region allele has the potential to serve as a unique species‐specific “DNA barcode” that could be used for the rapid identification of dinoflagellates in field and laboratory studies.  相似文献   

11.
DNA barcoding protocols require the linkage of each sequence record to a voucher specimen that has, whenever possible, been authoritatively identified. Natural history collections would seem an ideal resource for barcode library construction, but they have never seen large-scale analysis because of concerns linked to DNA degradation. The present study examines the strength of this barrier, carrying out a comprehensive analysis of moth and butterfly (Lepidoptera) species in the Australian National Insect Collection. Protocols were developed that enabled tissue samples, specimen data, and images to be assembled rapidly. Using these methods, a five-person team processed 41,650 specimens representing 12,699 species in 14 weeks. Subsequent molecular analysis took about six months, reflecting the need for multiple rounds of PCR as sequence recovery was impacted by age, body size, and collection protocols. Despite these variables and the fact that specimens averaged 30.4 years old, barcode records were obtained from 86% of the species. In fact, one or more barcode compliant sequences (>487 bp) were recovered from virtually all species represented by five or more individuals, even when the youngest was 50 years old. By assembling specimen images, distributional data, and DNA barcode sequences on a web-accessible informatics platform, this study has greatly advanced accessibility to information on thousands of species. Moreover, much of the specimen data became publically accessible within days of its acquisition, while most sequence results saw release within three months. As such, this study reveals the speed with which DNA barcode workflows can mobilize biodiversity data, often providing the first web-accessible information for a species. These results further suggest that existing collections can enable the rapid development of a comprehensive DNA barcode library for the most diverse compartment of terrestrial biodiversity – insects.  相似文献   

12.
P. Seperack  M. Slatkin    N. Arnheim 《Genetics》1988,119(4):943-949
Members of the rDNA multigene family within a species do not evolve independently, rather, they evolve together in a concerted fashion. Between species, however, each multigene family does evolve independently indicating that mechanisms exist which will amplify and fix new mutations both within populations and within species. In order to evaluate the possible mechanisms by which mutation, amplification and fixation occur we have determined the level of linkage disequilibrium between two polymorphic sites in human ribosomal genes in five racial groups and among individuals within two of these groups. The marked linkage disequilibrium we observe within individuals suggests that sister chromatid exchanges are much more important than homologous or nonhomologous recombination events in the concerted evolution of the rDNA family and further that recent models of molecular drive may not apply to the evolution of the rDNA multigene family.  相似文献   

13.
Ebner K  Pinsker W  Lion T 《Journal of virology》2005,79(20):12635-12642
The adenovirus (AdV) hexon constitutes the major virus capsid protein. The epitopes located on the hexon protein are targets of neutralizing antibodies in vivo, serve in the recognition by cytotoxic T cells, and provide the basis for the classification of adenoviruses into the 51 serotypes known to date. We have sequenced the entire hexon gene from human serotypes with incomplete or no sequence information available (n = 34) and performed a comparative analysis of all sequences. The overall sequence divergence between the 51 human serotypes ranged from 0.7 to 25.4% at the protein level. The sequence information has been exploited to assess the phylogeny of the adenovirus family, and protein distances between the six AdV species (A to F) and among individual serotypes within each species were calculated. The analysis revealed that the differences among serotypes within individual species range from 0.3 to 5.4% in the conserved regions (765 amino acids [aa]) and from 1.5 to 59.6% in the variable regions (154 to 221 aa). Serotypes of different species showed an expectedly greater divergence both in the conserved (5.9 to 12.3%) and variable (49.0 to 74.7%) regions. Construction of a phylogenetic tree revealed three major clades comprising the species B+D+E, A+F, and C, respectively. For serotypes 50 and 51, the original assignment to species B and D, respectively, is not in accordance with the hexon DNA and protein sequence data, which placed serotype 50 within species D and serotype 51 within species B. Moreover, the hexon gene of serotype 16, a member of species B, was identified as the product of recombination between sequences of species B and E. In addition to providing a basis for improved molecular diagnostics and classification, the elucidation of the complete hexon gene sequence in all AdV serotypes yields information on putative epitopes for virus recognition, which may have important implications for future treatment strategies permitting efficient targeting of any AdV serotype.  相似文献   

14.
Simultaneous molecular dating of population and species divergences is essential in many biological investigations, including phylogeography, phylodynamics and species delimitation studies. In these investigations, multiple sequence alignments consist of both intra‐ and interspecies samples (mixed samples). As a result, the phylogenetic trees contain interspecies, interpopulation and within‐population divergences. Bayesian relaxed clock methods are often employed in these analyses, but they assume the same tree prior for both inter‐ and intraspecies branching processes and require specification of a clock model for branch rates (independent vs. autocorrelated rates models). We evaluated the impact of a single tree prior on Bayesian divergence time estimates by analysing computer‐simulated data sets. We also examined the effect of the assumption of independence of evolutionary rate variation among branches when the branch rates are autocorrelated. Bayesian approach with coalescent tree priors generally produced excellent molecular dates and highest posterior densities with high coverage probabilities. We also evaluated the performance of a non‐Bayesian method, RelTime, which does not require the specification of a tree prior or a clock model. RelTime's performance was similar to that of the Bayesian approach, suggesting that it is also suitable to analyse data sets containing both populations and species variation when its computational efficiency is needed.  相似文献   

15.
Molecular codes can be considered a special type of mapping among molecular species in biochemical systems. The formalization of molecular codes allows to identify these in network models of real world systems. Analyzing algorithmically identified codes leads to the observation that codes does not necessarily stand alone, but that we can identify certain relations among codes. In this paper I will define two types of relations that can occur among codes, (1) code linkage and (2) code nesting, and will discuss implications of this finding.  相似文献   

16.
Prediction of neuropeptide cleavage sites in insects   总被引:1,自引:0,他引:1  
MOTIVATION: The production of neuropeptides from their precursor proteins is the result of a complex series of enzymatic processing steps. Often, the annotation of new neuropeptide genes from sequence information outstrips biochemical assays and so bioinformatics tools can provide rapid information on the most likely peptides produced by a gene. Predicting the final bioactive neuropeptides from precursor proteins requires accurate algorithms to determine which locations in the protein are cleaved. RESULTS: Predictive models were trained on Apis mellifera and Drosophila melanogaster precursors using binary logistic regression, multi-layer perceptron and k-nearest neighbor models. The final predictive models included specific amino acids at locations relative to the cleavage sites. Correct classification rates ranged from 78 to 100% indicating that the models adequately predicted cleaved and non-cleaved positions across a wide range of neuropeptide families and insect species. The model trained on D.melanogaster data had better generalization properties than the model trained on A. mellifera for the data sets considered. The reliable and consistent performance of the models in the test data sets suggests that the bioinformatics strategies proposed here can accurately predict neuropeptides in insects with sequence information based on neuropeptides with biochemical and sequence information in well-studied species.  相似文献   

17.
Obtaining fundamental biodiversity metrics such as alpha, beta and gamma diversity for arthropods is often complicated by a lack of prior taxonomic information and/or taxonomic expertise, which can result in unreliable morphologically based estimates. We provide a set of standardized ecological and molecular sampling protocols that can be employed by researchers whose taxonomic skills may be limited, and where there may be a lack of robust a priori information regarding the regional pool of species. These protocols combine mass sampling of arthropods, classification of samples into parataxonomic units (PUs) and selective sampling of individuals for mtDNA sequencing to infer biological species. We sampled ten lowland rainforest plots located on the volcanic oceanic island of Réunion (Mascarene archipelago) for spiders, a group with limited taxonomic and distributional data for this region. We classified adults and juveniles into PUs and then demonstrated the reconciliation of these units with presumed biological species using mtDNA sequence data, ecological data and distributional data. Because our species assignment protocol is not reliant upon prior taxonomic information, or taxonomic expertise, it minimizes the problem of the Linnean shortfall to yield diversity estimates that can be directly compared across independent studies. Field sampling can be extended to other arthropod groups and habitats by adapting our field sampling protocol accordingly.  相似文献   

18.
非序列联配的序列分析方法,将序列中特定寡聚核苷酸的kmer统计频率作为特征,在序列间按特征进行比较和分析。这种方法综合考虑了所有变异类型对序列整体特征的影响,因而在组学数据分析上有独特的优势。但是,这类方法在复杂多细胞生物基因组系统发育中的适用性仍然有待检验。在本文中,我们使用基于非序列联配方法的CVTree软件,以45种哺乳动物的蛋白质组数据建立了系统发育关系NJ树,并据此探讨了哺乳动物系统发育的若干问题。在广受关注的真兽下纲四个总目的关系问题上,CVTree支持形态学的普遍结论即上兽类(Epitheria)假说。这与基于序列联配方法支持的外非洲胎盘类(Exafro-placentalia )假说不同。在哺乳动物内部目的层次上,CVTree树的结论与分子和形态所普遍接受的系统发育关系基本一致。但是在目的内部,CVTree树会有较多的差异。研究结果初步显示非序列联配方法在使用复杂多细胞生物的组学数据进行系统发育关系分析中的可行性。对非序列联配方法自身的改进及其与传统基于取代的序列联配方法之间的比较仍有待深入研究。  相似文献   

19.
Although it has been recognized for many years that the genus Centaurea L. is an artificial assemblage of taxa, its partition into more natural affiliations has been impossible due to its incredible complexity. One of the most reliable characteristics for establishing the phylogeny within this group is the type of pollen. Most of the classification difficulties centre in the Jacea group, which has a characteristic Jacea pollen type. Recent molecular studies indicate that this assemblage is probably polyphyletic. Specifically, previous DNA sequence analyses indicate that Centaurea pulchella and the genera Oligochaeta and Zoegea represent different lineages. This finding prompted an investigation of their pollen types, using scanning electron microscopy, and for some species, transmission microscopy. For a rigorous comparison, the study also included a wide representation of other species across the entire Jacea group. Results showed that both Oligochaeta and Zoegea , but not C. pulchella , can be clearly distinguished from the Jacea group on the basis of pollen morphology. The genus Oligochaeta has a peculiar pollen type that may represent a simplified form of the Serratula pollen type, and the genus Zoegea has Serratula pollen type.  相似文献   

20.
Selective genotyping of extreme progeny is a powerful method to increase the information content per individual when looking for quantitative trait loci (QTLs) using molecular markers for which a map is known. However, if marker information from the selected individuals is used to construct the map of the markers, this can lead to distorted segregation of the markers that in turn can lead to the estimation of a spurious linkage between independently inherited markers. The mistaken estimation of linkage between independently inherited markers will occur when there are two (or more) independently inherited QTLs linked to two (or more) markers and the same individuals are used to estimate the map of the markers and to do the QTL estimation. The incorrect linkage occurs because in selecting individuals from the tails of the phenotypic distribution we will also be selecting certain combinations of the markers instead of obtaining a random sample of the true distribution of the marker genotypes. Analytical results are outlined and the analyses of a simulated data set illustrate the problems that could arise when data from individuals chosen by selective genotyping are incorrectly employed to construct a marker map. A strategy is proposed to remedy this problem.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号