首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 687 毫秒
1.
We present a method for genome comparisons and high-resolution hybridization analyses using megabase stretches of known DNA sequences as a reference. The method employs two-dimensional gel electrophoresis, separating genomic segments cut with different restriction endonucleases in the first and second dimensions, to generate filters suitable for image analysis and repeated nucleic acid hybridizations. The corresponding two-dimensional pattern is computed from the reference nucleotide sequence and matched to the observed pattern, thereby identifying each fragment on the filter; at the same time the technique uncovers discrepancies from the reference sequence. This permits genome comparisons as well as automated identification and quantification of hybridization patterns with various probes. The technique is illustrated by an analysis ofSaccharomyces cerevisiaechromosome IX.  相似文献   

2.
Different methods are available to determine the G + C content (e.g. thermal denaturation temperature or high performance liquid chromatography, HPLC), but obtained values may differ significantly between strains, as well as between laboratories. Recently, several authors have demonstrated that the genomic DNA G + C content of prokaryotes can be reliably estimated from one or several protein coding gene nucleotide sequences. Few G + C content values have been published for the Aeromonas species described and the data, when available, are often incomplete or provide only a range of values. Our aim in this current work was twofold. First, the genomic G + C content of the type or reference strains of all species and subspecies of the genus Aeromonas was determined with a traditional experimental method in the same laboratory. Second, we wanted to see if the sequence-based method to estimate the G + C content described by Fournier et al. [7] could be applied to determine the G + C content of the different species of Aeromonas from the sequences of the genes used in taxonomy or phylogeny for this genus.  相似文献   

3.
As derived from a cDNA clone, the structure of the b-32 protein ofZea mays, a putative regulatory factor of zein expression, has a central acidic region separated by two domains covered by secondary structure motifs. In this work, three b-32 genomic clones were selected from two genomic libraries obtained from the maize inbred lines W64A and A69Y. The nucleotide sequences of the complete coding region of eachb-32 gene, as well as long stretches of their 5 and 3 flanking regions, were determined. Introns are not present in the b-32 genomic sequences. Minor variations among the three genes and an earlier reported b-32 cDNA indicates that they constitute a gene family showing a characteristic polymorphism. Such a polymorphism is highly evident in large segments of the upstream regulatory sequences. Interestingly, when compared with cDNA (W64A) or with geneb-32.120 (W64A), the genesb-32.129 (W64A) andb-32.152 (A69Y) show three jumps of the reading frame in the central part of the coding region, resulting in a completely different sequence of the b-32 protein central domain. In all cases, variations in the N- and C-terminal domains account only for microheterogeneity.  相似文献   

4.
We describe a novel modification of the polymerase chain reaction for efficient in vitro amplification of genomic DNA sequences flanking short stretches of known sequence. The technique utilizes a target enrichment step, based on the selective isolation of biotinylated fragments from the bulk of genomic DNA on streptavidin-containing support. Subsequently, following ligation with a second universal linker primer, the selected fragments can be amplified to amounts suitable for further molecular studies. The procedure has been applied to recover T-DNA flanking sequences in transgenic tomato plants which could subsequently be used to assign the positions of T-DNA to the molecular map of tomato. The method called supported PCR (sPCR) is a simple and efficient alternative to techniques used in the isolation of specific sequences flanking a known DNA segment.  相似文献   

5.
Liriodendron tulipifera L., a member of the Magnoliaceae, occupies an important phylogenetic position as a basal angiosperm that has retained numerous putatively ancestral morphological characters, and thus has often been used in studies of the evolution of flowering plants and of specific gene families. However, genomic resources for these early branching angiosperm lineages are very limited. In this study, we describe the construction of a large-insert bacterial artificial chromosome (BAC) library from L. tulipifera. Flow cytometry estimates that this nuclear genome is approximately 1,802 Mbp per haploid genome (±16 SD). The BAC library contains 73,728 clones, a 4.8-fold genome coverage, with an average insert size of 117 kb, a chloroplast DNA content of 0.2%, and little to no bacterial sequences nor empty vector content clones. As a test of the utility of this BAC library, we screened the library with six single/low-copy genic probes. We obtained at least two positive clones for each gene and confirmed the clones by DNA sequencing. A total of 182 paired end sequences were obtained from 96 of the BAC clones. Using BLAST searches, we found that 25% of the BAC end sequences were similar to DNA sequences in GenBank. Of these, 68% shared sequence with transposable elements and 25% with genes from other taxa. This result closely reflected the content of random sequences obtained from a small insert genomic library for L. tulipifera, indicating that the BAC library construction process was not biased. The first genomic DNA sequences for Liriodendron genes are also reported. All the Liriodendron genomic sequences described in this paper have been deposited in the GenBank data library. The end sequences from shotgun genomic clones and BAC clones are under accession DU169330–DU169684. Partial sequences of Gigantea, Frigida, LEAFY, cinnamyl alcohol dehydrogenase, 4-coumarate:CoA ligase, and phenylalanine ammonia-lyase genes are under accession DQ223429–DQ223434. Electronic supplementary material Supplementary material is available in the online version of this article at and is accessible for authorized users.  相似文献   

6.
A novel set of informative microsatellite markers for pepper (Capsicum annuum L.) is provided. Screening of approximately 168 000 genomic clones and 23 174 public database entries resulted in a total of 411 microsatellite-containing sequences that could be used for primer design and functional testing. A set of 154 microsatellite markers originated from short-insert genomic libraries and 257 markers originated from database sequences. Of those markers, 147 (61 from genomic libraries and 86 from database sequences) showed specific and scoreable amplification products and detected polymorphisms between at least 2 of the 33 lines of a test panel consisting of cultivated and wild Capsicum genotypes. These informative markers were subsequently surveyed for allelic variation and information content. The usefulness of the new markers for diversity and taxonomic studies was demonstrated by the construction of consistent phylogenetic trees based on the microsatellite polymorphisms. Conservation of a subset of microsatellite loci in pepper, tomato, and potato was proven by cross-species amplification and sequence comparisons. For several informative pepper microsatellite markers, homologous expressed sequence tag (EST) counterparts could be identified in these related species that also carry microsatellite motifs. Such orthologs can potentially be used as reference markers and common anchoring points on the genetic maps of different solanaceous species.  相似文献   

7.
Triticeae species (including wheat, barley and rye) have huge and complex genomes due to polyploidization and a high content of transposable elements (TEs). TEs are known to play a major role in the structure and evolutionary dynamics of Triticeae genomes. During the last 5 years, substantial stretches of contiguous genomic sequence from various species of Triticeae have been generated, making it necessary to update and standardize TE annotations and nomenclature. In this study we propose standard procedures for these tasks, based on structure, nucleic acid and protein sequence homologies. We report statistical analyses of TE composition and distribution in large blocks of genomic sequences from wheat and barley. Altogether, 3.8 Mb of wheat sequence available in the databases was analyzed or re-analyzed, and compared with 1.3 Mb of re-annotated genomic sequences from barley. The wheat sequences were relatively gene-rich (one gene per 23.9 kb), although wheat gene-derived sequences represented only 7.8% (159 elements) of the total, while the remainder mainly comprised coding sequences found in TEs (54.7%, 751 elements). Class I elements [mainly long terminal repeat (LTR) retrotransposons] accounted for the major proportion of TEs, in terms of sequence length as well as element number (83.6% and 498, respectively). In addition, we show that the gene-rich sequences of wheat genome A seem to have a higher TE content than those of genomes B and D, or of barley gene-rich sequences. Moreover, among the various TE groups, MITEs were most often associated with genes: 43.1% of MITEs fell into this category. Finally, the TRIM and copia elements were shown to be the most active TEs in the wheat genome. The implications of these results for the evolution of diploid and polyploid wheat species are discussed. Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

8.
We have isolated two clones containing 19 kilobases (kb) of the human gene coding for a pro-alpha 1 (II) collagen chain from human lambda genomic DNA libraries. A 3' clone, HC2A, was selected by cross-hybridization with a cDNA clone containing sequences coding for the carboxy propeptide of chick type II procollagen. A second clone, HC2B, was obtained by screening the library with the 5' part of HC2A. The sequence analysis of exon 3 corresponding to the C propeptide reveals the presence of stretches of conserved nucleotides between the human and the chick type II procollagen genes. On Northern blots, the human collagen clone hybridizes strongly to a 5.5-kb RNA for the rat type II procollagen chain. Finally, studies of genomic DNAs from normal individuals reveal the presence of a HindIII and a BamHI polymorphic site at the 3' end of the gene.  相似文献   

9.
Microsatellite repeat sequences were investigated as sequenced-tagged site (STS) DNA markers to determine the potential for genetic analysis of the grapevine genome. The PCR-generated markers detect codominant alleles at a single locus or site in the genome. The marker type is very informative detecting high heterozygosity (69%–88%) within individual grapevine cultivars and high genetic variation between cultivars, making it a useful marker type for plant genome mapping and genome typing. For five loci a screening of 26 V. vinifera cultivars found 13, 12, 8, 5, and 4 different length alleles respectively with some alleles more common than others. The genomic DNA sequences surrounding microsatellite sequences were conserved within the genus permitting STS primers to amplify STSs from other Vitis species. These Vitis species were found to have some unique alleles not present in V. vinifera.  相似文献   

10.
Zhang Y  Zhu S  Yan D  Liu G  Bai R  Wang D  Chen L  Zhu H  An H  Kew O  Xu W 《PloS one》2010,5(12):e15300

Background

Ten uncommon natural type 3/type 2 intertypic poliovirus recombinants were isolated from stool specimens from nine acute flaccid paralysis case patients and one healthy vaccinee in China from 2001 to 2008.

Principal Findings

Complete genomic sequences revealed their vaccine-related genomic features and showed that their first crossover sites were randomly distributed in the 3′ end of the VP1 coding region. The length of donor Sabin 2 sequences ranged from 55 to 136 nucleotides, which is the longest donor sequence reported in the literature for this type of poliovirus recombination. The recombination resulted in the introduction of Sabin 2 neutralizing antigenic site 3a (NAg3a) into a Sabin 3 genomic background in the VP1 coding region, which may have been altered by some of the type 3-specific antigenic properties, but had not acquired any type 2-specific characterizations. NAg3a of the Sabin 3 strain seems atypical; other wild-type poliovirus isolates that have circulated in recent years have sequences of NAg3a more like the Sabin 2 strain.

Conclusions

10 natural type 3/type 2 intertypic VP1 capsid-recombinant polioviruses, in which the first crossover sites were found to be in the VP1 coding region, were isolated and characterized. In spite of the complete replacement of NAg3a by type 2-specific amino acids, the serotypes of the recombinants were not altered, and they were totally neutralized by polyclonal type 3 antisera but not at all by type 2 antisera. It is possible that recent type 3 wild poliovirus isolates may be a recombinant having NAg3a sequences derived from another strain during between 1967 and 1980, and the type 3/type 2 recombination events in the 3′ end of the VP1 coding region may result in a higher fitness.  相似文献   

11.
We report a new experimental method of direct selection, identification, and mapping of potential enhancer sequences within extended stretches of genomic DNA. The method allows simultaneous cloning of a quantity of sequences instead of tedious screening of the separate ones, thus providing a robust and high-throughput approach to the mapping of enhancers. The selection procedure is based on the ability of such sequences to activate a minimal promoter that drives expression of a selective gene. To this end a mixture of short DNA fragments derived from the segment of interest was cloned in a retroviral vector containing the neomycin phosphotransferase II gene under control of a cytomegalovirus (CMV) minimal promoter. The pool of retroviruses obtained was used to infect HeLa cells and then to select neomycin-resistant colonies containing constructs with enhancer-like sequences. The pool of the genomic fragments was rescued by PCR and cloned, forming a library of the potential enhancers. Fifteen enhancer-like fragments were selected from 1-Mb human genome locus, and enhancer activity of 13 of them was verified in a transient transfection reporter gene assay. The sequences selected were found to be predominantly located near 5' regions of genes or within gene introns.  相似文献   

12.
13.
Complete genomes can be recovered from metagenomes by assembling and binning DNA sequences into metagenome assembled genomes (MAGs). Yet, the presence of microdiversity can hamper the assembly and binning processes, possibly yielding chimeric, highly fragmented and incomplete genomes. Here, the metagenomes of four samples of aerobic granular sludge bioreactors containing Candidatus (Ca.) Accumulibacter, a phosphate-accumulating organism of interest for wastewater treatment, were sequenced with both PacBio and Illumina. Different strategies of genome assembly and binning were investigated, including published protocols and a binning procedure adapted to the binning of long contigs (MuLoBiSC). Multiple criteria were considered to select the best strategy for Ca. Accumulibacter, whose multiple strains in every sample represent a challenging microdiversity. In this case, the best strategy relies on long-read only assembly and a custom binning procedure including MuLoBiSC in metaWRAP. Several high-quality Ca. Accumulibacter MAGs, including a novel species, were obtained independently from different samples. Comparative genomic analysis showed that MAGs retrieved in different samples harbour genomic rearrangements in addition to accumulation of point mutations. The microdiversity of Ca. Accumulibacter, likely driven by mobile genetic elements, causes major difficulties in recovering MAGs, but it is also a hallmark of the panmictic lifestyle of these bacteria.  相似文献   

14.
Lee YK  Lee KH  Kim SG  Melhem R  Moon CS  Liu S  Greenhalgh DG  Cho K 《PloS one》2012,7(4):e35156
The entirety of all protein coding sequences is reported to represent a small fraction (~2%) of the mouse and human genomes; the vast majority of the rest of the genome is presumed to be repetitive elements (REs). In this study, the C57BL/6J mouse reference genome was subjected to an unbiased RE mining to establish a whole-genome profile of RE occurrence and arrangement. The C57BL/6J mouse genome was fragmented into an initial set of 5,321 units of 0.5 Mb, and surveyed for REs using unbiased self-alignment and dot-matrix protocols. The survey revealed that individual chromosomes had unique profiles of RE arrangement structures, named RE arrays. The RE populations in certain genomic regions were arranged into various forms of complexly organized structures using combinations of direct and/or inverse repeats. Some of these RE arrays spanned stretches of over 2 Mb, which may contribute to the structural configuration of the respective genomic regions. There were substantial differences in RE density among the 21 chromosomes, with chromosome Y being the most densely populated. In addition, the RE array population in the mouse chromosomes X and Y was substantially different from those of the reference human chromosomes. Conversion of the dot-matrix data pertaining to a tandem 13-repeat structure within the Ch7.032 genome unit into a line map of known REs revealed a repeat unit of ~11.3 Kb as a mosaic of six different RE types. The data obtained from this study allowed for a comprehensive RE profiling, including the establishment of a library of RE arrays, of the reference mouse genome. Some of these RE arrays may participate in a spectrum of normal and disease biology that are specific for mice.  相似文献   

15.
Recombination events between human enteroviruses (HEV) are known to occur frequently and to participate in the evolution of these viruses. In a previous study, we reported the isolation of a panel of viruses belonging to the Human enterovirus species C (HEV-C) that had been cocirculating in a small geographic area of Madagascar in 2002. This panel included type 2 vaccine-derived polioviruses (PV) that had caused several cases of acute flaccid paralysis in humans. Previous partial sequencing of the genome of these HEV-C isolates revealed considerable genetic diversity, mostly due to recombination. In the work presented herein, we carried out a more detailed characterization of the genomes of viruses from this collection. First, we determined the full VP1 sequence of 41 of these isolates of different types. These sequences were compared with those of HEV-C isolates obtained from other countries or in other contexts. The sequences of the Madagascan isolates of a given type formed specific clusters clearly differentiated from those formed by other strains of the same type isolated elsewhere. Second, we sequenced the entire genome of 10 viruses representing most of the lineages present in this panel. All but one of the genomes appeared to be mosaic assemblies of different genomic fragments generated by intra- and intertypic recombination. The location of the breakpoints suggested potential preferred genomic regions for recombination. Our results also suggest that recombination between type HEV-99 and other HEV-C may be quite rare. This first exhaustive genomic analysis of a panel of non-PV HEV-C cocirculating in a small human population highlights the high frequency of inter and intra-typic genetic recombination, constituting a widespread mechanism of genetic plasticity and continually shifting the HEV-C biodiversity.  相似文献   

16.
Polyploidy is a key factor in the evolution of higher plants and plays an important role in the variation of plant genomes, leading to speciation in some cases. During polyploidisation, different balancing processes take place at the genomic level that can promote variation in nuclear DNA content. We estimated genome size using flow cytometry in 84 populations of 67 Artemisia species and one population of Crossostephium chinense. A total of 73 sequences of nrDNA ITS and 3′‐ETS were newly generated and analysed, together with previously published sequences, to address the evolution of genome size in a phylogenetic framework. Differences in 2C values were detected among some lineages, as well as an increase of genome size heterogeneity in subgenera whose phylogenetic relationships are still unclear. We confirmed that the increase in 2C values in Artemisia polyploids was not proportional to ploidy level, but 1Cx genome size tended to decrease significantly when high ploidy levels were reached. The results lead us to hypothesise that genome size in polyploids tends to a maximum as it follows saturation behaviour, in agreement with the Michaelis–Menten model. We tested different arithmetic functions with our dataset that corroborated a non‐linear relationship of genome size increase in polyploids, allowing us to suggest a theoretical upper limit for the DNA content of this genus.  相似文献   

17.
Retrospective molecular and phenotypic characterization of a vaccine-derived poliovirus (VDPV) type 1 isolate (7/b/97) isolated from sewage in Athens, Greece, in 1997 is reported. VP1 sequencing of this isolate revealed 1.87% divergence from the VP1 region of reference strain Sabin 1, while further genomic characterization of isolate 7/b/97 revealed a recombination event in the nonstructural part of the genome between a vaccine strain and a nonvaccine strain probably belonging to Enterovirus species C. Amino acid substitutions commonly found in previous studies were identified in the capsid coding region of the isolate, while most of the attenuation and temperature sensitivity determinants were reverted. The ultimate source of isolate 7/b/97 is unknown. The recovery of such a highly divergent derivative of a vaccine strain emphasizes the need for urgent implementation of environmental surveillance as a supportive procedure in the polio surveillance system even in countries with high rates of OPV coverage in order to prevent cases or even outbreaks of poliomyelitis that otherwise would be inevitable.  相似文献   

18.
Group C rotaviruses are an important cause of acute gastroenteritis in humans and animals. Fecal samples were collected from a porcine herd in July, 2009. Group C rotavirus RNA was detected using RT-PCR for the VP6 gene. The identified strain was further characterized by sequencing and phylogenetic analysis of the partial VP4, and complete VP6 and VP7 gene sequences. The partial VP4 and complete VP6 gene sequences of the CUK-5 strain were most closely related to those of the CUK-6 strain of group C rotaviruses. Phylogenetic analysis of the VP7 gene of the 2 strains (CUK-5 and CUK-6) and reference strains of group G rotavirus by the neighbor-joining method also confirmed that CUK-5 and CUK-6 belonged to type G5 and G1 strains, respectively. This study provides useful data for the prediction of newly appearing variants of porcine group C rotaviruses in neighboring countries through comparisons with GCRVs and fundamental research for vaccine development.  相似文献   

19.
Two synthetic oligonucleotide probe mixtures, whose sequences were inferred from two separate stretches of amino acids, one closer to the carboxy terminal and the other closer to the amino terminal, of ferredoxin I protein ofAzotobacter vinelandii, were used to select ferredoxin I gene clones from a cosmid gene library ofAzotobacter vinelandii. Restriction analysis revealed that 7 out of 10 selected clones were of the same type. All these clones were found to hybridize withfixABCX genes ofRhizobium meliloti.  相似文献   

20.
Introns are generally believed to evolve too rapidly and too erratically to be of much use in phylogenetic reconstructions. Few phylogenetically informative intron sequences are available, however, to ascertain the validity of this supposition. In the present study the supposition was tested on the example of the mammalian class II major histocompatibility complex (Mhc) genes of the DRB family. Since the Mhc genes evolve under balancing selection and are believed to recombine or rearrange frequently, the evolution of their introns could be expected to be particularly rapid and subject to scrambling. Sequences of intron 4 and 5 DRB genes were obtained from polymerase chain reaction-amplified fragments of genomic DNA from representatives of six eutherian orders—Primates, Scandentia, Chiroptera, Dermoptera, Lagomorpha, and Insectivora. Although short stretches of the introns have indeed proved to be unalignable, the bulk of the intron sequences from all six orders, spanning >85 million years (my) of evolution, could be aligned and used in a study of the tempo and mode of intron evolution. The analysis has revealed the Mhc introns to evolve at a rate similar to that of other genes and of synonymous sites of non-Mhc genes. No evidence of homogenization or large-scale scrambling of the intron sequences could be found. The Mhc introns apparently evolve largely by point mutations and insertions/deletions. The phylogenetic signals contained in the intron sequences could be used to identify Scandentia as the sister group of Primates, to support the existence of the Archonta superorder, and to confirm the monophyly of the Chiroptera. Received: 26 October 1998 / Accepted: 21 December 1998  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号