首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 9 毫秒
1.
The Human Microbiome Project (HMP) aims to characterize the microbial communities of 18 body sites from healthy individuals. To accomplish this, the HMP generated two types of shotgun data: reference shotgun sequences isolated from different anatomical sites on the human body and shotgun metagenomic sequences from the microbial communities of each site. The alignment strategy for characterizing these metagenomic communities using available reference sequence is important to the success of HMP data analysis. Six next-generation aligners were used to align a community of known composition against a database comprising reference organisms known to be present in that community. All aligners report nearly complete genome coverage (>97%) for strains with over 6X depth of coverage, however they differ in speed, memory requirement and ease of use issues such as database size limitations and supported mapping strategies. The selected aligner was tested across a range of parameters to maximize sensitivity while maintaining a low false positive rate. We found that constraining alignment length had more impact on sensitivity than does constraining similarity in all cases tested. However, when reference species were replaced with phylogenetic neighbors, similarity begins to play a larger role in detection. We also show that choosing the top hit randomly when multiple, equally strong mappings are available increases overall sensitivity at the expense of taxonomic resolution. The results of this study identified a strategy that was used to map over 3 tera-bases of microbial sequence against a database of more than 5,000 reference genomes in just over a month.  相似文献   

2.
Genome-wide data, both from SNP arrays and from complete genome sequencing, are becoming increasingly abundant and are now even available from extinct hominins. These data are providing new insights into population history; in particular, when combined with model-based analytical approaches, genome-wide data allow direct testing of hypotheses about population history. For example, genome-wide data from both contemporary populations and extinct hominins strongly support a single dispersal of modern humans from Africa, followed by two archaic admixture events: one with Neanderthals somewhere outside Africa and a second with Denisovans that (so far) has only been detected in New Guinea. These new developments promise to reveal new stories about human population history, without having to resort to storytelling.  相似文献   

3.
Neely RK  Deen J  Hofkens J 《Biopolymers》2011,95(5):298-311
The technologies associated with DNA sequencing are rapidly evolving. Indeed, single-molecule DNA sequencing strategies are cheaper and faster than ever before. Despite this progress, every sequencing platform to date relies on reading the genome in small, abstract fragments, typically of less than 1000 bases in length. The overarching aim of the optical map is to complement the information derived from DNA sequencing by providing long-range context on which these short sequence reads can be built. This is typically done using an enzyme to target and modify at short DNA sequences of, say, six bases in length throughout the genome. By accurately placing these short pieces of sequence on long genomic DNA fragments, up to several millions of bases in length, a scaffold for sequence assembly can be obtained. This review focuses on three enzymatic approaches to optical mapping. Optical mapping was first developed using restriction enzymes to sequence-specifically cleave DNA that is immobilized on a surface. More recently, nicking enzymes have found application in the sequence-specific fluorescent labeling of DNA for optical mapping. Such covalent modification allows the DNA to be imaged in solution, and this, in combination with developing nanofluidic technologies, is enabling new high-throughput approaches to mapping. And, finally, this review will discuss the recent development of mapping with subdiffraction-limit precision using methyltransferase enzymes to label the DNA with an ultrahigh density.  相似文献   

4.
Comparative studies of DNA in recent populations and characterisation of ancient hereditary material have contributed very interesting facts to our understanding of evolution of modern mankind. Analysis of DNA homology in related species, assessment of mutations and polymorphisms in various populations and new DNA sequence data from prehistoric finds allowed - via sophisticated DNA extraction techniques, PCR, sequencing and digitalised processing of genetic information - insights into possible roots of Homo sapiens and related species, migration patterns and ancient cultural habits, thus enrhing the palaeoanthropological discipline. However, a presentation of this development would not be complete without pointing towards the methodological limitations and manifold presentations burdened with artifacts, data misinterpretation and unjustified conclusions. Presently, this modern field of research is in its consolidation phase and new parameters for quality control and authentication are being implemented to avoid spectacular but unfounded reports. It is expected that most of the problems connected to old biomolecules may be closely related to fossilisation parameters. The future challenge will be the full understanding of the complex and multi-faceted processes underlying diagenesis, including the elucidation of nucleic acid postmortem damage".  相似文献   

5.
6.
A polymorphic noncoding region of chloroplast DNA (cpDNA) was successfully amplified by the polymerase chain reaction (PCR) from various oak wood samples, including recent and more ancient (about 600-years-old) samples from different oak species. Adaptation of DNA isolation and amplification protocols was necessary to obtain this result. Polymorphisms useful to distinguish species or geographical origin of these samples could be scored through sequencing. These polymorphisms include one substitution and two microsatellite-type polymorphisms, due to a variable number of A/T repeats. Identical results were obtained independently in two separate laboratories.  相似文献   

7.
Turano-Mongolian cattle are a group of taurine cattle from Northern and Eastern Asia with distinct morphological traits, which are known for their ability to tolerate harsh environments, such as the Asian steppe and the Tibetan plateau. Through the analysis of 170 mitogenomes from ten modern breeds, two sub-lineages within T3 (T3119 and T3055) were identified as specific of Turano-Mongolian cattle. These two T3 sub-lineages, together with the previously identified T4, were also present in six Neolithic samples, dated to ~3900 years BP, which might represent the earliest domestic taurine stocks from Southwest Asia. The rare haplogroup Q, found in three Tibetan cattle, testifies for the legacy of ancient migrations from Southwest Asia and suggests that the isolated Tibetan Plateau preserved unique prehistoric genetic resources. These findings confirm the geographic substructure of Turano-Mongolian cattle breeds, which have been shaped by ancient migrations and geographic barriers.Subject terms: Animal migration, Haplotypes, Phylogenetics  相似文献   

8.
High‐capacity sequencing technologies have dramatically reduced both the cost and time required to generate complete human genome sequences. Besides expanding our knowledge about existing diversity, the nature of these technologies makes it possible to extend knowledge in yet another dimension: time. Recently, the complete genome sequence of a 4,000‐year‐old human from the Saqqaq culture of Greenland was determined to 20‐fold coverage. These data make it possible to investigate the population affinities of this enigmatic culture and, by identifying several phenotypic traits of this individual, provide a limited glimpse into how these people may have looked. While undoubtedly a milestone in ancient DNA research, the cost to generate an ancient genome, even from such an exceptionally preserved specimen, remains out of reach for most. Nonetheless, recently developed DNA capture methods, already applied to Neanderthal and fossil human mitochondrial DNA, may soon make large‐scale genome‐wide analysis of ancient human diversity a reality, providing a fresh look at human population history.  相似文献   

9.

Background

Seed location filtering is critical in DNA read mapping, a process where billions of DNA fragments (reads) sampled from a donor are mapped onto a reference genome to identify genomic variants of the donor. State-of-the-art read mappers 1) quickly generate possible mapping locations for seeds (i.e., smaller segments) within each read, 2) extract reference sequences at each of the mapping locations, and 3) check similarity between each read and its associated reference sequences with a computationally-expensive algorithm (i.e., sequence alignment) to determine the origin of the read. A seed location filter comes into play before alignment, discarding seed locations that alignment would deem a poor match. The ideal seed location filter would discard all poor match locations prior to alignment such that there is no wasted computation on unnecessary alignments.

Results

We propose a novel seed location filtering algorithm, GRIM-Filter, optimized to exploit 3D-stacked memory systems that integrate computation within a logic layer stacked under memory layers, to perform processing-in-memory (PIM). GRIM-Filter quickly filters seed locations by 1) introducing a new representation of coarse-grained segments of the reference genome, and 2) using massively-parallel in-memory operations to identify read presence within each coarse-grained segment. Our evaluations show that for a sequence alignment error tolerance of 0.05, GRIM-Filter 1) reduces the false negative rate of filtering by 5.59x–6.41x, and 2) provides an end-to-end read mapper speedup of 1.81x–3.65x, compared to a state-of-the-art read mapper employing the best previous seed location filtering algorithm.

Conclusion

GRIM-Filter exploits 3D-stacked memory, which enables the efficient use of processing-in-memory, to overcome the memory bandwidth bottleneck in seed location filtering. We show that GRIM-Filter significantly improves the performance of a state-of-the-art read mapper. GRIM-Filter is a universal seed location filter that can be applied to any read mapper. We hope that our results provide inspiration for new works to design other bioinformatics algorithms that take advantage of emerging technologies and new processing paradigms, such as processing-in-memory using 3D-stacked memory devices.
  相似文献   

10.
Problems concerning the concept of biocoenosis in ecology (the antecedent of the pathocoenosis concept) are discussed first of all. Six main problems are identified: the problem of emergent properties of ecological communities; the problem of ambiguity; the problem of heterogeneity; the boundary problem; the problem of retrospective differential diagnosis; and the problem of explaining change over time. The rest of the paper gives illustrations of these problems in relation to human pathogens drawn mainly from the interactions of malaria with other diseases, particularly but not exclusively in the Mediterranean world, from antiquity through to modern times.  相似文献   

11.
Ancient DNA research has developed rapidly over the past few decades due to improvements in PCR and next‐generation sequencing (NGS) technologies, but challenges still exist. One major challenge in relation to ancient DNA research is to recover genuine endogenous ancient DNA sequences from raw sequencing data. This is often difficult due to degradation of ancient DNA and high levels of contamination, especially homologous contamination that has extremely similar genetic background with that of the real ancient DNA. In this study, we collected whole‐genome sequencing (WGS) data from 6 ancient samples to compare different mapping algorithms. To further explore more effective methods to separate endogenous DNA from homologous contaminations, we attempted to recover reads based on ancient DNA specific characteristics of deamination, depurination, and DNA fragmentation with different parameters. We propose a quick and improved pipeline for separating endogenous ancient DNA while simultaneously decreasing homologous contaminations to very low proportions. Our goal in this research was to develop useful recommendations for ancient DNA mapping and for separation of endogenous DNA to facilitate future studies of ancient DNA.  相似文献   

12.
13.
古代病原微生物基因组研究对病理学、微生物学、考古学等领域均具有重要的价值。在过去的十年里,高通量测序和靶向富集技术的发展和应用使古代微生物基因组的获取成为可能,通过对古代人群样本中获取的宏基因组进行筛查,使得引发古代疫情的相关病原体的基因组得以重建,为研究人类传染病的起源、传播和演化提供了一个独特的窗口。在当今全球化的背景下,新发及再发传染性疾病的出现频率促使我们回顾过去,以便更好地了解现代病原菌出现和古代病原菌重新出现的过程和生态环境。在这篇文章中,我们总结了近十年古代病原微生物基因组水平的研究进展,并提出了这项研究所面临的挑战以及未来的研究前景和方向。  相似文献   

14.
Different strains of the Escherichia coli reference collection (ECOR) differ widely in chromosomal size. To analyze the nature of the differential gene pool carried by different strains, we have followed an approach in which random amplified polymorphic DNA (RAPD) was used to generate several PCR fragments. Those present in some but not all the strains were screened by hybridization to assess their distribution throughout the ECOR collection. Thirteen fragments with various degrees of occurrence were sequenced. Three of them corresponded to RAPD markers of widespread distribution. Of these, two were housekeeping genes shown by hybridization to be present in all the E. coli strains and in Salmonella enterica LT2; the third fragment contained a paralogous copy of dnaK with widespread, but not global, distribution. The other 10 RAPD markers were found in only a few strains. However, hybridization results demonstrated that four of them were actually present in a large selection of the ECOR collection (between 42 and 97% of the strains); three of these fragments contained open reading frames associated with phages or plasmids known in E. coli K-12. The remaining six fragments were present in only between one and four strains; of these, four fragments showed no similarity to any sequence in the databases, and the other two had low but significant similarity to a protein involved in the Klebsiella capsule synthesis and to RNA helicases of archaeal genomes, respectively. Their percent GC, dinucleotide content, and codon adaptation index suggested an exogenous origin by horizontal transfer. These results can be interpreted as reflecting the presence of a large pool of strain-specific genes, whose origin could be outside the species boundaries.  相似文献   

15.
16.
In the present study, nuclear (autosomal and Y-chromosome short tandem repeats) and mitochondrial (hypervariable region I) ancient DNA data previously obtained from a 2,300-year-old Xiongnu population of the Egyin Gol Valley (south of Lake Baikal in northern Mongolia) (Keyser-Tracqui et al. 2003 Am. J. Hum. Genet. 73:247-260) were compared with data from two contemporary Mongolian populations: one from the same location (Egyin Gol Valley plus a perimeter of less than 100 km around the valley), and one from the whole of Mongolia. The principal objective of this comparative analysis was to assess the likelihood that genetic continuity exists between ancient and present-day Mongolian populations. Since the ancient Xiongnu sample might have been composed of some of the ancestors of the present-day Yakuts, data from a present-day Yakut population, as well as published data from Turkish populations, were also included in the comparative analysis. The main result of our study was the genetic similarity observed among Mongolian samples from different periods and geographic areas. This result supports the hypothesis that the succession over time of different Turkic and Mongolian tribes in the current territory of Mongolia resulted in cultural rather than genetic exchanges. Furthermore, it appears that the Yakuts probably did not find their origin among the Xiongnu tribes, as we previously hypothesized.  相似文献   

17.
Physical mapping of bacterial genomes.   总被引:11,自引:3,他引:8       下载免费PDF全文
  相似文献   

18.
19.
Evolution of ancient satellite DNAs in sturgeon genomes   总被引:1,自引:0,他引:1  
This study characterizes a repetitive DNA family of sequences in sturgeon, the PstI satellite DNA. We have found a high degree of preservation for these sequences, which are present in all 13 species analyzed, including within the genera Acipenser, Huso, and Scaphirhynchus of the family Acipenseridae. This is one of the most ancient satellite DNAs found to date, because it has been estimated to be more than 100 million years old. Alternatively, to the current view that most satellite DNAs are species-specific or preserved in a few closely related species, the PstI family and other previously characterized sturgeon satellite DNA, the HindIII, represent the most fascinating exceptions to the rapid sequence change usually undergone by satellite DNAs. Here, we compare the evolutionary pattern of these two satellite DNA families, PstI and HindIII, which differ markedly in length, sequence, and nucleotide composition. We have found that, in contrast to the situation in most other living beings, a high degree of preservation, a slow sequence change rate and slowed concerted evolution, appears to be a general rule for sturgeon satellite DNAs. The possible causes for all these features are discussed in the light of the evolutionary specifics found within these ancient organisms.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号