首页 | 本学科首页   官方微博 | 高级检索  
文章检索
  按 检索   检索词:      
出版年份:   被引次数:   他引次数: 提示:输入*表示无穷大
  收费全文   2345篇
  免费   252篇
  国内免费   1篇
  2023年   21篇
  2022年   42篇
  2021年   86篇
  2020年   44篇
  2019年   64篇
  2018年   75篇
  2017年   66篇
  2016年   94篇
  2015年   129篇
  2014年   123篇
  2013年   147篇
  2012年   213篇
  2011年   178篇
  2010年   114篇
  2009年   109篇
  2008年   153篇
  2007年   120篇
  2006年   118篇
  2005年   89篇
  2004年   111篇
  2003年   77篇
  2002年   75篇
  2001年   23篇
  2000年   14篇
  1999年   18篇
  1998年   14篇
  1997年   18篇
  1996年   14篇
  1995年   14篇
  1994年   12篇
  1993年   12篇
  1992年   7篇
  1991年   10篇
  1990年   8篇
  1989年   15篇
  1988年   7篇
  1987年   7篇
  1986年   8篇
  1985年   9篇
  1984年   17篇
  1983年   8篇
  1982年   12篇
  1981年   10篇
  1980年   9篇
  1979年   7篇
  1978年   5篇
  1977年   6篇
  1976年   8篇
  1974年   7篇
  1953年   3篇
排序方式: 共有2598条查询结果,搜索用时 46 毫秒
71.
Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies. On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). The procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.Recent technological advancements have enabled the large-scale sampling of genomes from uncultured microbial taxa, through the high-throughput sequencing of single amplified genomes (SAGs; Rinke et al., 2013; Swan et al., 2013) and assembly and binning of genomes from metagenomes (GMGs; Cuvelier et al., 2010; Sharon and Banfield, 2013). The importance of these products in assessing community structure and function has been established beyond doubt (Kalisky and Quake, 2011). Multiple Displacement Amplification (MDA) and sequencing of single cells has been immensely successful in capturing rare and novel phyla, generating valuable references for phylogenetic anchoring. However, efforts to conduct MDA and sequencing in a high-throughput manner have been heavily impaired by contamination from DNA introduced by the environmental sample, as well as introduced during the MDA or sequencing process (Woyke et al., 2011; Engel et al., 2014; Field et al., 2014). Similarly, metagenome binning and assembly often carries various errors and artifacts depending on the methods used (Nielsen et al., 2014). Even cultured isolate genomes have been shown to lack immunity to contamination with other species (Parks et al., 2014; Mukherjee et al., 2015). As sequencing of these genome product types rapidly increases, contaminant sequences are finding their way into public databases as reference sequences. It is therefore extremely important to define standardized and automated protocols for quality control and decontamination, which would go a long way towards establishing quality standards for all microbial genome product types.Current procedures for decontamination and quality control of genome sequences in single cells and metagenome bins are heavily manual and can consume hours/megabase when performed by expert biologists. Supervised decontamination typically involves homology-based inspection of ribosomal RNA sequences and protein coding genes, as well as visual analysis of k-mer frequency plots and guanine–cytosine content (Clingenpeel, 2015). Manual decontamination is also possible through the software SmashCell (Harrington et al., 2010), which contains a tool for visual identification of contaminants from a self-organizing map and corresponding U-matrix. Another existing software tool, DeconSeq (Schmieder and Edwards, 2011), automatically removes contaminant sequences, however, the contaminant databases are required input. The former lacks automation, whereas the latter requires prior knowledge of contaminants, rendering both applications impractical for high-throughput decontamination.Here, we introduce ProDeGe, the first fully automated computational protocol for decontamination of genomes. ProDeGe uses a combination of homology-based and sequence composition-based approaches to separate contaminant sequences from the target genome draft. It has been pre-calibrated to discard at least 84% of the contaminant sequence, which results in retention of a median 84% of the target sequence. The standalone software is freely available at http://prodege.jgi-psf.org//downloads/src and can be run on any system that has Perl, R (R Core Team, 2014), Prodigal (Hyatt et al., 2010) and NCBI Blast (Camacho et al., 2009) installed. A graphical viewer allowing further exploration of data sets and exporting of contigs accompanies the web application for ProDeGe at http://prodege.jgi-psf.org, which is open to the wider scientific community as a decontamination service (Supplementary Figure S1).The assembly and corresponding NCBI taxonomy of the data set to be decontaminated are required inputs to ProDeGe (Figure 1a). Contigs are annotated with genes following which, eukaryotic contamination is removed based on homology of genes at the nucleotide level using the eukaryotic subset of NCBI''s Nucleotide database as the reference. For detecting prokaryotic contamination, a curated database of reference contigs from the set of high-quality genomes within the Integrated Microbial Genomes (IMG; Markowitz et al., 2014) system is used as the reference. This ensures that errors in public reference databases due to poor quality of sequencing, assembly and annotation do not negatively impact the decontamination process. Contigs determined as belonging to the target organism based on nucleotide level homology to sequences in the above database are defined as ‘Clean'', whereas those aligned to other organisms are defined as ‘Contaminant''. Contigs whose origin cannot be determined based on alignment are classified as ‘Undecided''. Classified clean and contaminated contigs are used to calibrate the separation in the subsequent 5-mer based binning module, which classifies undecided contigs as ‘Clean'' or ‘Contaminant'' using principal components analysis (PCA) of 5-mer frequencies. This parameter can also be specified by the user. When data sets do not have taxonomy deeper than phylum level, or a single confident taxonomic bin cannot be detected using sequence alignment, solely 9-mer based binning is used due to more accurate overall classification. In the absence of a user-defined cutoff, a pre-calibrated cutoff for 80% or more specificity separates the clean contigs from contaminated sequences in the resulting PCA of the 9-mer frequency matrix. Details on ProDeGe''s custom database, evaluation of the performance of the system and exploration of the parameter space to calibrate ProDeGe for a high accurate classification rate are provided in the Supplementary Material.Open in a separate windowFigure 1(a) Schematic overview of the ProDeGe engine. (b) Features of data sets used to validate ProDeGe: SAGs from the Arabidopsis endophyte sequencing project, MDM project, public data sets found in IMG but not sequenced at the JGI, as well as genomes from metagenomes. All the data and results can be found in Supplementary Table S3.The performance of ProDeGe was evaluated using 182 manually screened SAGs (Figure 1b,Supplementary Table S1) from two studies whose data sets are publicly available within the IMG system: genomes of 107 SAGs from an Arabidopsis endophyte sequencing project and 75 SAGs from the Microbial Dark Matter (MDM) project* (only 75/201 SAGs from the MDM project had 1:1 mapping between contigs in the unscreened and the manually screened versions, hence these were used; Rinke et al., 2013). Manual curation of these SAGs demonstrated that the use of ProDeGe prevented 5311 potentially contaminated contigs in these data sets from entering public databases. Figure 2a demonstrates the sensitivity vs specificity plot of ProDeGe results for the above data sets. Most of the data points in Figure 2a cluster in the top right of the box reflecting a median retention of 89% of the clean sequence (sensitivity) and a median rejection of 100% of the sequence of contaminant origin (specificity). In addition, on average, 84% of the bases of a data set are accurately classified. ProDeGe performs best when the target organism has sequenced homologs at the class level or deeper in its high-quality prokaryotic nucleotide reference database. If the target organism''s taxonomy is unknown or not deeper than domain level, or there are few contigs with taxonomic assignments, a target bin cannot be assessed and thus ProDeGe removes contaminant contigs using sequence composition only. The few samples in Figure 2a that demonstrate a higher rate of false positives (lower specificity) and/or reduced sensitivity typically occur when the data set contains few contaminant contigs or ProDeGe incorrectly assumes that the largest bin is the target bin. Some data sets contain a higher proportion of contamination than target sequence and ProDeGe''s performance can suffer under this condition. However, under all other conditions, ProDeGe demonstrates high speed, specificity and sensitivity (Figure 2). In addition, ProDeGe demonstrates better performance in overall classification when nucleotides are considered than when contigs are considered, illustrating that longer contigs are more accurately classified (Supplementary Table S1).Open in a separate windowFigure 2ProDeGe accuracy and performance scatterplots of 182 manually curated single amplified genomes (SAGs), where each symbol represents one SAG data set. (a) Accuracy shown by sensitivity (proportion of bases confirmed ‘Clean'') vs specificity (proportion of bases confirmed ‘Contaminant'') from the Endophyte and Microbial Dark Matter (MDM) data sets. Symbol size reflects input data set size in megabases. Most points cluster in the top right of the plot, showing ProDeGe''s high accuracy. Median and average overall results are shown in Supplementary Table S1. (b) ProDeGe completion time in central processing unit (CPU) core hours for the 182 SAGs. ProDeGe operates successfully at an average rate of 0.30 CPU core hours per megabase of sequence. Principal components analysis (PCA) of a 9-mer frequency matrix costs more computationally than PCA of a 5-mer frequency matrix used with blast-binning. The lack of known taxonomy for the MDM data sets prevents blast-binning, thus showing longer finishing times than the endophyte data sets, which have known taxonomy for use in blast-binning.All SAGs used in the evaluation of ProDeGe were assembled using SPAdes (Bankevich et al., 2012). In-house testing has shown that reads assembled with SPAdes from different strains or even slightly divergent species of the same genera may be combined into the same contig (Personal communications, KT and Robert Bowers). Ideally, the DNA in a well that gets sequenced belongs to a single cell. In the best case, contaminant sequences need to be at least from a different species to be recognized as such by the homology-based screening stage. In the absence of closely related sequenced organisms, contaminant sequences need to be at least from a different genus to be recognized as such by the composition-based screening stage (Supplementary Material). Thus, there is little risk of ProDeGe separating sequences from clonal populations or strains. We have found species- and genus-level contamination in MDA samples to be rare.To evaluate the quality of publicly available uncultured genomes, ProDeGe was used to screen 185 SAGs and 14 GMGs (Figure 1b). Compared with CheckM (Parks et al., 2014), a tool which calculates an estimate of genome sequence contamination using marker genes, ProDeGe generally marks a higher proportion of sequence as ‘Contaminant'' (Supplementary Table S2). This is because ProDeGe has been calibrated to perform at high specificity levels. The command line version of ProDeGe allows users to conduct their own calibration and specify a user-defined distance cutoff. Further, CheckM only outputs the proportion of contamination, but ProDeGe actually labels each contig as ‘Clean'' or ‘Contaminant'' during the process of automated removal.The web application for ProDeGe allows users to export clean and contaminant contigs, examine contig gene calls with their corresponding taxonomies, and discover contig clusters in the first three components of their k-dimensional space. Non-linear approaches for dimensionality reduction of k-mer vectors are gaining popularity (van der Maaten and Hinton, 2008), but we observed no systematic advantage of using t-Distributed Stochastic Neighbor Embedding over PCA (Supplementary Figure S2).ProDeGe is the first step towards establishing a standard for quality control of genomes from both cultured and uncultured microorganisms. It is valuable for preventing the dissemination of contaminated sequence data into public databases, avoiding resulting misleading analyses. The fully automated nature of the pipeline relieves scientists of hours of manual screening, producing reliably clean data sets and enabling the high-throughput screening of data sets for the first time. ProDeGe, therefore, represents a critical component in our toolkit during an era of next-generation DNA sequencing and cultivation-independent microbial genomics.  相似文献   
72.
We present a novel regularization scheme called The Generalized Elastic Net (GELnet) that incorporates gene pathway information into feature selection. The proposed formulation is applicable to a wide variety of problems in which the interpretation of predictive features using known molecular interactions is desired. The method naturally steers solutions toward sets of mechanistically interlinked genes. Using experiments on synthetic data, we demonstrate that pathway-guided results maintain, and often improve, the accuracy of predictors even in cases where the full gene network is unknown. We apply the method to predict the drug response of breast cancer cell lines. GELnet is able to reveal genetic determinants of sensitivity and resistance for several compounds. In particular, for an EGFR/HER2 inhibitor, it finds a possible trans-differentiation resistance mechanism missed by the corresponding pathway agnostic approach.  相似文献   
73.
Understanding the physiological and genetic basis of growth and body size variation has wide‐ranging implications, from cancer and metabolic disease to the genetics of complex traits. We examined the evolution of body and wing size in high‐altitude Drosophila melanogaster from Ethiopia, flies with larger size than any previously known population. Specifically, we sought to identify life history characteristics and cellular mechanisms that may have facilitated size evolution. We found that the large‐bodied Ethiopian flies laid significantly fewer but larger eggs relative to lowland, smaller‐bodied Zambian flies. The highland flies were found to achieve larger size in a similar developmental period, potentially aided by a reproductive strategy favoring greater provisioning of fewer offspring. At the cellular level, cell proliferation was a strong contributor to wing size evolution, but both thorax and wing size increases involved important changes in cell size. Nuclear size measurements were consistent with elevated somatic ploidy as an important mechanism of body size evolution. We discuss the significance of these results for the genetic basis of evolutionary changes in body and wing size in Ethiopian D. melanogaster.  相似文献   
74.
75.
BackgroundHigher serum 25-hydroxyvitamin D [25(OH)D] concentrations have been associated with a lower risk of multiple cancer types across a range of 25(OH)D concentrations.ObjectivesTo investigate whether the previously reported inverse association between 25(OH)D and cancer risk could be replicated, and if a 25(OH)D response region could be identified among women aged 55 years and older across a broad range of 25(OH)D concentrations.MethodsData from two cohorts representing different median 25(OH)D concentrations were pooled to afford a broader range of 25(OH)D concentrations than either cohort alone: the Lappe cohort (N = 1,169), a randomized clinical trial cohort (median 25(OH)D = 30 ng/ml) and the GrassrootsHealth cohort (N = 1,135), a prospective cohort (median 25(OH)D = 48 ng/ml). Cancer incidence over a multi-year period (median: 3.9 years) was compared according to 25(OH)D concentration. Kaplan-Meier plots were developed and the association between 25(OH)D and cancer risk was examined with multivariate Cox regression using multiple 25(OH)D measurements and spline functions. The study included all invasive cancers excluding skin cancer.ResultsAge-adjusted cancer incidence across the combined cohort (N = 2,304) was 840 cases per 100,000 person-years (1,020 per 100,000 person-years in the Lappe cohort and 722 per 100,000 person-years in the GrassrootsHealth cohort). Incidence was lower at higher concentrations of 25(OH)D. Women with 25(OH)D concentrations ≥40 ng/ml had a 67% lower risk of cancer than women with concentrations <20 ng/ml (HR = 0.33, 95% CI = 0.12–0.90).Conclusions25(OH)D concentrations ≥40 ng/ml were associated with substantial reduction in risk of all invasive cancers combined.  相似文献   
76.
77.
78.
Background:Use of electronic cigarettes (e-cigarettes) among adolescents has not been fully described, in particular their motivations for using them and factors associated with use. We sought to evaluate the frequency, motivations and associated factors for e-cigarette use among adolescents in Ontario.Methods:We conducted a cross-sectional study in the Niagara region of Ontario, Canada, involving universal screening of students enrolled in grade 9 in co-operation with the Heart Niagara Inc. Healthy Heart Schools’ Program (for the 2013–2014 school year). We used a questionnaire to assess cigarette, e-cigarette and other tobacco use, and self-rated health and stress. We assessed household income using 2011 Canadian census data by matching postal codes to census code.Results:Of 3312 respondents, 2367 answered at least 1 question in the smoking section of the questionnaire (1274 of the 2367 respondents [53.8%] were male, with a mean [SD] age of 14.6 [0.5] yr) and 2292 answered the question about use of e-cigarettes. Most respondents to the questions about use of e-cigarettes (n = 1599, 69.8%) had heard of e-cigarettes, and 380 (23.8%) of these respondents had learned about them from a store sign or display. Use of e-cigarettes was reported by 238 (10.4%) students. Most of the respondents who reported using e-cigarettes (171, 71.9%) tried them because it was “cool/fun/new,” whereas 14 (5.8%) reported using them for smoking reduction or cessation. Male sex, recent cigarette or other tobacco use, family members who smoke and friends who smoke were strongly associated with reported e-cigarette use. Reported use of e-cigarettes was associated with self-identified fair/poor health rating (odds ratio [OR] 1.9 (95% confidence interval [CI] 1.2–3.0), p < 0.001), high stress level (OR 1.7 (95% CI 1.1–2.7), p < 0.001) and lower mean (33.4 [8.4] × $1000 v. 36.1 [10.7] × $1000, p = 0.001) and median [interquartile range] (26.2 [5.6] × $1000 v. 28.1 [5.7] × $1000) household incomes.Interpretation:Use of e-cigarettes is common among adolescents in the Niagara region and is associated with sociodemographic features. Engaging in seemingly exciting new behaviours appears to be a key motivating factor rather than smoking cessation.Electronic cigarettes (e-cigarettes) are novel devices that are designed to mimic the physical and tactile experience of conventional cigarettes while producing a smoke-free vapour. They have quickly gained popularity despite limited evidence regarding the health risks associated with their use and a lack of regulation.1 In addition, existing literature about e-cigarettes suggests that they may not be effective for achieving smoking reduction or cessation, a use for which they are often marketed.13 Given their physical similarities to conventional cigarettes, there are concerns that the increasing use of e-cigarettes may result in the “renormalization” of cigarette smoking.4,5 Previous studies have suggested that use of e-cigarettes among adolescents and young adults may be associated with use of and exposure to tobacco.1,6,7Rates of the use of e-cigarettes at least once among high school students in the United States have increased annually.6,8 Among adolescents in Canada, use of e-cigarettes is now more common than cigarette use.9 However, questions still remain regarding the motivations and factors associated with e-cigarette use among adolescents. Therefore, we sought to evaluate the frequency, motivations and associated factors for use of e-cigarettes by students in grade 9 who were undergoing universal school-based screening for cardiovascular risk factors in the Niagara region in Ontario.  相似文献   
79.
80.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号