Recently, single nucleotide polymorphisms (SNPs) located in specific loci or genes have been identified associated with susceptibility to colorectal cancer (CRC) in Genome-Wide Association Studies (GWAS). However, in different ethnicities and regions, the genetic variations and the environmental factors can widely vary. Therefore, here we propose a post-GWAS analysis method to investigate the CRC susceptibility SNPs in Taiwan by conducting a replication analysis and bioinformatics analysis. One hundred and forty-four significant SNPs from published GWAS results were collected by a literature survey, and two hundred and eighteen CRC samples and 385 normal samples were collected for post-GWAS analysis. Finally, twenty-six significant SNPs were identified and reported as associated with susceptibility to colorectal cancer, other cancers, obesity, and celiac disease in a previous GWAS study. Functional analysis results of 26 SNPs indicate that most biological processes identified are involved in regulating immune responses and apoptosis. In addition, an efficient prediction model was constructed by applying Jackknife feature selection and ANOVA testing. As compared to another risk prediction model of CRC for European Caucasians population, which performs 0.616 of AUC by using 54 SNPs, the proposed model shows good performance in predicting CRC risk within the Taiwanese population, i.e., 0.724 AUC by using 16 SNPs. We believe that the proposed risk prediction model is highly promising for predicting CRC risk within the Taiwanese population. In addition, the functional analysis results could be helpful to explore the potential associated regulatory mechanisms that may be involved in CRC development. 相似文献
Viruses infect humans and progress inside the body leading to various diseases and complications. The phosphorylation of viral proteins catalyzed by host kinases plays crucial regulatory roles in enhancing replication and inhibition of normal host-cell functions. Due to its biological importance, there is a desire to identify the protein phosphorylation sites on human viruses. However, the use of mass spectrometry-based experiments is proven to be expensive and labor-intensive. Furthermore, previous studies which have identified phosphorylation sites in human viruses do not include the investigation of the responsible kinases. Thus, we are motivated to propose a new method to identify protein phosphorylation sites with its kinase substrate specificity on human viruses. The experimentally verified phosphorylation data were extracted from virPTM - a database containing 301 experimentally verified phosphorylation data on 104 human kinase-phosphorylated virus proteins. In an attempt to investigate kinase substrate specificities in viral protein phosphorylation sites, maximal dependence decomposition (MDD) is employed to cluster a large set of phosphorylation data into subgroups containing significantly conserved motifs. The experimental human phosphorylation sites are collected from Phospho.ELM, grouped according to its kinase annotation, and compared with the virus MDD clusters. This investigation identifies human kinases such as CK2, PKB, CDK, and MAPK as potential kinases for catalyzing virus protein substrates as confirmed by published literature. Profile hidden Markov model is then applied to learn a predictive model for each subgroup. A five-fold cross validation evaluation on the MDD-clustered HMMs yields an average accuracy of 84.93% for Serine, and 78.05% for Threonine. Furthermore, an independent testing data collected from UniProtKB and Phospho.ELM is used to make a comparison of predictive performance on three popular kinase-specific phosphorylation site prediction tools. In the independent testing, the high sensitivity and specificity of the proposed method demonstrate the predictive effectiveness of the identified substrate motifs and the importance of investigating potential kinases for viral protein phosphorylation sites. 相似文献
One of the most common and recurrent vaginal infections is bacterial vaginosis (BV). The diagnosis is based on changes to the “normal” vaginal microbiome; however, the normal microbiome appears to differ according to reproductive status and ethnicity, and even among individuals within these groups. The Amsel criteria and Nugent score test are widely used for diagnosing BV; however, these tests are based on different criteria, and so may indicate distinct changes in the vaginal microbial community. Nevertheless, few studies have compared the results of these test against metagenomics analysis.
Methods
Vaginal flora samples from 77 participants were classified according to the Amsel criteria and Nugent score test. The microbiota composition was analyzed using 16S ribosome RNA gene amplicon sequencing. Bioinformatics analysis and multivariate statistical analysis were used to evaluate the microbial diversity and function.
Results
Only 3 % of the participants diagnosed BV negative using the Amsel criteria (A−) were BV-positive according to the Nugent score test (N+), while over half of the BV-positive patients using the Amsel criteria (A+) were BV-negative according to the Nugent score test (N−). Thirteen genera showed significant differences in distribution among BV status defined by BV tests (e.g., A − N−, A + N− and A + N+). Variations in the four most abundant taxa, Lactobacillus, Gardnerella, Prevotella, and Escherichia, were responsible for most of this dissimilarity. Furthermore, vaginal microbial diversity differed significantly among the three groups classified by the Nugent score test (N−, N+, and intermediate flora), but not between the Amsel criteria groups. Numerous predictive microbial functions, such as bacterial chemotaxis and bacterial invasion of epithelial cells, differed significantly among multiple BV test, but not between the A− and A+ groups.
Conclusions
Metagenomics analysis can greatly expand our current understanding of vaginal microbial diversity in health and disease. Metagenomics profiling may also provide more reliable diagnostic criteria for BV testing.
Human gut microbiome has an essential role in human health and disease. Although the major dominant microbiota within individuals have been reported, the change of gut microbiome caused by external factors, such as antibiotic use and bowel cleansing, remains unclear. We conducted this study to investigate the change of gut microbiome in overweight male adults after bowel preparation, where none of the participants had been diagnosed with any systemic diseases.
Methods
A total of 20 overweight, male Taiwanese adults were recruited, and all participants were omnivorous. The participants provided fecal samples and blood samples at three time points: prior to bowel preparation, 7 days after colonoscopy, and 28 days after colonoscopy. The microbiota composition in fecal samples was analyzed using 16S ribosome RNA gene amplicon sequencing.
Results
Our results demonstrated that the relative abundance of the most dominant bacteria hardly changed from prior to bowel preparation to 28 days after colonoscopy. Using the ratio of Prevotella to the sum of Prevotella and Bacteroides in the fecal samples at baseline, the participants were separated into two groups. The fecal samples of the Type 1 group was Bacteroides-dominant, and that of the Type 2 group was Prevotella-dominant with a noticeable presence Bacteroides. Bulleidia appears more in the Type 1 fecal samples, while Akkermensia appears more in the Type 2 fecal samples. Of each type, the gut microbial diversity differed slightly among the three collection times. Additionally, the Type 2 fecal microbiota was temporarily susceptible to bowel cleansing. Predictive functional analysis of microbial community reveals that their activities for the mineral absorption metabolism and arachidonic acid metabolism differed significantly between the two types. Depending on their fecal type, the variance of triglycerides and C-reactive protein also differed between the two types of participants.
Conclusions
Depending upon the fecal type, the microbial diversity and the predictive functional modules of microbial community differed significantly after bowel preparation. In addition, blood biochemical markers presented somewhat associated with fecal type. Therefore, our results might provide some insights as to how knowledge of the microbial community could be used to promote health through personalized clinical treatment.
Uric acid is the primary byproduct of purine metabolism. Hyperuricemia is associated with body mass index (BMI), sex, and multiple complex diseases including gout, hypertension (HTN), renal disease, and type 2 diabetes (T2D). Multiple genome-wide association studies (GWAS) in individuals of European ancestry (EA) have reported associations between serum uric acid levels (SUAL) and specific genomic loci. The purposes of this study were: 1) to replicate major signals reported in EA populations; and 2) to use the weak LD pattern in African ancestry population to better localize (fine-map) reported loci and 3) to explore the identification of novel findings cognizant of the moderate sample size.
Methods
African American (AA) participants (n = 1,017) from the Howard University Family Study were included in this study. Genotyping was performed using the Affymetrix® Genome-wide Human SNP Array 6.0. Imputation was performed using MACH and the HapMap reference panels for CEU and YRI. A total of 2,400,542 single nucleotide polymorphisms (SNPs) were assessed for association with serum uric acid under the additive genetic model with adjustment for age, sex, BMI, glomerular filtration rate, HTN, T2D, and the top two principal components identified in the assessment of admixture and population stratification.
Results
Four variants in the gene SLC2A9 achieved genome-wide significance for association with SUAL (p-values ranging from 8.88 × 10-9 to 1.38 × 10-9). Fine-mapping of the SLC2A9 signals identified a 263 kb interval of linkage disequilibrium in the HapMap CEU sample. This interval was reduced to 37 kb in our AA and the HapMap YRI samples.
Conclusions
The most strongly associated locus for SUAL in EA populations was also the most strongly associated locus in this AA sample. This finding provides evidence for the role of SLC2A9 in uric acid metabolism across human populations. Additionally, our findings demonstrate the utility of following-up EA populations GWAS signals in African-ancestry populations with weaker linkage disequilibrium. 相似文献
Some previous studies have identified bacteria in semen as being a potential factor in male infertility. However, only few types of bacteria were taken into consideration while using PCR-based or culturing methods. Here we present an analysis approach using next-generation sequencing technology and bioinformatics analysis to investigate the associations between bacterial communities and semen quality. Ninety-six semen samples collected were examined for bacterial communities, measuring seven clinical criteria for semen quality (semen volume, sperm concentration, motility, Kruger''s strict morphology, antisperm antibody (IgA), Atypical, and leukocytes). Computer-assisted semen analysis (CASA) was also performed. Results showed that the most abundant genera among all samples were Lactobacillus (19.9%), Pseudomonas (9.85%), Prevotella (8.51%) and Gardnerella (4.21%). The proportion of Lactobacillus and Gardnerella was significantly higher in the normal samples, while that of Prevotella was significantly higher in the low quality samples. Unsupervised clustering analysis demonstrated that the seminal bacterial communities were clustered into three main groups: Lactobacillus, Pseudomonas, and Prevotella predominant group. Remarkably, most normal samples (80.6%) were clustered in Lactobacillus predominant group. The analysis results showed seminal bacteria community types were highly associated with semen health. Lactobacillus might not only be a potential probiotic for semen quality maintenance, but also might be helpful in countering the negative influence of Prevotella and Pseudomonas. In this study, we investigated whole seminal bacterial communities and provided the most comprehensive analysis of the association between bacterial community and semen quality. The study significantly contributes to the current understanding of the etiology of male fertility. 相似文献
Protein ubiquitination catalyzed by E3 ubiquitin ligases play important modulatory roles in various biological processes. With the emergence of high-throughput mass spectrometry technology, the proteomics research community embraced the development of numerous experimental methods for the determination of ubiquitination sites. The result is an accumulation of ubiquitinome data, coupled with a lack of available resources for investigating the regulatory networks among E3 ligases and ubiquitinated proteins. In this study, by integrating existing ubiquitinome data, experimentally validated E3 ligases and established protein-protein interactions, we have devised a strategy to construct a comprehensive map of protein ubiquitination networks.
Results
In total, 41,392 experimentally verified ubiquitination sites from 12,786 ubiquitinated proteins of humans have been obtained for this study. Additional 494 E3 ligases along with 1220 functional annotations and 28588 protein domains were manually curated. To characterize the regulatory networks among E3 ligases and ubiquitinated proteins, a well-established network viewer was utilized for the exploration of ubiquitination networks from 40892 protein-protein interactions. The effectiveness of the proposed approach was demonstrated in a case study examining E3 ligases involved in the ubiquitination of tumor suppressor p53. In addition to Mdm2, a known regulator of p53, the investigation also revealed other potential E3 ligases that may participate in the ubiquitination of p53.
Conclusion
Aside from the ability to facilitate comprehensive investigations of protein ubiquitination networks, by integrating information regarding protein-protein interactions and substrate specificities, the proposed method could discover potential E3 ligases for ubiquitinated proteins. Our strategy presents an efficient means for the preliminary screen of ubiquitination networks and overcomes the challenge as a result of limited knowledge about E3 ligase-regulated ubiquitination.