The ’omics revolution has made a large amount of sequence data available to researchers and the industry. This has had a profound impact in the field of bioinformatics, stimulating unprecedented advancements in this discipline. Mostly, this is usually looked at from the perspective of human ’omics, in particular human genomics. Plant and animal genomics, however, have also been deeply influenced by next‐generation sequencing technologies, with several genomics applications now popular among researchers and the breeding industry. Genomics tends to generate huge amounts of data, and genomic sequence data account for an increasing proportion of big data in biological sciences, due largely to decreasing sequencing and genotyping costs and to large‐scale sequencing and resequencing projects. The analysis of big data poses a challenge to scientists, as data gathering currently takes place at a faster pace than does data processing and analysis, and the associated computational burden is increasingly taxing, making even simple manipulation, visualization and transferring of data a cumbersome operation. The time consumed by the processing and analysing of huge data sets may be at the expense of data quality assessment and critical interpretation. Additionally, when analysing lots of data, something is likely to go awry—the software may crash or stop—and it can be very frustrating to track the error. We herein review the most relevant issues related to tackling these challenges and problems, from the perspective of animal genomics, and provide researchers that lack extensive computing experience with guidelines that will help when processing large genomic data sets. 相似文献
Introduction: Proteins have been historically regarded as ‘nature’s robots’: Molecular machines that are essential to cellular/extracellular physical mechanical properties and catalyze key reactions for cell/system viability. However, these robots are kept in check by other protein-based machinery to preserve proteome integrity and stability. During aging, protein homeostasis is challenged by oxidation, decreased synthesis, and increasingly inefficient mechanisms responsible for repairing or degrading damaged proteins. In addition, disruptions to protein homeostasis are hallmarks of many neurodegenerative diseases and diseases disproportionately affecting the elderly.
Areas covered: Here we summarize age- and disease-related changes to the protein machinery responsible for preserving proteostasis and describe how both aging and disease can each exacerbate damage initiated by the other. We focus on alteration of proteostasis as an etiological or phenomenological factor in neurodegenerative diseases such as Alzheimer’s, Parkinson’s, and Huntington’s, along with Down syndrome, ophthalmic pathologies, and cancer.
Expert commentary: Understanding the mechanisms of proteostasis and their dysregulation in health and disease will represent an essential breakthrough in the treatment of many (senescence-associated) pathologies. Strides in this field are currently underway and largely attributable to the introduction of high-throughput omics technologies and their combination with novel approaches to explore structural and cross-link biochemistry. 相似文献
Tomato (Solanum lycopersicum), which is used for both processing and fresh markets, is a major crop species that is the top ranked vegetable produced over the world. Tomato is also a model species for research in genetics, fruit development and disease resistance. Genetic resources available in public repositories comprise the 12 wild related species and thousands of landraces, modern cultivars and mutants. In addition, high quality genome sequences are available for cultivated tomato and for several wild relatives, hundreds of accessions have been sequenced, and databases gathering sequence data together with genetic and phenotypic data are accessible to the tomato community. Major breeding goals are productivity, resistance to biotic and abiotic stresses, and fruit sensorial and nutritional quality. New traits, including resistance to various biotic and abiotic stresses and root architecture, are increasingly being studied. Several major mutations and quantitative trait loci (QTLs) underlying traits of interest in tomato have been uncovered to date and, thanks to new populations and advances in sequencing technologies, the pace of trait discovery has considerably accelerated. In recent years, clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 gene editing (GE) already proved its remarkable efficiency in tomato for engineering favorable alleles and for creating new genetic diversity by gene disruption, gene replacement, and precise base editing. Here, we provide insight into the major tomato traits and underlying causal genetic variations discovered so far and review the existing genetic resources and most recent strategies for trait discovery in tomato. Furthermore, we explore the opportunities offered by CRISPR/Cas9 and their exploitation for trait editing in tomato. 相似文献
Next-generation sequencing projects have underappreciated information management tasks requiring detailed attention to
specimen curation, nucleic acid sample preparation and sequence production methods required for downstream data processing,
comparison, interpretation, sharing and reuse. The few existing metadata management tools for genome-based studies provide
weak curatorial frameworks for experimentalists to store and manage idiosyncratic, project-specific information, typically offering
no automation supporting unified naming and numbering conventions for sequencing production environments that routinely
deal with hundreds, if not thousands of samples at a time. Moreover, existing tools are not readily interfaced with bioinformatics
executables, (e.g., BLAST, Bowtie2, custom pipelines). Our application, the Omics Metadata Management Software (OMMS),
answers both needs, empowering experimentalists to generate intuitive, consistent metadata, and perform analyses and
information management tasks via an intuitive web-based interface. Several use cases with short-read sequence datasets are
provided to validate installation and integrated function, and suggest possible methodological road maps for prospective users.
Provided examples highlight possible OMMS workflows for metadata curation, multistep analyses, and results management and
downloading. The OMMS can be implemented as a stand alone-package for individual laboratories, or can be configured for webbased
deployment supporting geographically-dispersed projects. The OMMS was developed using an open-source software base,
is flexible, extensible and easily installed and executed. The OMMS can be obtained at http://omms.sandia.gov.