Although the knowledge about biological systems has advanced exponentially in recent decades, it is surprising to realize that the very definition of Life keeps presenting theoretical challenges. Even if several lines of reasoning seek to identify the essence of life phenomenon, most of these thoughts contain fundamental problem in their basic conceptual structure. Most concepts fail to identify either necessary or sufficient features to define life. Here, we analyzed the main conceptual frameworks regarding theoretical aspects that have been supporting the most accepted concepts of life, such as (i) the physical, (ii) the cellular and (iii) the molecular approaches. Based on an ontological analysis, we propose that Life should not be positioned under the ontological category of Matter. Yet, life should be better understood under the top-level ontology of “Process”. Exercising an epistemological approach, we propose that the essential characteristic that pervades each and every living being is the presence of organic codes. Therefore, we explore theories in biosemiotics and code biology in order to propose a clear concept of life as a macrocode composed by multiple inter-related coding layers. This way, as life is a sort of metaphysical process of encoding, the living beings became the molecular materialization of that process. From the proposed concept, we show that the evolutionary process is a fundamental characteristic for life’s maintenance but it is not necessary to define life, as many organisms are clearly alive but they do not participate in the evolutionary process (such as infertile hybrids). The current proposition opens a fertile field of debate in astrobiology, epistemology, biosemiotics, code biology and robotics.
The production of nucleic acid sequences by automatic DNA sequencer machines is always associated with some base-calling errors. In order to produce a high-quality DNA sequence from a molecule of interest, researchers normally sequence the same sample many times. Considering base-calling errors as rare events, re-sequencing the same molecule and assembling the reads produced are frequently thought to be a good way to generate reliable sequences. However, a relevant question on this issue is: how many times the sample needs to be re-sequenced to minimize costs and achieve a high-fidelity sequence? We examined how both the number of re-sequenced reads and PHRED trimming parameters affect the accuracy and size of final consensus sequences. Hundreds of single-pool reaction pUC18 reads were generated and assembled into consensus sequences with CAP3 software. Using local alignment against the published pUC18 cloning vector sequence, the position and number of errors in the consensus were identified and stored in MySQL databases. Stringent PHRED trimming parameters proved to be efficient for the reduction of errors; however, this procedure also decreased consensus size. Moreover, re-sequencing did not have a clear effect on the removal of consensus errors, although it was able to slightly increase consensus. 相似文献
One of the most studied topics about AIDS disease is the presence of different progression levels in patients infected by
HIV. Several studies have shown that this progression is directly associated with host genetics, although viral factors are
also known to play a role. Here we explore the contribution of Vpu protein in the evolution of viral population. The sequence
variation of Vpu was analyzed during HIV infection in peripheral blood monocyte cells of 12 patients in different clinical
stages of HIV-1 infection early and late stages of infections, separated by at least 4 years. The clustering analysis of Vpu
sequences showed higher diversity of early alleles, non-random distribution of sequences, and viral evolution strains selection.
Forty-two amino acid modifications were found in the multiple alignments of the 57 different alleles found for early stage
were 23 modifications were found in the late stage dataset. Interestingly fourteen alteration of early stage were located
in conserved site related with Vpu functions alterations while these alterations appear with less frequency in the late stage
of infection. Moreover, late stage alleles tend to be similar with the Vpu wild type sequence, suggesting viral selection
toward populations harboring more efficient variants during the course of infection. This would contribute to higher infectivity
and viral replication actually observed at the aggressive late stages of infection. These data, in conjunction with in vitro
experiments, will be important to elucidation of the physiological relevance of Vpu protein in the pathogenic mechanisms of
AIDS. 相似文献
A procedure to recruit members to enlarge protein family databases is described here. The procedure makes use of UniRef50 clusters produced by UniProt. Current family entries are used to recruit additional members based on the UniRef50 clusters to which they belong. Only those additional UniRef50 members that are not fragments and whose length is within a restricted range relative to the original entry are recruited. The enriched dataset is then limited to contain only genomes from selected clades. We used the COG database - used for genome annotation and for studies of phylogenetics and gene evolution - as a model. To validate the method, a UniRef-Enriched COG0151 (UECOG) was tested with distinct procedures to compare recruited members with the recruiters: PSI-BLAST, secondary structure overlap (SOV), Seed Linkage, COGnitor, shared domain content, and neighbor-joining single-linkage, and observed that the former four agree in their validations. Presently, the UniRef50-based recruitment procedure enriches the COG database for Archaea, Bacteria and its subgroups Actinobacteria, Firmicutes, Proteobacteria, and other bacteria by 2.2-, 8.0-, 7.0-, 8.8-, 8.7-, and 4.2-fold, respectively, in terms of sequences, and also considerably increased the number of species. 相似文献