首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
4.

Background

The feline genome is valuable to the veterinary and model organism genomics communities because the cat is an obligate carnivore and a model for endangered felids. The initial public release of the Felis catus genome assembly provided a framework for investigating the genomic basis of feline biology. However, the entire set of protein coding genes has not been elucidated.

Results

We identified and characterized 1227 protein coding feline sequences, of which 913 map to public sequences and 314 are novel. These sequences have been deposited into NCBI's genbank database and complement public genomic resources by providing additional protein coding sequences that fill in some of the gaps in the feline genome assembly. Through functional and comparative genomic analyses, we gained an understanding of the role of these sequences in feline development, nutrition and health. Specifically, we identified 104 orthologs of human genes associated with Mendelian disorders. We detected negative selection within sequences with gene ontology annotations associated with intracellular trafficking, cytoskeleton and muscle functions. We detected relatively less negative selection on protein sequences encoding extracellular networks, apoptotic pathways and mitochondrial gene ontology annotations. Additionally, we characterized feline cDNA sequences that have mouse orthologs associated with clinical, nutritional and developmental phenotypes. Together, this analysis provides an overview of the value of our cDNA sequences and enhances our understanding of how the feline genome is similar to, and different from other mammalian genomes.

Conclusions

The cDNA sequences reported here expand existing feline genomic resources by providing high-quality sequences annotated with comparative genomic information providing functional, clinical, nutritional and orthologous gene information.  相似文献   

5.
6.
Novel sequences are DNA sequences present in an individual''s genome but absent in the human reference assembly. They are predicted to be biologically important, both individual and population specific, and consistent with the known human migration paths. Recent works have shown that an average person harbors 2–5 Mb of such sequences and estimated that the human pan-genome contains as high as 19–40 Mb of novel sequences. To identify them in a de novo genome assembly, some existing sequence aligners have been used but no computational method has been specifically proposed for this task. In this work, we developed NSIT (Novel Sequence Identification Tool), a software that can accurately and efficiently identify novel sequences in an individual''s de novo whole genome assembly. We identified and characterized 1.1 Mb, 1.2 Mb, and 1.0 Mb of novel sequences in NA18507 (African), YH (Asian), and NA12878 (European) de novo genome assemblies, respectively. Our results show very high concordance with the previous work using the respective reference assembly. In addition, our results using the latest human reference assembly suggest that the amount of novel sequences per individual may not be as high as previously reported. We additionally developed a graphical viewer for comparisons of novel sequence contents. The viewer also helped in identifying sequence contamination; we found 130 kb of Epstein-Barr virus sequence in the previously published NA18507 novel sequences as well as 287 kb of zebrafish repeats in NA12878 de novo assembly. NSIT requires 2GB of RAM and 1.5–2 hrs on a commodity desktop. The program is applicable to input assemblies with varying contig/scaffold sizes, ranging from 100 bp to as high as 50 Mb. It works in both 32-bit and 64-bit systems and outperforms, by large margins, other fast sequence aligners previously applied to this task. To our knowledge, NSIT is the first software designed specifically for novel sequence identification in a de novo human genome assembly.  相似文献   

7.
8.
9.
10.
11.
12.
The CHORI-212 bacterial artificial chromosome (BAC) library was constructed by cloning EcoRI/EcoRI partially digested DNA into the pTARBAC2.1 vector. The library has an average insert size of 161 kb, and provides 10.6-fold coverage of the channel catfish haploid genome. Screening of 32 genes using overgo or cDNA probes indicated that this library had a good representation of the genome as all tested genes existed in the library. We previously reported sequencing of approximately 25,000 BAC ends that generated 20,366 high-quality BAC end sequences (BES) and identified a large number of sequences similar to known genes using BLASTX searches. In this work, particular attention was given to identification of BAC mate pairs with known genes from both ends. When identified, comparative genome analysis was conducted to determine syntenic regions of the catfish genome with the genomes of zebrafish and Tetraodon. Of the 141 mate pairs with known genes from channel catfish, conserved syntenies were identified in 34 (24.1%), with 30 conserved in the zebrafish genome and 14 conserved in the Tetraodon genome. Additional analysis of three of the 34 conserved syntenic groups by direct sequencing indicated conserved gene contents in all three species. This indicates that comparative genome analysis may provide shortcuts to genome analysis in catfish, especially for short genomic regions once the conserved syntenies are identified. Shaolin Wang and Peng Xu contributed equally to the article.  相似文献   

13.
14.
15.
Protein-tyrosine phosphatases (PTPs) have an important role in cell survival, differentiation, proliferation, migration and other cellular processes in conjunction with protein-tyrosine kinases. Still relatively little is known about the function of PTPs in vivo. We set out to systematically identify all classical PTPs in the zebrafish genome and characterize their expression patterns during zebrafish development. We identified 48 PTP genes in the zebrafish genome by BLASTing of human PTP sequences. We verified all in silico hits by sequencing and established the spatio-temporal expression patterns of all PTPs by in situ hybridization of zebrafish embryos at six distinct developmental stages. The zebrafish genome encodes 48 PTP genes. 14 human orthologs are duplicated in the zebrafish genome and 3 human orthologs were not identified. Based on sequence conservation, most zebrafish orthologues of human PTP genes were readily assigned. Interestingly, the duplicated form of ptpn23, a catalytically inactive PTP, has lost its PTP domain, indicating that PTP activity is not required for its function, or that ptpn23b has lost its PTP domain in the course of evolution. All 48 PTPs are expressed in zebrafish embryos. Most PTPs are maternally provided and are broadly expressed early on. PTP expression becomes progressively restricted during development. Interestingly, some duplicated genes retained their expression pattern, whereas expression of other duplicated genes was distinct or even mutually exclusive, suggesting that the function of the latter PTPs has diverged. In conclusion, we have identified all members of the family of classical PTPs in the zebrafish genome and established their expression patterns. This is the first time the expression patterns of all members of the large family of PTP genes have been established in a vertebrate. Our results provide the first step towards elucidation of the function of the family of classical PTPs.  相似文献   

16.
17.
18.
Zebrafish embryonic slow muscle cells, with their superficial localization and clear sarcomere organization, provide a useful model system for genetic analysis of muscle cell differentiation and sarcomere assembly. To develop a quick assay for testing CRISPR-mediated gene editing in slow muscles of zebrafish embryos, we targeted a red fluorescence protein (RFP) reporter gene specifically expressed in slow muscles of myomesin-3-RFP (Myom3-RFP) zebrafish embryos. We demonstrated that microinjection of RFP-sgRNA with Cas9 protein or Cas9 mRNA resulted in a mosaic pattern in loss of RFP expression in slow muscle fibers of the injected zebrafish embryos. To uncover gene functions in sarcomere organization, we targeted two endogenous genes, slow myosin heavy chain-1 (smyhc1) and heat shock protein 90 α1 (hsp90α1), which are specifically expressed in zebrafish muscle cells. We demonstrated that injection of Cas9 protein or mRNA with respective sgRNAs targeted to smyhc1 or hsp90a1 resulted in a mosaic pattern of myosin thick filament disruption in slow myofibers of the injected zebrafish embryos. Moreover, Myom3-RFP expression and M-line localization were also abolished in these defective myofibers. Given that zebrafish embryonic slow muscles are a rapid in vivo system for testing genome editing and uncovering gene functions in muscle cell differentiation, we investigated whether microinjection of Natronobacterium gregoryi Argonaute (NgAgo) system could induce genetic mutations and muscle defects in zebrafish embryos. Single-strand guide DNAs targeted to RFP, Smyhc1, or Hsp90α1 were injected with NgAgo mRNA into Myom3-RFP zebrafish embryos. Myom3-RFP expression was analyzed in the injected embryos. The results showed that, in contrast to the CRISPR/Cas9 system, injection of the NgAgo-gDNA system did not affect Myom3-RFP expression and sarcomere organization in myofibers of the injected embryos. Sequence analysis failed to detect genetic mutations at the target genes. Together, our studies demonstrate that zebrafish embryonic slow muscle is a rapid model for testing gene editing technologies in vivo and uncovering gene functions in muscle cell differentiation.  相似文献   

19.
20.
Syngenta claims ownership of rice - but will give data away   总被引:1,自引:0,他引:1       下载免费PDF全文
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号