首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
The genome of a Gondwanan mammal   总被引:1,自引:0,他引:1  
Australia is thought of as the home of marsupials, but South America has 60 or so species of these interesting mammals. The genome of one of these, the South American grey short-tailed opossum, Monodelphis domestica, has just been sequenced and published in June.1 The high quality 6x coverage is the first marsupial genome completed, pipping the 2x coverage of the Australian tammar wallaby at the post by half a year. The opossum genome has an unusual structure with fewer chromosomes than the human genome (9 pairs versus 23 pairs) but a longer total length (3.4 billion versus 3 billion bases). The opossum autosomes, like those of all marsupials, are extremely large but, in contrast, the X chromosome is only 76 Mb long. The opossum genome has turned up several surprises and provided critical new information on the evolution of mammalian genomes.  相似文献   

3.
《Genomics》2021,113(2):717-726
High quality genome is of great significance for the mining of biological information resources of species. Up to now, the genomic information of several important economic flatfishes has been well explained. All these fishes are eyes on left side-type, and no high-quality genome of eyes on right side-type species has been reported. In this study, we applied a combined strategy involving stLFR and Hi-C technologies to generate sequencing data for constructing the chromosomal genome of Verasper variegates, which belongs to Pleuronectidae with characteristic of eyes on right side. The size of genome of V. variegatus is 556 Mb. More than 97.2% of BUSCO genes were detected, and N50 lengths of the contigs and scaffolds reached 79.8 Kb and 23.8 Mb, respectively, demonstrating the outstanding completeness and sequence continuity of the genome. A total of 22,199 protein-coding genes were predicted in the assembled genome, and more than 95% of those genes could be functionally annotated. Meanwhile, the genomic collinearity, gene family and phylogenetic analyses of similar species in Pleuronectiformes were also investigated and portrayed for metamorphosis and benthic adaptation. Sex related genes mapping has also been achieved at the chromosome level. This study is the first chromosomal level genome of a Pleuronectidae fish (V. variegatus). The chromosomal genome assembly constructed in this work will not only be valuable for conservation and aquaculture studies of the V. variegatus but will also be of general interest in the phylogenetic and taxonomic studies of Pleuronectiformes.  相似文献   

4.
The African trypanosome genome   总被引:1,自引:0,他引:1  
The haploid nuclear genome of the African trypanosome, Trypanosoma brucei, is about 35 Mb and varies in size among different trypanosome isolates by as much as 25%. The nuclear DNA of this diploid organism is distributed among three size classes of chromosomes: the megabase chromosomes of which there are at least 11 pairs ranging from 1 Mb to more than 6 Mb (numbered I-XI from smallest to largest); several intermediate chromosomes of 200-900 kb and uncertain ploidy; and about 100 linear minichromosomes of 50-150 kb. Size differences of as much as four-fold can occur, both between the two homologues of a megabase chromosome pair in a specific trypanosome isolate and among chromosome pairs in different isolates. The genomic DNA sequences determined to date indicated that about 50% of the genome is coding sequence. The chromosomal telomeres possess TTAGGG repeats and many, if not all, of the telomeres of the megabase and intermediate chromosomes are linked to expression sites for genes encoding variant surface glycoproteins (VSGs). The minichromosomes serve as repositories for VSG genes since some but not all of their telomeres are linked to unexpressed VSG genes. A gene discovery program, based on sequencing the ends of cloned genomic DNA fragments, has generated more than 20 Mb of discontinuous single-pass genomic sequence data during the past year, and the complete sequences of chromosomes I and II (about 1 Mb each) in T. brucei GUTat 10.1 are currently being determined. It is anticipated that the entire genomic sequence of this organism will be known in a few years. Analysis of a test microarray of 400 cDNAs and small random genomic DNA fragments probed with RNAs from two developmental stages of T. brucei demonstrates that the microarray technology can be used to identify batteries of genes differentially expressed during the various life cycle stages of this parasite.  相似文献   

5.
Japanese chestnut (Castanea crenata Sieb. et Zucc.), unlike other Castanea species, is resistant to most diseases and wasps. However, genomic data of Japanese chestnut that could be used to determine its biotic stress resistance mechanisms have not been reported to date. In this study, we employed long-read sequencing and genetic mapping to generate genome sequences of Japanese chestnut at the chromosome level. Long reads (47.7 Gb; 71.6× genome coverage) were assembled into 781 contigs, with a total length of 721.2 Mb and a contig N50 length of 1.6 Mb. Genome sequences were anchored to the chestnut genetic map, comprising 14,973 single nucleotide polymorphisms (SNPs) and covering 1,807.8 cM map distance, to establish a chromosome-level genome assembly (683.8 Mb), with 69,980 potential protein-encoding genes and 425.5 Mb repetitive sequences. Furthermore, comparative genome structure analysis revealed that Japanese chestnut shares conserved chromosomal segments with woody plants, but not with herbaceous plants, of rosids. Overall, the genome sequence data of Japanese chestnut generated in this study is expected to enhance not only its genetics and genomics but also the evolutionary genomics of woody rosids.  相似文献   

6.
The Tetraodontidae family are known to have relatively small and compact genomes compared to other vertebrates. The obscure puffer fish Takifugu obscurus is an anadromous species that migrates to freshwater from the sea for spawning. Thus the euryhaline characteristics of T. obscurus have been investigated to gain understanding of their survival ability, osmoregulation, and other homeostatic mechanisms in both freshwater and seawater. In this study, a high quality chromosome‐level reference genome for T. obscurus was constructed using long‐read Pacific Biosciences (PacBio) Sequel sequencing and a Hi‐C‐based chromatin contact map platform. The final genome assembly of T. obscurus is 381 Mb, with a contig N50 length of 3,296 kb and longest length of 10.7 Mb, from a total of 62 Gb of raw reads generated using single‐molecule real‐time sequencing technology from a PacBio Sequel platform. The PacBio data were further clustered into chromosome‐scale scaffolds using a Hi‐C approach, resulting in a 373 Mb genome assembly with a contig N50 length of 15.2 Mb and and longest length of 28 Mb. When we directly compared the 22 longest scaffolds of T. obscurus to the 22 chromosomes of the tiger puffer Takifugu rubripes, a clear one‐to‐one orthologous relationship was observed between the two species, supporting the chromosome‐level assembly of T. obscurus. This genome assembly can serve as a valuable genetic resource for exploring fugu‐specific compact genome characteristics, and will provide essential genomic information for understanding molecular adaptations to salinity fluctuations and the evolution of osmoregulatory mechanisms.  相似文献   

7.
Recent genome sequencing papers have given genome sizes of 180 Mb for Drosophila melanogaster Iso-1 and 125 Mb for Arabidopsis thaliana Columbia. The former agrees with early cytochemical estimates, but numerous cytometric estimates of around 170 Mb imply that a genome size of 125 Mb for arabidopsis is an underestimate. In this study, nuclei of species pairs were compared directly using flow cytometry. Co-run Columbia and Iso-1 female gave a 2C peak for arabidopsis only approx. 15 % below that for drosophila, and 16C endopolyploid Columbia nuclei had approx. 15 % more DNA than 2C chicken nuclei (with >2280 Mb). Caenorhabditis elegans Bristol N2 (genome size approx. 100 Mb) co-run with Columbia or Iso-1 gave a 2C peak for drosophila approx. 75 % above that for 2C C. elegans, and a 2C peak for arabidopsis approx. 57 % above that for C. elegans. This confirms that 1C in drosophila is approx. 175 Mb and, combined with other evidence, leads us to conclude that the genome size of arabidopsis is not approx. 125 Mb, but probably approx. 157 Mb. It is likely that the discrepancy represents extra repeated sequences in unsequenced gaps in heterochromatic regions. Complete sequencing of the arabidopsis genome until no gaps remain at telomeres, nucleolar organizing regions or centromeres is still needed to provide the first precise angiosperm C-value as a benchmark calibration standard for plant genomes, and to ensure that no genes have been missed in arabidopsis, especially in centromeric regions, which are clearly larger than once imagined.  相似文献   

8.
9.
The genome sequence of silkworm, Bombyx mori.   总被引:21,自引:0,他引:21  
We performed threefold shotgun sequencing of the silkworm (Bombyx mori) genome to obtain a draft sequence and establish a basic resource for comprehensive genome analysis. By using the newly developed RAMEN assembler, the sequence data derived from whole-genome shotgun (WGS) sequencing were assembled into 49,345 scaffolds that span a total length of 514 Mb including gaps and 387 Mb without gaps. Because the genome size of the silkworm is estimated to be 530 Mb, almost 97% of the genome has been organized in scaffolds, of which 75% has been sequenced. By carrying out a BLAST search for 50 characteristic Bombyx genes and 11,202 non-redundant expressed sequence tags (ESTs) in a Bombyx EST database against the WGS sequence data, we evaluated the validity of the sequence for elucidating the majority of silkworm genes. Analysis of the WGS data revealed that the silkworm genome contains many repetitive sequences with an average length of <500 bp. These repetitive sequences appear to have been derived from truncated transposons, which are interspersed at 2.5- to 3-kb intervals throughout the genome. This pattern suggests that silkworm may have an active mechanism that promotes removal of transposons from the genome. We also found evidence for insertions of mitochondrial DNA fragments at 9 sites. A search for Bombyx orthologs to Drosophila genes controlling sex determination in the WGS data revealed 11 Bombyx genes and suggested that the sex-determining systems differ profoundly between the two species.  相似文献   

10.
The greenfin horse‐faced filefish, Thamnaconus septentrionalis, is a valuable commercial fish species that is widely distributed in the Indo‐West Pacific Ocean. This fish has characteristic blue–green fins, rough skin and a spine‐like first dorsal fin. Thamnaconus septentrionalis is of conservation concern because its population has declined sharply, and it is an important marine aquaculture fish species in China. Genomic resources for the filefish are lacking, and no reference genome has been released. In this study, the first chromosome‐level genome of T. septentrionalis was constructed using nanopore sequencing and Hi‐C technology. A total of 50.95 Gb polished nanopore sequences were generated and were assembled into a 474.31‐Mb genome, accounting for 96.45% of the estimated genome size of this filefish. The assembled genome contained only 242 contigs, and the achieved contig N50 was 22.46 Mb, a surprisingly high value among all sequenced fish species. Hi‐C scaffolding of the genome resulted in 20 pseudochromosomes containing 99.44% of the total assembled sequences. The genome contained 67.35 Mb of repeat sequences, accounting for 14.2% of the assembly. A total of 22,067 protein‐coding genes were predicted, 94.82% of which were successfully annotated with putative functions. Furthermore, a phylogenetic tree was constructed using 1,872 single‐copy orthologous genes, and 67 unique gene families were identified in the filefish genome. This high‐quality assembled genome will be a valuable resource for a range of future genomic, conservation and breeding studies of T. septentrionalis.  相似文献   

11.
Isochore structures in the mouse genome   总被引:2,自引:0,他引:2  
Zhang CT  Zhang R 《Genomics》2004,83(3):384-394
The distribution of the G+C content in the mouse genome has been studied using a windowless technique. We have found that: (i). Abrupt variations of the G+C content from a GC-rich region to a GC-poor region, and vice versa, occur frequently at some sites along the sequence of the mouse genome. (ii). Long domains with relatively homogeneous G+C content (isochores) exist, which usually have sharp boundaries. Consequently, 28 isochores longer than 1 Mb have been identified in the mouse genome. A homogeneity index was used to quantify the variations of the G+C content within isochores. The precise boundaries, sizes, and G+C contents of these isochores have been determined. The windowless technique for the G+C content computation was also used to analyze the DNA sequence containing the mouse MHC region, which has a GC-poor isochore. This isochore is located at the central part of the sequence with boundaries at 468459 and 812716 bp, where the sequence is extended from the centromeric end to the telomeric end. In addition, the analysis of a segment of the rat genome shows that the rat genome also has clear isochore structures.  相似文献   

12.
Novel sequences are DNA sequences present in an individual''s genome but absent in the human reference assembly. They are predicted to be biologically important, both individual and population specific, and consistent with the known human migration paths. Recent works have shown that an average person harbors 2–5 Mb of such sequences and estimated that the human pan-genome contains as high as 19–40 Mb of novel sequences. To identify them in a de novo genome assembly, some existing sequence aligners have been used but no computational method has been specifically proposed for this task. In this work, we developed NSIT (Novel Sequence Identification Tool), a software that can accurately and efficiently identify novel sequences in an individual''s de novo whole genome assembly. We identified and characterized 1.1 Mb, 1.2 Mb, and 1.0 Mb of novel sequences in NA18507 (African), YH (Asian), and NA12878 (European) de novo genome assemblies, respectively. Our results show very high concordance with the previous work using the respective reference assembly. In addition, our results using the latest human reference assembly suggest that the amount of novel sequences per individual may not be as high as previously reported. We additionally developed a graphical viewer for comparisons of novel sequence contents. The viewer also helped in identifying sequence contamination; we found 130 kb of Epstein-Barr virus sequence in the previously published NA18507 novel sequences as well as 287 kb of zebrafish repeats in NA12878 de novo assembly. NSIT requires 2GB of RAM and 1.5–2 hrs on a commodity desktop. The program is applicable to input assemblies with varying contig/scaffold sizes, ranging from 100 bp to as high as 50 Mb. It works in both 32-bit and 64-bit systems and outperforms, by large margins, other fast sequence aligners previously applied to this task. To our knowledge, NSIT is the first software designed specifically for novel sequence identification in a de novo human genome assembly.  相似文献   

13.
The European rabbit (Oryctolagus cuniculus) is a domesticated species with one of the broadest ranges of economic and scientific applications and fields of investigation. Rabbit genome information and assembly are available (oryCun2.0), but so far few studies have investigated its variability, and massive discovery of polymorphisms has not been published yet for this species. Here, we sequenced two reduced representation libraries (RRLs) to identify single nucleotide polymorphisms (SNPs) in the rabbit genome. Genomic DNA of 10 rabbits belonging to different breeds was pooled and digested with two restriction enzymes (HaeIII and RsaI) to create two RRLs which were sequenced using the Ion Torrent Personal Genome Machine. The two RRLs produced 2 917 879 and 4 046 871 reads, for a total of 280.51 Mb (248.49 Mb with quality >20) and 417.28 Mb (360.89 Mb with quality >20) respectively of sequenced DNA. About 90% and 91% respectively of the obtained reads were mapped on the rabbit genome, covering a total of 15.82% of the oryCun2.0 genome version. The mapping and ad hoc filtering procedures allowed to reliably call 62 491 SNPs. SNPs in a few genomic regions were validated by Sanger sequencing. The Variant Effect Predictor Web tool was used to map SNPs on the current version of the rabbit genome. The obtained results will be useful for many applied and basic research programs for this species and will contribute to the development of cost‐effective solutions for high‐throughput SNP genotyping in the rabbit.  相似文献   

14.
Fetal DNA is present in the plasma of pregnant women. Massively parallel sequencing of maternal plasma DNA has been used to detect fetal trisomies 21, 18, 13 and selected sex chromosomal aneuploidies noninvasively. Case reports describing the detection of fetal microdeletions from maternal plasma using massively parallel sequencing have been reported. However, these previous reports were either polymorphism-dependent or used statistical analyses which were confined to one or a small number of selected parts of the genome. In this report, we reported a procedure for performing noninvasive prenatal karyotyping at 3 Mb resolution across the whole genome through the massively parallel sequencing of maternal plasma DNA. This method has been used to analyze the plasma obtained from 6 cases. In three cases, fetal microdeletions have been detected successfully from maternal plasma. In two cases, fetal microduplications have been detected successfully from maternal plasma. In the remaining case, the plasma DNA sequencing result was consistent with the pregnant mother being a carrier of a microduplication. Simulation analyses were performed for determining the number of plasma DNA molecules that would need to be sequenced and aligned for enhancing the diagnostic resolution of noninvasive prenatal karyotyping to 2 Mb and 1 Mb. In conclusion, noninvasive prenatal molecular karyotyping from maternal plasma by massively parallel sequencing is feasible and would enhance the diagnostic spectrum of noninvasive prenatal testing.  相似文献   

15.
Helps NR  Cohen PT 《FEBS letters》1999,450(1-2):72-76
Two genome sequences of Escherichia coli K-12 substrains, one partial W3110 and one complete MG1655, have been determined by Japanese and American genome projects, respectively. In order to estimate the rate of nucleotide changes, we directly compared 2 Mb of the nucleotide sequences from these closely-related E. coli substrains. Given that the two substrains separated about 40 years ago, the rate of nucleotide changes was estimated to be less than 10(-7) per site per year. This rate was supported by a further comparison between partial genome sequences of E. coli and Shigella flexneri.  相似文献   

16.
For 50 years now, one of the enigmas of molecular evolution has been the C-value paradox, which refers to the often massive, counterintuitive and seemingly arbitrary differences in genome size observed among eukaryotic organisms. For example, the genome of the fruitfly Drosophila melanogaster is 180 megabases (Mb), whereas that of the European brown grasshopper Podisma pedestris is 18,000 Mb. The difference in genome size of a factor of 100 is difficult to explain in view of the apparently similar levels of evolutionary, developmental and behavioural complexity of these organisms.  相似文献   

17.
Several publicly funded large-scale sequencing efforts have been initiated with the goal of completing the first reference human genome sequence by the year 2005. Here we present the results of analysis of 11.8 Mb of genomic sequence from chromosome 16. The apparent gene density varies throughout the region, but the number of genes predicted (84) suggests that this is a gene-poor region. This result may also suggest that the total number of human genes is likely to be at the lower end of published estimates. One of the most interesting aspects of this region of the genome is the presence of highly homologous, recently duplicated tracts of sequence distributed throughout the p-arm. Such duplications have implications for mapping and gene analysis as well as the predisposition to recurrent chromosomal structural rearrangements associated with genetic disease.  相似文献   

18.
Electrophoretic karyotyping of the two most widely studied strains of Phanerochaete chrysosporium, BKMF-1767 and ME-446, has been determined using transverse alternating field etectrophoresis. The genomic DNA of BKMF-1767 was resolved into 10 chromosomes ranging in size from 1.8–5.0 Mb, amounting to a total genome size of about 29 Mb. The genomic DNA of strain ME-446, on the other hand, was resolved into 11 chromosomes, amounting to a total genome size of about 32Mb. Lignin peroxidase genes have been localized to five chromosomes in strain BKMF-1767 and to four chromosomes in strain ME-446.  相似文献   

19.
We have followed the transmission of Ophiostoma ulmis.l. chromosome length polymorphisms (CLPs) into the F2 generation to determine the reproducibility of a genome rearrangement culminating in the conversion of a 1.0 Mb chromosome into a 800 kb chromosome. The 1.0 Mb chromosome in strain CESS16K is thus far unique among O. ulmi s.l. wild-type strains, as no other wild-type strains have been observed with chromosomes smaller than 2.3 Mb. It has been previously shown that the 1.0 Mb chromosome is mitotically stable, carries at least one normally expressed gene, and is transmitted through meiosis. In this study, a series of crosses were performed to further elucidate the pattern of inheritance of the 1.0 Mb chromosome and the process of conversion of the 1.0 Mb species to 800 kb. In crosses where the 1.0 Mb chromosome was allowed to pair with itself or with the 800 kb chromosome, all progeny inherited a copy of the 1.0 Mb or 800 kb form, further demonstrating the A-type nature of these small chromosomes. When a cross was repeated between the strains CESS16K (1.0 Mb chromosome) and FG245Br-O (no 1.0 Mb or 800 kb chromosome), the occurrence of a 800 kb chromosome was observed in 9% of the progeny. A reciprocal cross between an 800 kb strain and a strain with no 800 kb or 1.0 Mb chromosome was conducted, and a progeny strain containing a 1.0 Mb chromosome was recovered. The reproducibility and reciprocality of the 1.0 Mb to 800 kb chromosome conversion demonstrates that meiotic processes are responsible for this CLP, and that O. ulmi s.l. strains with various divergent genome architectures can remain sexually compatible. Received: 6 February 1996 / Accepted: 21 January 1997  相似文献   

20.
We have followed the transmission of Ophiostoma ulmis.l. chromosome length polymorphisms (CLPs) into the F2 generation to determine the reproducibility of a genome rearrangement culminating in the conversion of a 1.0 Mb chromosome into a 800 kb chromosome. The 1.0 Mb chromosome in strain CESS16K is thus far unique among O. ulmi s.l. wild-type strains, as no other wild-type strains have been observed with chromosomes smaller than 2.3 Mb. It has been previously shown that the 1.0 Mb chromosome is mitotically stable, carries at least one normally expressed gene, and is transmitted through meiosis. In this study, a series of crosses were performed to further elucidate the pattern of inheritance of the 1.0 Mb chromosome and the process of conversion of the 1.0 Mb species to 800 kb. In crosses where the 1.0 Mb chromosome was allowed to pair with itself or with the 800 kb chromosome, all progeny inherited a copy of the 1.0 Mb or 800 kb form, further demonstrating the A-type nature of these small chromosomes. When a cross was repeated between the strains CESS16K (1.0 Mb chromosome) and FG245Br-O (no 1.0 Mb or 800 kb chromosome), the occurrence of a 800 kb chromosome was observed in 9% of the progeny. A reciprocal cross between an 800 kb strain and a strain with no 800 kb or 1.0 Mb chromosome was conducted, and a progeny strain containing a 1.0 Mb chromosome was recovered. The reproducibility and reciprocality of the 1.0 Mb to 800 kb chromosome conversion demonstrates that meiotic processes are responsible for this CLP, and that O. ulmi s.l. strains with various divergent genome architectures can remain sexually compatible.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号