首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The search for all sequences containing centromeric (CEN) minor satellite (MiSat) or pericen-tromeric (periCEN) mouse major satellite (MaSat) was conducted in the whole genome shotgun (WGS) database. The sequences were checked for the presence of the known dispersed repeats using the Censor software. The presence of tandem repeats was tested using Tandem Repeat Finder (TRF). Monotonous MiSat and MaSat arrays and MaSat to MiSat array transitions were detected. Moreover, two other types of contacts were revealed: (1) MiSat transition to fragments of retroelements LINE and IAP (ERV family, intracisternal A-type particles), mainly to ORF2 and 5′-LTR containing elements; (2) MaSat transition to two tandem repeats with monomers 21 bp and 31 bp in size. The presence of the MiSat/IAP transition could be checked experimentally. The common DNA motif among the IAP fragments close to MiSat was isolated. IAP-specific primers were constructed and the fragments obtained in PCR with IAP and MiSat primers compiled the plasmid vector library. Clone n51 with the maximum length of the possible insertion (∼no. 800 bp) was selected from the library. FISH on extended chromatin fibers (fiberFISH) carried out on the n51 clone demonstrated that the main signal definitely belonged to CEN. However, the signals on the chromosome arms were also detected that could be due to the partial homology of n51 to the dispersed repeats. The duplicated fiberFISH with MiSat and n51 allowed to measure the distances between the fragments. The previously obtained MS3 sequence has some homology to IAP and CEN localization. Accordingly, the regular associations of MiSat with IAP retroelements were shown in silico and in situ. Together with the published data, the present findings suggest that retroelements or their fragments may be essential components of the normal centromere of higher eukaryotes.  相似文献   

2.
3.
Nucleolar precursor bodies (NPB) are discrete entities in zygotic pronuclei and in the nuclei of two-cell mouse embryos. Centromeric (CEN) and pericentromeric (periCEN) chromosome regions are associated with the chromatin layer surrounding NPB. Four types of satellite DNA (satDNA) are currently known in Mus musculus, including mouse minor satellite 4 (MiSat), mouse satellite 3 (MS3) in the CEN region, mouse major satellite (MaSat), and mouse satellite (MS4) in the periCEN region. We determined the localization of these four types of mouse satDNA and associated proteins (RNA-helicase p68, SMC3, Rad21 subunits of the cohesin complex and SYCP3 subunit of the synaptonemal complex) in respect to NPB. Partially flattened nuclei of the one- and two-cell intact embryos and embryos treated with okadaic acid (OA) were used. It was found that different satDNA are localized in different regions at the NPB surface: periCEN MaSat occupied almost the whole NPB surface; CEN MiSat, MS3 and periCEN MS4 were located more peripherally. All four satDNA did not cover the entire NPB area, which indicates the presence of other DNA sequences involved in the association with NPB periphery. Among the proteins probed, RNA-helicase p68 and components of multiprotein cohesin and synaptonemal complexes (SCs) showed the most prominent colocolization with NPB. Our results support the idea that NPB are chromocenter precursors.  相似文献   

4.
Although satellite DNAs are well-explored components of heterochromatin and centromeres, little is known about emergence, dispersal and possible impact of comparably structured tandem repeats (TRs) on the genome-wide scale. Our bioinformatics analysis of assembled Tribolium castaneum genome disclosed significant contribution of TRs in euchromatic chromosomal arms and clear predominance of satellite DNA-typical 170 bp monomers in arrays of ≥5 repeats. By applying different experimental approaches, we revealed that the nine most prominent TR families Cast1–Cast9 extracted from the assembly comprise ∼4.3% of the entire genome and reside almost exclusively in euchromatic regions. Among them, seven families that build ∼3.9% of the genome are based on ∼170 and ∼340 bp long monomers. Results of phylogenetic analyses of 2500 monomers originating from these families show high-sequence dynamics, evident by extensive exchanges between arrays on non-homologous chromosomes. In addition, our analysis shows that concerted evolution acts more efficiently on longer than on shorter arrays. Efficient genome-wide distribution of nine TR families implies the role of transposition only in expansion of the most dispersed family, and involvement of other mechanisms is anticipated. Despite similarities in sequence features, FISH experiments indicate high-level compartmentalization of centromeric and euchromatic tandem repeats.  相似文献   

5.
The interspersed repeat content of mammalian genomes has been best characterized in human, mouse and cow. In this study, we carried out de novo identification of repeated elements in the equine genome and identified previously unknown elements present at low copy number. The equine genome contains typical eutherian mammal repeats, but also has a significant number of hybrid repeats in addition to clade-specific Long Interspersed Nuclear Elements (LINE). Equus caballus clade specific LINE 1 (L1) repeats can be classified into approximately five subfamilies, three of which have undergone significant expansion. There are 1115 full-length copies of these equine L1, but of the 103 presumptive active copies, 93 fall within a single subfamily, indicating a rapid recent expansion of this subfamily. We also analysed both interspersed and simple sequence repeats (SSR) genome-wide, finding that some repeat classes are spatially correlated with each other as well as with G+C content and gene density. Based on these spatial correlations, we have confirmed that recently-described ancestral vs. clade-specific genome territories can be defined by their repeat content. The clade-specific Short Interspersed Nuclear Element correlations were scattered over the genome and appear to have been extensively remodelled. In contrast, territories enriched for ancestral repeats tended to be contiguous domains. To determine if the latter territories were evolutionarily conserved, we compared these results with a similar analysis of the human genome, and observed similar ancestral repeat enriched domains. These results indicate that ancestral, evolutionarily conserved mammalian genome territories can be identified on the basis of repeat content alone. Interspersed repeats of different ages appear to be analogous to geologic strata, allowing identification of ancient vs. newly remodelled regions of mammalian genomes.  相似文献   

6.
There is evidence that Matrix Attachment Region (MAR)-binding proteins also bind satellite DNA (satDNA). The aim of the current work was to determine whether the major nuclear matrix (NM) MAR-binding proteins are able to recognize satDNAs of different locations and what DNA structural features are important for the recognition. In nuclei and NM, a number of the same polypeptides were recognized on a southwestern blot when MAR of immunoglobulin kappa gene (Ig kappa MAR) and pericentromeric (periCEN) satDNA fragments were used. However, the binding decreased dramatically when human and mouse CEN satDNA were used for the probes. After an NM extract was subjected to ion exchange chromatography, the main DNA-binding proteins were identified as SAF-A (scaffold attachment factor A) and lamin B. It was not possible to test the binding of lamin B by gel mobility shift assay (GMSA), but SAF-A showed an ability to distinguish CEN and periCEN satDNA fragments in GMSA. While periCEN fragments have an abnormally slow mobility on electrophoresis, which is a hallmark of bent DNA, CEN satDNA fragments have a normal mobility. A computer analysis was done using the wedge model (Ulanovsky and Trifonov [1987] Nature 326:720-722), which describes how the curved state depends on particular nucleotide sequences. The curved states of the fragments predicted by the model are in good agreement with their ability to be recognized by NM proteins. Thus SAF-A and lamin B are able to recognize conserved structural features of satDNA in the same way that MAR-binding proteins recognize MARs in spite of a lack of a consensus sequence. CEN and periCEN satDNAs are distinguished by proteins in correlation with the helical curvature of these fragments.  相似文献   

7.
Tandem repeats (TRs) represent one of the most prevalent features of genomic sequences. Due to their abundance and functional significance, a plethora of detection tools has been devised over the last two decades. Despite the longstanding interest, TR detection is still not resolved. Our large-scale tests reveal that current detectors produce different, often nonoverlapping inferences, reflecting characteristics of the underlying algorithms rather than the true distribution of TRs in genomic data. Our simulations show that the power of detecting TRs depends on the degree of their divergence, and repeat characteristics such as the length of the minimal repeat unit and their number in tandem. To reconcile the diverse predictions of current algorithms, we propose and evaluate several statistical criteria for measuring the quality of predicted repeat units. In particular, we propose a model-based phylogenetic classifier, entailing a maximum-likelihood estimation of the repeat divergence. Applied in conjunction with the state of the art detectors, our statistical classification scheme for inferred repeats allows to filter out false-positive predictions. Since different algorithms appear to specialize at predicting TRs with certain properties, we advise applying multiple detectors with subsequent filtering to obtain the most complete set of genuine repeats.  相似文献   

8.
Tandem repeats (TRs) are often present in proteins with crucial functions, responsible for resistance, pathogenicity and associated with infectious or neurodegenerative diseases. This motivates numerous studies of TRs and their evolution, requiring accurate multiple sequence alignment. TRs may be lost or inserted at any position of a TR region by replication slippage or recombination, but current methods assume fixed unit boundaries, and yet are of high complexity. We present a new global graph-based alignment method that does not restrict TR unit indels by unit boundaries. TR indels are modeled separately and penalized using the phylogeny-aware alignment algorithm. This ensures enhanced accuracy of reconstructed alignments, disentangling TRs and measuring indel events and rates in a biologically meaningful way. Our method detects not only duplication events but also all changes in TR regions owing to recombination, strand slippage and other events inserting or deleting TR units. We evaluate our method by simulation incorporating TR evolution, by either sampling TRs from a profile hidden Markov model or by mimicking strand slippage with duplications. The new method is illustrated on a family of type III effectors, a pathogenicity determinant in agriculturally important bacteria Ralstonia solanacearum. We show that TR indel rate variation contributes to the diversification of this protein family.  相似文献   

9.
10.
小鼠基因组中的微卫星重复序列的数量、分布和密度   总被引:1,自引:0,他引:1  
作者分析了老鼠基因组中各染色体及其内含子、外显子和基因间区上各种类型的微卫星(1-6个碱基的重复序列)的数量及其密度。SSR约占老鼠基因组的2.85%,其中46.2%存在于基因间区,4.75%存在于外显子,49.05%在内含子区域,即非编码区富含微卫星。微卫星的数量与染色体或基因区域的大小有关,但密度与染色体或基因区域的大小的关系并不十分密切。第4染色体的外显子区域中6种类型的SSR含量都比其它区域少。A,T,AC,AG,AT,AAC,AAG,AGG,AAAC,AAAG,AAAT,AACC,AAAAC,AAAAG,AAAAT,AAACC,AAAGG,AAGAG,AAAAAC,AAAAAG,AAAAAT,AAAGAG,ACACAT,ACAGAG,ACAGGC,ACATAT是老鼠基因组中主要的SSR类型,而一些5碱基重复单元的SSR在老鼠基因组的某一条甚至某几条染色体都不存在  相似文献   

11.
Expansion or shrinkage of existing tandem repeats (TRs) associated with various biological processes has been actively studied in both prokaryotic and eukaryotic genomes, while their origin and biological implications remain mostly unknown. Here we describe various duplications (de novo TRs) that occurred in the coding region of a β-lactamase gene, where a conserved structure called the omega loop is encoded. These duplications that occurred under selection using ceftazidime conferred substrate spectrum extension to include the antibiotic. Under selective pressure with one of the original substrates (amoxicillin), a high level of reversion occurred in the mutant β-lactamase genes completing a cycle back to the original substrate spectrum. The de novo TRs coupled with reversion makes a genetic toggling mechanism enabling reversible switching between the two phases of the substrate spectrum of β-lactamases. This toggle exemplifies the effective adaptation of de novo TRs for enhanced bacterial survival. We found pairs of direct repeats that mediated the DNA duplication (TR formation). In addition, we found different duos of sequences that mediated the DNA duplication. These novel elements—that we named SCSs (same-strand complementary sequences)—were also found associated with β-lactamase TR mutations from clinical isolates. Both direct repeats and SCSs had a high correlation with TRs in diverse bacterial genomes throughout the major phylogenetic lineages, suggesting that they comprise a fundamental mechanism shaping the bacterial evolution.  相似文献   

12.
The distribution of SINE and LINE sequences in the pig genome was examined by fluorescence in situ hybridization (FISH), interspersed repeat PCR, and restriction analysis of high molecular weight DNA. FISH revealed a largely uniform hybridization to the euchromatic chromosome regions with both interspersed repeats, although a bias toward the G-bands was observed for the LINE probe. Southern blots of inter-SINE and inter-LINE PCR products showed strong hybridization to LINE and SINE probes, respectively. High molecular weight DNA derived from a pig |m~ hamster hybrid cell line was cut with a panel of G + C and A + T rich rare cutter restriction enzymes, then run on a pulsed field gel and Southern blotted. Sequential hybridization with SINE and LINE probes showed that SINE hybridization was to relatively low molecular weight fragments with the G + C rich enzymes, whereas the LINE probe gave hybridization to significantly larger fragments produced by these enzymes. DNA samples digested with A + T rich enzymes gave essentially similar patterns with SINE and LINE probes. We conclude that the pattern of differential distribution of SINEs and LINEs, which has been described in man and mouse, does exist in the pig but is much less pronounced. Received: 25 April 1995 / Accepted: 1 September 1995  相似文献   

13.
Genomes contain various types of repetitive sequences. They may be used as probes for seeking genome rearrangements because they are rather free from the natural selection if they are located in the intergenic regions. In this study, we searched for tandem repeats (TRs) in 44 prokaryotic genomes by the color-coding method and sought the signs of genome rearrangements by detailed analysis of the detected TRs. We found 13,542 tandem repeats from 44 prokaryotic genomes in total ranging from several tens to one thousand per genome. The results of statistical analysis show that TRs tend to exist on high base composition bias regions in some genomes. Moreover, we recognized the characteristic distribution patterns of equivalent TR-pairs in 12 genomes, which are expected to indicate the occurrence of whole-genome duplication (WGD) on the genomes. It is demonstrated that TRs could indeed be used for seeking genome rearrangements. Although it has not been made clear at this time whether or not WGD had occurred in prokaryotic genomes, the results of the analyses of equivalent TR-pairs in this study are thought to be evidences of WGD in these genomes.  相似文献   

14.
15.
Stedman W  Deng Z  Lu F  Lieberman PM 《Journal of virology》2004,78(22):12566-12575
The viral genome of Kaposi's sarcoma-associated herpesvirus (KSHV) persists as an extrachromosomal plasmid in latently infected cells. The KSHV latency-associated nuclear antigen (LANA) stimulates plasmid maintenance and DNA replication by binding to an approximately 150-bp region within the viral terminal repeats (TR). We have used chromatin immunoprecipitation assays to demonstrate that LANA binds specifically to the replication origin sequence within the KSHV TR in latently infected cells. The latent replication origin within the TR was also bound by LANA-associated proteins CBP, double-bromodomain-containing protein 2 (BRD2), and the origin recognition complex 2 protein (ORC2) and was enriched in hyperacetylated histones H3 and H4 relative to other regions of the latent genome. Cell cycle analysis indicated that the minichromosome maintenance complex protein, MCM3, bound TR in late-G(1)/S-arrested cells, which coincided with the loss of histone H3 K4 methylation. Micrococcal nuclease studies revealed that TRs are embedded in a highly ordered nucleosome array that becomes disorganized in late G(1)/S phase. ORC binding to TR was LANA dependent when reconstituted in transfected plasmids. DNA affinity purification confirmed that LANA, CBP, BRD2, and ORC2 bound TR specifically and identified the histone acetyltransferase HBO1 (histone acetyltransferase binding to ORC1) as a potential TR binding protein. Disruption of ORC2, MCM5, and HBO1 expression by small interfering RNA reduced LANA-dependent DNA replication of TR-containing plasmids. These findings are the first demonstration that cellular replication and origin licensing factors are required for KSHV latent cycle replication. These results also suggest that the KSHV latent origin of replication is a unique chromatin environment containing histone H3 hyperacetylation within heterochromatic tandem repeats.  相似文献   

16.
17.
This report presents the characterization of the first virulent phages infecting Brevibacterium aurantiacum, a bacterial species used during the manufacture of surface-ripened cheeses. These phages were also responsible for flavour and colour defects in surface-ripened cheeses. Sixteen phages (out of 62 isolates) were selected for genome sequencing and comparative analyses. These cos-type phages with a long non-contractile tail currently belong to the Siphoviridae family (Caudovirales order). Their genome sizes vary from 35,637 to 36,825 bp and, similar to their host, have a high GC content (~61%). Genes encoding for an immunity repressor, an excisionase and a truncated integrase were found, suggesting that these virulent phages may be derived from a prophage. Their genomic organization is highly conserved, with most of the diversity coming from the presence of long (198 bp) DNA tandem repeats (TRs) within an open reading frame coding for a protein of unknown function. We categorized these phages into seven genomic groups according to their number of TR, which ranged from two to eight. Moreover, we showed that TRs are widespread in phage genomes, found in more than 85% of the genomes available in public databases.  相似文献   

18.
Arabidopsis thaliana is an important and long-established model species for plant molecular biology, genetics, epigenetics, and genomics. However, the latest version of reference genome still contains a significant number of missing segments. Here, we reported a high-quality and almost complete Col-0 genome assembly with two gaps (named Col-XJTU) by combining the Oxford Nanopore Technologies ultra-long reads, Pacific Biosciences high-fidelity long reads, and Hi-C data. The total genome assembly size is 133,725,193 bp, introducing 14.6 Mb of novel sequences compared to the TAIR10.1 reference genome. All five chromosomes of the Col-XJTU assembly are highly accurate with consensus quality (QV) scores > 60 (ranging from 62 to 68), which are higher than those of the TAIR10.1 reference (ranging from 45 to 52). We completely resolved chromosome (Chr) 3 and Chr5 in a telomere-to-telomere manner. Chr4 was completely resolved except the nucleolar organizing regions, which comprise long repetitive DNA fragments. The Chr1 centromere (CEN1), reportedly around 9 Mb in length, is particularly challenging to assemble due to the presence of tens of thousands of CEN180 satellite repeats. Using the cutting-edge sequencing data and novel computational approaches, we assembled a 3.8-Mb-long CEN1 and a 3.5-Mb-long CEN2. We also investigated the structure and epigenetics of centromeres. Four clusters of CEN180 monomers were detected, and the centromere-specific histone H3-like protein (CENH3) exhibited a strong preference for CEN180 Cluster 3. Moreover, we observed hypomethylation patterns in CENH3-enriched regions. We believe that this high-quality genome assembly, Col-XJTU, would serve as a valuable reference to better understand the global pattern of centromeric polymorphisms, as well as the genetic and epigenetic features in plants.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号