Computational Genomics Group
  • Home
  • Research
  • Publications
  • Teaching
  • Blog
  • Group Members
  • News
  • Computational Biology Book
  • Data Analysis with R Book
  • CG2 github
  • Fiction

TADs in yeast and how you can go around a reviewer (if you are right)

4/20/2017

23 Comments

 
Just a few weeks ago we published a paper on Genome Urbanization, a concept describing the spatial clustering of genes in the unicellular eukaryote genome of S. cerevisiae (you can read more about it here). One of the things we put forward in that paper was the existence of discrete topological domains in the yeast genome that strongly resembled the Topologically-Associated Domains (TADs) initially discovered in mammals and now found in most complex eukaryotes. We based our arguments on some rather clear topological boundaries that we were able to observe on HiC contact maps obtained from a widely cited (Duan et al, 2010) public dataset. As you may see in the bottom of the Figure below (adapted from Figure 4D of our paper), one can locate boundaries between TAD-like domains even by eye inspection. In order to do so, we used an insulating approach (as described by Crane et al, Nature 2015) that is largely independent of the local read density. After defining such insulating regions we went on to show that genes that are up-regulated upon topological stress tend to cluster within these regions. 

​But since TADs had not (at the time) been reported in yeast, one of the main criticisms that we received during the review process was directed at this analysis. The reviewer's comment was:
"Authors declare the existence of TAD-like globules in budding yeast. However, these kinds of structures have, so far, never been detected in Saccharomyces cerevisiae. If the authors want to establish the existence of such TAD like structures, they must reinforce their analysis."

At the time, we were eager to get the paper accepted and so we down-played the original term "TAD-like" in "insulated domains" in the final version. Figure 4D remained though and was further supported by a number of analyses that showed the robustness of the boundaries upon different normalization strategies and the lack of LTRs in these regions. In our view, it mattered little to get the message of "TADs also exist in yeast" across, as our main interest was to show the tendency for spatial clustering of genes.
Picture
HiC contact maps for yeast chromosome IV. Top: Figure 1B from Eser et al. (PNAS, 2017). Bottom: Figure 4D from our Genome Urbanization paper (Tsochatzidou et al, Nucleic Acids Res, 2017). Even though the original HiC datasets are not the same (top: Noble lab 2017, bottom: Noble lab 2010) the maps show great similarity. The location of the boundaries shows significant discrepancies as the way to define them differs (see text below).

We were nevertheless right all along as was only recently shown in a paper by the Noble lab (whose original data we had used in our analysis). In a paper that just came out in PNAS, Eser et al., show that TADs do exist in yeast and that they have some very interesting properties. Eser et al., use a new HiC dataset (it seems that you cannot escape the curse of having to RE-do the experiments even if you were the one to perform them originally) but apply a different method to call the boundaries. Their "coverage score" is interesting as an approach because it is insensitive to the resolution of the obtained boundaries (a problem we had to deal with by arbitrarily choosing a 10kb window) and leads to fewer TADs that we were able to define but this is likely related to thresholds in the calling process (we used a 5%-percentile approach, while Eser et al use a local minimum function). Eser et al, find 41 TADs, (we found 85) with a median size that is more than double of the one we found (260kb vs our 100kb). The fact that remains is that there is significant coincidence between both the maps and the boundaries as you can see in the Figure above (adapted from Figures 1B from Eser et al, 2017 and 4B from our paper). 

​What is more important, the properties that are shared by the TAD boundaries in Eser et al. are matching our observations in many respects, as they are shown to be enriched in transcription activity (as originally shown by the group of Sergei Razin) and activating histone marks. Above all, Eser et al. report that regions between the defined TADs are significantly overlapping areas of topoII depletion, which constitutes an immediate link to our finding that genes that tend to be up-regulated by topoII inactivation are enriched in insulated regions (remember this was our way to call TAD boundaries, without using the term "TAD"). Thus, even though one of the main arguments in Eser et al. is that TADs in yeast are mostly related to DNA replication than transcription, it seems that you cannot really do away with transcriptional effects in relation to chromatin organization, especially in a unicellular eukaryote genome where the two processes are expected to be more tightly connected.


In closing, we can now be confident that TADs, or TAD-like domains if you will, do exist in yeast and that they are inherently related to both DNA replication and transcription (even though perhaps indirectly).  Our observations under topological stress lie in the interphase between the two processes as torsional stress accumulation inevitably affects the DNA replication process. Even more interesting, in our view, is the fact that the embedded constraints in the organization of genome architecture are reflected on the evolution of gene distribution along chromosomes, but then again we have already discussed this elsewhere. One last point that we can make is that it is reassuring to see you can constructively argue with a reviewer (provided the reviewer's sanity) if your hypothesis is solid and supported by the data and that it is always nice to see you were right in the first place, even though sometimes courtesy towards a reviewer obliges you to be less audacious in the choice of terminology.
23 Comments

Genome Urbanization: Gene segregation in eukaryotes

3/27/2017

0 Comments

 
Our group's last paper has just appeared in Nucleic Acids Research. In it we introduce a new concept that is related to the architecture of eukaryotic genomes and how it may have evolved to segregate functions in different genomic compartments in a way that is reminiscent of the semi-spontaneous social/ratial segregation of cities.

Genome Urbanization is inspired by the pioneering works on social segregation by Thomas Schelling whereby weak constraints may aggregate to shape macroscopic behaviour in various systems. Using an already published experiment in which we had previously studied how the chromatin structure of yeast promoters may shape gene expression under the accumulation of topological stress, we set out to investigate  whether differential gene expression upon a structurally-related stimulus may be reflected upon the spatial organization of genes. The first thing we found was that when put under topological stress, genes tend to form clusters in space, with groups of 6 or more adjacent genes being consistently up- or down-regulated. This was not so much of a surprise. DNA torsional stress acts exactly on the topology of the nucleus and so it would be expected for genes to follow this constraint. Surprisingly though, some clustering of gene expression also happens under other stress conditions  such as heat shock or nutrient deprivation albeit to a much lesser extent. What this pointed to is that genes occupy positions in the nucleus (even in lthe inear dimension) that allows them to respond to stimuli in a coordinated manner, thus the "micro-motives" of Schelling may be reflected on the local prerequisites of certain genes to be close to others due to their shared affinities for the same transcription factors or because the local environment is more favourable to their dedicated function.

Gene clustering is, of course, nothing new and many people have spotted preferences for genes that are spatially related in terms of expression, co-evolution or co-regulation. Our work, in this respect, focused on the chromatin and local gene structure of the observed gene clusters. We were thus able to define two distinct genome components with properties so different that the allusion to urban neighborhoods was almost spontaneous.

Genes that are "shut-down" upon the accumulation of topological stress were predominantly found close to the centromeres and the nuclear core. They were "old" genes in the sense that they coded for conserved, fundamental functions such as gene transcription and protein processing. More importantly, they were placed within very short distances from each other and with a rather "crammed" orientation that put, very frequently, adjacent genes transcribed in opposing directions. On the other hand, genes that were positively regulated, were found on the other extreme of the nucleus, close to subtelomeric regions. They coded for "newly" acquired stress-response functions, they had complex regulation and were surprisingly aligned with a clear tendency for co-directionality. They also had significantly longer "breathing" intergenic space between them. The set of these structural properties may allow them not only to be "resistant" to topological stress but to even harness DNA supercoiling in order to propel transcription.

This discrepancy in so many levels lead us to propose the Genome Urbanization model according to which, older, more conserved genes are preferentially located in the "old city center". The genome's core resembles the urban plan of a medieval city with its narrow meandering streets leaving little space between houses that appear as if touching each other. At the edges of the chromosomes lies what we call the "suburban genome" where the genome's "nouveau riches", new genes with complex functions that are not necessarily constitutively expressed and are employed only under specific conditions, have created a much different landscape. Here, genes are organized in tandem and with longer intergenic spacers inbetween them, in a way that brings to mind the tract housing of US city suburbia.   
Picture
Genome Urbanization may not be a particular property of the yeast genome, even though unicellularity and the increased gene density compared to more complex eukaryotic genomes is likely to make positional constraints more apparent. We nonetheless believe that similar tendencies may exist in bigger, mammalian genomes and that they may reflect even more intricate regulatory patterns that balance transcriptional homeostasis with expression noise and with the ability to respond to a great number of external and internal stimuli.

Given that the original data were already in our hands since 2006 and the fact that I first presented this concept more than 2 years ago in a talk at the IMBB, FORTH in Crete, this work has been no easy task to complete. It took the combined work of two undergraduate students (Maria Tsochatzidou and Maria Malliarou), the crucial assistance of a fellow bioinformatician (Nikolas Papanikolaou) and the support of long-standing collaborator Joaquim Roca at the CSIC, Barcelona who first introduced me to DNA topology. We are currently looking into many interesting perspectives that this work opens up regarding the evolution of genome architecture in eukaryotes and how gene positioning and chromosome structure may provide insight on the way cells employ transcriptional regulation under various conditions.

This paper is also, strictly speaking, the first paper to come entirely out of our group and thus seeing it published on my son's 5th birthday adds to a sense of accomplishment.
0 Comments

    RSS Feed

    It's all about...

    Bioinformatics and computational biology with a focus on chromatin and genome architecture, plus a little bit of football and occasional aspects of  University education.

    Archives

    April 2021
    December 2020
    March 2020
    November 2018
    September 2017
    April 2017
    March 2017
    December 2016
    November 2016
    February 2016
    May 2015
    November 2014
    September 2014
    July 2014
    February 2014
    November 2013
    October 2013

    Categories

    All
    Academic Life
    Bioinformatics
    ChIPSeq
    ChIPSeq Bias
    Cpg Islands
    Data Analysis
    Exons
    Football
    Footballomics
    Gene Regulation
    Genetic Diseases
    Genome Architecture
    Genome Structure
    Inflammation
    Journalism
    Math Illiteracy
    NGS
    Nucleosome Positioning
    Nucleotide Composition
    Nucleotide Skews
    Promoters
    R
    Splicing
    Statistics
    Systems Biology
    Tnf
    Transcriptome
    Variation
    Whole Exome

Powered by Create your own unique website with customizable templates.