Category Archives: genomics

The Reading List – Day 1

Last week I posted a holiday reading list (link). I’ll be posting brief summaries of some of the more interesting papers over the next week or two.

Topic: Tumor Mutational Landscape

The Papers

Age related variants of variants occurred in the genes DNMT3A, TET2, and ASXL1 are associated with hematological malignancy risk – NEJM-1 and NEJM-2
Nature Genetics on the NEJM papers: Commentary

The Highlights

– Interesting papers on the power of whole exome and targeted gene sequencing – and a large number of patients followed for a long time – to uncover putative initiating cancer mutations. These can be termed initiating in the sense that they predate the occurrence of malignancy.

– Both studies identified somatic mutations in peripheral blood mononuclear cells that occurred more frequently in patients who developed hematological malignancies than those who did not.

– Here is a peak at some of the data. This first figure is from the paper out of Benjamin Ebert’s lab at Brigham & Women’s Hospital, Harvard Med School, Boston.

Screen Shot 2014-12-30 at 5.06.06 PM

On the left we see the 10 most commonly detected mutated genes, on the right the distribution of mutated genes per patient – overwhelmingly just one (Fig.2 from Jaiswal et al. 2014).

The second paper is from the McCarroll lab at the Broad Institute, MIT and Harvard, Cambridge, MA, across the river from Boston. Their list is a little different, as seen in this figure:

Screen Shot 2014-12-30 at 5.13.20 PM

Notably, however, the top three genes are the same (Fig. 2 from Genovese et al. 2014).

Many of the other mutations are in genes well studies in the context of hematological malignancies (JAK2, IDH2, MYD88) and other tumors (TP53, STAT3). The differences are mainly in the rarer mutations and reflect statistical noise, the difference in patient populations studied, or both.

– The top 3 somatically mutated genes are various types of DNA transcriptional regulators. This suggests that the drivers for these malignancies are actually the targets of the activity of DNMTA3 (a DNA methyltransferase), TET2 (a methylcytosine dioxygenase) and ASXL1 (an epigenetic regulator). All three had been previously identified as mutated genes in patients with various lymphoid and myeloid malignancies, however these new studies show the mutations to be present and stable years before cancer develops.

– These somatic mutations are one-hit wonders, that is, they are clonal, and very likely causative. The McCarroll group’s paper has a nice figure illustrating this principal:

Screen Shot 2014-12-30 at 5.35.59 PM

Thus clonal hematopoiesis evolves from one of the initiating mutations.

– A few other very interesting messages in these papers:

1) the number of mutations increases with age
2) mutations predict malignancy, but at a pretty low rate
3) mutations are associated with survival

The association with survival is not only due to the link with cancer: mutations in these transcriptional regulators appear cause other physiologies as well. The Ebert paper finds an association between age-related somatic mutations and cardiovascular disease for example. I’d caution here that this paper studies patients already predisposed to develop cardiovascular disease, and so this association might wash out in general population studies. Regardless, the relative risk of carrying one of these somatic mutations is not large, so it is not as if we should all run out and get these genes analyzed.

– Finally, and this has been known for a while, mutations in these transcriptional regulators are really problematic once cancer develops, and that is shown by the impact on overall survival post-diagnosis. This last point is very intriguing as it suggests that somatic mutations in transcription regulators do two distinct things, probably via action on distinct targets – one, predispose to hematologic oncogenesis and two, allow mutations to accumulate once oncogenesis has taken place.

Given that the relative risk of carrying any of these somatic mutations remains very low, what are we to do with the data? Genovese et al. anticipate this question, and offer the following (quoting here):

“Several important research directions could bring DNA sequencing for clonal hematopoiesis closer to clinical usefulness. First, some somatic mutations are likely to be associated with a particularly high risk of subsequent cancer; larger studies could identify such mutations. Second, single-cell analyses might identify high-risk combinations of mutations occurring in the same cells. Third, the sequencing of specific cell types might identify mutation–cell-type combinations with increased predictive value. Fourth, initial detection of clonal hematopoiesis might justify periodic screening for the presence of cooperating mutations at low allele frequencies that could presage cancer”.

I would add that a more detailed understanding of how somatic mutations in the DNMT3A, TET2, and ASXL1 genes trigger and support oncogenesis may yield downstream targets suitable for therapeutic intervention.

next topic: New Immunotherapy Papers

stay tuned.


The Cancer Genomic Ecosystem

There have been several important recent advances in our understanding of tumor genomic ecosystems, and these advances have interesting implications for drug discovery in oncology.

The Journal Nature recently published a large data set on gene mutations in 21 distinct tumor types ( Much of the data came from the The Cancer Genome Atlas (TCGA) database, with additional data generated by the study authors. This study is sufficiently powered to uncover significance in several different ways. There is a cluster of mutations that are significant only in the combined tumor analyses, that is, when lumping different tumor types together. Conversely there a large cluster of mutations that are significant only in the analysis of individual tumor types, that is, the significance is lost if you look too broadly. Therefore these are genes that are important for specific tumor types. Finally there is a large cluster of gene mutations that are significant in both the combined analyses and in individual tumor analyses. This complexity of analysis is nicely shown in Figure 3 (

I spent a fair amount of time staring at this figure and going through the supplemental data (posted online and see also and there are some results that I found interesting. First, the study confirmed many known cancer-related genes. The study also identified a fair number of new cancer-related genes mutated across or within tumor types, although these were found at the lower levels of significance. This is because they are mutated at a low rate, or the sample size for a particular tumor type was small, or both. The authors are transparent about this, and call for larger studies to increase sample size. This does beg the question as to the rate of gene mutation below which the knowledge is no longer actionable (because there will be so few patients), regardless the data will be critical to understanding tumor pathway biologies. Another interesting question is the extent to which new patterns of gene-mutation will emerge across tumor types, allowing binning (across tumor types) to complement subsetting (within a tumor type). Finally, the data might allow a different type of query, which is to ask which combinations of mutations are found within specific tumor types.

I want mention a few of the more common mutations, because these data held some surprises for me (although some readers know all this already, I’m sure). First, the best known cancer-related gene mutations cluster at the very highest levels of significance both across the 21 tumor types and within specific tumors. This makes sense, as these genes include those that contribute obligate cancer mutations: TP53; PTEN, PIK3CA and PIK3R1; KRAS, BRAF and NRAS; APC; EGFR, etc. There were a few genes in this category that surprised me, not so much because they made the list but because these at first glance appear more common than I had thought. GATA3 is a good example. Mutations in this gene are most commonly see in breast cancer but there are enough mutations in other tumor types to drive significance in the pooled tumor analysis, even though no tumor type other than breast is significantly associated with GATA3 mutations. Examination of the FTL3 data reveal a very similar pattern: mutations are significantly associated with acute myeloid leukemia (AML), as is well known, but also present in other tumor types, notably endometrial tumors and lung adenocarcinomas. When the mutational data across tumor type is pooled, significance is achieved. What are we do with such data? I think the answer perhaps is to simply know that these mutations can occur, and to look for them when typical mutations are missing in a given patient’s tumor. Such cataloging is of course the goal of personalized medicine. The other use of such data is to raise awareness of rare drug resistance mutations that may arise when targeting the major tumor oncogenic pathway in a particular tumor type. Many examples of this phenomena have been described (more on this below).

A different pattern emerges when we look at some other genes that are commonly mutated across tumor types but whose significant in these analyses is lower, due to a lower mutational rate. IDH1 is a good example here, having significant association with AML and glioblastoma multiforma, as is well known, but also with multiple myeloma (MM) and perhaps chronic lymphocytic leukemia (CLL). IDH2 is also most commonly associated with AML, but is present in colorectal cancer at “near significance” (love that fuzzy language). Notably, no other tumor metabolism genes appear in the analysis.

There are same gaps too I think. Looking at those genes that are significantly mutated only in a specific tumor type or types, we find some interesting genes. TGFBR2 has been described as a mutational driver in colorectal cancer, along with SMAD4. In the present analysis SMAD4 and SMAD2 are found to be significantly mutated in colorectal cancer, but the TGFBR2 mutation rate only reaches significance in Head and Neck cancer, although a few mutations do appear in the colorectal cancer data set used. Either the original studies are incorrect, which does not make sense biologically (TGFBR2 protein signals through the SMAD pathway), or this is an example of sampling error. Again, bigger data sets may be needed. Other tumor-type restricted patterns of gene mutation are very well known, such as EZH2 and CARD11 mutations in diffuse large B cell lymphoma (DLBCL). The CARD11 observation is interesting, as these mutations are associated with activation of MYD88, a gene known to be mutated in DLBCL and CLL.

There are lots of examples like these, and the data are easy to see and analyze: this is fun data to play with so have at it (see

There is much discussion in the paper on new genes identified, and we’ll have to see how much of it is actionable at the drug development level.

That brings us to a different data set. If you go to the tumor portal you can sort by tumor type. Choosing melanoma, a highly mutated cancer, brings forth a whole spectrum of genes. Here’s a screengrab right from the tumor portal site (

Screen Shot 2014-02-02 at 11.51.43 AM
In the table above, blue refers to known cancer-related genes, red indicates genes whose function is relevant to cancer biology, and black are novel genes. As many readers know, BRAF mutations are the canonical melanoma oncogenic driver, signaling through the MEK/ERK pathway to drive melanoma cell proliferation, migration and metastasis. Antagonists of the BRAF and MEK proteins have emerged as the best line of defense against melanoma, but its a complicated fight. BRAF inhibitors were developed several years ago, starting with vemurafenib (Roche). Although BRAF inhibition induced responses in many melanoma patients, BRAF resistance mutants and MEK1 escape mutations evolve quickly and patients relapse. Common BRAF resistance mutations include V600E and V600K mutations that confer protection against the first generation drugs. Second generation inhibitors that target the resistance mutations were developed, such as dabrafenib (GSK). In addition, the MEK inhibitor trametinib (GSK, Japan Tobacco) was approved last year for use in treating melanoma. Several weeks ago the combination of these two drugs was granted accelerated approval for the treatment of advanced (metastatic) or unresectable melanoma that is positive for either mutation (V600E or K). This is a great example of cancer genetics=driven drug development in action.

However, other mechanisms of resistance are independent of BRAF mutational status because of additional MEK resistance mutations. These additional mutational strategies were discussed in a series of papers published online on November 21, 2013, in Cancer Discovery. These studies used tumor samples from patients that had relapsed after either BRAF inhibitor of dual BRAF/MEK inhibitor therapy. Mutations were found in the MEK1, MEK2, ERK1, ERK2 pathway and the PI3K, AKT1, PTEN pathway (PTEN is a negative regulator of PI3K signaling to mTOR and AKT1). The papers were reviewed in the January 9th issue of SciBx.

What does this single example tell us about the mutational landscape and drug discovery. First let’s note that some of the resistance mechanisms for melanoma do not show up in the proposed melanoma mutational landscape chart above, that is, these did not appear in the tumor ecosystem until that ecosystem came under selective pressure via drug treatment. This has 2 implications: the first is that the TCGA type overview of tumor mutations is just one source of data and following patients longitudinally as they experience therapy is another source of data. The other implication is that the mutational landscape contains putative additional mechanisms of escape at least in some patients. So using our melanoma example, we see in the table evidence of other potential escape pathways (NRAS, several checkpoint genes, and KIT stand out to me). So how many drugs will any individual patient need to keep a rapidly evolving melanoma under control?

The good news is that drug developers have taken notice and ERK1/2. AKT and PI3K inhibitors of various specificity are under development. The bad news I guess is that this is just one example of how complicated cancer therapy is likely to become. One good question not addressed here is how the immune checkpoint drugs will overlay with targeted therapies, for melanoma and many other tumors. Thats a question for another day.

stay tuned.