Chapter 5 Conclusions
5.2 Limitations and future directions
Some aspects within this thesis could be further improved by future work. Although the coverages of 600 TCGA WGS data used in this study are sufficient for SV and SCNA analy- sis, much higher coverages are need to confidently call somatic SNVs, thus somatic SNVs are not investigated in this thesis. It is interesting to explore the relationship between the fea- ture of SVs in cancer genome and behavior of somatic SNVs. For example, TP53 mutations have been revealed to be highly correlated with SCNAs in various tumor types (Fridlyand et al., 2006; Snijders et al., 2005). A previous study on HNSC showed that TP53 mutation is frequently accompanied by loss of chromosome 3p and the combination of these events is associated with a surprising decrease in survival time (Gross et al., 2014).
The presence of variant subclones within the cancer cell population, termed as intratu- mor heterogeneity, is another feature of primary tumor genomes, and brought tremendous challenges in bioinformatics analysis of tumor WGS data (Vogelstein et al., 2013). For example, the existence of subclones in WGS data will confound the identification of SVs and estimation of ASCNS/ASCNG. Studies have revealed subclonal changes across distinct geographic regions of a primary tumor (Gerlinger et al., 2012) and within hematopoietic malignant populations (Ding et al., 2012). A multiregion sequencing study on LUAD re- vealed that a larger subclonal mutation fraction may be associated with increased likelihood of postsurgical relapse in patients (Zhang et al., 2014). It has been proposed that the major cause of genetic heterogeneity in cancer is genomic instability, including BFB (Gisselsson et al., 2000), which leads to an increased mutation rate and can shape the evolution of the cancer genome (Burrell et al., 2013). The development of single-cell sequencing technology has accelerated studies of cancer heterogeneity (Navin et al., 2011; Eberwine et al., 2014; Eirew et al., 2014). Single-cell sequencing is now providing cutting-edge clinical applica- tions, especially in studying blood-borne circulating tumor cells (CTCs) (Heitzer et al., 2013; Ni et al., 2013; Swennenhuis et al., 2013), derived from a solid tumor, to investigate the value of CTCs for guiding diagnosis, prognosis, and treatment of the cancer. However, the amount of DNA in single cell is not enough for current NGS technology and whole genome amplification, which has inherent risk of inducing significant bias, is inevitable (Tre↵ et al., 2011; Alix-Panabi`eres and Pantel, 2014). Moreover, the isolation of single cell from tissues and designing an optimal single-cell sampling strategy, which can comprehensively represent the mutational landscapes within a primary tumor are both challenging (Eberwine et al.,
2014). There are still pressing needs for building computational models which can tackle the heterogeneity in primary tumor sequencing data.
Although the length is growing, NGS reads are still too short to cover larger or more complex structural alterations, especially those with breakpoints inside centromere or other low sequence complexity regions. Indeed, many arm-level SCNAs might have their break- points right inside centromere gap regions and the SVs which formed these SCNAs would inevitably be missed. Hybrid approaches that combine OM or ‘third generation sequenc- ing’ provided by PacBio, together with NGS, might provide new insight into those genomic regions hard to explore by current NGS technology alone.
References
Abecasis, G. R., Auton, A., Brooks, L. D., DePristo, M. A., Durbin, R. M., Handsaker, R. E., Kang, H. M., Marth, G. T., and McVean, G. A., 2012. An integrated map of genetic variation from 1,092 human genomes. Nature, 491(7422):56–65.
Abyzov, A., Urban, A. E., Snyder, M., and Gerstein, M., 2011. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and popula- tion genome sequencing. Genome Res., 21(6):974–984.
Adey, A., Burton, J. N., Kitzman, J. O., Hiatt, J. B., Lewis, A. P., Martin, B. K., Qiu, R., Lee, C., and Shendure, J., 2013. The haplotype-resolved genome and epigenome of the aneuploid HeLa cancer cell line. Nature, 500(7461):207–211.
Albertson, D. G., 2006. Gene amplification in cancer. TRENDS in Genetics, 22(8):447–455. Albino, D., Longoni, N., Curti, L., Mello-Grand, M., Pinton, S., Civenni, G., Thalmann, G., D’Ambrosio, G., Sarti, M., Sessa, F., et al., 2012. ESE3/EHF controls epithelial cell di↵erentiation and its loss leads to prostate tumors with mesenchymal and stem-like features. Cancer Research, 72(11):2889–2900.
Alexandrov, L. B., Nik-Zainal, S., Wedge, D. C., Aparicio, S. a. J. R., Behjati, S., Biankin, A. V., Bignell, G. R., Bolli, N., Borg, A., Bø rresen Dale, A.-L., et al., 2013. Signatures of mutational processes in human cancer. Nature, 500:415–21.
Alix-Panabi`eres, C. and Pantel, K., 2014. Challenges in circulating tumour cell research. Nature Reviews Cancer, 14(9):623–631.
Amarasinghe, K. C., Li, J., Hunter, S. M., Ryland, G. L., Cowin, P. A., Campbell, I. G., and Halgamuge, S. K., 2014. Inferring copy number and genotype in tumour exome data. BMC genomics, 15(1):732.
Anguelov, D., Taskarf, B., Chatalbashev, V., Koller, D., Gupta, D., Heitz, G., and Ng, A., 2005. Discriminative learning of markov random fields for segmentation of 3d scan data. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 2, pages 169–176. IEEE.
Bailey, J. A., Yavor, A. M., Massa, H. F., Trask, B. J., and Eichler, E. E., 2001. Seg- mental duplications: organization and impact within the current human genome project assembly. Genome Res., 11(6):1005–1017.
Basecke, J., Whelan, J. T., Griesinger, F., and Bertrand, F. E., 2006. The mll partial tandem duplication in acute myeloid leukaemia. British journal of haematology, 135(4):438–449. Bass, A. J., Watanabe, H., Mermel, C. H., Yu, S., Perner, S., Verhaak, R. G., Kim, S. Y., Wardwell, L., Tamayo, P., Gat-Viks, I., et al., 2009. Sox2 is an amplified lineage-survival oncogene in lung and esophageal squamous cell carcinomas. Nature genetics, 41(11):1238– 1242.
Beerenwinkel, N., Schwarz, R. F., Gerstung, M., and Markowetz, F., 2015. Cancer evolution: mathematical models and computational inference. Systematic biology, 64(1):e1–e25. Beggs, A. D., Jones, A., El-Bahwary, M., Abulafi, M., Hodgson, S. V., and Tomlinson, I. P.,
2013. Whole-genome methylation analysis of benign and malignant colorectal tumours. The Journal of pathology, 229(5):697–704.
Bell, R. J., Rube, H. T., Kreig, A., Mancini, A., Fouse, S. D., Nagarajan, R. P., Choi, S., Hong, C., He, D., Pekmezci, M., et al., 2015. The transcription factor gabp selectively binds and activates the mutant tert promoter in cancer. Science, 348(6238):1036–1039. Beltran, H., Yelensky, R., Frampton, G. M., Park, K., Downing, S. R., MacDonald, T. Y.,
Jarosz, M., Lipson, D., Tagawa, S. T., Nanus, D. M., et al., 2013. Targeted next-generation sequencing of advanced prostate cancer identifies potential therapeutic targets and disease heterogeneity. European urology, 63(5):920–926.
Beroukhim, R., Mermel, C. H., Porter, D., Wei, G., Raychaudhuri, S., Donovan, J., Bar- retina, J., Boehm, J. S., Dobson, J., Urashima, M., et al., 2010. The landscape of somatic copy-number alteration across human cancers. Nature, 463(7283):899–905.
Berry, W. L. and Janknecht, R., 2013. Kdm4/jmjd2 histone demethylases: epigenetic regulators in cancer cells. Cancer Research, 73(10):2936–42.
Bieche, I., Olivi, M., Nogues, C., Vidaud, M., and Lidereau, R., 2002. Prognostic value of ccnd1 gene status in sporadic breast tumours, as determined by real-time quantitative pcr assays. British journal of cancer, 86(4):580–586.
Bignell, G. R., Greenman, C. D., Davies, H., Butler, A. P., Edkins, S., Andrews, J. M., Buck, G., Chen, L., Beare, D., Latimer, C., et al., 2010. Signatures of mutation and selection in the cancer genome. Nature, 463(7283):893–898.
Bignell, G. R., Santarius, T., Pole, J. C., Butler, A. P., Perry, J., Pleasance, E., Greenman, C., Menzies, A., Taylor, S., Edkins, S., et al., 2007. Architectures of somatic genomic rearrangement in human cancer amplicons at sequence-level resolution. Genome Res, 17(9):1296–303.
Blegen, H., Will, J. S., Ghadimi, B. M., Nash, H.-P., Zetterberg, A., Auer, G., and Ried, T., 2003. Dna amplifications and aneuploidy, high proliferative activity and impaired cell cycle control characterize breast carcinomas with poor prognosis. Analytical Cellular Pathology, 25(3):103–114.
Boguski, M. S., Mandl, K. D., and Sukhatme, V. P., 2009. Repurposing with a di↵erence. Science, 324(5933):1394.
Boveri, T., 1914. Zur frage der entstehung maligner tumoren. Gustav Fischer.
Boykov, Y., Veksler, O., and Zabih, R., 1998. Markov random fields with efficient approx- imations. In Computer vision and pattern recognition, 1998. Proceedings. 1998 IEEE computer society conference on, pages 648–655. IEEE.
Brennan, C. W., Verhaak, R. G., McKenna, A., Campos, B., Noushmehr, H., Salama, S. R., Zheng, S., Chakravarty, D., Sanborn, J. Z., Berman, S. H., et al., 2013. The somatic genomic landscape of glioblastoma. Cell, 155(2):462–477.
Browning, B. L. and Browning, S. R., 2009. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet., 84(2):210–223.
Browning, S. R. and Browning, B. L., 2011. Haplotype phasing: existing methods and new developments. Nat Rev Genet, 12(10):703–14.
Bunting, S. F. and Nussenzweig, A., 2013. End-joining, translocations and cancer. Nature Reviews Cancer, 13(7):443–454.
Burrell, R. A., McGranahan, N., Bartek, J., and Swanton, C., 2013. The causes and consequences of genetic heterogeneity in cancer evolution. Nature, 501(7467):338–345. Cahill, D. P., Lengauer, C., Yu, J., Riggins, G. J., Willson, J. K., Markowitz, S. D., Kinzler,
K. W., and Vogelstein, B., 1998. Mutations of mitotic checkpoint genes in human cancers. Nature, 392(6673):300–303.
Campbell, P. J., Yachida, S., Mudie, L. J., Stephens, P. J., Pleasance, E. D., Stebbings, L. A., Morsberger, L. A., Latimer, C., McLaren, S., Lin, M. L., et al., 2010. The patterns and dynamics of genomic instability in metastatic pancreatic cancer. Nature, 467(7319):1109–13.
Cancer Genome Atlas Research Network et al., 2011. Integrated genomic analyses of ovarian carcinoma. Nature, 474(7353):609–15.
Cancer Genome Atlas Research Network et al., 2012. Comprehensive genomic characteri- zation of squamous cell lung cancers. Nature, 489(7417):519–525.
Cancer Genome Atlas Research Network et al., 2013. Integrated genomic characterization of endometrial carcinoma. Nature, 497(7447):67–73.
Cancer Genome Atlas Research Network et al., 2014a. Comprehensive molecular character- ization of gastric adenocarcinoma. Nature, .
Cancer Genome Atlas Research Network et al., 2014b. Comprehensive molecular profiling of lung adenocarcinoma. Nature, 511(7511):543–550.
Cancer Genome Atlas Research Network et al., 2015. Comprehensive genomic characteri- zation of head and neck squamous cell carcinomas. Nature, 517(7536):576–582.
Carter, S. L., Cibulskis, K., Helman, E., McKenna, A., Shen, H., Zack, T., Laird, P. W., Onofrio, R. C., Winckler, W., Weir, B. A., et al., 2012. Absolute quantification of somatic DNA alterations in human cancer. Nat. Biotechnol., 30(5):413–421.
Cerami, E., Gao, J., Dogrusoz, U., Gross, B. E., Sumer, S. O., Aksoy, B. A., Jacobsen, A., Byrne, C. J., Heuer, M. L., Larsson, E., et al., 2012. The cBio Cancer Genomics Portal: An open platform for exploring multidimensional cancer genomics data. Cancer Discovery, 2(5):401–404.
Chan, D. W., Liu, V. W., To, R. M., Chiu, P. M., Lee, W. Y., Yao, K. M., Cheung, A. N., and Ngan, H. Y., 2009. Overexpression of foxg1 contributes to tgf-beta resistance through inhibition of p21waf1/cip1 expression in ovarian cancer. Br J Cancer, 101(8):1433–43. Cheang, M. C., Chia, S. K., Voduc, D., Gao, D., Leung, S., Snider, J., Watson, M., Davies,
S., Bernard, P. S., Parker, J. S., et al., 2009. Ki67 index, her2 status, and progno- sis of patients with luminal b breast cancer. Journal of the National Cancer Institute, 101(10):736–750.
Chen, H., Bell, J. M., Zavala, N. A., Ji, H. P., and Zhang, N. R., 2014. Allele-specific copy number profiling by next-generation dna sequencing. Nucleic acids research, :gku1252.
Chen, K., Wallis, J. W., McLellan, M. D., Larson, D. E., Kalicki, J. M., Pohl, C. S., McGrath, S. D., Wendl, M. C., Zhang, Q., Locke, D. P., et al., 2009. BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat. Methods, 6(9):677–681.
Ciriello, G., Miller, M. L., Aksoy, B. A., Senbabaoglu, Y., Schultz, N., and Sander, C., 2013. Emerging landscape of oncogenic signatures across human cancers. Nat Genet, 45(10):1127–1133.
Collins, F. S. and Varmus, H., 2015. A new initiative on precision medicine. New England Journal of Medicine, 372(9):793–795.
Davidson-Pilon, C., 2015. Lifelines. https://github.com/camdavidsonpilon/lifelines. De, S. and Michor, F., 2011. DNA replication timing and long-range DNA interactions
predict mutational landscapes of cancer genomes. Nat Biotechnol, 29(12):1103–1108. Dei Tos, A. P., Doglioni, C., Piccinin, S., Sciot, R., Furlanetto, A., Boiocchi, M., Dal Cin,
P., Maestro, R., Fletcher, C. D., and Tallini, G., et al., 2000. Coordinated expression and amplification of the mdm2, cdk4, and hmgi-c genes in atypical lipomatous tumours. The Journal of pathology, 190(5):531–536.
Deshwar, A. G., Vembu, S., Yung, C. K., Jang, G. H., Stein, L., and Morris, Q., 2015. Phylowgs: Reconstructing subclonal composition and evolution from whole-genome se- quencing of tumors. Genome biology, 16(1):35.
Diebold, J., Mosinger, K., Peiro, G., Pannekamp, U., Kaltz, C., Baretton, G. B., Meier, W., and Lohrs, U., 2000. 20q13 and cyclin d1 in ovarian carcinomas. analysis by fluorescence in situ hybridization. J Pathol, 190(5):564–71.
Ding, L., Ley, T. J., Larson, D. E., Miller, C. A., Koboldt, D. C., Welch, J. S., Ritchey, J. K., Young, M. A., Lamprecht, T., McLellan, M. D., et al., 2012. Clonal evolution in relapsed acute myeloid leukaemia revealed by whole-genome sequencing. Nature, 481(7382):506– 510.
Druck, T., Berk, L., and Huebner, K., 1997. FHITness and cancer. Oncology research, 10(7):341–345.
Dulbecco, R., 1986. A turning point in cancer research: sequencing the human genome. Science, 231(4742):1055–1056.
Eberwine, J., Sul, J.-Y., Bartfai, T., and Kim, J., 2014. The promise of single-cell sequenc- ing. Nature methods, 11(1):25–27.
Edamoto, Y., Hara, A., Biernat, W., Terracciano, L., Cathomas, G., Riehle, H.-M., Mat- suda, M., Fujii, H., Scoazec, J.-Y., and Ohgaki, H., et al., 2003. Alterations of rb1, p53 and wnt pathways in hepatocellular carcinomas associated with hepatitis c, hepatitis b and alcoholic liver cirrhosis. International journal of cancer, 106(3):334–341.
Eirew, P., Steif, A., Khattra, J., Ha, G., Yap, D., Farahani, H., Gelmon, K., Chia, S., Mar, C., Wan, A., et al., 2014. Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution. Nature, .
ENCODE Project Consortium et al., 2012. An integrated encyclopedia of dna elements in the human genome. Nature, 489(7414):57–74.
Etemadmoghadam, D., deFazio, A., Beroukhim, R., Mermel, C., George, J., Getz, G., Tothill, R., Okamoto, A., Raeder, M. B., Harnett, P., et al., 2009. Integrated genome- wide dna copy number and expression analysis identifies distinct mechanisms of primary chemoresistance in ovarian carcinomas. Clin Cancer Res, 15(4):1417–27.
Feldser, D. M., Hackett, J. A., and Greider, C. W., 2003. Telomere dysfunction and the initiation of genome instability. Nature Reviews Cancer, 3(8):623–627.
Fischer, A., Vazquez-Garcia, I., Illingworth, C. J., and Mustonen, V., 2014. High-definition reconstruction of clonal composition in cancer. Cell Rep, 7(5):1740–1752.
Fishman, M. C. and Porter, J. A., 2005. Pharmaceuticals: a new grammar for drug discov- ery. Nature, 437(7058):491–493.
Frampton, G. M., Fichtenholtz, A., Otto, G. A., Wang, K., Downing, S. R., He, J., Schnall- Levin, M., White, J., Sanford, E. M., An, P., et al., 2013. Development and validation of a clinical cancer genomic profiling test based on massively parallel dna sequencing. Nature biotechnology, 31(11):1023–1031.
Frey, B. and MacKay, D., 1998. A revolution: Belief propagation in graphs with cycles. Advances in neural information processing systems, :479–485.
Fridlyand, J., Snijders, A. M., Ylstra, B., Li, H., Olshen, A., Segraves, R., Dairkee, S., Tokuyasu, T., Ljung, B. M., Jain, A. N., et al., 2006. Breast tumor copy number aberra- tion phenotypes and genomic instability. BMC cancer, 6(1):96.
Fullwood, M. J., Liu, M. H., Pan, Y. F., Liu, J., Xu, H., Mohamed, Y. B., Orlov, Y. L., Velkov, S., Ho, A., Mei, P. H., et al., 2009. An oestrogen-receptor-&agr;-bound human chromatin interactome. Nature, 462(7269):58–64.
Garraway, L. A. and J¨anne, P. A., 2012. Circumventing cancer drug resistance in the era of personalized medicine. Cancer discovery, 2(3):214–226.
Garraway, L. A. and Lander, E. S., 2013. Lessons from the cancer genome. Cell, 153(1):17– 37.
Gerlinger, M., Rowan, A. J., Horswell, S., Larkin, J., Endesfelder, D., Gronroos, E., Mar- tinez, P., Matthews, N., Stewart, A., Tarpey, P., et al., 2012. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. New England Journal of Medicine, 366(10):883–892.
Gisselsson, D., Pettersson, L., H¨oglund, M., Heidenblad, M., Gorunova, L., Wiegant, J., Mertens, F., Dal Cin, P., Mitelman, F., and Mandahl, N., et al., 2000. Chromosomal breakage-fusion-bridge events cause genetic intratumor heterogeneity. Proceedings of the National Academy of Sciences, 97(10):5357–5362.
Gordon, D. J., Resio, B., and Pellman, D., 2012. Causes and consequences of aneuploidy in cancer. Nat. Rev. Genet., 13(3):189–203.
Greenman, C., Stephens, P., Smith, R., Dalgliesh, G. L., Hunter, C., Bignell, G., Davies, H., Teague, J., Butler, A., Stevens, C., et al., 2007. Patterns of somatic mutation in human cancer genomes. Nature, 446(7132):153–158.
Greenman, C. D., Pleasance, E. D., Newman, S., Yang, F., Fu, B., Nik-Zainal, S., Jones, D., Lau, K. W., Carter, N., Edwards, P. A., et al., 2012. Estimation of rearrangement phylogeny for cancer genomes. Genome Res., 22(2):346–361.
Gross, A. M., Orosco, R. K., Shen, J. P., Eglo↵, A. M., Carter, H., Hofree, M., Choueiri, M., Co↵ey, C. S., Lippman, S. M., Hayes, D. N., et al., 2014. Multi-tiered genomic analysis of head and neck cancer ties tp53 mutation to 3p loss. Nature genetics, .
Guichard, C., Amaddeo, G., Imbeaud, S., Ladeiro, Y., Pelletier, L., Maad, I. B., Calder- aro, J., Bioulac-Sage, P., Letexier, M., Degos, F., et al., 2012. Integrated analysis of somatic mutations and focal copy-number changes identifies key genes and pathways in hepatocellular carcinoma. Nature genetics, 44(6):694–698.
Gupta, A., Place, M., Goldstein, S., Sarkar, D., Zhou, S., Potamousis, K., Kim, J., Flana- gan, C., Li, Y., Newton, M. A., et al., 2015. Single-molecule analysis reveals widespread structural variation in multiple myeloma. Proceedings of the National Academy of Sci- ences, :201418577.
Ha, G., Roth, A., Khattra, J., Ho, J., Yap, D., Prentice, L. M., Melnyk, N., McPherson, A., Bashashati, A., Laks, E., et al., 2014. Titan: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data. Genome research, 24(11):1881–1893.
Ha, G., Roth, A., Lai, D., Bashashati, A., Ding, J., Goya, R., Giuliany, R., Rosner, J., Oloumi, A., Shumansky, K., et al., 2012. Integrative analysis of genome-wide loss of het- erozygosity and monoallelic expression at nucleotide resolution reveals disrupted pathways in triple-negative breast cancer. Genome Res, 22(10):1995–2007.
Hampton, O. A., Den Hollander, P., Miller, C. A., Delgado, D. A., Li, J., Coarfa, C., Harris, R. A., Richards, S., Scherer, S. E., Muzny, D. M., et al., 2009. A sequence-level map of chromosomal breakpoints in the mcf-7 breast cancer cell line yields insights into the evolution of a cancer genome. Genome Res, 19(2):167–77.
Hanahan, D. and Weinberg, R. A., 2000. The hallmarks of cancer. cell, 100(1):57–70. Hansen, R. S., Thomas, S., Sandstrom, R., Canfield, T. K., Thurman, R. E., Weaver, M.,
Dorschner, M. O., Gartler, S. M., and Stamatoyannopoulos, J. A., 2010. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci U S A, 107(1):139–144.
Heitzer, E., Auer, M., Gasch, C., Pichler, M., Ulz, P., Ho↵mann, E. M., Lax, S., Waldispuehl-Geigl, J., Mauermann, O., Lackner, C., et al., 2013. Complex tumor genomes inferred from single circulating tumor cells by array-cgh and next-generation sequencing.