Open Access Green as soon as Postprint is submitted to ZB.
Genomic determinants of somatic copy number alterations across human cancers.
Hum. Mol. Genet. 25, 1019-1030 (2016)
Somatic copy number alterations (SCNAs) play an important role in carcinogenesis. However, the impact of genomic architecture on the global patterns of SCNAs in cancer genomes remains elusive. In this work we conducted multiple linear regression (MLR) analyses of the pooled SCNA data from The Cancer Genome Atlas Pan-Cancer project. We performed MLR analyses for 11 individual cancer types and three different kinds of SCNAs - amplifications and deletions, telomere-bound and interstitial SCNAs, and local SCNAs. Our MLR model explains more than 30% of the pooled SCNA breakpoint variation, with the explanatory power ranging from 13% to 32% for different cancer types and SCNA types. In addition to confirming previously identified features (e.g., Long interspersed element-1 (L1) and SINEs) we also identified several novel informative features, including distance to telomere, distance to centromere, and low complexity repeats. The results of the MLR analyses were additionally confirmed on an independent SCNA data set obtained from the COSMIC database. Using a rare event logistic regression model and an extremely randomized tree classifier we revealed that genomic features are informative for defining common SCNA breakpoint hotspots. Our findings shed light on the molecular mechanisms of SCNA generation in cancer.
Edit extra informations Login
Publication type Article: Journal article
Document type Scientific Article
Keywords Dna-forming Sequences; Chromosomal Instability; Mammalian-cells; Recombination; Mechanisms; Landscape; Dysfunction; Breakpoint; Deletions; Features