英文标题:
《Factor Models for Cancer Signatures》
---
作者:
Zura Kakushadze and Willie Yu
---
最新提交年份:
2017
---
英文摘要:
We present a novel method for extracting cancer signatures by applying statistical risk models (http://ssrn.com/abstract=2732453) from quantitative finance to cancer genome data. Using 1389 whole genome sequenced samples from 14 cancers, we identify an \"overall\" mode of somatic mutational noise. We give a prescription for factoring out this noise and source code for fixing the number of signatures. We apply nonnegative matrix factorization (NMF) to genome data aggregated by cancer subtype and filtered using our method. The resultant signatures have substantially lower variability than those from unfiltered data. Also, the computational cost of signature extraction is cut by about a factor of 10. We find 3 novel cancer signatures, including a liver cancer dominant signature (96% contribution) and a renal cell carcinoma signature (70% contribution). Our method accelerates finding new cancer signatures and improves their overall stability. Reciprocally, the methods for extracting cancer signatures could have interesting applications in quantitative finance.
---
中文摘要:
我们提出了一种利用统计风险模型提取癌症特征的新方法(http://ssrn.com/abstract=2732453)从定量金融到癌症基因组数据。使用来自14种癌症的1389个全基因组测序样本,我们确定了体细胞突变噪声的“整体”模式。我们给出了一个分解噪声的方法,并给出了固定签名数的源代码。我们将非负矩阵分解(NMF)应用于按癌症亚型聚合的基因组数据,并使用我们的方法进行过滤。与未经过滤的数据相比,生成的特征具有显著更低的可变性。此外,签名提取的计算成本降低了约10倍。我们发现了3种新的癌症特征,包括肝癌显性特征(96%的贡献)和肾细胞癌特征(70%的贡献)。我们的方法加速了寻找新的癌症特征,提高了它们的整体稳定性。反过来,提取癌症特征的方法在定量金融领域也可能有有趣的应用。
---
分类信息:
一级分类:Quantitative Biology 数量生物学
二级分类:Genomics 基因组学
分类描述:DNA sequencing and assembly; gene and motif finding; RNA editing and alternative splicing; genomic structure and processes (replication, transcription, methylation, etc); mutational processes.
DNA测序与组装;基因和基序的发现;RNA编辑和选择性剪接;基因组结构和过程(复制、转录、甲基化等);突变过程。
--
一级分类:Quantitative Biology 数量生物学
二级分类:Quantitative Methods 定量方法
分类描述:All experimental, numerical, statistical and mathematical contributions of value to biology
对生物学价值的所有实验、数值、统计和数学贡献
--
一级分类:Quantitative Finance 数量金融学
二级分类:Statistical Finance 统计金融
分类描述:Statistical, econometric and econophysics analyses with applications to financial markets and economic data
统计、计量经济学和经济物理学分析及其在金融市场和经济数据中的应用
--
---
PDF下载:
-->