英文标题:
《*K-means and Cluster Models for Cancer Signatures》
---
作者:
Zura Kakushadze and Willie Yu
---
最新提交年份:
2017
---
英文摘要:
We present *K-means clustering algorithm and source code by expanding statistical clustering methods applied in https://ssrn.com/abstract=2802753 to quantitative finance. *K-means is statistically deterministic without specifying initial centers, etc. We apply *K-means to extracting cancer signatures from genome data without using nonnegative matrix factorization (NMF). *K-means\' computational cost is a fraction of NMF\'s. Using 1,389 published samples for 14 cancer types, we find that 3 cancers (liver cancer, lung cancer and renal cell carcinoma) stand out and do not have cluster-like structures. Two clusters have especially high within-cluster correlations with 11 other cancers indicating common underlying structures. Our approach opens a novel avenue for studying such structures. *K-means is universal and can be applied in other fields. We discuss some potential applications in quantitative finance.
---
中文摘要:
通过扩展统计聚类方法,我们提出了*K-means聚类算法和源代码https://ssrn.com/abstract=2802753量化金融*K-means在统计上具有确定性,无需指定初始中心等。我们应用*K-means从基因组数据中提取癌症特征,无需使用非负矩阵分解(NMF)*K-means的计算成本只是NMF的一小部分。使用1389个已发表的14种癌症类型的样本,我们发现3种癌症(肝癌、肺癌和肾细胞癌)突出,没有簇状结构。两个簇内相关性特别高,其他11种癌症显示出共同的潜在结构。我们的方法为研究此类结构开辟了一条新途径*K-means具有通用性,可以应用于其他领域。我们讨论了定量金融中的一些潜在应用。
---
分类信息:
一级分类:Quantitative Biology 数量生物学
二级分类:Genomics 基因组学
分类描述:DNA sequencing and assembly; gene and motif finding; RNA editing and alternative splicing; genomic structure and processes (replication, transcription, methylation, etc); mutational processes.
DNA测序与组装;基因和基序的发现;RNA编辑和选择性剪接;基因组结构和过程(复制、转录、甲基化等);突变过程。
--
一级分类:Quantitative Biology 数量生物学
二级分类:Quantitative Methods 定量方法
分类描述:All experimental, numerical, statistical and mathematical contributions of value to biology
对生物学价值的所有实验、数值、统计和数学贡献
--
一级分类:Quantitative Finance 数量金融学
二级分类:Statistical Finance 统计金融
分类描述:Statistical, econometric and econophysics analyses with applications to financial markets and economic data
统计、计量经济学和经济物理学分析及其在金融市场和经济数据中的应用
--
---
PDF下载:
-->