摘要翻译:
我们研究了西伯利亚落叶松转录组的结构集成。用$K$-means技术对64维空间中的簇进行了识别,其中待聚类的对象是基因组的不同片段。这些碎片的分布呈四面体状。Chargaff的差异度量是为每个班级以及班级之间的差异度量确定的。它揭示了类的相对相似性。这些结果已经与每个组织的特定转录组的结果进行了比较。此外,一个替代转录组已经被开发,包括为特定组织组装的连接体;后者与真实的总转录组进行了比较,观察到有显著性差异。
---
英文标题:
《Some preliminary results on relation between triplet composition and
tissue source in larch total transcriptome》
---
作者:
Michael Sadovsky, Tatiana Guseva, Vladislav Birukov, Tatiana Shpagina,
Victoria Fedotovskaya
---
最新提交年份:
2018
---
分类信息:
一级分类:Quantitative Biology 数量生物学
二级分类:Other Quantitative Biology 其他定量生物学
分类描述:Work in quantitative biology that does not fit into the other q-bio classifications
不适合其他q-bio分类的定量生物学工作
--
---
英文摘要:
We studied the structuredness ensemble of transcriptome of Siberian larch. The clusters in 64-dimensional space were identified with $K$-means technique, where the objects to be clusterized are the different fragments of the genome. A tetrahedron like structure in distribution of these fragments was found. Chargaff's discrepancy measure was determined for each class, as well as that latter between the classes. It reveals a relative similitude of the classes. The results have been compared to those obtained for specific transcriptome of each tissue. Also, a surrogate transcriptome has been developed comprising the contigs assembled for specific tissues; that latter has been compared with the real total transcriptome, and significant difference has been observed.
---
PDF链接:
https://arxiv.org/pdf/1803.03461