摘要翻译:
CoHSI(Hartley-Shannon信息守恒)分布是一类广泛的离散系统的核心,定义了它们的分量的长度分布以及其他全局性质。离散系统,如已知的蛋白质组,其中的成分是蛋白质,计算机软件,其中的成分是功能,文本,其中的成分是书籍,都已知准确地符合这种分布。在这篇短短的论文中,我们探索了它的解及其结果性质,并为一系列论文奠定了基础,这些论文将演示为什么组分的平均长度如此高度守恒,以及为什么长组分在这些系统中如此频繁地出现。这些性质不符合局部的论据,例如蛋白质组中的自然选择或计算机软件中的人类意志,实际上是直接从CoHSI中分离出来的、为所有人所共有的离散系统的不可避免的全局性质。我们将用Uniprot蛋白质数据库的例子来说明这一点,作为后续研究的前奏。
---
英文标题:
《CoHSI I; Detailed properties of the Canonical Distribution for Discrete
Systems such as the Proteome》
---
作者:
Les Hatton and Gregory Warr
---
最新提交年份:
2018
---
分类信息:
一级分类:Quantitative Biology 数量生物学
二级分类:Other Quantitative Biology 其他定量生物学
分类描述:Work in quantitative biology that does not fit into the other q-bio classifications
不适合其他q-bio分类的定量生物学工作
--
---
英文摘要:
The CoHSI (Conservation of Hartley-Shannon Information) distribution is at the heart of a wide-class of discrete systems, defining the length distribution of their components amongst other global properties. Discrete systems such as the known proteome where components are proteins, computer software, where components are functions and texts where components are books, are all known to fit this distribution accurately. In this short paper, we explore its solution and its resulting properties and lay the foundation for a series of papers which will demonstrate amongst other things, why the average length of components is so highly conserved and why long components occur so frequently in these systems. These properties are not amenable to local arguments such as natural selection in the case of the proteome or human volition in the case of computer software, and indeed turn out to be inevitable global properties of discrete systems devolving directly from CoHSI and shared by all. We will illustrate this using examples from the Uniprot protein database as a prelude to subsequent studies.
---
PDF链接:
https://arxiv.org/pdf/1806.08785