摘要翻译:
排列--线粒体DNA中核苷酸序列的频率分布是以类似于语言学方法的方式定义的,最高频率的核苷酸作为空白。对于这样的序列,计算熵和平均长度。这些参数可以区分猫科<I>felidae</I>(猫科)和熊科<I>ursidae</I>(熊科)的种类。从纯粹的数值来看,我们可以特别看出大熊猫是熊,而考拉不是。用一个简单的概率模型解释了观测到的参数之间的线性关系。利用玻色分布的非加性推广方法对核苷酸序列进行频谱分析。在这种情况下,家庭的分离不是很尖锐。然而,<I>Felidae</I>的分布与<I>Ursidae</I>相比具有平均更长的尾巴。
---
英文标题:
《Telling apart <I>Felidae</I> and <I>Ursidae</I> from the distribution of
nucleotides in mitochondrial DNA》
---
作者:
Andrij Rovenchak
---
最新提交年份:
2018
---
分类信息:
一级分类:Quantitative Biology 数量生物学
二级分类:Other Quantitative Biology 其他定量生物学
分类描述:Work in quantitative biology that does not fit into the other q-bio classifications
不适合其他q-bio分类的定量生物学工作
--
一级分类:Physics 物理学
二级分类:Biological Physics 生物物理学
分类描述:Molecular biophysics, cellular biophysics, neurological biophysics, membrane biophysics, single-molecule biophysics, ecological biophysics, quantum phenomena in biological systems (quantum biophysics), theoretical biophysics, molecular dynamics/modeling and simulation, game theory, biomechanics, bioinformatics, microorganisms, virology, evolution, biophysical methods.
分子生物物理、细胞生物物理、神经生物物理、膜生物物理、单分子生物物理、生态生物物理、生物系统中的量子现象(量子生物物理)、理论生物物理、分子动力学/建模与模拟、博弈论、生物力学、生物信息学、微生物、病毒学、进化论、生物物理方法。
--
一级分类:Physics 物理学
二级分类:Data Analysis, Statistics and Probability
数据分析、统计与概率
分类描述:Methods, software and hardware for physics data analysis: data processing and storage; measurement methodology; statistical and mathematical aspects such as parametrization and uncertainties.
物理数据分析的方法、软硬件:数据处理与存储;测量方法;统计和数学方面,如参数化和不确定性。
--
---
英文摘要:
Rank--frequency distributions of nucleotide sequences in mitochondrial DNA are defined in a way analogous to the linguistic approach, with the highest-frequent nucleobase serving as a whitespace. For such sequences, entropy and mean length are calculated. These parameters are shown to discriminate the species of the <I>Felidae</I> (cats) and <I>Ursidae</I> (bears) families. From purely numerical values we are able to see in particular that giant pandas are bears while koalas are not. The observed linear relation between the parameters is explained using a simple probabilistic model. The approach based on the nonadditive generalization of the Bose-distribution is used to analyze the frequency spectra of the nucleotide sequences. In this case, the separation of families is not very sharp. Nevertheless, the distributions for <I>Felidae</I> have on average longer tails comparing to <I>Ursidae</I>.
---
PDF链接:
https://arxiv.org/pdf/1802.02610