ITP OpenIR  > 理论物理所2016年知识产出
The bulk and the tail of minimal absent words in genome sequences
Aurell, E; Innocenti, N; Zhou, HJ; Innocenti, N (reprint author), Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel.
2016
发表期刊PHYSICAL BIOLOGY
卷号13期号:2页码:26004
文章类型Article
摘要Minimal absent words (MAW) of a genomic sequence are subsequences that are absent themselves but the subwords of which are all present in the sequence. The characteristic distribution of genomic MAWs as a function of their length has been observed to be qualitatively similar for all living organisms, the bulk being rather short, and only relatively few being long. It has been an open issue whether the reason behind this phenomenon is statistical or reflects a biological mechanism, and what biological information is contained in absent words. In this work we demonstrate that the bulk can be described by a probabilistic model of sampling words from random sequences, while the tail of long MAWs is of biological origin. We introduce the concept of a core of a MAW, which are sequences present in the genome and closest to a given MAW. We show that in E. faecalis, E. coli and yeast the cores of the longest MAWs, which exist in two or more copies, are located in highly conserved regions the most prominent example being ribosomal RNAs. We also show that while the distribution of the cores of long MAWs is roughly uniform over these genomes on a coarse-grained level, on a more detailed level it is strongly enhanced in 3' untranslated regions (UTRs) and, to a lesser extent, also in 5' UTRs. This indicates that MAWs and associated MAW cores correspond to fine-tuned evolutionary relationships, and suggest that they can be more widely used as markers for genomic complexity.
关键词Minimal Absent Words Copy-mutation Evolution Model Random Sequence
学科领域Biochemistry & Molecular Biology ; Biophysics
资助者Swedish Science Council [621-2012-2982] ; Swedish Science Council [621-2012-2982] ; Academy of Finland through its Center of Excellence COIN ; Academy of Finland through its Center of Excellence COIN ; Natural Science Foundation of China [11225526] ; Natural Science Foundation of China [11225526] ; Swedish Science Council [621-2012-2982] ; Swedish Science Council [621-2012-2982] ; Academy of Finland through its Center of Excellence COIN ; Academy of Finland through its Center of Excellence COIN ; Natural Science Foundation of China [11225526] ; Natural Science Foundation of China [11225526]
DOIhttp://dx.doi.org/10.1088/1478-3975/13/2/026004
关键词[WOS]COMMUNITY RECONSTRUCTION ; EFFICIENT COMPUTATION ; SPONTANEOUS MUTATION ; BACTERIA ; PHYLOGENY ; MATTER
收录类别SCI
语种英语
资助者Swedish Science Council [621-2012-2982] ; Swedish Science Council [621-2012-2982] ; Academy of Finland through its Center of Excellence COIN ; Academy of Finland through its Center of Excellence COIN ; Natural Science Foundation of China [11225526] ; Natural Science Foundation of China [11225526] ; Swedish Science Council [621-2012-2982] ; Swedish Science Council [621-2012-2982] ; Academy of Finland through its Center of Excellence COIN ; Academy of Finland through its Center of Excellence COIN ; Natural Science Foundation of China [11225526] ; Natural Science Foundation of China [11225526]
WOS类目Biochemistry & Molecular Biology ; Biophysics
引用统计
文献类型期刊论文
条目标识符http://ir.itp.ac.cn/handle/311006/21704
专题理论物理所2016年知识产出
通讯作者Innocenti, N (reprint author), Hebrew Univ Jerusalem, Sch Comp Sci & Engn, IL-91904 Jerusalem, Israel.
推荐引用方式
GB/T 7714
Aurell, E,Innocenti, N,Zhou, HJ,et al. The bulk and the tail of minimal absent words in genome sequences[J]. PHYSICAL BIOLOGY,2016,13(2):26004.
APA Aurell, E,Innocenti, N,Zhou, HJ,&Innocenti, N .(2016).The bulk and the tail of minimal absent words in genome sequences.PHYSICAL BIOLOGY,13(2),26004.
MLA Aurell, E,et al."The bulk and the tail of minimal absent words in genome sequences".PHYSICAL BIOLOGY 13.2(2016):26004.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
The bulk and the tai(1879KB) 开放获取--请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Aurell, E]的文章
[Innocenti, N]的文章
[Zhou, HJ]的文章
百度学术
百度学术中相似的文章
[Aurell, E]的文章
[Innocenti, N]的文章
[Zhou, HJ]的文章
必应学术
必应学术中相似的文章
[Aurell, E]的文章
[Innocenti, N]的文章
[Zhou, HJ]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。