ITP OpenIR  > SCI期刊论文
Gao, CY; Zhou, HJ; Aurell, E3,4,5
Correlation-compressed direct-coupling analysis
发表期刊PHYSICAL REVIEW E
语种英语
关键词STRUCTURE PREDICTION PROTEIN-STRUCTURE CONTACTS IDENTIFICATION MODEL SEQUENCES FAMILIES HUMANS
摘要Learning Ising or Potts models from data has become an important topic in statistical physics and computational biology, with applications to predictions of structural contacts in proteins and other areas of biological data analysis. The corresponding inference problems are challenging since the normalization constant (partition function) of the Ising or Potts distribution cannot be computed efficiently on large instances. Different ways to address this issue have resulted in a substantial amount of methodological literature. In this paper we investigate how these methods could be used on much larger data sets than studied previously. We focus on a central aspect, that in practice these inference problems are almost always severely under-sampled, and the operational result is almost always a small set of leading predictions. We therefore explore an approach where the data are prefiltered based on empirical correlations, which can be computed directly even for very large problems. Inference is only used on the much smaller instance in a subsequent step of the analysis. We show that in several relevant model classes such a combined approach gives results of almost the same quality as inference on the whole data set. It can therefore provide a potentially very large computational speedup at the price of only marginal decrease in prediction quality. We also show that the results on whole-genome epistatic couplings that were obtained in a recent computation-intensive study can be retrieved by our approach. The method of this paper hence opens up the possibility to learn parameters describing pairwise dependences among whole genomes in a computationally feasible and expedient manner.
2018
ISSN2470-0045
卷号98期号:3页码:32407
学科领域Physics
学科门类Physics, Fluids & Plasmas ; Physics, Mathematical
DOI10.1103/PhysRevE.98.032407
收录类别SCIE
引用统计
被引频次:4[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.itp.ac.cn/handle/311006/22820
专题SCI期刊论文
计算平台成果
作者单位1.Chinese Acad Sci, Inst Theoret Phys, Key Lab Theoret Phys, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Sch Phys Sci, Beijing 100049, Peoples R China
3.Hunan Normal Univ, Synerget Innovat Ctr Quantum Effects & Applicat, Changsha 410081, Hunan, Peoples R China
4.KTH Royal Inst Technol, Dept Computat Biol, S-10044 Stockholm, Sweden
5.Aalto Univ, Dept Appl Phys, Aalto 00076, Finland
6.Aalto Univ, Dept Comp Sci, Aalto 00076, Finland
推荐引用方式
GB/T 7714
Gao, CY,Zhou, HJ,Aurell, E. Correlation-compressed direct-coupling analysis[J]. PHYSICAL REVIEW E,2018,98(3):32407.
APA Gao, CY,Zhou, HJ,&Aurell, E.(2018).Correlation-compressed direct-coupling analysis.PHYSICAL REVIEW E,98(3),32407.
MLA Gao, CY,et al."Correlation-compressed direct-coupling analysis".PHYSICAL REVIEW E 98.3(2018):32407.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Correlation-compress(4972KB)期刊论文作者接受稿开放获取CC BY-NC-SA请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Gao, CY]的文章
[Zhou, HJ]的文章
[Aurell, E]的文章
百度学术
百度学术中相似的文章
[Gao, CY]的文章
[Zhou, HJ]的文章
[Aurell, E]的文章
必应学术
必应学术中相似的文章
[Gao, CY]的文章
[Zhou, HJ]的文章
[Aurell, E]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。