中国科学院理论物理研究所机构知识库
Advanced  
ITP OpenIR  > 理论物理所1978-2010年知识产出  > 期刊论文
题名: Minimum entropy approach to word segmentation problems
作者: Wang, B
刊名: PHYSICA A
出版日期: 2001
卷号: 293, 期号:40972, 页码:583-591
关键词: SEQUENCES
学科分类: Physics
通讯作者: Wang, B , Chinese Acad Sci, Inst Theoret Phys, POB 2735, Beijing 100080, Peoples R China.
部门归属: Chinese Acad Sci, Inst Theoret Phys, Beijing 100080, Peoples R China; Inst Computat Math & Sci Engn Comp, State Key Lab Sci & Engn Comp, Beijing 100080, Peoples R China
英文摘要: Given a sequence composed of a limited number of characters, we try to "read" it as a "text", This involves segmenting the sequence into "words". The difficulty is to distinguish good segmentation from enormous numbers of random ones. Aiming at revealing the nonrandomness of the sequence as strongly as possible, by applying maximum likelihood method, we find a quantity called segmentation entropy that can be used to fulfill the aim. Contrary to commonplace where maximum entropy principle was applied to obtain good solution, we chose to minimize the segmentation entropy to obtain good segmentation. The concept developed in this letter carl be used to study the noncoding DNA sequences, e.g,, for regulatory elements prediction, in eukaryote genomes. (C) 2001 Elsevier Science B.V. All rights reserved.
收录类别: SCI
原文出处: 查看原文
WOS记录号: WOS:000168730500023
Citation statistics: 
内容类型: 期刊论文
URI标识: http://ir.itp.ac.cn/handle/311006/12775
Appears in Collections:理论物理所1978-2010年知识产出_期刊论文

Files in This Item:
File Name/ File Size Content Type Version Access License
Minimum entropy approach to word segmentation problems.pdf(133KB)----开放获取View Download

Recommended Citation:
Wang, B. Minimum entropy approach to word segmentation problems[J]. PHYSICA A,2001,293(40972):583-591.
Service
 Recommend this item
 Sava as my favorate item
 Show this item's statistics
 Export Endnote File
Google Scholar
 Similar articles in Google Scholar
 [Wang, B]'s Articles
CSDL cross search
 Similar articles in CSDL Cross Search
 [Wang, B]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
  Add to CiteULike  Add to Connotea  Add to Del.icio.us  Add to Digg  Add to Reddit 
文件名: Minimum entropy approach to word segmentation problems.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院理论物理研究所 - Feedback
Powered by CSpace