|Table of Contents|

An Multiagent Reinforcement Learning Based on Partition and Integration(PDF)

南京师范大学学报(工程技术版)[ISSN:1006-6977/CN:61-1281/TN]

Issue:
2008年04期
Page:
59-62
Research Field:
Publishing date:

Info

Title:
An Multiagent Reinforcement Learning Based on Partition and Integration
Author(s):
Wang YunHan Wei
Information Science and Engineering College,Nanjing University of Financial and Economics,Nanjing 210046,China
Keywords:
mu ltiagent system re inforcem ent learn ing state- space partition
PACS:
TP301
DOI:
-
Abstract:
To counte r for the prob lem of slow ly convergence of Q leaning w hen com e ing to large state-space, the paper pu ts forw ard an a lgo rithm w hich divide the sta tes space acco rd ing to learn ing e rrors. The basic idea o f our algor ithm is to d iscover the m atch ing re lationship be tw een ag ents and the sub- space o f sta tes space. The sim ulations in g rids w ith b locks ind icate that the algorithm perform s betterw hen com e ing to on- line learning.

References:

[ 1] 刘海涛, 洪炳熔, 朴松昊, 等. 不确定环境下基于进化算法的强化学习[ J]. 电子学报, 2006, 7( 34): 1 356-1 360.
Liu H aitao, H ong B ingrong, Pu Songhao, et a.l Evolutionary a lgor ithm based re inforcem ent learn ing in the uncerta in env ironm ents[ J]. A cta E lectron ica S in ica. 2006( 34) 7: 1 356-1 360. ( in Ch inese)
[ 2] 韩伟, 陈优广, 姜昌华. 基于内省推理的多agent在线学习新方法[ J] . 模式识别与人工智能, 2007, 20( 2) : 254-260.
H anW e,i Chen Youguang, Jiang Changhua. An Interna l- In fe rence BasedM ultiagent Learning M ethod[ J]. Pattern Recognition & Artificia l Inte lligence. 2007, 20( 2): 254-260. ( in Ch inese)
[ 3] 韩伟. 基于情节序列训练的电子市场智能定价算法[ J] . 计算机工程与应用, 2007, 43( 6): 17-19.
H anW e.i Intelligent pr ic ing a lgo rithm based on mu ltiagent lea rning [ J]. Com pute r Eng ineer ing and Applica tions. 2007( 43)6: 17-19. ( in Chinese)
[ 4] 文益民, 杨旸, 吕宝粮. 集成学习算法在增量学习中的应用研究[ J] . 计算机研究与发展, 2005, 42: 222-227.
W en Y im in, Y ang Yang, L?Bao liang. Rescearch o f the app lication ensem ble lea rning algor ithm s to increm enta l lea rn ing [ J].Jou rnal of Com puter Research and Developm ent, 2005, 42: 222-227. ( in Ch inese)

Memo

Memo:
-
Last Update: 2013-04-24