注冊(cè) | 登錄讀書(shū)好,好讀書(shū),讀好書(shū)!
讀書(shū)網(wǎng)-DuShu.com
當(dāng)前位置: 首頁(yè)出版圖書(shū)科學(xué)技術(shù)計(jì)算機(jī)/網(wǎng)絡(luò)計(jì)算機(jī)科學(xué)理論與基礎(chǔ)知識(shí)多主體強(qiáng)化學(xué)習(xí)協(xié)作策略研究

多主體強(qiáng)化學(xué)習(xí)協(xié)作策略研究

多主體強(qiáng)化學(xué)習(xí)協(xié)作策略研究

定 價(jià):¥48.00

作 者: 孫若瑩,趙鋼 著
出版社: 清華大學(xué)出版社
叢編項(xiàng):
標(biāo) 簽: 計(jì)算機(jī)/網(wǎng)絡(luò) 人工智能

購(gòu)買(mǎi)這本書(shū)可以去


ISBN: 9787302368304 出版時(shí)間: 2014-08-01 包裝: 平裝
開(kāi)本: 16開(kāi) 頁(yè)數(shù): 164 字?jǐn)?shù):  

內(nèi)容簡(jiǎn)介

  多主體的研究與應(yīng)用是近年來(lái)備受關(guān)注的熱點(diǎn)領(lǐng)域,多主體強(qiáng)化學(xué)習(xí)理論與方法、多主體協(xié)作策略的研究是該領(lǐng)域重要研究方向,其理論和應(yīng)用價(jià)值極為廣泛,備受廣大從事計(jì)算機(jī)應(yīng)用、人工智能、自動(dòng)控制、以及經(jīng)濟(jì)管理等領(lǐng)域研究者的關(guān)注?!抖嘀黧w強(qiáng)化學(xué)習(xí)協(xié)作策略研究》清晰地介紹了多主體、強(qiáng)化學(xué)習(xí)及多主體協(xié)作等基本概念和基礎(chǔ)內(nèi)容,明確地闡述了有關(guān)多主體強(qiáng)化學(xué)習(xí)、協(xié)作策略研究的發(fā)展過(guò)程及最新動(dòng)向,深入地探討了多主體強(qiáng)化學(xué)習(xí)與協(xié)作策略的理論與方法,具體地分析了多主體強(qiáng)化學(xué)習(xí)與協(xié)作策略在相關(guān)研究領(lǐng)域的應(yīng)用方法?!抖嘀黧w強(qiáng)化學(xué)習(xí)協(xié)作策略研究》系統(tǒng)脈絡(luò)清晰、基本概念清楚、圖表分析直觀,注重內(nèi)容的體系化和實(shí)用性。通過(guò)本書(shū)的閱讀和學(xué)習(xí),讀者即可掌握多主體強(qiáng)化學(xué)習(xí)及協(xié)作策略的理論和方法,更可了解在實(shí)際工作中應(yīng)用這些研究成果的手段?!抖嘀黧w強(qiáng)化學(xué)習(xí)協(xié)作策略研究》可作為從事計(jì)算機(jī)應(yīng)用、人工智能、自動(dòng)控制、以及經(jīng)濟(jì)管理等領(lǐng)域研究者的學(xué)習(xí)和閱讀參考,同時(shí)高等院校相關(guān)專業(yè)研究生以及人工智能愛(ài)好者也可從中獲得借鑒。

作者簡(jiǎn)介

暫缺《多主體強(qiáng)化學(xué)習(xí)協(xié)作策略研究》作者簡(jiǎn)介

圖書(shū)目錄

Chapter 1 Introduction
1.1 Reinforcement Learning
1.1.1 Generality of Reinforcement Learning
1.1.2 Reinforcement Learning on Markov Decision Processes
1.1.3 Integrating Reinforcement Learning into Agent Architecture
1.2 Multiagent Reinforcement Learning
1.2.1 Multiagent Systems
1.2.2 Reinforcement Learning in Multiagent Systems
1.2.3 Learning and Coordination in Multiagent Systems
1.3 Ant System for Stochastic Combinatorial Optimization
1.3.1 Ants Forage Behavior
1.3.2 Ant Colony Optimization
1.3.3 MAX-MIN Ant System
1.4 Motivations and Consequences
1.5 Book Summary
Bibliography
Chapter 2 Reinforcement Learning and Its Combination with Ant Colony System
2.1 Introduction
2.2 Investigation into Reinforcement Learning and Swarm Intelligence
2.2.1 Temporal Differences Learning Method
2.2.2 Active Exploration and Experience Replay in Reinforcement Learning
2.2.3 Ant Colony System for Traveling Salesman Problem
2.3 The Q-ACS Multiagent Learning Method
2.3.I The Q-ACS Learning Algorithm
2.3.2 Some Properties of the Q-ACS Learning Method
2.3.3 Relation with Ant-Q Learning Method
2.4 Simulat'ions and Results
2.5 Conclusions
Bibliography
Chapter 3 Multiagent Learning Methods Based on Indirect Media Information Sharing
3.1 Introduction
3.2 The Multiagent Learning Method Considering Statistics Features
3.2.I Accelerated K-certainty Exploration
3.2.2 The T-ACS Learning Algorithm
3.3 The Heterogeneous Agents Learning
3.3.1 The D-ACS Learning Algorithm
3.3.2 Some Discussions about the D-ACS Learning Algorithm
3.4 Comparisons with Related State-of-the-arts
3.5 Simulations and Results
3.5.1 Experimental Results on Hunter Game
3.5.2 Experimental Results on Traveling Salesman Problem
3.6 Conclusions
Bibliography
Chapter 4 Action Conversion Mechanism in Multiagent Reinforcement Learning
4.1 Introduction
4.2 Model-Based Reinforcement Learning
4.2.1 Dyna-Q Architecture
4.2.2 Prioritized Sweeping Method
4.2.3 Minimax Search and Reinforcement Learning
4.2.4 RTP-Q Learning
4.3 The Q-ac Multiagent Reinforcement Learning
4.3.1 Task Model
4.3.2 Converting Action
4.3.3 Multiagent Cooperation Methods
4.3.4 Q-value Update
4.3.5 The Q-ac Learning Algorithm
4.3.6 Using Adversarial Action Instead of s Probability Exploration
……
Chapter 5 Multiagent Learning Approaches Applied to Vehicle Routing Problems
Chapter 6 Multiagent learning Methods Applied to Multicast Routing Problems
Chapter 7 Multiagent Reinforcement Learning for Supply Chain Management
Chapter 8 Multiagent Learning Applied in Supply Chain Ordering Management

本目錄推薦

掃描二維碼
Copyright ? 讀書(shū)網(wǎng) www.autoforsalebyowners.com 2005-2020, All Rights Reserved.
鄂ICP備15019699號(hào) 鄂公網(wǎng)安備 42010302001612號(hào)