Adjusting backup-length automatically in reinforcement learning

M. Ohta; I. Noda

doi:10.1109/ICMLC.2003.1259756

Proceedings of the 2003 International Conference on Machine Learning and Cybernetics

Adjusting backup-length automatically in reinforcement learning

Year: 2003, Volume: 3, Pages: 1624,1625,1626,1627,1628,1629

DOI Bookmark: 10.1109/ICMLC.2003.1259756

Authors

M. Ohta, Cyber Assist Res. Center, Nat. Inst. of Adv. Ind. Sci. & Technol., Tokyo, Japan
I. Noda, Cyber Assist Res. Center, Nat. Inst. of Adv. Ind. Sci. & Technol., Tokyo, Japan

Abstract

Reinforcement learning agents often acquire wrong action-values in some states when the environment has problem such as perceptual aliasing. Especially, this is a serious problem for reinforcement learning that uses bootstrapping, because it propagates wrong action-values to other states. To solve this problem, we propose DBLA in which the agent skips aliased states and does backup from the first non-aliased state. We demonstrate effectiveness of DBLA in an example of a grid-world maze. The result shows that the influence of the wrong action-values is reduced very much with this method.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Multiplicative bases approach in mathematical cybernetics
Proceedings of 12th International Conference on Pattern Recognition
Model of Reliability of the Software with Coxian Distribution of Length of Intervals between the Moments of Detection of Errors
2010 IEEE 34th Annual Computer Software and Applications Conference
Software Reliability Model with Coxian Distribution of Length of Intervals between Errors Detection and Fixing Moments
2011 IEEE 35th Annual Computer Software and Applications Conference Workshops
Fast Reinforcement Learning under Uncertainties with Self-Organizing Neural Networks
2015 IEEE / WIC / ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)
Data-Efficient Deep Reinforcement Learning with Symmetric Consistency
2022 26th International Conference on Pattern Recognition (ICPR)
Improved Demonstration-Knowledge Utilization in Reinforcement Learning
IEEE Transactions on Artificial Intelligence
Micro-Armed Bandit: Lightweight & Reusable Reinforcement Learning for Microarchitecture Decision-Making
2023 56th IEEE/ACM International Symposium on Microarchitecture (MICRO)
DeepEE: Joint Optimization of Job Scheduling and Cooling Control for Data Center Energy Efficiency Using Deep Reinforcement Learning
2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS)
Motion Coordination of Multiple Robots Based on Deep Reinforcement Learning
2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)
Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning
IEEE Intelligent Systems

Adjusting backup-length automatically in reinforcement learning

Authors

Abstract

Related Articles