This study uses simulation-optimization and Reinforcement Learning (RL) to analyze the routing behavior of delivery vehicles (DVs). By conceptualizing the system as a stochastic k-armed bandit problem, the RL model helps DVs modify their routes based on delivery strategy selection. The experiments conducted on a simulated network with realistic traffic conditions show that employing an RL-based decision support system for en-route decision-making enhances the overall efficiency of the transport network.
内容由零声教学AI助手提供,问题来源于学员提问
将下面这段话写成2到3句话的综述引用: This study leverages simulation-optimisation with a Reinforcement Learn- ing (RL) model to analyse the routing behaviour of delivery vehicles (DVs). We conceptualise the system as a stochastic k-armed bandit problem, rep- resen...
本站部分文章来源于网络,版权归原作者所有,如有侵权请联系站长删除。
转载请注明出处:https://sdn.0voice.com/?id=4447
发表列表
评论列表
还没有评论,快来说点什么吧~