基于强化学习的带软时间窗多行程绿色车辆路径优化研究
    点此下载全文
引用本文:姚利军,王可君.基于强化学习的带软时间窗多行程绿色车辆路径优化研究[J].计算技术与自动化,2024,(4):153-160
摘要点击次数: 252
全文下载次数: 0
作者单位
姚利军,王可君 (湖南省长株潭烟草物流有限责任公司湖南 长沙 410004) 
中文摘要:为了助力物流行业响应“碳达峰”和“碳中和”建设目标,提速绿色物流产业的建立与发展,首先综合考虑油耗、碳排放、人力、车辆、用户体验等因素,构建带软时间窗约束的多行程绿色车辆路径优化模型。然后综合考虑PinSAGE图网络、TRPO和GAE方法来改进Actor-Critic的深度强化学习优化算法,最后采用Actor-Critic算法对模型对绿色多行程车辆路径方案求解。实验表明,提出的求解方法能高效规划绿色车辆路径,进而显著减少物流成本,实现物流企业经济效益与环境效益的双重优化。
中文关键词:绿色物流  软时间窗  深度强化学习  Actor-Critic框架
 
Reinforcement Learning Based Path Optimization for Multi-Trip Green Vehicles with Soft Time Window
Abstract:In order to assist the logistics industry in achieving its goal of peak carbon dioxide emissions and carbon neutrality, construction must be facilitated, and a green logistics industry must be rapidly established and developed. Firstly, a multi-trip green vehicle path optimization model with soft time window constraints is constructed by comprehensively considering the factors of fuel consumption, carbon emission, manpower, vehicles, and user experience. Subsequently, the PinSAGE graph network, TRPO, and GAE methods are considered collectively to enhance the deep reinforcement learning optimization algorithm of Actor-Critic. Ultimately, the Actor-Critic algorithm is employed to address the model for the green multi-trip vehicle path scheme.Experimental evidence indicates that the solution method proposed in the paper is an effective means of planning green vehicle routes, which in turn has the potential to significantly reduce logistics costs and realise the dual optimisation of economic and environmental benefits for logistics enterprises.
keywords:green logistics  soft time window  deep reinforcement learning  Actor-Critic framework
查看全文   查看/发表评论   下载pdf阅读器