Return to Article Details Reinforcement Learning with Reward Shaping for Last-Mile Delivery Dispatch Efficiency Download Download PDF