TY - JOUR
T1 - Optimal outbound shipment policy for an inventory system with advance demand information
AU - Ralfs, Jana
AU - Pham, Dai T.
AU - Kiesmüller, Gudrun P.
N1 - Publisher Copyright:
© 2025 The Authors
PY - 2025
Y1 - 2025
N2 - This paper examines a single-echelon inventory system that fulfills stochastic orders from a production facility using a time-based shipment consolidation strategy. In this system, the production facility provides advance demand information to the warehouse, ensuring that all orders are placed with a positive demand lead time. Using value iteration, we identify the optimal outbound shipment quantities while accounting for costs related to early deliveries, late deliveries, and shipments. Additionally, this research highlights the impact of advance demand information on transportation capacity planning and the optimization of load factors The results from value iteration enable us to observe the general structure of the optimal dispatch policy, and we determine that it is a multidimensional threshold policy. Based on this observation, we introduce an approximated three-level threshold policy with acceptable performance. Furthermore, the decision itself is easier to interpret and to explain. To analyze large-scale instances, we compare several heuristic policies. First, we develop a deep reinforcement learning algorithm that approximates the value of the post-decision state instead of the pre-decision state. We compare our approach to value iteration and find that our method works very well; the average optimality gap is 0.08%. Additionally, three simple heuristic policies are proposed that might be justifiable in specific situations. Finally, we find that the value of advance demand information does not follow a linear pattern but decreases as the demand lead time increases. Furthermore, the transportation capacity should be planned in the range of the mean demand between two shipments.
AB - This paper examines a single-echelon inventory system that fulfills stochastic orders from a production facility using a time-based shipment consolidation strategy. In this system, the production facility provides advance demand information to the warehouse, ensuring that all orders are placed with a positive demand lead time. Using value iteration, we identify the optimal outbound shipment quantities while accounting for costs related to early deliveries, late deliveries, and shipments. Additionally, this research highlights the impact of advance demand information on transportation capacity planning and the optimization of load factors The results from value iteration enable us to observe the general structure of the optimal dispatch policy, and we determine that it is a multidimensional threshold policy. Based on this observation, we introduce an approximated three-level threshold policy with acceptable performance. Furthermore, the decision itself is easier to interpret and to explain. To analyze large-scale instances, we compare several heuristic policies. First, we develop a deep reinforcement learning algorithm that approximates the value of the post-decision state instead of the pre-decision state. We compare our approach to value iteration and find that our method works very well; the average optimality gap is 0.08%. Additionally, three simple heuristic policies are proposed that might be justifiable in specific situations. Finally, we find that the value of advance demand information does not follow a linear pattern but decreases as the demand lead time increases. Furthermore, the transportation capacity should be planned in the range of the mean demand between two shipments.
KW - Deep reinforcement learning
KW - Inventory management
KW - Optimal policy
KW - Post-decision state
KW - Shipment consolidation
UR - http://www.scopus.com/inward/record.url?scp=85216897932&partnerID=8YFLogxK
U2 - 10.1016/j.ejor.2025.01.020
DO - 10.1016/j.ejor.2025.01.020
M3 - Article
AN - SCOPUS:85216897932
SN - 0377-2217
JO - European Journal of Operational Research
JF - European Journal of Operational Research
ER -