LAPSE:2025.0379v1
Published Article

LAPSE:2025.0379v1
Handling discrete decisions in bilevel optimization via neural network embeddings
June 27, 2025
Abstract
Bilevel optimization is an active area of research within the operations research community due to its ability to capture the interdependencies between two levels of decisions. This study introduces a metamodeling approach for addressing mixed-integer bilevel optimization problems, exploiting the approximation capabilities of neural networks. The proposed methodology employs neural network embeddings to approximate the optimal follower's response, bypassing the inner optimization problem by parametrizing it with continuous leaders decisions. The use of Rectified Linear Unit (ReLU) activations allows the forward pass of the neural network to be represented as a set of mixed-integer linear constraints. Thereby, the bilevel structure is simplified into a single-level optimization model. A case study based on a two-echelon supply chain demonstrates the effectiveness of the approach, with solutions comparable to traditional bilevel optimization methods. The results suggest that neural network embeddings offer a promising alternative for tackling complex bilevel problems even when discrete decisions are involved and unveil the trade-off between prediction accuracy and computational demands. This methodology provides a versatile and efficient framework for solving mixed-integer bilevel optimization problems across diverse domains.
Bilevel optimization is an active area of research within the operations research community due to its ability to capture the interdependencies between two levels of decisions. This study introduces a metamodeling approach for addressing mixed-integer bilevel optimization problems, exploiting the approximation capabilities of neural networks. The proposed methodology employs neural network embeddings to approximate the optimal follower's response, bypassing the inner optimization problem by parametrizing it with continuous leaders decisions. The use of Rectified Linear Unit (ReLU) activations allows the forward pass of the neural network to be represented as a set of mixed-integer linear constraints. Thereby, the bilevel structure is simplified into a single-level optimization model. A case study based on a two-echelon supply chain demonstrates the effectiveness of the approach, with solutions comparable to traditional bilevel optimization methods. The results suggest that neural network embeddings offer a promising alternative for tackling complex bilevel problems even when discrete decisions are involved and unveil the trade-off between prediction accuracy and computational demands. This methodology provides a versatile and efficient framework for solving mixed-integer bilevel optimization problems across diverse domains.
Record ID
Keywords
Bilevel Optimization, MILP reformulation, Neural Network Embeddings, Supply Chain Planning, Surrogate Modelling
Subject
Suggested Citation
Moreno-Palancas IF, Díaz RS, Femenia RR, Caballero JA. Handling discrete decisions in bilevel optimization via neural network embeddings. Systems and Control Transactions 4:1415-1420 (2025) https://doi.org/10.69997/sct.175350
Author Affiliations
Moreno-Palancas IF: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Díaz RS: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Femenia RR: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Caballero JA: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Díaz RS: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Femenia RR: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Caballero JA: University of Alicante, Department of Chemical Engineering, San Vicente del Raspeig, Alicante, Spain
Journal Name
Systems and Control Transactions
Volume
4
First Page
1415
Last Page
1420
Year
2025
Publication Date
2025-07-01
Version Comments
Original Submission
Other Meta
PII: 1415-1420-1371-SCT-4-2025, Publication Type: Journal Article
Record Map
Published Article

LAPSE:2025.0379v1
This Record
External Link

https://doi.org/10.69997/sct.175350
Article DOI
Download
Meta
Record Statistics
Record Views
716
Version History
[v1] (Original Submission)
Jun 27, 2025
Verified by curator on
Jun 27, 2025
This Version Number
v1
Citations
Most Recent
This Version
URL Here
https://psecommunity.org/LAPSE:2025.0379v1
Record Owner
PSE Press
Links to Related Works
References Cited
- Moore, J. T. and J. F. Bard (1990). The mixed integer linear bilevel programming problem. Oper. Res. 38(5):911-921 https://doi.org/10.1287/opre.38.5.911
- Kleinert T, Labbé M, Ljubic I, Schmidt M. A Survey on Mixed-Integer Programming Techniques in Bilevel Optimization. EURO J. Comput. Optim. 9: 100007 (2021) https://doi.org/10.1016/j.ejco.2021.100007
- Qiang Z, Yefei Y, Fangfang M. A Stackelberg-based deep reinforcement learning approach for dynamic cooperative advertising in a two-echelon supply chain. Comp. Chem. Eng. 196:109048 (2025) https://doi.org/10.1016/j.compchemeng.2025.109048
- Moti M, Uddin RS, Hai MA, Saleh TB, Alam MGR, Hassan MM, Hassan MR, Blockchain Based Smart-Grid Stackelberg Model for Electricity Trading and Price Forecasting Using Reinforcement Learning. Appl. Sci. 12(10):5144 (2022) https://doi.org/10.3390/app12105144
- Jiménez RX. Bilevel optimisation with embedded neural networks: Application to scheduling and control integration. arXiv (2023)
- Vlah D, Sepetanc K, Pandzic H. Solving bilevel optimal bidding problems using deep convolutional neural networks. IEEE Syst. J. 17(2): 2767-2778 (2023) https://doi.org/10.1109/JSYST.2022.3232942
- Dumouchelle J, Julien E, Kurtz J, Khalil E. Neur2BiLO: Neural Bilevel Optimization. arXiv (2024)
- Dumouchelle J, Patel R, Khalil EB, Bodur M. Neur2SP: Neural Two-Stage Stochastic Programming. arXiv (2022)
- Aftabi N, Moradi N, Mahroo F. Feed-forward neural networks as mixed-integer program. arXiv (2024) https://doi.org/10.1007/s00366-025-02114-2
- Serra T, Tjandraatmadja C, Ramalingam S. Bounding and counting linear regions of deep neural networks. arXiv (2017)
- Yue D, You F. Stackelberg-game-based modeling and optimization for supply chain design and operations: A mixed integer bilevel programming framework. Comput. Chem. Eng. 102:81-95 (2017) https://doi.org/10.1016/j.compchemeng.2016.07.026
- Smith LN. Cyclical Learning Rates for Training Neural Networks. IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA. 464-472 (2017) https://doi.org/10.1109/WACV.2017.58

