Treffer: Integrating Q-Learning with Branch and Bound for the Job Shop Scheduling Problem

Title:
Integrating Q-Learning with Branch and Bound for the Job Shop Scheduling Problem
Contributors:
Laboratoire des sciences pour la conception, l'optimisation et la production (G-SCOP), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP), Université Grenoble Alpes (UGA), Design, Engineering and Operation Management of Systems and Services (G-SCOP_DOME2S), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP)
Source:
11th IFAC Conference on Manufacturing Modelling. :560-565
Publisher Information:
CCSD, 2025.
Publication Year:
2025
Collection:
collection:UGA
collection:CNRS
collection:INPG
collection:G-SCOP
collection:LORIA2
collection:TDS-MACS
collection:UGA-EPE
collection:G-SCOP_DOME2S
collection:TEST-UGA
Subject Geographic:
Original Identifier:
HAL: hal-05306175
Document Type:
Konferenz conferenceObject<br />Conference papers
Language:
English
Relation:
info:eu-repo/semantics/altIdentifier/doi/10.1016/j.ifacol.2025.09.096
DOI:
10.1016/j.ifacol.2025.09.096
Rights:
info:eu-repo/semantics/OpenAccess
URL: http://creativecommons.org/licenses/by-nc-nd/
Accession Number:
edshal.hal.05306175v1
Database:
HAL

Weitere Informationen

The integration of machine learning (ML), particularly reinforcement learning (RL), techniques with classical optimization methods has shown significant potential in addressing the computational challenges of complex combinatorial problems. This paper presents a novel approach that enhances the Branch and Bound (B&B) algorithm with Q-learning, a type of model-free RL, applied to the Job Shop Scheduling Problem (JSSP). While B&B is a powerful method for finding optimal solutions, its computational demands are often prohibitive for large instances. By incorporating Q-learning into the B&B framework (BBQL), the proposed method learns to prioritize promising branches, thereby improving search efficiency and reducing computational overhead. We evaluate BBQL against the classic B&B algorithm on standard JSSP benchmarks. Experimental results indicate that BBQL achieves better makespan results with significantly lower computational costs, highlighting the potential of reinforcement learning to enhance traditional optimization frameworks in scheduling applications. This research offers insights into how AI-driven guidance can improve exact optimization methods, providing a scalable solution to complex scheduling challenges.