Please use this identifier to cite or link to this item:
http://cmuir.cmu.ac.th/jspui/handle/6653943832/72757
Title: | Integration of the A2C Algorithm for Production Scheduling in a Two-Stage Hybrid Flow Shop Environment |
Authors: | Falk T. Gerpott Sebastian Lang Tobias Reggelin Hartmut Zadek Poti Chaopaisarn Sakgasem Ramingwong |
Authors: | Falk T. Gerpott Sebastian Lang Tobias Reggelin Hartmut Zadek Poti Chaopaisarn Sakgasem Ramingwong |
Keywords: | Computer Science |
Issue Date: | 1-Jan-2022 |
Abstract: | The paper introduces an approach to apply reinforcement learning (RL) for production scheduling in a two-stage hybrid flow shop (THFS) production system. The Advantage-Actor Critic (A2C) method is used to train multiple agents to minimize the total tardiness and makespan of a production program. The two-stage hybrid flow shop scheduling problem is a NP-hard combinatorial optimization problem that describes a production system with two stages, each consisting of a set of parallel machines. Our concept combines a Discrete-Event Simulation with a pre-implemented RL algorithm using Stable Baselines3. Since similar research often lacks concrete implementation information, the configuration of the OpenAI Gym interface and the agent-environment interaction is presented. |
URI: | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85127829514&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/72757 |
ISSN: | 18770509 |
Appears in Collections: | CMUL: Journal Articles |
Files in This Item:
There are no files associated with this item.
Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.