Treffer: Reinforcement learning for train dispatching : A study on the possibility to use reinforcement learning to optimize train ordering and minimize train delays in disrupted situations, inside the r ail simulator OSRD

Title:

Reinforcement learning for train dispatching : A study on the possibility to use reinforcement learning to optimize train ordering and minimize train delays in disrupted situations, inside the r ail simulator OSRD

Authors:

Popescu, Teodora

Publisher Information:

KTH, Skolan för elektroteknik och datavetenskap (EECS) 2022

Document Type:

E-Ressource Electronic Resource

Index Terms:

Electrical Engineering, Electronic Engineering, Information Engineering, Elektroteknik och elektronik, Student thesis, info:eu-repo/semantics/bachelorThesis, text

URL:

http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-319948

Availability:

Open access content. Open access content
info:eu-repo/semantics/openAccess

Note:

application/pdf
English

Other Numbers:

UPE oai:DiVA.org:kth-319948
1387571496

Contributing Source:

UPPSALA UNIV LIBR
From OAIster®, provided by the OCLC Cooperative.

Accession Number:

edsoai.on1387571496

Database:

OAIster

Weitere Informationen

Train dispatching is a complex process, especially when the train traffic is disrupted, as the decisions taken by the dispatchers can have substantial consequences on the delays of the trains. The most frequent dispatching decisions consists in changing the order of trains at convergence points, where two tracks unite to become a single track. Choosing the right train order is crucial, as the trains cannot bypass each other again while they are on the single track after the convergence point. The OSRD team of SNCF R´eseau has designed the rail simulator OSRD (Open Source Railway Designer), which can simulate any traffic situation. The goal of this degree project was to study if reinforcement learning could be implemented in that simulator to find optimal ordering policies under traffic disruptions. A thorough literature review was carried out to identify what reinforcement learning models have already been used in the literature to handle similar problems. None of the models seen in the literature could directly be adapted to the OSRD simulator but key features which seemed to be necessary to build an efficient reinforcement learning model in OSRD were determined. Based on those features and on the specificities of OSRD, a custom reinforcement learning model (states, actions, rewards) was created. This model was then implemented into a Python reinforcement learning environment after designing an interactive simulation module which enabled communication between the Python reinforcement learning environment and OSRD. After ensuring that the model was running and enabled interacting with an OSRD simulation to retrieve decisions from it and take decisions which modified the train order, the study focused on what reinforcement learning algorithms could be used to implement a reinforcement learning algorithm which learns based on the implemented reinforcement learning model. Another in-depth literature review was performed on the existing reinforcement learning algorithms
Tågbeställning är en komplicerad process, särskilt när tågtrafiken är störd, eftersom de beslut som fattas av tågbeställarna kan få betydande konsekvenser för tågens förseningar. De vanligaste besluten i fråga om tågplanering består i att ¨andra tågens ordning vid konvergenspunkter, där två spår förenas till ett enda spår. Det ¨ar viktigt att välja rätt tågordning eftersom tågen inte kan köra förbi varandra igen när de befinner sig på det enda spåret efter konvergenspunkten. OSRD-teamet vid SNCF R´eseau har utformat järnvägssimulatorn OSRD (Open Source Railway Designer), som kan simulera alla trafiksituationer. Målet med detta examensarbete var att undersöka om förstärkningsinlärning kan implementeras i den simulatorn för att hitta optimala beställningsprinciper vid trafikstörningar. En grundlig litteraturgenomgång genomfördes för att identifiera vilka förstärkningsinlärningsmodeller som redan har använts i litteraturen för att hantera liknande problem. Ingen av modellerna i litteraturen kunde direkt anpassas till OSRD-simulatorn, men man fastställde de viktigaste egenskaper som verkade vara nödvändiga för att bygga en effektiv förstärkningsinlärningsmodell i OSRD. På grundval av dessa egenskaper och OSRD:s särdrag skapades en anpassad modell för förstärkningsinlärning (tillstånd, åtgärder, belöningar). Denna modell implementerades sedan i en Python-miljö för förstärkningsinlärning efter att en interaktiv simuleringsmodul utformats som möjliggjorde kommunikation mellan Python-miljön för förstärkningsinlärning och OSRD. Efter att ha säkerställt att modellen var igång och möjliggjorde interaktion med en OSRD-simulering för att hämta beslut från den och fatta beslut som ändrade tågordningen, fokuserade studien på vilka algoritmer för förstärkningsinlärning som kunde användas för att genomföra en algoritm för förstärkningsinlärning som lär sig utifrån den genomförda modellen för förstärkningsinlärning. En annan djupgående litteraturstudie genomfördes om de befintliga al

Treffer: Reinforcement learning for train dispatching : A study on the possibility to use reinforcement learning to optimize train ordering and minimize train delays in disrupted situations, inside the r ail simulator OSRD

Weitere Informationen

Links

Zusatz-Funktionen