Serviceeinschränkungen vom 12.-22.02.2026 - weitere Infos auf der UB-Homepage

Treffer: A new strategy for incorporating Gaussian process dynamic models into stochastic dynamic programming

Title:

A new strategy for incorporating Gaussian process dynamic models into stochastic dynamic programming

Authors:

Mahmoudi Filabadi, Mohammad, Lefebvre, Tom, Crevecoeur, Guillaume

Source:

2025 European Control Conference (ECC) ; ISSN: 2996-8895 ; ISBN: 9783907144121

Publisher Information:

IEEE

Publication Year:

2025

Collection:

Ghent University Academic Bibliography

Subject Terms:

Technology and Engineering

Document Type:

Konferenz conference object

File Description:

application/pdf

Language:

English

ISBN:

978-3-907144-12-1
3-907144-12-0

Relation:

https://biblio.ugent.be/publication/01K7HGQGRK40DGEGFDHME7YSE7; https://doi.org/10.23919/ECC65951.2025.11187069; https://biblio.ugent.be/publication/01K7HGQGRK40DGEGFDHME7YSE7/file/01K7HHM9922QZBZCQTFTCANMP0

DOI:

10.23919/ECC65951.2025.11187069

Availability:

https://biblio.ugent.be/publication/01K7HGQGRK40DGEGFDHME7YSE7
https://hdl.handle.net/1854/LU-01K7HGQGRK40DGEGFDHME7YSE7
https://doi.org/10.23919/ECC65951.2025.11187069
https://biblio.ugent.be/publication/01K7HGQGRK40DGEGFDHME7YSE7/file/01K7HHM9922QZBZCQTFTCANMP0

Rights:

info:eu-repo/semantics/restrictedAccess

Accession Number:

edsbas.E536B4BA

Database:

BASE

Weitere Informationen

This paper proposes a local solution method tailored to stochastic optimal control problems with Gaussian process (GP) representation of the dynamics leaning on the stochastic dynamic programming (DP) approach. We explore two methods - Fourier-Hermite DP (FHDP) and its recent extension, Fourier-Hermite Probabilistic DP (FHPDP) - for incorporating GP-based model learning. Compared to other model learning techniques, GP-based model learning explicitly quantifies model uncertainty and mitigates the effects of structural model errors. These Fourier-Hermite methods provide derivative-free versions of the differential dynamic programming (DDP) method through iterative backward-forward sweeps using sigma-point integration schemes for probabilistic value function approximation. Unlike the deterministic nature of the state-of-the-art GP-based DDP methods, the probabilistic foundation of the Fourier-Hermite methods makes them well-suited for integrating GPs. Therefore, we leverage GP-based forward uncertainty propagation within the Fourier-Hermite methods to propose sample-efficient data-driven methods, called GP-FHDP and GP-FHPDP, that can be applied to both stochastic and risk-sensitive optimal control problems. Furthermore, our methods can actively adjust exploration based on the uncertainty level, leading to accelerated convergence. The capabilities of the proposed algorithms are demonstrated on a simulated nonlinear vehicle system.

Treffer: A new strategy for incorporating Gaussian process dynamic models into stochastic dynamic programming

Weitere Informationen

Links

Zusatz-Funktionen