Shi, C., Zhang, S., Lu, W., & Song, R. (2022). Statistical inference of the value function for reinforcement learning in infinite‐horizon settings. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 84(3), 765-793. https://doi.org/10.1111/rssb.12465
ISO-690 (author-date, English)SHI, Chengchun, ZHANG, Sheng, LU, Wenbin und SONG, Rui, 2022. Statistical inference of the value function for reinforcement learning in infinite‐horizon settings. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 1 Juli 2022. Vol. 84, no. 3, p. 765-793. DOI 10.1111/rssb.12465.
Modern Language Association 9th editionShi, C., S. Zhang, W. Lu, und R. Song. „Statistical Inference of the Value Function for Reinforcement Learning in infinite‐horizon Settings.“. Journal of the Royal Statistical Society: Series B (Statistical Methodology), Bd. 84, Nr. 3, Juli 2022, S. 765-93, https://doi.org/10.1111/rssb.12465.
Mohr Siebeck - Recht (Deutsch - Österreich)Shi, Chengchun/Zhang, Sheng/Lu, Wenbin/Song, Rui: Statistical inference of the value function for reinforcement learning in infinite‐horizon settings., Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2022, 765-793.
Emerald - HarvardShi, C., Zhang, S., Lu, W. und Song, R. (2022), „Statistical inference of the value function for reinforcement learning in infinite‐horizon settings.“, Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 84 No. 3, S. 765-793.