Syed Shahul Hameed, A. S., & Rajagopalan, N. (2023). MABSearch: The Bandit Way of Learning the Learning Rate—A Harmony Between Reinforcement Learning and Gradient Descent. National Academy Science Letters, 1-6. https://doi.org/10.1007/s40009-023-01292-1
ISO-690 (author-date, English)SYED SHAHUL HAMEED, A. S. und RAJAGOPALAN, Narendran, 2023. MABSearch: The Bandit Way of Learning the Learning Rate—A Harmony Between Reinforcement Learning and Gradient Descent. National Academy Science Letters. 4 Juni 2023. P. 1-6. DOI 10.1007/s40009-023-01292-1.
Modern Language Association 9th editionSyed Shahul Hameed, A. S., und N. Rajagopalan. „MABSearch: The Bandit Way of Learning the Learning Rate—A Harmony Between Reinforcement Learning and Gradient Descent“. National Academy Science Letters, Juni 2023, S. 1-6, https://doi.org/10.1007/s40009-023-01292-1.
Mohr Siebeck - Recht (Deutsch - Österreich)Syed Shahul Hameed, A. S./Rajagopalan, Narendran: MABSearch: The Bandit Way of Learning the Learning Rate—A Harmony Between Reinforcement Learning and Gradient Descent, National Academy Science Letters 2023, 1-6.
Emerald - HarvardSyed Shahul Hameed, A.S. und Rajagopalan, N. (2023), „MABSearch: The Bandit Way of Learning the Learning Rate—A Harmony Between Reinforcement Learning and Gradient Descent“, National Academy Science Letters, S. 1-6.