A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Observations
The proposed network contains clustering layers, based on earlier work by Afshar et al., 2020 and Bethi et al., 2022, with an introduction of TD-error modulation and eligibility traces.
BibTex: