Belief States in POMDPs for Reinforcement Learning (RL). The earliest work I have found on something like POMDPs was by Alvin Drake at MIT, who studied decoding outputs from a noisy channel: A. W. Drake, “Observation of a.

Belief States in POMDPs for Reinforcement Learning (RL)
Belief States in POMDPs for Reinforcement Learning (RL) from image3.slideserve.com

Under a beginner model of reinforcement learning (RL) you probably learned the Markov Decision Process (MDP). There’s just one major problem with this model. In practice.