Learning Agent State Online With Recurrent Generate-And-Test