POC: Full Learned Sokoban experiment (!302) · Merge requests · Awarelab / Alpacka

This is a proof of concept MR (not intended to merge) replicating a full experiment with on-line learning of Sokoban model.

POC: Full Learned Sokoban experiment