Reinforcement Learning on FusePop