Text this: Learning decentralized policies with incremental reinforcement learning, reward shaping and self-play learning.