Skip to main content

rl-algo-impls MicroRTS Training

Created on June 10|Last edited on June 10
Model used in colab_microrts_demo.ipynb
  • PPO training with phased rewards:
  • Self-play training with 6 environments of playing latest and 12 environments of playing models in the last 10 million steps
  • Trained with 6 maps:
    • 16x16/basesWorkers16x16A.xml
    • 16x16/TwoBasesBarracks16x16.xml
    • 8x8/basesWorkers8x8A.xml
    • 8x8/FourBasesWorkers8x8.xml
    • NoWhereToRun9x8.xml
    • 16x16/EightBasesWorkers16x16.xml (Not public competition map)

Run set
1



Run set
1