MicroRTS 64x64

Created on June 22|Last edited on June 28
Comment
﻿
﻿
eval/microrts_results_win
eval/microrts_results_win
050M100M150M200Mglobal_step00.10.20.30.4
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-23T16:32:29.790708
ppo-Microrts-squnet-map64-64ch-selfplay-S1-2023-06-22T00:54:18.964106
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-21T01:26:44.862949
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T23:30:08.340275
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T22:45:14.778231
eval/results_WinLoss
eval/results_WinLoss
050M100M150M200Mglobal_step-0.6-0.4-0.20
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-23T16:32:29.790708
ppo-Microrts-squnet-map64-64ch-selfplay-S1-2023-06-22T00:54:18.964106
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-21T01:26:44.862949
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T23:30:08.340275
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T22:45:14.778231
Run set5
﻿
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-23T16:32:29.790708
cb507da Minibatch size of 258 means 24 minibatches…
19113cb Microrts-squnet-map64-128ch-selfplay
e6b81cb Add some early map32 and map64 models to sub agent
71d90f9 Microrts-dc-repro-map16-selfplay
566c7d3 Tweaks to kernel size on down and up conv2d
a37b313 Try n_envs 48 and n_steps 512 again given map fix
b5dae04 Another fix to allow observation_space size 8
9d8ba13 A100 specific map8 hyperparams, more envs fewer steps
0f4eba7 map8 reduce n_envs (and batch size) to match map12
2a26838 map8 is now a valid size
30159f5 map8 trains on only 8x8 maps with larger batch size
ppo-Microrts-squnet-map64-64ch-selfplay-S1-2023-06-22T00:54:18.964106
56f3537 Still have to use 512 batch size for 64ch map64
c438cb3 Rename envs with 64ch, deprecate non 64ch
d01c6e3 “Deprecate” _Microrts-squnet-map12-selfplay
eccf7f1 Evaluate map32 every 5M steps, map64 10M steps
c4bed6d map64 uses 64 channels across all levels
9ef4d96 Agent support for multiple models based on map size
37989c2 Fix ACTION_TYPE_TO_ACTION_INDEXES checking
97dd419 Restore prior model structure to be able to run old models
7b56dad Separate combat produce reward between types
74171a4 No val-clip squnet
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-21T01:26:44.862949
fddc017 128 channels across all levels for map32 and 64
c0a4e6c Need to remember to update both references
c1f666d Catch exceptions from non-RAI AIs
c80ad07 Fix unhandled argument crash
51aaf61 Compiled resources observation into jar
4b86d47 Add player and opponent resources to observation
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T23:30:08.340275
477fa93 map64: Make network narrower at the top
﻿ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T22:45:14.778231﻿
bec93ad map64: 16 envs to meet map training invariants
ee41e07 map64: Half envs and rollout steps
607b2e2 48 minibatches for 64x64 maps…
e0eba44 64x64 requires 32 minibatches…
aa45849 Reduce minibatch size for 32 and 64 maps for memory
cbad659 16, 32, and 64 Microrts squnet hyperparameters
855e79b Critic heads use same strides as used by the unet
ab7d32a wide map12 and fix to critic_heads num_channels
8618698 Remove unnecessary env make_kwarg
ef3675c Forgot to set bot alternating player during training
ccc490f Fix squnet-map12 eval env overrides
083c565 Set squnet-map12 to use 8x8 map for eval﻿﻿
6d5785f Microrts-squnet-map12 uses one minibatch
9189e1a min grid size is 12
1d93b6f Use ModuleList to register list of modules with Pytorch
f990525 Accidentally deleted a hyperparam copy reference
d78da94 SqueezeUnet training on 8x8-12x12 maps
b1f0d91 Down-res convolutition should have GELU non-linearity
﻿
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-23T16:32:29.790708
﻿
Run set1
﻿
ppo-Microrts-squnet-map64-64ch-selfplay-S1-2023-06-22T00:54:18.964106
﻿
Run set1
﻿
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-21T01:26:44.862949
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T23:30:08.340275
﻿
﻿
Run set1
﻿
﻿ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T22:45:14.778231﻿
﻿
Run set1
﻿
﻿﻿﻿
﻿
Add a comment