MicroRTS 64x64
Created on June 22|Last edited on June 28
Comment
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-23T16:32:29.790708
cb507da Minibatch size of 258 means 24 minibatches…19113cb Microrts-squnet-map64-128ch-selfplaye6b81cb Add some early map32 and map64 models to sub agent71d90f9 Microrts-dc-repro-map16-selfplay566c7d3 Tweaks to kernel size on down and up conv2da37b313 Try n_envs 48 and n_steps 512 again given map fixb5dae04 Another fix to allow observation_space size 89d8ba13 A100 specific map8 hyperparams, more envs fewer steps0f4eba7 map8 reduce n_envs (and batch size) to match map122a26838 map8 is now a valid size30159f5 map8 trains on only 8x8 maps with larger batch size
ppo-Microrts-squnet-map64-64ch-selfplay-S1-2023-06-22T00:54:18.964106
56f3537 Still have to use 512 batch size for 64ch map64c438cb3 Rename envs with 64ch, deprecate non 64chd01c6e3 “Deprecate” _Microrts-squnet-map12-selfplayeccf7f1 Evaluate map32 every 5M steps, map64 10M stepsc4bed6d map64 uses 64 channels across all levels9ef4d96 Agent support for multiple models based on map size37989c2 Fix ACTION_TYPE_TO_ACTION_INDEXES checking97dd419 Restore prior model structure to be able to run old models7b56dad Separate combat produce reward between types74171a4 No val-clip squnet
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-21T01:26:44.862949
fddc017 128 channels across all levels for map32 and 64c0a4e6c Need to remember to update both referencesc1f666d Catch exceptions from non-RAI AIsc80ad07 Fix unhandled argument crash51aaf61 Compiled resources observation into jar4b86d47 Add player and opponent resources to observation
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T23:30:08.340275
477fa93 map64: Make network narrower at the top
bec93ad map64: 16 envs to meet map training invariantsee41e07 map64: Half envs and rollout steps607b2e2 48 minibatches for 64x64 maps…e0eba44 64x64 requires 32 minibatches…aa45849 Reduce minibatch size for 32 and 64 maps for memorycbad659 16, 32, and 64 Microrts squnet hyperparameters855e79b Critic heads use same strides as used by the unetab7d32a wide map12 and fix to critic_heads num_channels8618698 Remove unnecessary env make_kwargef3675c Forgot to set bot alternating player during trainingccc490f Fix squnet-map12 eval env overrides6d5785f Microrts-squnet-map12 uses one minibatch9189e1a min grid size is 121d93b6f Use ModuleList to register list of modules with Pytorchf990525 Accidentally deleted a hyperparam copy referenced78da94 SqueezeUnet training on 8x8-12x12 mapsb1f0d91 Down-res convolutition should have GELU non-linearity
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-23T16:32:29.790708
ppo-Microrts-squnet-map64-64ch-selfplay-S1-2023-06-22T00:54:18.964106
ppo-Microrts-squnet-map64-128ch-selfplay-S1-2023-06-21T01:26:44.862949
ppo-Microrts-squnet-map64-selfplay-S1-2023-06-19T23:30:08.340275
Add a comment