MicroRTS 16x16
Created on June 21|Last edited on December 7
Comment
ppo-Microrts-dc-repro-map16-selfplay-S1-2023-06-23T00:38:35.882761
71d90f9 Microrts-dc-repro-map16-selfplay566c7d3 Tweaks to kernel size on down and up conv2da37b313 Try n_envs 48 and n_steps 512 again given map fixb5dae04 Another fix to allow observation_space size 89d8ba13 A100 specific map8 hyperparams, more envs fewer steps0f4eba7 map8 reduce n_envs (and batch size) to match map122a26838 map8 is now a valid size30159f5 map8 trains on only 8x8 maps with larger batch size
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-22T08:14:02.488462
56f3537 Still have to use 512 batch size for 64ch map64c438cb3 Rename envs with 64ch, deprecate non 64chd01c6e3 “Deprecate” _Microrts-squnet-map12-selfplayeccf7f1 Evaluate map32 every 5M steps, map64 10M stepsc4bed6d map64 uses 64 channels across all levels9ef4d96 Agent support for multiple models based on map size37989c2 Fix ACTION_TYPE_TO_ACTION_INDEXES checking97dd419 Restore prior model structure to be able to run old models7b56dad Separate combat produce reward between types74171a4 No val-clip squnetfddc017 128 channels across all levels for map32 and 64c0a4e6c Need to remember to update both referencesc1f666d Catch exceptions from non-RAI AIsc80ad07 Fix unhandled argument crash51aaf61 Compiled resources observation into jar4b86d47 Add player and opponent resources to observation477fa93 map64: Make network narrower at the topbec93ad map64: 16 envs to meet map training invariantsee41e07 map64: Half envs and rollout steps607b2e2 48 minibatches for 64x64 maps…e0eba44 64x64 requires 32 minibatches…aa45849 Reduce minibatch size for 32 and 64 maps for memory
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-19T22:18:43.032940
cbad659 16, 32, and 64 Microrts squnet hyperparameters855e79b Critic heads use same strides as used by the unetab7d32a wide map12 and fix to critic_heads num_channels8618698 Remove unnecessary env make_kwargef3675c Forgot to set bot alternating player during trainingccc490f Fix squnet-map12 eval env overrides6d5785f Microrts-squnet-map12 uses one minibatch9189e1a min grid size is 121d93b6f Use ModuleList to register list of modules with Pytorchf990525 Accidentally deleted a hyperparam copy referenced78da94 SqueezeUnet training on 8x8-12x12 mapsb1f0d91 Down-res convolutition should have GELU non-linearity
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-17T03:31:15.998948
51d7269 Extend MicroRTS observation to include action and pending159b86b Move microrts_space_transform to microrts/vec_env4d57e0d Make MicroRTS action mask more strict
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-16T20:50:42.240639
ad822de Fix num_old_policies vs map_paths check50593e8 Fix Microrts-selfplay-dc-phases-A60007bfef73 Microrts-selfplay-dc-phases-A6000: 3 minibatches8b2891b Fix zip filenamea62d761 Renamed zip file used for MicroRTS submission1e2afc2 (tag: v0.0.18) v0.0.17 -> v0.0.18 because of tagging to wrong commit
ppo-Microrts-dc-repro-map16-selfplay-S1-2023-06-23T00:38:35.882761
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-22T08:14:02.488462
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-19T22:18:43.032940
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-17T03:31:15.998948
Add a comment