MicroRTS 16x16

Created on June 21|Last edited on December 7
Comment
﻿
﻿
eval/microrts_results_win
eval/microrts_results_win
50M100M150M200M250M300Mglobal_step00.20.40.60.81
ppo-Microrts-dc-repro-map16-selfplay-S1-2023-06-23T00:38:35.882761
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-22T08:14:02.488462
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-19T22:18:43.032940
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-17T03:31:15.998948
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-16T20:50:42.240639
eval/results_WinLoss
eval/results_WinLoss
50M100M150M200M250M300Mglobal_step-0.500.51
ppo-Microrts-dc-repro-map16-selfplay-S1-2023-06-23T00:38:35.882761
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-22T08:14:02.488462
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-19T22:18:43.032940
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-17T03:31:15.998948
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-16T20:50:42.240639
Run set5
﻿
ppo-Microrts-dc-repro-map16-selfplay-S1-2023-06-23T00:38:35.882761
71d90f9 Microrts-dc-repro-map16-selfplay
566c7d3 Tweaks to kernel size on down and up conv2d
a37b313 Try n_envs 48 and n_steps 512 again given map fix
b5dae04 Another fix to allow observation_space size 8
9d8ba13 A100 specific map8 hyperparams, more envs fewer steps
0f4eba7 map8 reduce n_envs (and batch size) to match map12
2a26838 map8 is now a valid size
30159f5 map8 trains on only 8x8 maps with larger batch size
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-22T08:14:02.488462
56f3537 Still have to use 512 batch size for 64ch map64
c438cb3 Rename envs with 64ch, deprecate non 64ch
d01c6e3 “Deprecate” _Microrts-squnet-map12-selfplay
eccf7f1 Evaluate map32 every 5M steps, map64 10M steps
c4bed6d map64 uses 64 channels across all levels
9ef4d96 Agent support for multiple models based on map size
37989c2 Fix ACTION_TYPE_TO_ACTION_INDEXES checking
97dd419 Restore prior model structure to be able to run old models
7b56dad Separate combat produce reward between types
74171a4 No val-clip squnet
fddc017 128 channels across all levels for map32 and 64
c0a4e6c Need to remember to update both references
c1f666d Catch exceptions from non-RAI AIs
c80ad07 Fix unhandled argument crash
51aaf61 Compiled resources observation into jar
4b86d47 Add player and opponent resources to observation
477fa93 map64: Make network narrower at the top
bec93ad map64: 16 envs to meet map training invariants
ee41e07 map64: Half envs and rollout steps
607b2e2 48 minibatches for 64x64 maps…
e0eba44 64x64 requires 32 minibatches…
aa45849 Reduce minibatch size for 32 and 64 maps for memory
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-19T22:18:43.032940
cbad659 16, 32, and 64 Microrts squnet hyperparameters
855e79b Critic heads use same strides as used by the unet
ab7d32a wide map12 and fix to critic_heads num_channels
8618698 Remove unnecessary env make_kwarg
ef3675c Forgot to set bot alternating player during training
ccc490f Fix squnet-map12 eval env overrides
083c565 Set squnet-map12 to use 8x8 map for eval﻿﻿
6d5785f Microrts-squnet-map12 uses one minibatch
9189e1a min grid size is 12
1d93b6f Use ModuleList to register list of modules with Pytorch
f990525 Accidentally deleted a hyperparam copy reference
d78da94 SqueezeUnet training on 8x8-12x12 maps
b1f0d91 Down-res convolutition should have GELU non-linearity
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-17T03:31:15.998948
51d7269 Extend MicroRTS observation to include action and pending
159b86b Move microrts_space_transform to microrts/vec_env
4d57e0d Make MicroRTS action mask more strict
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-16T20:50:42.240639
ad822de Fix num_old_policies vs map_paths check
50593e8 Fix Microrts-selfplay-dc-phases-A6000
7bfef73 Microrts-selfplay-dc-phases-A6000: 3 minibatches
8b2891b Fix zip filename
a62d761 Renamed zip file used for MicroRTS submission
1e2afc2 (tag: v0.0.18) v0.0.17 -> v0.0.18 because of tagging to wrong commit
﻿
ppo-Microrts-dc-repro-map16-selfplay-S1-2023-06-23T00:38:35.882761
﻿
Run set1
﻿
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-22T08:14:02.488462
﻿
﻿
Run set1
﻿
ppo-Microrts-squnet-map16-selfplay-S1-2023-06-19T22:18:43.032940
﻿
﻿
Run set1
﻿
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-17T03:31:15.998948
﻿
Run set1
﻿
ppo-Microrts-selfplay-dc-phases-A6000-S1-2023-06-16T20:50:42.240639﻿﻿
﻿
Run set1
﻿
﻿﻿﻿
﻿
Add a comment