Skip to main content
Reports
Created by
Created On
Last edited
0
2024-02-09
0
2024-02-01
0
2024-01-30
ppo-Microrts-selfplay-dc-phases-A10-S1-2023-05-26T00:09:17.258204
Reference self-play only training run used for RAISocketAI in IEEE-CoG microRTS 2023 competition
0
2024-01-29
0
2024-01-11
0
2024-01-01
0
2023-12-27
0
2023-12-21
0
2023-12-18
0
2023-12-18
0
2023-12-11
0
2023-12-06
0
2023-06-21
0
2023-11-30
0
2023-10-23
0
2023-10-19
0
2023-10-16
0
2023-10-14
0
2023-10-14
0
2023-10-11
0
2023-10-10
0
2023-08-09
0
2023-06-04
0
2023-06-26
0
2023-06-30
0
2023-06-16
0
2023-06-22
0
2023-06-22
0
2023-06-10
0
2023-06-21
0
2023-05-31
0
2023-06-10
0
2023-05-31
0
2023-04-08
0
2023-05-09
0
2023-04-26
0
2023-04-25
0
2023-04-21
0
2023-04-25
0
2023-04-11
0
2023-03-28
unet microrts-ai trained against coacAI
Eval is WinLoss and training with progressive reward decay for non-WinLoss rewards
0
2023-04-04
0
2023-03-31
0
2023-03-23
0
2023-03-23
0
2023-03-23
0
2023-03-21
0
2023-02-24
0
2023-03-18
0
2023-03-17
0
2023-03-18
Comparing SB3VecEnv+GymLike Normalization
gymlike and sb3 norm give equivalent results
0
2023-03-17
0
2023-03-08
0
2023-03-08
0
2023-03-02
0
2023-03-01
2/17/2023 starpilot-hard Lambda Benchmark
Playing procgen's starpilot on hard distribution with varying IMPALA network sizes
0
2023-02-17
2/19/2023 ppo-BipedalWalker-v3 M1Max Benchmark
BipedalWalker-v3 trained with PPO
0
2023-02-20
2/17/2023 vpg Lambda Benchmark
benchmark_e8bc541 host_192-9-247-28
0
2023-02-17
2/15/2023 procgen-easy Lambda Benchmark
Fix to not normalize rewards during eval
0
2023-02-15
2/13/2023 procgen-easy Lambda Benchmark
Branch: https://github.com/sgoodfriend/rl-algo-impls/tree/vpg_hard_cap_steps_per_epoch
0
2023-02-14
2/14/2023 DQN IMPALA Atari Lambda Benchmark
Use IMPALA-style network for dqn playing Atari games
0
2023-02-17
2/9/2023 vpg_hard_cap_steps_per_epoch Lambda Benchmark
Branch: https://github.com/sgoodfriend/rl-algo-impls/tree/vpg_hard_cap_steps_per_epoch
0
2023-02-10
2/14/2023 IMPALA Atari+CarRacing Lambda Benchmark
Use IMPALA-style network for Atari games and CarRacing
0
2023-02-15
2/15/2023 ppo_eps_1e-7 Lambda Benchmark
ppo Adam eps to match Andrychowicz, et al. (2021)
0
2023-02-16
2/15/2023 vpg_atari_tweaks Atari Lambda Benchmark
Removed hidden layers since 2/13/2023
0
2023-02-15
2/14/2023 procgen-easy Lambda Benchmark
Now using the gym3 backed ProcgenEnv
0
2023-02-15
2/13/2023 vpg_atari_tweaks Atari Lambda Benchmark
Branch: https://github.com/sgoodfriend/rl-algo-impls/tree/vpg_atari_tweaks
0
2023-02-13
0
2023-02-09
2/8/2023 VPG Lambda Benchmark
Branch: https://github.com/sgoodfriend/rl-algo-impls/tree/vpg_02_08_2023
0
2023-02-09
2/8/2023 DQN Lambda Labs Benchmark
Branch: https://github.com/sgoodfriend/rl-algo-impls/tree/dqn_benchmark Commit: https://github.com/sgoodfriend/rl-algo-impls/commit/1d4094fbcc9082de7f53f4348dd4c7c354152907
0
2023-02-08
0
2023-02-09
2/6/2023 main vs atari_separate_feature_extractor
Commit: https://github.com/sgoodfriend/rl-algo-impls/commit/5598ebc4b03054f16eebe76792486ba7bcacfc5c
0
2023-02-06
2/5/2023 update_rtg_between_epochs Lambda Labs Benchmark
Commit: https://github.com/sgoodfriend/rl-algo-impls/commit/5540e1fc804d609fac4498150d6af086639906e1
0
2023-02-06
0
2023-02-01
0
2023-02-05
0
2023-02-04
0
2023-02-03
0
2023-02-02
0
2023-02-01
0
2023-02-01