Skip to main content
Reports
Created by
Created On
Last edited
0
2023-08-23
0
2023-06-28
0
2023-04-15
0
2023-03-29
0
2023-03-27
0
2023-03-25
0
2023-03-24
0
2023-03-23
0
2023-03-22
Sweep/tuning.
Are our runs reasonable?
0
2023-03-14
[Scale] In-session vs multi-session
Controlled understanding of our ability to scale.
0
2023-03-03
0
2023-03-03
0
2023-03-01
[Tuning] Initial HP ranges + Maze Pilot
Life's too short to sweep everything.
0
2023-02-14
0
2023-02-28
0
2023-02-25
0
2023-02-25
0
2023-02-21
0
2023-02-21
Decoding Pilot
Need to figure out how to get fine-tuning work. Testing on RTT single session to kick off.
0
2023-02-17
[Atomicity] MC_RTT 5ms
- On RTT data alone, factor 1 is the only one that achieves competence. - With Maze data augmented (i.e. 30K base + 3K added, likely not a representational quality increase), RTT Factor 2 becomes feasible.
0
2023-02-10
Multisession, Multisubject Pilot
Goal: Understand scaling when it probably works
0
2023-02-13
[Scale] 100K Proof of concept
Does scaling help us improve 5K Pitt trials?
0
2023-02-11
[Atomicity] Compute-normalized factor size comps
Hm, BPS clouds picture, let's just look at loss.
0
2023-02-10
[Tuning] Pre-norm, initialization
To scale well we should follow good practices.
0
2023-02-12
[Throughput] Mask Ratio effects
Masking more increases throughput (and perf/flop), but at what cost?
0
2023-02-10
Flat vs factorized
Factorized is objectively much more efficient but a flat model has more eventual potential/is more agnostic. Per Kaiming's spatiotemporal paper, this might be better at scaled throughputs.
0
2023-02-10
[Atomicity] Factor RTT Maze
4 appears better for BPS, loss is too noisy.
0
2023-02-10
0
2023-02-05