Skip to main content

Levmckinney's workspace

bias_norm
32
050100150200Step0.20.40.60.8
group: meta-llama/Meta-Llama-3-8B, w_kl: -, w_ce: -
group: meta-llama/Meta-Llama-3-8B-Instruct, w_kl: -, w_ce: -
group: meta-llama/Llama-2-7b-hf, w_kl: -, w_ce: -
050100150200Step0.20.40.60.8
group: meta-llama/Meta-Llama-3-8B, w_kl: -, w_ce: -
group: meta-llama/Meta-Llama-3-8B-Instruct, w_kl: -, w_ce: -
group: meta-llama/Llama-2-7b-hf, w_kl: -, w_ce: -
050100150200Step0.10.20.3
group: meta-llama/Meta-Llama-3-8B, w_kl: -, w_ce: -
group: meta-llama/Meta-Llama-3-8B-Instruct, w_kl: -, w_ce: -
group: meta-llama/Llama-2-7b-hf, w_kl: -, w_ce: -
050100150200Step0.10.20.30.40.5
group: meta-llama/Meta-Llama-3-8B, w_kl: -, w_ce: -
group: meta-llama/Meta-Llama-3-8B-Instruct, w_kl: -, w_ce: -
group: meta-llama/Llama-2-7b-hf, w_kl: -, w_ce: -
050100150200Step0.20.40.60.8
group: meta-llama/Meta-Llama-3-8B, w_kl: -, w_ce: -
group: meta-llama/Meta-Llama-3-8B-Instruct, w_kl: -, w_ce: -
group: meta-llama/Llama-2-7b-hf, w_kl: -, w_ce: -
050100150200Step0.010.020.030.040.05
group: meta-llama/Meta-Llama-3-8B, w_kl: -, w_ce: -
group: meta-llama/Meta-Llama-3-8B-Instruct, w_kl: -, w_ce: -
group: meta-llama/Llama-2-7b-hf, w_kl: -, w_ce: -
loss
32
weight_norm
32