upup-ashton-wang-usc

Upup-ashton-wang's group workspace

Group: Source Model Ablation (Trained-from-Scatch SAE)

1-6

of 6

Tags

Notes

Author

upup-ashton-wang

State

Finished

Start time

June 1st, 2025 9:50:21 AM

Runtime

18m 23s

Tracked hours

18m 22s

Run path

upup-ashton-wang-usc/Resa/8vmie0yi

Linux-5.15.0-92-generic-x86_64-with-glibc2.35

Python version

CPython 3.10.16

Command

/home/omer/shangshang/workspace/reasoning/reasoning-sae/./scripts/train/run_sae_based_distill.py --config ./recipes/DeepSeek-R1-Distill-Qwen-1.5B/grpo/distill_curated_still.yaml --host_model_checkpoint checkpoint-3000 --student_model_name DeepSeek-R1-Distill-Qwen-1.5B --distill_dataset_name curated_still --distill_type sft_r1_distill --sae_hookpoint model.layers.12 --sae_type reason_pretrained

System Hardware

CPU count	32
Logical CPU count	64
GPU count	8
GPU type	NVIDIA RTX 6000 Ada Generation

W&B CLI Version

0.19.8

Group

Source Model Ablation (Trained-from-Scatch SAE)

Config parameters are your model's inputs. Learn more

▶
Config parameters:{} 20 keys
- base_model_name:
  "DeepSeek-R1-Distill-Qwen-1.5B"
- batch_size:
  1
- distill_dataset_name:
  "curated_still"
- distill_type:
  "sft_r1_distill"
- host_model_checkpoint:
  "checkpoint-3000"
- host_model_post_train_dataset_name:
  "curated_still"
- host_model_post_train_type:
  "grpo"
- learning_rate:
  0.000001
- logging_steps:
  1
- lora_alpha:
  128
- lora_dropout:
  0.05
- lora_r:
  32
- ▶
  lora_target_modules:[] 7 items
- num_epochs:
  2
- sae_hookpoint:
  "model.layers.12"
- sae_name:
  "sae-DeepSeek-R1-Distill-Qwen-1.5B-65k"
- sae_type:
  "reason_pretrained"
- save_steps:
  500
- seed:
  42
- student_model_name:
  "DeepSeek-R1-Distill-Qwen-1.5B"

Summary metrics are your model's outputs. Learn more

▶
Summary metrics:{} 5 keys
- epoch:
  1
- epoch_kl_loss:
  188.6456310679612
- learning_rate:
  0.000001
- sae_reconstruction_loss:
  2.21875
- step_kl_loss:
  258

This run produced these artifacts as outputs. Total: 1. Learn more

wandb-history

run-8vmie0yi-history:v0