Skip to main content

Upup-ashton-wang's group workspace

Timestamps visible
2025-05-30 19:33:33
  "tie_word_embeddings": false,
2025-05-30 19:33:33
  "torch_dtype": "bfloat16",
2025-05-30 19:33:33
  "transformers_version": "4.50.0",
2025-05-30 19:33:33
  "use_cache": true,
2025-05-30 19:33:33
  "use_mrope": false,
2025-05-30 19:33:33
  "use_sliding_window": false,
2025-05-30 19:33:33
  "vocab_size": 151936
2025-05-30 19:33:33
}
2025-05-30 19:33:33

2025-05-30 19:33:33
tokenizer config file saved in /home/omer/shangshang/project/reasoning/reasoning-sae/ckpts/models/DeepSeek-R1-Distill-Qwen-1.5B/grpo_curated_still/checkpoint-2000/distill/curated_open_rs1/model.layers.12/sft_r1_distill/tokenizer_config.json
2025-05-30 19:33:33
Special tokens file saved in /home/omer/shangshang/project/reasoning/reasoning-sae/ckpts/models/DeepSeek-R1-Distill-Qwen-1.5B/grpo_curated_still/checkpoint-2000/distill/curated_open_rs1/model.layers.12/sft_r1_distill/special_tokens_map.json
2025-05-30 19:33:33
Final model saved to /home/omer/shangshang/project/reasoning/reasoning-sae/ckpts/models/DeepSeek-R1-Distill-Qwen-1.5B/grpo_curated_still/checkpoint-2000/distill/curated_open_rs1/model.layers.12/sft_r1_distill
2025-05-30 19:33:33
Training finished.