Skip to main content

Upup-ashton-wang's group workspace

Tina-STILL-3-1.5B-preview

What makes this group special?
Tags

Tina-STILL-3-1.5B-preview-continue4

Notes
State
Crashed
Start time
April 6th, 2025 11:43:57 PM
Runtime
1d 21h 34m
Tracked hours
1d 21h 33m 48s
Run path
upup-ashton-wang-usc/Tina/3dwvlwyk
OS
Linux-5.15.0-92-generic-x86_64-with-glibc2.35
Python version
CPython 3.10.16
Command
/home/omer/shangshang/workspace/reasoning/reasoning-sae/./resee/post_train_hf/grpo.py --config ./recipes/DeepSeek-R1-Distill-Qwen-1.5B/grpo/model_curated_still.yaml
System Hardware
CPU count32
Logical CPU count 64
GPU count8
GPU typeNVIDIA RTX 6000 Ada Generation
W&B CLI Version
0.19.8
Config

Config parameters are your model's inputs. Learn more

  • {} 225 keys
    • true
    • "/home/omer/shangshang/project/reasoning/reasoning-sae/ckpts/models/DeepSeek-R1-Distill-Qwen-1.5B/base"
    • {} 6 keys
      • false
      • 0.9
      • 0.999
      • 0.00000001
      • false
      • [] 1 item
        • "Qwen2ForCausalLM"
      • 0
      • false
      • false
      • null
      • false
      • null
      • 0.04
      • true
      • false
      • 151,643
      • 0
      • null
      • null
      • false
      • 0
      • false
      • true
      • null
      • null
      • null
      • null
      • null
      • 1,800
      • [] 0 items
        • null
        • null
        • false
        • null
        • 0
        • false
        • false
        • false
        • false
        • true
        • false
        • 0
        • 151,643
        • 46 ... 95
          96 ... 145
          146 ... 195
          196 ... 220
        • 151,936
        • 0.1
        • 0
        • 0
      Summary

      Summary metrics are your model's outputs. Learn more

      • {} 12 keys
        • "table-file"
        • 3,252.09375
        • 0.9987302011628684
        • 3,736
        • 1.278078556060791
        • 0.5712890625
        • 0.00000063102760784477
        • 0.0229
        • 0.0040328651666641235
        • 0.5001259967684746
        • 0.0625
        • -0.12096713669598104