Skip to main content

Asap-zzhou's group workspace

sft-reproduction

What makes this group special?
Tags

Dream-7B-SFT-tulu3-fsdp-bs3-len2048-ep5-lr1e-5-gbl

Notes
State
Finished
Start time
October 21st, 2025 5:06:43 AM
Runtime
16h 45m 36s
Tracked hours
16h 45m 31s
Run path
asap-zzhou/dllm/1z7kqb2q
OS
Linux-3.10.0-957.el7.x86_64-x86_64-with-glibc2.17
Python version
CPython 3.10.18
Git repository
git clone git@github.com:ZHZisZZ/dllm.git
Git state
git checkout -b "Dream-7B-SFT-tulu3-fsdp-bs3-len2048-ep5-lr1e-5-gbl" f3e64f95ad3d3fa2611d777786f7529959ee0656
Command
/mnt/lustrenew/mllm_aligned/dongzhichen/dllm/examples/dream/sft.py --dataset_args data/sft/dream/tulu-3-sft-mixture --load_preprocessed_data True --output_dir models/Dream-7B-SFT-tulu3-fsdp-bs3-len2048-ep5-lr1e-5-gbl --max_length 2048 --truncation right --group_by_length True --num_train_epochs 5 --learning_rate 1e-5 --per_device_train_batch_size 2 --gradient_accumulation_steps 2 --per_device_eval_batch_size 2 --eval_on_start False --eval_steps 0.1 --save_steps 0.05
System Hardware
CPU count64
Logical CPU count 128
GPU count8
GPU typeNVIDIA A100-SXM4-80GB
W&B CLI Version
0.22.2
Config

Config parameters are your model's inputs. Learn more

  • {} 218 keys
    • "/mnt/lustrenew/mllm_aligned/shared/models/huggingface/Dream-org/Dream-v0-Base-7B"
    • {} 6 keys
      • false
      • 0.9
      • 0.999
      • 0.00000001
      • false
      • [] 1 item
        • "DreamModel"
      • 0
      • false
      • {} 2 keys
        • "configuration_dream.DreamConfig"
        • "modeling_dream.DreamModel"
      • true
      • null
      • false
      • null
      • true
      • false
      • "none"
      • 151,665
      • 0
      • null
      • null
      • false
      • 0
      • false
      • true
      • null
      • null
      • null
      • null
      • null
      • 1,800
      • [] 0 items
        • null
        • null
        • false
        • 0
        • true
        • false
        • false
        • false
        • "bfloat16"
        • false
        • 0
        • 151,643
        • null
        • 46 ... 95
          96 ... 145
          146 ... 195
          196 ... 213
        • 152,064
        • 0.1
        • 0
        • 0
      Summary

      Summary metrics are your model's outputs. Learn more

      • {} 14 keys
        • 0.1407092809677124
        • 315.7667
        • 288.897
        • 0.754
        • 564,184,193,811,087,360
        • 0.2889569140949837
        • 60,254.9897
        • 68.24
        • 0.089
        • 5
        • 5,355
        • 0.9333163499832152
        • 0.00000000003824968373
        • 0.2077
      Artifact Outputs

      This run produced these artifacts as outputs. Total: 1. Learn more

      Loading...