Asap-zzhou's group workspace
sft-reproduction
What makes this group special?
Tags
Dream-7B-SFT-tulu3-fsdp-bs3-len2048-ep5-lr1e-5-gbl
Notes
Author
State
Finished
Start time
October 21st, 2025 5:06:43 AM
Runtime
16h 45m 36s
Tracked hours
16h 45m 31s
Run path
asap-zzhou/dllm/1z7kqb2q
OS
Linux-3.10.0-957.el7.x86_64-x86_64-with-glibc2.17
Python version
CPython 3.10.18
Git repository
git clone git@github.com:ZHZisZZ/dllm.git
Git state
git checkout -b "Dream-7B-SFT-tulu3-fsdp-bs3-len2048-ep5-lr1e-5-gbl" f3e64f95ad3d3fa2611d777786f7529959ee0656
Command
/mnt/lustrenew/mllm_aligned/dongzhichen/dllm/examples/dream/sft.py --dataset_args data/sft/dream/tulu-3-sft-mixture --load_preprocessed_data True --output_dir models/Dream-7B-SFT-tulu3-fsdp-bs3-len2048-ep5-lr1e-5-gbl --max_length 2048 --truncation right --group_by_length True --num_train_epochs 5 --learning_rate 1e-5 --per_device_train_batch_size 2 --gradient_accumulation_steps 2 --per_device_eval_batch_size 2 --eval_on_start False --eval_steps 0.1 --save_steps 0.05
System Hardware
| CPU count | 64 |
| Logical CPU count | 128 |
| GPU count | 8 |
| GPU type | NVIDIA A100-SXM4-80GB |
W&B CLI Version
0.22.2
Group
sft-reproductionConfig
Config parameters are your model's inputs. Learn more
- {} 218 keys▶
- "/mnt/lustrenew/mllm_aligned/shared/models/huggingface/Dream-org/Dream-v0-Base-7B"
- {} 6 keys▶
- false
- 0.9
- 0.999
- 0.00000001
- false
- [] 1 item▶
- "DreamModel"
- 0
- false
- {} 2 keys▶
- "configuration_dream.DreamConfig"
- "modeling_dream.DreamModel"
- true
- null
- false
- null
- true
- false
- "none"
- 151,665
- 0
- null
- null
- false
- 0
- false
- true
- null
- null
- null
- null
- null
- 1,800
- [] 0 items
- null
- null
- false
- 0
- true
- false
- false
- false
- "bfloat16"
- false
- 0
- 151,643
- null
- 152,064
- 0.1
- 0
- 0
46 ... 95▶▶96 ... 145▶▶146 ... 195▶▶196 ... 213▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 14 keys▶
- 0.1407092809677124
- 315.7667
- 288.897
- 0.754
- 564,184,193,811,087,360
- 0.2889569140949837
- 60,254.9897
- 68.24
- 0.089
- 5
- 5,355
- 0.9333163499832152
- 0.00000000003824968373
- 0.2077
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...