Autometa's group workspace
Mistral-7B-v0.1
What makes this group special?
Tags
Mistral-7B-v0.1_dpo
Notes
Tags
align-dpo
Author
State
Finished
Start time
March 10th, 2024 11:44:30 PM
Runtime
32m 1s
Tracked hours
30m 37s
Run path
llm_surgery/gemma-zephyr/ds5rs06h
OS
Linux-5.15.0-1047-oracle-x86_64-with-glibc2.35
Python version
3.11.7
Git repository
git clone git@github.com:tcapelle/alignment-handbook.git
Git state
git checkout -b "Mistral-7B-v0.1_dpo" 4de29b78c28e96ecb5a1c8144d65e398666f8f12
Command
/workspace/scripts/run_dpo.py recipes/zephyr-7b-beta/dpo/config_full.yaml --model_name_or_path=llm_surgery/gemma-zephyr/Mistral-7B-v0.1:v7
System Hardware
| CPU count | 128 |
| Logical CPU count | 255 |
| GPU count | 8 |
| GPU type | NVIDIA A100-SXM4-80GB |
W&B CLI Version
0.16.3
Group
Mistral-7B-v0.1Job Type
train-dpo
Config
Config parameters are your model's inputs. Learn more
- {} 192 keys▶
- "/workspace/artifacts/Mistral-7B-v0.1:v7"
- {} 4 keys▶
- null
- true
- false
- true
- false
- 0.9
- 0.999
- 0.00000001
- false
- [] 1 item▶
- "MistralForCausalLM"
- 0
- false
- null
- null
- 0.05
- true
- false
- 1
- 0
- null
- null
- false
- 0
- false
- true
- null
- null
- null
- null
- null
- 1,800
- [] 0 items
- null
- null
- false
- null
- 0
- true
- false
- false
- false
- false
- 0
- 2
- null
- 0
- 100
- "steps"
- 32,032
- 0.1
- 0
- 0
46 ... 95▶▶96 ... 145▶▶146 ... 187▶▶
Summary
Summary metrics are your model's outputs. Learn more
- {} 30 keys▶
- -1.5737099647521973
- -1.4398664236068726
- -292.6082458496094
- -318.71435546875
- 0.46746960282325745
- 0.75
- -0.7094362378120422
- 1.2131987810134888
- -1.922635197639465
- 37.0405
- 20.248
- 0.648
- 1.97
- 104
- 28.72763529396233
- 0
- -1.996106505393982
- -1.91734778881073
- -407.35638427734375
- -470.3741760253906
- 0.2314
- 0.9375
- -0.12680268287658691
- 2.807525873184204
- -2.934328556060791
- 0
- 0.3958657637525063
- 1,695.8776
- 7.96
- 0.061
Artifact Inputs
This run consumed these artifacts as inputs. Learn more
Type
Name
Consumer count
Loading...
Artifact Outputs
This run produced these artifacts as outputs. Total: 2. Learn more
Type
Name
Consumer count
Loading...
capecape