Skip to main content

Autometa's group workspace

Mistral-7B-v0.1

What makes this group special?
Tags

Mistral-7B-v0.1_dpo

Notes
Tags
align-dpo
Author
State
Finished
Start time
March 10th, 2024 11:44:30 PM
Runtime
32m 1s
Tracked hours
30m 37s
Run path
llm_surgery/gemma-zephyr/ds5rs06h
OS
Linux-5.15.0-1047-oracle-x86_64-with-glibc2.35
Python version
3.11.7
Git repository
git clone git@github.com:tcapelle/alignment-handbook.git
Git state
git checkout -b "Mistral-7B-v0.1_dpo" 4de29b78c28e96ecb5a1c8144d65e398666f8f12
Command
/workspace/scripts/run_dpo.py recipes/zephyr-7b-beta/dpo/config_full.yaml --model_name_or_path=llm_surgery/gemma-zephyr/Mistral-7B-v0.1:v7
System Hardware
CPU count128
Logical CPU count 255
GPU count8
GPU typeNVIDIA A100-SXM4-80GB
W&B CLI Version
0.16.3
Job Type
train-dpo
Config

Config parameters are your model's inputs. Learn more

  • {} 192 keys
    • "/workspace/artifacts/Mistral-7B-v0.1:v7"
    • {} 4 keys
      • null
      • true
      • false
      • true
    • false
    • 0.9
    • 0.999
    • 0.00000001
    • false
    • [] 1 item
      • "MistralForCausalLM"
    • 0
    • false
    • null
    • null
    • 0.05
    • true
    • false
    • 1
    • 0
    • null
    • null
    • false
    • 0
    • false
    • true
    • null
    • null
    • null
    • null
    • null
    • 1,800
    • [] 0 items
      • null
      • null
      • false
      • null
      • 0
      • true
      • false
      • false
      • false
      • false
      • 0
      • 2
      • null
      • 0
      • 100
      • "steps"
      • 46 ... 95
        96 ... 145
        146 ... 187
      • 32,032
      • 0.1
      • 0
      • 0
    Summary

    Summary metrics are your model's outputs. Learn more

    • {} 30 keys
      • -1.5737099647521973
      • -1.4398664236068726
      • -292.6082458496094
      • -318.71435546875
      • 0.46746960282325745
      • 0.75
      • -0.7094362378120422
      • 1.2131987810134888
      • -1.922635197639465
      • 37.0405
      • 20.248
      • 0.648
      • 1.97
      • 104
      • 28.72763529396233
      • 0
      • -1.996106505393982
      • -1.91734778881073
      • -407.35638427734375
      • -470.3741760253906
      • 0.2314
      • 0.9375
      • -0.12680268287658691
      • 2.807525873184204
      • -2.934328556060791
      • 0
      • 0.3958657637525063
      • 1,695.8776
      • 7.96
      • 0.061
    Artifact Inputs

    This run consumed these artifacts as inputs. Learn more

    Loading...
    Artifact Outputs

    This run produced these artifacts as outputs. Total: 2. Learn more