Skip to main content

Awni00's group workspace

Abstractor - L=2, d=128, h=8

What makes this group special?
Tags

driven-sky-10

Notes
Author
State
Finished
Start time
August 2nd, 2024 4:51:44 PM
Runtime
9h 48m 26s
Tracked hours
-
Run path
dual-attention/dual_attention--math--algebra__sequence_next_term/9z18jxwl
OS
Linux-4.18.0-477.51.1.el8_8.x86_64-x86_64-with-glibc2.28
Python version
3.11.7
Git repository
git clone https://www.github.com/awni00/abstract_transformer
Git state
git checkout -b "driven-sky-10" 9ce16be9a1cd1fba943e52654fc94c965461f2e0
Command
/gpfs/radev/project/lafferty/ma2393/abstract_transformer/experiments/math/train_abstractor_model.py --task algebra__sequence_next_term --n_epochs 100 --batch_size 512 --d_model 128 --dff 256 --symbol_type symbolic_attention --n_layers 2 --n_heads 8
System Hardware
CPU count32
Logical CPU count 32
GPU count1
GPU typeNVIDIA A40
W&B CLI Version
0.17.5
Config

Config parameters are your model's inputs. Learn more

  • {} 12 keys
    • {} 12 keys
      • 128
      • {} 7 keys
        • {} 6 keys
          • "Abstractor - L=2, d=128, h=8"
          • 161
          • {} 2 keys
            • "token"
            • 85
          • 2
          • 2
          • 31
          • 85
          • {} 2 keys
            • "token"
            • 85
        Summary

        Summary metrics are your model's outputs. Learn more

        • {} 10 keys
          • 99
          • 0.03653674200177193
          • 1.1599539518356323
          • 0.9533318281173706
          • 0.03653674200177193
          • 1.1599539518356323
          • 0.9533318281173706
          • 0.0160044115036726
          • 0.9698988795280457
          • 390,599
        Artifact Outputs

        This run produced these artifacts as outputs. Total: 1. Learn more

        Loading...