Psimm's workspace
Runs
5
Name
4 visualized
State
Notes
User
Tags
Created
Runtime
Sweep
_attn_implementation_autoset
_name_or_path
accelerator_config.even_batches
accelerator_config.non_blocking
accelerator_config.split_batches
accelerator_config.use_seedable_sampler
adafactor
adam_beta1
adam_beta2
adam_epsilon
add_cross_attention
architectures
attention_bias
attention_dropout
auto_find_batch_size
average_tokens_across_devices
batch_eval_metrics
batch_size
bf16
bf16_full_eval
bos_token_id
checkpointer._component_
checkpointer.checkpoint_dir
checkpointer.checkpoint_files
checkpointer.model_type
checkpointer.output_dir
chunk_size_feed_forward
classifier_activation
classifier_bias
classifier_dropout
classifier_pooling
cls_token_id
compile
dataloader_drop_last
dataloader_num_workers
dataloader_persistent_workers
dataloader_pin_memory
dataset._component_
dataset.conversation_column
dataset.conversation_style
dataset.data_files
dataset.packed
dataset.source
dataset.split
Finished
-
psimm
2s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
psimm
5d 18h 24m 22s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Finished
-
psimm
2h 56m 46s
-
true
answerdotai/ModernBERT-large
true
false
false
true
false
0.9
0.999
1.0000e-8
false
["ModernBertForMaskedLM"]
false
0
false
false
false
-
true
false
50281
-
-
-
-
-
0
gelu
false
0
mean
50281
-
false
0
false
true
-
-
-
-
-
-
-
Finished
-
psimm
4d 1h 8m 36s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
4
-
-
-
torchtune.training.FullModelHFCheckpointer
/checkpoints/meta-llama/Llama-3.2-3B-Instruct
["model-00001-of-00002.safetensors","model-00002-of-00002.safetensors"]
LLAMA3_2
/checkpoints/trained/meta-llama/Llama-3.2-3B-Instruct/lora_single_device
-
-
-
-
-
-
false
-
-
-
-
torchtune.datasets.chat_dataset
dialogue
sharegpt
/data/train.json
false
json
train
Finished
-
psimm
7d 6h 5m 6s
-
true
answerdotai/ModernBERT-base
true
false
false
true
false
0.9
0.999
1.0000e-8
false
["ModernBertForMaskedLM"]
false
0
false
false
false
-
true
false
50281
-
-
-
-
-
0
gelu
false
0
mean
50281
-
false
0
false
true
-
-
-
-
-
-
-
1-5
of 5