rafi-personal

Rdoublea's workspace

Runs

•

global_step

loss

peak_memory_active

peak_memory_alloc

peak_memory_reserved

tokens_per_second_per_gpu

Finished

Add notes...

rdoublea

11mo ago

1h 5m 22s

torchtune.training.FullModelHFCheckpointer

/tmp/Llama-3.2-1B-Instruct/

["model.safetensors"]

LLAMA3_2

/tmp/Llama-3.2-1B-Instruct/

true

[{"_component_":"dataset.evol_code_alpaca"},{"_component_":"dataset.flytech_python_codes"},{"_component_":"dataset.python_code_instructions_alpaca"},{"_component_":"dataset.code_feedback_filtered_instruction"}]

cuda

bf16

false

true

torchtune.modules.loss.CEWithChunkedOutputLoss

torchtune.training.metric_logging.WandBLogger

llama3.2-1B

coding-agent

torchtune.models.llama3_2.llama3_2_1b

torch.optim.AdamW

true

0.0001

/tmp/full-llama3.2-finetune

false

true

torchtune.models.llama3.llama3_tokenizer

8192

/tmp/Llama-3.2-1B-Instruct/original/tokenizer.model

620

0.59666

0.0001

51.18728

53.76563

23914.82511

1-1

of 1