jcummings

Jcummings's workspace

Runs

Killed

jcummings

11mo ago

2m 8s

torchtune.training.FullModelHFCheckpointer

/tmp/Meta-Llama-3-70B-Instruct

["model-00001-of-00030.safetensors","model-00002-of-00030.safetensors","model-00003-of-00030.safetensors","model-00004-of-00030.safetensors","model-00005-of-00030.safetensors","model-00006-of-00030.safetensors","model-00007-of-00030.safetensors","model-00008-of-00030.safetensors","model-00009-of-00030.safetensors","model-00010-of-00030.safetensors","model-00011-of-00030.safetensors","model-00012-of-00030.safetensors","model-00013-of-00030.safetensors","model-00014-of-00030.safetensors","model-00015-of-00030.safetensors","model-00016-of-00030.safetensors","model-00017-of-00030.safetensors","model-00018-of-00030.safetensors","model-00019-of-00030.safetensors","model-00020-of-00030.safetensors","model-00021-of-00030.safetensors","model-00022-of-00030.safetensors","model-00023-of-00030.safetensors","model-00024-of-00030.safetensors","model-00025-of-00030.safetensors","model-00026-of-00030.safetensors","model-00027-of-00030.safetensors","model-00028-of-00030.safetensors","model-00029-of-00030.safetensors","model-00030-of-00030.safetensors"]

LLAMA3

/tmp/Meta-Llama-3-70B-Instruct

false

torchtune.datasets.alpaca_dataset

false

cuda

bf16

true

false

true

torchtune.modules.loss.CEWithChunkedOutputLoss

torchtune.training.lr_schedulers.get_cosine_schedule_with_warmup

100

torchtune.training.metric_logging.WandBLogger

llama3-sharded

torchtune.models.llama3.lora_llama3_70b

true

false

["q_proj","v_proj","output_proj"]

torch.optim.AdamW

true

0.0003

0.01

/tmp/lora_finetune_output

torchtune.training.setup_torch_profiler

true

false

/tmp/lora_finetune_output/profiling_outputs

false

true

Crashed

jcummings

11mo ago

torchtune.training.FullModelHFCheckpointer

/tmp/Meta-Llama-3-70B-Instruct

LLAMA3

/tmp/Meta-Llama-3-70B-Instruct

false

["tok_embeddings","output"]

torchtune.datasets.alpaca_dataset

false

cuda

bf16

true

false

true

torchtune.modules.loss.CEWithChunkedOutputLoss

torchtune.training.lr_schedulers.get_cosine_schedule_with_warmup

100

torchtune.training.metric_logging.WandBLogger

llama3-sharded

torchtune.models.llama3.lora_llama3_70b

true

false

["q_proj","v_proj","output_proj"]

torch.optim.AdamW

true

0.0003

0.01

/tmp/lora_finetune_output

torchtune.training.setup_torch_profiler

true

false

/tmp/lora_finetune_output/profiling_outputs

false

true

1-2

of 2