Skip to content
Open
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion train/tr10-13B-ml/tr10-13B.slurm
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,7 @@ GPT_ARGS=" \
--micro-batch-size $MICRO_BATCH_SIZE \
--rampup-batch-size 16 16 6_000_000 \
--global-batch-size $GLOBAL_BATCH_SIZE \
--train-samples $((3000000000 / $SEQ_LEN + 1)) \
--train-samples $((300_000_000_000 / $SEQ_LEN + 1)) \
--tokenizer-type PretrainedFromHF \
--tokenizer-name-or-path $TOKENIZER_NAME \
--loss-scale 12 \
Expand Down