wsds
wsds
¶
warmup-stable-decay-stable schedule Think of this as WSD and then you run finetuning, all in one schedule
wsds
¶warmup-stable-decay-stable schedule Think of this as WSD and then you run finetuning, all in one schedule