This repo provides basic tuning scripts with support for specific models. The repo relies on Hugging Face SFTTrainer and PyTorch FSDP. Our approach to tuning is: ...
[trainer] Fix fsdp warmup steps + move warmup to optimizer config by @erictang000 in #52 Make deepspeed optional, so it is not initialized if FSDP backend is used by @pcmoritz in #59 fix: Updated ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results