Resuming from checkpoint with HuggingFace Trainer: does it matter what model/arguments the Trainer was instantiated with?

177 Views Asked by ubadub At 28 June 2025 at 11:58

Instantiating a Trainer object from the transformers library requires passing a model (or a model_init function that returns a model). After instantiating a Trainer with a model, if I call trainer.train(resume_from_checkpoint="path/to/checkpoints"), the docs say that "training will resume from the model/optimizer/scheduler states loaded here."

But what happens to the original model I passed to the Trainer constructor? Does it even matter what I pass?

Or am I misinterpreting the docs and the model passed to the constructor is what is trained, but the checkpoint just determines how many epochs are left etc.?

Similarly, if the checkpoint has a different set of TrainingArguments than are passed to the Trainer constructor, which has precedence? Do I even need to pass TrainingArguments to Trainer when resuming from checkpoint?

Original Q&A

Resuming from checkpoint with HuggingFace Trainer: does it matter what model/arguments the Trainer was instantiated with?

There are 0 best solutions below

Related Questions in MACHINE-LEARNING

Related Questions in HUGGINGFACE-TRANSFORMERS

Related Questions in HUGGINGFACE-TRAINER

Trending Questions

Popular # Hahtags

Popular Questions