Skip to content

backbone

backbone

BackbonedTrainer: finetune from a pretrained HuggingFace backbone.

Instead of configuring model architecture from scratch, reads two config keys: - architecture/backbone/implementation: "llama", "qwen", or "gpt_neox" - architecture/backbone/weights: HuggingFace model ID (e.g. "TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T")

The model class and initial weights are loaded via from_pretrained, bypassing the normal configure() path for architecture parameters.

BackbonedTrainer(spec: ExecutionSpec)

Bases: BaseTrainer[BaseTrainerConfig, Module]

Trainer that initializes from a pretrained HuggingFace backbone.