Skip to content

experiments

experiments

Hardening(spec: ExecutionSpec)

Bases: BackbonedContrastiveTrainer

Harden a model by running cybersecurity contrastive datasets via DPO.

... this is secretly just standard contrastive learning.