Excited to share new research with Jon Kutasov,
@saprmarks,
@sprice354_: Model Spec Midtraining (MSM)
The Model Spec sets out how AIs should behave and why. MSM trains AIs on documents about the spec. This can improve how AIs generalize from subsequent alignment training.