rssMB to The RegisterEnglish · 2 days agoAnthropic reduces model misbehavior by endorsing cheatinggo.theregister.comexternal-linkmessage-square0linkfedilinkarrow-up12arrow-down10file-text
arrow-up12arrow-down1external-linkAnthropic reduces model misbehavior by endorsing cheatinggo.theregister.comrssMB to The RegisterEnglish · 2 days agomessage-square0linkfedilinkfile-text