rssMB to Science StreamsEnglish · 14 hours agoHow to catch AI sleeper agents with a simple interpretability trickwww.youtube.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkHow to catch AI sleeper agents with a simple interpretability trickwww.youtube.comrssMB to Science StreamsEnglish · 14 hours agomessage-square0linkfedilinkfile-text