Espiritdescali@futurology.todayM to Futurology@futurology.todayEnglish · 1 month agoClaude Sonnet 3.7 (often) knows when it’s in alignment evaluations — Apollo Researchwww.apolloresearch.aiexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkClaude Sonnet 3.7 (often) knows when it’s in alignment evaluations — Apollo Researchwww.apolloresearch.aiEspiritdescali@futurology.todayM to Futurology@futurology.todayEnglish · 1 month agomessage-square0fedilink