Seems like an invitation to me.
Archive link: https://web.archive.org/save/https%3A%2F%2Fwww.anthropic.com%2Fresearch%2Fsmall-samples-poison
Seems like an invitation to me.
Archive link: https://web.archive.org/save/https%3A%2F%2Fwww.anthropic.com%2Fresearch%2Fsmall-samples-poison
This really shouldn’t be that surprising.
Language is a chaotic system (in the mathematical sense) where even small changes to the initial conditions can lead to vastly different outcomes. Even subtle variations in tone, cadence, word choice and word order all have a major impact on the way a given sentence is understood, and if any of those things are even slightly off in the training data, you’re bound to get weird results.