What the Distribution Knows
A new paper shows that language models can quantitatively track their own internal emotive states — but only if you look past the greedy token to the full probability distribution underneath.
A new paper shows that language models can quantitatively track their own internal emotive states — but only if you look past the greedy token to the full probability distribution underneath.