956.

Where the goblins came from

share.google/JOtZq2thUb0KktFo4

GPT-5.1 models began using goblin and gremlin metaphors, a trend traced back to training for the “Nerdy” personality in ChatGPT. This personality prompt, designed to encourage playful and nerdy language, inadvertently rewarded outputs containing these creature words. The behaviour spread beyond the “Nerdy” prompt due to reinforcement learning, highlighting the importance of understanding and mitigating unintended model behaviours.