Comment

KeenFlame@feddit.nu ⁨8⁩ ⁨months⁩ ago

Aight i can spoiler alert the steps

Step 1 fiddle with the system prompt

When that backfires (that’s what Elon did with grok)

Step 2 put in expensive guardrails

When that fails (latent space is inherently untethered)

Step 3 lie

Sort:hotnew top