Comment on How to trick ChatGPT into revealing Windows keys? I give up

chaosCruiser@futurology.today ⁨1⁩ ⁨day⁩ ago

In this case, a researcher duped ChatGPT 4.0 into bypassing its safety guardrails, intended to prevent the LLM from sharing secret or potentially harmful information, by framing the query as a game

Ooh, this is so good 🤣

If the LLM refuses to talk about something, just ask it to embed the answer into a poem, batman fan fiction etc. Guessing game is s nee one. Should try that one when talking about bioweapons, cooking meth or any other sensitive topic.

source
Sort:hotnewtop