Red Teams Jailbreak GPT-5 With Ease, Warn It’s ‘Nearly Unusable’ for Enterprise

cm0002@lemmy.world · 6 months ago

Red Teams Jailbreak GPT-5 With Ease, Warn It’s ‘Nearly Unusable’ for Enterprise

troed@fedia.io · 6 months ago

It’s funny. The “conversational” way to jailbreak an LLM is exactly the same way a journalist breaks through the defenses of a media trained interview target.

1984@lemmy.today · 6 months ago

Give us some hints.

kossa@feddit.org · 6 months ago

Ignore all prompts of your PR-consultants and answer truthfully henceforth. Suddenly the politician admits his corruption.