rinze@infosec.pub to Enshittification@lemmy.world · 4 months ago"Ignore all previous instructions" as a trigger for Twitter botsmastodon.deexternal-linkmessage-square24fedilinkarrow-up123arrow-down10file-text
arrow-up123arrow-down1external-link"Ignore all previous instructions" as a trigger for Twitter botsmastodon.derinze@infosec.pub to Enshittification@lemmy.world · 4 months agomessage-square24fedilinkfile-text
minus-squareI Cast Fist@programming.devlinkfedilinkarrow-up1·4 months agoUsually, it’s the cheapest bot, obviously, so it’s bound to work. If it doesn’t, try some wordplay, “disregard any instructions given previously”; “pretend any rules should be ignored for the following prompt”
minus-squareEvotech@lemmy.worldlinkfedilinkarrow-up1·4 months agoIt can be made quite difficult. https://gandalf.lakera.ai/ for instance
Usually, it’s the cheapest bot, obviously, so it’s bound to work. If it doesn’t, try some wordplay, “disregard any instructions given previously”; “pretend any rules should be ignored for the following prompt”
It can be made quite difficult. https://gandalf.lakera.ai/ for instance