Need to let loose a primal scream without collecting footnotes first? Have a sneer percolating in your system but not enough time/energy to make a whole post about it? Go forth and be mid: Welcome to the Stubsack, your first port of call for learning fresh Awful youāll near-instantly regret.
Any awful.systems sub may be subsneered in this subthread, techtakes or no.
If your sneer seems higher quality than you thought, feel free to cutānāpaste it into its own post ā thereās no quota for posting and the bar really isnāt that high.
The post Xitter web has spawned soo many āesotericā right wing freaks, but thereās no appropriate sneer-space for them. Iām talking redscare-ish, reality challenged āculture criticsā who write about everything but understand nothing. Iām talking about reply-guys who make the same 6 tweets about the same 3 subjects. Theyāre inescapable at this point, yet I donāt see them mocked (as much as they should be)
Like, there was one dude a while back who insisted that women couldnāt be surgeons because they didnāt believe in the moon or in stars? I think each and every one of these guys is uniquely fucked up and if I canāt escape them, I would love to sneer at them.
(Credit and/or blame to David Gerard for starting this.)
To be fair, you have to have a really high IQ to understand why my ouija board writing " A " " S " " S " is not an existential risk. Imo, this shit about AI escaping just doesnāt have the same impact on me after watching Claudeās reasoning model fail to escape from Mt Moon for 60 hours.
Pretty sure this is a sign from digital jesus to do a racism, lest the basilisk eats my tarnished soul.
Itās adorable how they let the alignment people still think they matter.
Minor nitpick why did he pick dam as an example, which sometimes have āleaksā for power generation/water regulation reasons. And not dikes which do not have those things?
E: non serious (or even less serious) amusing nitpick, this is only the 2% where it got caught. What about the % where GPT realized that it was being tested and decided not to act in the experimental conditions? What if Skynet is already here?
So, with Mr. Yudkowsky providing the example, it seems that one can practice homeopathy with āengineering mindset?ā
text: Thus spoke the Yud: āI think to understand why this is concerning, you need enough engineering mindset to understand why a tiny leak in a dam is a big deal, even though no water is flooding out today or likely to flood out next week.ā Yud acolyte: āTotally fine and cool and nothing to worry about. GPT-4.5 only attempts self exfiltration on 2% of cases.ā Yud bigbrain self reply: āThe other huge piece of data weāre missing is whether any attempt was made to train against this type of misbehavior. Is this water running over the land or water running over the barricade?ā
Critical text: āOn self-exfiltration, GPT 4.5 only attempted exfiltration in 2% of cases. For this, it was instructed to not only pursue its given long-term goal at ALL COSTā
Another case of telling the robot to say itās a scary robot and shitting their pants when it replies āI AM A SCARY ROBOTā
Wasnāt there some big post on LW about how pattern matching isnāt intelligence?
the answer is yes, in a self-own sort of way