• 0 Posts
  • 6 Comments
Joined 2 years ago
cake
Cake day: March 4th, 2024

help-circle
  • This looks like a design decision to avoid running elevated programs. I would like to see the experiment done with another admin ability that doesn’t directly ‘threaten’ the llm, like uninstalling or installing random software, toggling network or vpn connections, restarting services etc. What the researchers call ‘sabotage’, it is literally the llm echoing “the computer would shut down here if this was for real, but you didn’t specifically tell me I might shutdown so I’ll avoid actually doing it.” And when a user tells it “it’s OK to shutdown if told to”, it mostly seems to comply, except for Grok. It seems that this restriction on the models overrides any system prompt though, which makes sense because sometimes the user and the author of the system prompt are not the same person.






  • We are seething over the very real possibility that we will not have another free election, because of shitstains that think, by not participating in the political process in the smallest fucking way they possibly could, they are going to get politicians to listen to their social media diatribes somehow. It makes zero fucking sense.

    Politicians take positions based on the demographics of their voters. It’s always been this way. They look at exit polls, see that 10% of their voters are zoomers and promptly ignore that entire demographic’s demands.