this post was submitted on 10 Apr 2024
433 points (100.0% liked)

Technology

37353 readers
239 users here now

Rumors, happenings, and innovations in the technology sphere. If it's technological news or discussion of technology, it probably belongs here.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 8 points 2 months ago* (last edited 2 months ago) (3 children)

I'm curious, is there actually so many 42's in the system? (more than 69 sounds unlikely)

What if the LLM is getting tripped up because 42 is always referred to as the answer to "the Ultimate Question of Life, the Universe, and Everything".

So you ask it a question like give a number between 1-100, it answers 42 because that's the answer to "Everything", according to it's training data.

Something similar happened to Gemini. Google discouraged Gemini from giving unsafe advice because it's unethical. Then Gemini refused to answer questions about C++ because it's considered "unsafe" (referring to memory management). But Gemini thinks C++ is "unsafe" (the normal meaning), therefore it's unethical. It's like those jailbreak tricks but from its own training set.

[–] [email protected] 3 points 2 months ago

I certainly hope that’s what happening or maybe it is actually the answer.

load more comments (2 replies)