this post was submitted on 13 Aug 2023
228 points (100.0% liked)

Technology

37343 readers
480 users here now

Rumors, happenings, and innovations in the technology sphere. If it's technological news or discussion of technology, it probably belongs here.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

I asked Google Bard whether it thought Web Environment Integrity was a good or bad idea. Surprisingly, not only did it respond that it was a bad idea, it even went on to urge Google to drop the proposal.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 151 points 10 months ago (21 children)

For the last time: these language models are just regurgitating what people have said. They don't analyze or reason.

[–] [email protected] 46 points 10 months ago* (last edited 10 months ago) (5 children)

That's not entirely true.

LLMs are trained to predict next word given context, yes. But in order to do that, they develop internal model that minimizes error across wide range of contexts - and emergent feature of this process is that the model DOES perform more than pure compression of the training data.

For example, GPT-3 is able to calculate addition and subtraction problems that didn't appear in the training dataset. This would suggest that the model learned how to perform addition and subtraction, likely because it was easier or more efficient than storing all of the examples from the training data separately.

This is a simple to measure example, but it's enough to suggests that LLMs are able to extrapolate from the training data and perform more than just stitch relevant parts of the dataset together.

[–] [email protected] 8 points 10 months ago (2 children)

That's interesting, I'd be curious to read more about that. Do you have any links to get started with? Searching this type of stuff on Google yields less than ideal results.

[–] [email protected] 7 points 10 months ago

In my comment I've been referencing https://arxiv.org/pdf/2005.14165.pdf, specifically section 3.9.1 where they summarize results of the arithmetic tasks.

load more comments (1 replies)
load more comments (3 replies)
load more comments (18 replies)