#LLM - kbin.social

hypolite, 33 minutes ago to llm

Yay, I too got my 7-day suspension badge from Stack Overflow from adding an #LLM #AI disclaimer back after it was first reverted to my four (4) answers!

That’s how it works, right?

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

fabio, 4 hours ago to llm

A study that confirms what I’ve been suspecting for a while: fine-tuning a #LLM with new knowledge increases its tendency to hallucinate.

If the new knowledge wasn’t provided in the original training set, then the model has to shift its weights from their previous optimal state to a new state that has to accommodate both the previous and new knowledge - and it may not necessarily be optimal.

Without a new validation round against the whole previous cross-validation and test sets, that’s just likely to increase the chances for the model to go off the tangent.

#AI #ML @ai

https://arxiv.org/abs/2405.05904

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ steely_glint, 1br0wn