Sept

Xylight‮@lemdro.id · 5 months ago

deleted by creator

MonkeMischief@lemmy.today · 5 months ago

Expertly explained. Thank you! It’s pretty rad what you can get out of a quantized model on home hardware, but I still can’t understand why people are trying to use it for anything resembling productivity.

It sounds like the typical tech industry:

“Look how amazing this is!” (Full power)

“Uh…uh oh, that’s unsustainable. Let’s quietly drop it.” (Way reduced power)

“People are saying it’s not as good, we can offer them LLM+ plus for better accuracy!” (3/4 power with subscription)

mcv@lemmy.zip · 5 months ago

But if that’s how you’re going to run it, why not also train it in that mode?

Xylight‮@lemdro.id · 5 months ago

That is a thing, and it’s called quantization aware training. Some open weight models like Gemma do it.

The problem is that you need to re-train the whole model for that, and if you also want a full-quality version you need to train a lot more.

It is still less precise, so it’ll still be worse quality than full precision, but it does reduce the effect.

mudkip@lemdro.id · 5 months ago

Your response reeks of AI slop

Xylight‮@lemdro.id · 5 months ago

4/10 bait

mudkip@lemdro.id · 5 months ago

Is it, or is it not, AI slop? Why are you using so heavily markdown formatting? That is a telltale sign of an LLM being involved

Xylight‮@lemdro.id · 5 months ago

I am not using an llm but holy bait

Hop off the reddit voice

mudkip@lemdro.id · 5 months ago

…You do know what platform you’re on? It’s a REDDIT alternative

psud@aussie.zone · 5 months ago

heavily markdown formatting

They used one formatting mark, and it’s the most common. What are you smoking, and may I have some?