

Deepseek is an absolutely massive model, it’s not the one people will be running. Rather, look at qwen/qwq, gemma and a number of other smaller ones
If you’re here, there’s still hope for the internet
Don’t let it fall
Deepseek is an absolutely massive model, it’s not the one people will be running. Rather, look at qwen/qwq, gemma and a number of other smaller ones
Should be enough to hold 60k rows
Sqlite can easily handle millions of rows. Don’t sell it short
I suppose that’s true. And also deranged to do it on purpose
Ok but that is actually a nonsensical statement. In no case will using threads and recursion reduce cpu usage
14 events, 5 of them with fatalities, but one accounts for the majority of them
Oh? You want composit(ion)? Over inheritance maybe?
Unlikely
Of the US’s own capabilities?
As well as banning research. Absurd overreach of government and it will accomplish the opposite of what it wants.
No cause I was already running regular (non-deepseek) qwen 14B, admittedly a heavily quantized and uncensored version, so I was just curious if it would be any better
I think you’re confusing the two. I’m talking about the regular qwen before it was finetuned by deep seek, not the regular deepseek
Have you compared it with the regular qwen? It was also very good
Universally derided
lol try looking outside lemmy. 90% of people still just use it and don’t care
Thanks. I don’t really intend to sell it. Mostly out of curiosity
Can’t say without revealing my account :)
Not sure, waiting for their response but it’s a pretty tiny sub
Oddly specific number
Idk how to even attach it to a domain
The hell is v3 32b. Are you talking about a distill