@lily33

lily33@lemm.ee · 12 days ago

I’d be very skeptical of claims that Debian maintainers actually audit the code of each piece of software they package. Perhaps they make some brief reviews, but actually scrutinizing every line for hidden backdoors is just not feasible.

lily33@lemm.ee · 20 days ago

That makes me think, perhaps, you might be able to set it to exec("stuff") or True…

lily33@lemm.ee · 20 days ago

deleted by creator

lily33@lemm.ee · 1 month ago

Is that something new? As in, has WaPo not been willing to go after Meta in a similar manner before?

lily33@lemm.ee · edit-2 1 month ago

So, essentially, they wanted to enter the Chinese market so much that they were even willing to comply with the local rules and regulations!

This is such a big secret, we really needed a whistleblower to tell us that!

lily33@lemm.ee · 2 months ago

Too bad that’s based on macros. A full preprocessor could require that all keywords and names in each scope form a prefix code, and then allow us to freely concatenate them.

lily33@lemm.ee · edit-2 2 months ago

That is why I use just int main(){...} without arguments instead.

lily33@lemm.ee · 2 months ago

I don’t think any kind of “poisoning” actually works. It’s well known by now that data quality is more important than data quantity, so nobody just feeds training data in indiscriminately. At best it would hamper some FOSS AI researchers that don’t have the resources to curate a dataset.

lily33@lemm.ee · edit-2 2 months ago

What makes these consumer-oriented models different is that that rather than being trained on raw data, they are trained on synthetic data from pre-existing models. That’s what the “Qwen” or “Llama” parts mean in the name. The 7B model is trained on synthetic data produced by Qwen, so it is effectively a compressed version of Qen. However, neither Qwen nor Llama can “reason,” they do not have an internal monologue.

You got that backwards. They’re other models - qwen or llama - fine-tuned on synthetic data generated by Deepseek-R1. Specifically, reasoning data, so that they can learn some of its reasoning ability.

But the base model - and so the base capability there - is that of the corresponding qwen or llama model. Calling them “Deepseek-R1-something” doesn’t change what they fundamentally are, it’s just marketing.

lily33@lemm.ee · 2 months ago

There are already other providers like Deepinfra offering DeepSeek. So while the the average person (like me) couldn’t run it themselves, they do have alternative options.

lily33@lemm.ee · 2 months ago

A server grade CPU with a lot of RAM and memory bandwidth would work reasonable well, and cost “only” ~$10k rather than 100k+…

lily33@lemm.ee · 2 months ago

To be fair, most people can’t actually self-host Deepseek, but there already are other providers offering API access to it.

lily33@lemm.ee · 2 months ago

The point of it being open is that people can remove any censorship built into it.

lily33@lemm.ee · 2 months ago

The particular AI model this article is talking about is actually openly published for anyone to freely use or modify (fine-tune). There is a barrier in that it requires several hundred gigs of RAM to run, but it is public.

lily33@lemm.ee · 3 months ago

https://gizmodo.com/to-further-its-mission-of-benefitting-everyone-openai-will-become-fully-for-profit-2000543628

lily33@lemm.ee · 4 months ago

Now, if only the article explained how that killing was related to TikTok. The only relevant thing I saw was,

had its roots in a confrontation on social media.

It’s says “social media”, not “TokTok” though.

lily33@lemm.ee · edit-2 5 months ago

Wary reader, learn from my cautionary tale

I’m not sure what to learn exactly. I don’t get what went wrong or why, just that the files hit deleted somehow…

lily33@lemm.ee · 5 months ago

Yes, almost like they have intentionally waited until Trump’s election.

lily33@lemm.ee · edit-2 5 months ago

And they’re all with different commit message:

“switched arse to bottom to create a more uplifting vibe”

“took arse out and put bottom in to keep my language warm and friendly”

“thought bottom would sound a lot nicer than arse, so I used it”

And so on…

lily33@lemm.ee · edit-2 5 months ago

Type in "Is Kamala Harris a good Democratic candidate

…and any good search engine will find results containing keywords such as “Kamala Harris”, “Democratic”, “candidate”, and “good”.

[…] you might ask if she’s a “bad” Democratic candidate instead

In that case, of course the search engine will find results containing keywords such as “Kamala Harris”, “Democratic”, “candidate”, and “bad”.

So the whole premise that, “Fundamentally, that’s an identical question” is just bullshit when it comes to searching. Obviously, when you put in the keyword “good”, you’ll find articles containing “good”, and if you put in the keyword “bad”, you’ll find articles containing “bad” instead.

Google will find things that match the keywords that you put in. So does DuckDuckGo, Qwant, Yahoo, whatever. That is what a good search engine is supposed to do.

I can assure you, when search engines stop doing that, and instead try to give “balanced” results, according to whatever opaque criteria for “balanced” their company comes up with, that will be the real problem.

I don’t like Google, and only use google when other search engines fail. But this article is BS.