Open casting alternative (by Amazon?)

teawrecks@sopuli.xyz · 4 days ago

How does it compare to fzf?

Also, fd already means “file descriptor”.

teawrecks@sopuli.xyz · 4 days ago

It’s neat that this exists, but not neat if someone hosts it for a year, a bunch of fed users rely on it and share a bunch of links using it, and then the hoster takes it down for whatever reason, and now there are a bunch of dead links littered all over the place.

Even less neat if some malicious group can then buy the lapsed domain and forward all those dead links to ads and viruses.

Please host responsibly, is all I’m saying.

teawrecks@sopuli.xyz · 10 days ago

Yes, and I don’t like the common comparison to binary blobs, and I’m attempting to explain why.

It is inherently safer to blindly run weights than it is to blindly execute a binary. The issues only arrise if you are then blindly trusting the outputs from the AI. But you should already have something in place to sanitize outputs and limit permissions, even for the most trustworthy weights.

It’s basically like hiring someone and wondering if they’re Hydra; no matter how deep your background check is, they could always decide to spontaneously defect and try to sabotage you. But that won’t matter if their decisions are always checked against enough other non-Hydra employees.

teawrecks@sopuli.xyz · 10 days ago

If you are familiar with the concept of an NP-complete problem, the weights are just one possible solution.

The Traveling Salesman Problem is probably the easiest analogy to make. It’s as though we’re all trying to find the shortest path through a bunch of points (ex. towns), and when someone says “here is a path that I think is pretty good”, that is analogous to sharing network weighs for an AI. We can then all openly test that solution against other solutions and determine which is “best”.

What they aren’t telling you is whether people traveling that path somehow benefits them (maybe they own all the gas stations on that path. Or maybe they’ve hired highway men to rob people on that path). And figuring out if that’s the case in a hyper-dimensional space is non-trivial.

teawrecks@sopuli.xyz · 12 days ago

It’s not sunk cost, dude. We agreed that $120 will get them 5 years of service that meets their needs. Even if they switch to jellyfin after 5 years, they still got their money’s worth.

It’s only sunk cost if they are worse off than if they had switched earlier. I guess if you’re arguing that they would still have $120 if they switch today, I would argue they should still pay that $120 toward jellyfin’s development. And that’s assuming they have time to switch to jellyfin AND it fits 100% of their usecases, either of which could be untrue.

teawrecks@sopuli.xyz · 13 days ago

Or Plex currently does everything they need it to, and $120 for 5+ years of keeping that going without any interruption of service is very reasonable. In the meantime, jellyfin will only get better and there might even be other options available by then.

Stop trying to make the issue black and white, one-size-fits-all. There are perfectly legitimate reasons for people to use both Plex and Jellyfin.

teawrecks@sopuli.xyz · 16 days ago

And then the citizens will own a chunk of Tesla? And we will see our investment pay off when it does well?

Pretend I made that into padme/anakin meme.

teawrecks@sopuli.xyz · 24 days ago

Hah, they’re TrueNAS BSD jails, but yes, now I need to figure out how to rename the “Jails” tab in my UI to overlords.

Also, all the extra work my self-hosting endeavors generate is “creep”.

teawrecks@sopuli.xyz · edit-2 25 days ago

I use zerg units.

NAS is named Nydus
Homelab with a GPU is Hydralisk
Jail instance that I can use for random cron jobs is Drone

teawrecks@sopuli.xyz · 1 month ago

It seems like the issue here is, users want to be spoken to in colloquial language they understand, but any document a legal entity produces MUST be in unambiguous “legal” language.

So unless there’s a way to write a separate “unofficial FAQ” with what they want to say, they are limited to what they legally have to say.

And maybe that’s a good thing. Maybe now they need to create a formal document specifying in the best legalese exactly what they mean when they say they “will never sell your data”, because if there’s any ambiguity around it, then customers deserve for them to disambiguate. Unfortunately, it’s probably not going read as quick and catchy as an ambiguous statement.

teawrecks@sopuli.xyz · 1 month ago

Afaik the cookie policy on your site is not GDPR compliant, at least how it is currently worded. If all cookies are “technically necessary” for function of the site, then I think all you need to do is say that. (I think for a wiki it’s acceptable to require clients to allow caching of image data, so your server doesn’t have to pay for more bandwidth).

teawrecks@sopuli.xyz · edit-2 1 month ago

My recommendation would be, have two machines: new hw for all your services, and use the old hw for your NAS. Each could be whatever OS you’re comfortable with using. Most everything on the services machine could be in docker configs, including network mount points to the NAS. You might be able to get away with using the 1080TI in the services box depending on what all you want to do (AI stuff, or newer stream transcoding requirements may require newer hw).

Moving the data from the old NAS to a new one without new disks will be a challenge, yes.

I have a TrueNAS box and used jails for services. I recently set up a debian box separately, and am switching from jails on truenas to docker on debian. Wish I had done this from the start.

teawrecks@sopuli.xyz · 1 month ago

To be fair, it’s entirely possible someone else made a post about this topic with an non-sensationalized title, but no one engaged with that one. Including us.

teawrecks@sopuli.xyz · 1 month ago

Come on and slam

teawrecks@sopuli.xyz · 2 months ago

I agree that you can’t know if the AI has been deliberately trained to act nefarious given the right circumstances. But I maintain that it’s (currently) impossible to know if any AI had been inadvertently trained to do the same. So the security implications are no different. If you’ve given an AI the ability to exfiltrating data without any oversight, you’ve already messed up, no matter whether you’re using a single AI you trained yourself, a black box full of experts, or deepseek directly.

But all this is about whether merely sharing weights is “open source”, and you’ve convinced me that it’s not. There needs to be a classification, similar to “source available”; this would be like “weights available”.

teawrecks@sopuli.xyz · 2 months ago

Those security concerns seem completely unrelated to the model, though. You can have a completely open source model that fits all those requirements, and still give it too much unfettered access to important resources with no way of actually knowing what it will do until it tries.

teawrecks@sopuli.xyz · 2 months ago

Is there any good LLM that fits this definition of open source, then? I thought the “training data” for good AI was always just: the entire internet, and they were all ethically dubious that way.

What is the concern with only having weights? It’s not abritrary code exectution, so there’s no security risk or lack of computing control that are the usual goals of open source in the first place.

To me the weights are less of a “blob” and more like an approximate solution to an NP-hard problem. Training is traversing the search space, and sharing a model is just saying “hey, this point looks useful, others should check it out”. But maybe that is a blob, since I don’t know how they got there.

teawrecks@sopuli.xyz · 2 months ago

It’s also the free market for those corporations to buy a government and use it to outlaw competition.

teawrecks@sopuli.xyz · 2 months ago

Yeah, I agree that in the long term those two sentiments are inconsistent, but in the short term we have to deal with allegedly misguided layoffs, and worse user experiences, which I think makes both fair to criticise. Maybe firing everyone and using slop AI will make your company go bankrupt in a few years, and that’s great; in the meantime, employees everywhere can rightfully complain about the slop and the jobs.

But yeah, I don’t think it’s fair to complain about how “inefficient” an early technology is and also call it “magic beans”.

teawrecks@sopuli.xyz · 2 months ago

Hah, see that’s what I thought when various family members asked if I had heard about it. Turns out, if our electronics need grounding, so must our bodies…

teawrecks@sopuli.xyz · 1 year ago

Open casting alternative (by Amazon?)