smoothbrain coldtakes

why would you take anything you see on the internet seriously?

  • 0 Posts
  • 33 Comments
Joined 2 years ago
cake
Cake day: June 26th, 2023

help-circle








  • It’s also only valuable if people keep contributing to it. It’s highly likely the majority of current existing reddit data has been largely incorporated into many LLMs prior to the API access limiting. Google paying them 60 million dollars is a hilarious pittance to keep training their LLMs, given how much money AI services will likely generate off of the training data.

    I don’t actively use reddit anymore, but when I need an answer to something that isn’t programming-related, it’s usually the top source on any given web search. That kind of content is basically the only stuff I would give a shit about. I can’t imagine how much absolute garbage you have to sift through on the platform to get reliable training data. Maybe the ratio is terrible and that’s why Google paid so little.


  • Actually part of their IPO paperwork lists WSB as a potential positive benefit to the stock, in terms of having a clear userbase that will theoretically help sustain the value through shenanigans. That, to me, however, sounds like a securities violation waiting to happen.

    I don’t check reddit anymore. Does WSB actually consider this stock to be, uh, actually valuable? Every corner of the internet I’ve seen discuss this topic have all noted how worthless they think the shares are going to be. My money is on them shorting it.


  • I think it has to do with karma count.

    I had two accounts, one had been scrubbed and was mostly used for commenting, and the other was a porn alt.

    The porn alt has hundreds of thousands of karma and it got multiple IPO messages while the original, older account got nothing due to being sub 5k on posts.

    Edit: Suspicions confirmed!

    Reddit is planning six tiers of early access based on each “participant’s contributions to Reddit,” the company said in its updated SEC filing. Those tiers are based on a user’s “karma” score, ostensibly an aggregate total of up/down votes on posts and comments.

    The first tier of users will be those “who have meaningfully contributed to Reddit community programs,” though what that means isn’t explained more clearly. After that come tier 2 users, who must hold at least 200,000 karma points or have taken at least 5,000 moderator actions. Tier three includes users and moderators who hold at least 100,000 karma points and have taken 2,500 moderator actions. Tiers 4 and 5 are each half of the previous tier’s total, and tier 6 includes everyone else, with a waitlist available if the total number of shares purchased exceeds the original 1.76 million.





  • VLC isn’t equivalent to Jellyfin or Plex in streaming capabilities, and it’s just running on SMB which could be configured with general folder permissions and sharing, allowing access for any media player and not just VLC.

    SMB is a share protocol that’s pretty standard across ecosystems. I am not sure of your use case and why you are wanting to get rid of Jellyfin but VLC doesn’t sound like the tool to do it.

    You can configure the folder permissions with a guide here or by searching “Ubuntu smb sharing” or something similar.



  • My dev env doesn’t really change much over the OSes I use because I tend to stick with VSC which just works everywhere.

    I’ve found that WSL covers more and more of my use cases when it comes to wanting to do something in Linux.

    I have a ThinkPad with Fedora Silverblue on it but I’d never use it for work.

    Most of the time I just stick to Windows because it covers everything I need it to and it works on every single device I own flawlessly. I’m still tinkering with this laptop, and since it’s a T80s, there are no working drivers for the fingerprint sensor that I can find. Windows Hello just works, I don’t have to worry about what I plugged in or what laptop I picked up.


  • Stuff without the guardrails, stuff that’s been designed to produce porn, or totally answer truthfully to queries such as “how do I build a bomb” or “how do I make napalm” which are common tests to see how jailbroken any LLM is. When you feed something the entire internet, or even subsections of the internet, it tends to find both legal and illegal information. Also the ones designed to generate porn have gone beyond that boring shitty AI art style and now people are generating human being deepfakes, and it’s become a common tactic to spam places with artificial CSAM to cause problems with services. It’s been a recent and long-standing issue with Lemmy - people like Exploding Heads or Hexbear will get defederated and then out of retaliation will spam the servers that defederated from them with said artificial CSAM.

    I like copilot but that’s because I’m fine with the guardrails and I’m not trying to make it do anything out of its general scope. I also like how it’s covered by an enterprise privacy agreement which was a huge issue with people using ChatGPT and feeding it all kinds of private info.



  • It’s already easy to self host and we’ve optimized LLMs to run locally on not much serious hardware after we’ve trained them; I have GPT4ALL set up on my machine and it runs everything locally with my processor, no GPU or anything. Some of those datasets are uncensored, and I’ve seen what Stable Diffusion can do for image generation.

    I tend to use the GPT-4 built into Edge with my O365 corporate plan, because it suits my needs better for day-to-day challenges. It can still audit code and summarize things, which is all I really need it to do here and there.