Synapse is the epitome of this

Lena@gregtech.eu · 9 months ago

Synapse is the epitome of this

lime!@feddit.nu · 9 months ago

all programs are single threaded unless otherwise specified.

Ethan@programming.dev · 9 months ago

It’s safe to assume that any non-trivial program written in Go is multithreaded

Scoopta@programming.dev · 9 months ago

But it’s still not a guarantee

Ethan@programming.dev · 9 months ago

Definitely not a guarantee, bad devs will still write bad code (and junior devs might want to let their seniors handle concurrency).

kbotc@lemmy.world · 9 months ago

And yet: You’ll still be limited to two simultaneous calls to your REST API because the default HTTP client was built in the dumbest way possible.

Ethan@programming.dev · 9 months ago

Really? Huh, TIL. I guess I’ve just never run into a situation where that was the bottleneck.

Kairos@lemmy.today · 7 months ago

The client object or the library?

kbotc@lemmy.world · 7 months ago

… Is this a trick question? The object, provided by the library (net/http which is about as default as they come) sets “DefaultMaxIdleConnsPerHost” to 2. This is significant because if you finish a connection and you’ve got more than 2 idles, it slams that connection close. If you have a lot of simultaneous fast lived requests to the same IP (say a load balanced IP), your go programs will exhaust the ephemeral port list quickly. It’s one of the most common “gotchas” I see where Go programs work great in dev and blow themselves apart in prod.

https://dev.to/gkampitakis/http-connection-churn-in-go-34pl is a fairly decent write up.

Opisek@lemmy.world · 9 months ago

I absolutely love how easy multi threading and communication between threads is made in Go. Easily one of the biggest selling points.

Ethan@programming.dev · 9 months ago

Key point: they’re not threads, at least not in the traditional sense. That makes a huge difference under the hood.

Opisek@lemmy.world · 9 months ago

Well, they’re userspace threads. That’s still concurrency just like kernel threads.

Also, it still uses kernel threads, just not for every single goroutine.

Ethan@programming.dev · 9 months ago

What I mean is, from the perspective of performance they are very different. In a language like C where (p)threads are kernel threads, creating a new thread is only marginally less expensive than creating a new process (in Linux, not sure about Windows). In comparison creating a new ‘user thread’ in Go is exceedingly cheap. Creating 10s of thousands of goroutines is feasible. Creating 10s of thousands of threads is a problem.

Also, it still uses kernel threads, just not for every single goroutine.

This touches on the other major difference. There is zero connection between the number of goroutines a program spawns and the number of kernel threads it spawns. A program using kernel threads is relying on the kernel’s scheduler which adds a lot of complexity and non-determinism. But a Go program uses the same number of kernel threads (assuming the same hardware and you don’t mess with GOMAXPROCS) regardless of the number of goroutines it uses, and the goroutines are cooperatively scheduled by the runtime instead of preemptively scheduled by the kernel.

Opisek@lemmy.world · 9 months ago

Great details! I know the difference personally, but this is a really nice explanation for other readers.

About the last point though: I’m not sure Go always uses the maximum amount of kernel threads it is allowed to use. I read it spawns one on blocking syscalls, but I can’t confirm that. I could imagine it would make sense for it to spawn them lazily and then keep around to lessen the overhead of creating it in case it’s needed later again, but that is speculation.

Edit: I dove a bit deeper. It seems that nowadays it spawns as many kernel threads as CPU cores available plus additional ones for blocking syscalls. https://go.dev/doc/go1.5 https://docs.google.com/document/u/0/d/1At2Ls5_fhJQ59kDK2DFVhFu3g5mATSXqqV5QrxinasI/mobilebasic

Successful_Try543@feddit.org · 9 months ago

Does Python have the ability to specify loops that should be executed in parallel, as e.g. Matlab uses parfor instead of for?

lime!@feddit.nu · 9 months ago

python has way too many ways to do that. asyncio, future, thread, multiprocessing…

WolfLink@sh.itjust.works · 9 months ago

Of the ways you listed the only one that will actually take advantage of a multi core CPU is multiprocessing

lime!@feddit.nu · 9 months ago

yup, that’s true. most meaningful tasks are io-bound so “parallel” basically qualifies as “whatever allows multiple threads of execution to keep going”. if you’re doing numbercrunching in pythen without a proper library like pandas, that can parallelize your calculations, you’re doing it wrong.

WolfLink@sh.itjust.works · 9 months ago

I’ve used multiprocessing to squeeze more performance out of numpy and scipy. But yeah, resorting to multiprocessing is a sign that you should be dropping into something like Rust or a C variant.

itslilith@lemmy.blahaj.zone · 9 months ago

Most numpy array functions already utilize multiple cores, because they’re optimized and written in C

danhab99@programming.dev · 9 months ago

I’ve always hated object oriented multi threading. Goroutines (green threads) are just the best way 90% of the time. If I need to control where threads go I’ll write it in rust.

lime!@feddit.nu · 9 months ago

nothing about any of those libraries dictates an OO approach.

Buddahriffic@lemmy.world · 9 months ago

Unless it’s java.

entropicdrift@lemmy.sdf.org · 9 months ago

Meh, even Java has decent FP paradigm support these days. Just because you can do everything in an OO way in Java doesn’t mean you need to.

danhab99@programming.dev · 9 months ago

If I have to put a thread object in a variable and call a method on it to start it then it’s OO multi threading. I don’t want to know when the thread spawns, I don’t want to know what code it’s running, and I don’t want to know when it’s done. I just want shit to happen at the same time (90% of the time)

lime!@feddit.nu · 9 months ago

the thread library is aping the posix thread interface with python semantics.

Midnitte@beehaw.org · 9 months ago

Experimentally, yes

Successful_Try543@feddit.org · 9 months ago

Cool.

enemenemu@lemm.ee · 9 months ago

Are you still using matlab? Why? Seriously

Successful_Try543@feddit.org · 9 months ago

No, I’m not at university anymore.

enemenemu@lemm.ee · 9 months ago

Good for you

Poor prof

Successful_Try543@feddit.org · edit-2 9 months ago

We weren’t doing any ressource extensive computations with Matlab, mainly just for teaching FEM, as we’ve had an extensive collection of scripts for that purpose, and pre- and some post processing.

twice_hatch@midwest.social · 9 months ago

I don’t like that they don’t write their own algorithms in any other language. I was trying to understand low-pass filters a while back and so many web pages were like, “Call this MATLAB function” or “here’s a code generator that puts out bad C for specific filter parameters” Like no, I want the algorithm explained to me…

Panties@lemmy.ca · 9 months ago

I was telling a colleague about how my department started using Rust for some parts of our projects lately. (normally Python was good enough for almost everything but we wanted to try it out)

They asked me why we’re not using MATLAB. They were not joking. So, I can at least tell you their reasoning. It was their first programming language in university, it’s safer and faster than Python, and it’s quite challenging to use.

twice_hatch@midwest.social · 9 months ago

“Just use MATLAB” - Someone with a kind heart who has never deployed anything to anything

AndrasKrigare@beehaw.org · 9 months ago

I think OP is making a joke about python’s GIL, which makes it so even if you are explicitly multi threading, only one thread is ever running at a time, which can defeat the point in some circumstances.

lime!@feddit.nu · edit-2 9 months ago

no, they’re just saying python is slow. even without the GIL python is not multithreaded. the thread library doesn’t use OS threads so even a free-threaded runtime running “parallel” code is limited to one thread.

apparently not!

AndrasKrigare@beehaw.org · 9 months ago

If what you said were true, wouldn’t it make a lot more sense for OP to be making a joke about how even if the source includes multi threading, all his extra cores are wasted? And make your original comment suggesting a coding issue instead of a language issue pretty misleading?

But what you said is not correct. I just did a dumb little test

import threading 
import time

def task(name):
  time.sleep(600)

t1 = threading.Thread(target=task, args=("1",))
t2 = threading.Thread(target=task, args=("2",))
t3 = threading.Thread(target=task, args=("3",))

t1.start()
t2.start()
t3.start()

And then ps -efT | grep python and sure enough that python process has 4 threads. If you want to be even more certain of it you can strace -e clone,clone3 python ./threadtest.py and see that it is making clone3 syscalls.

lime!@feddit.nu · 9 months ago

is this stackless?

anyway, that’s interesting! i was under the impression that they eschewed os threads because of the gil. i’ve learned something.

anton@lemmy.blahaj.zone · edit-2 9 months ago

~~Now do computation in those threads and realize that they all wait on the GIL giving you single core performance on computation and multi threaded performance on io.~~

AndrasKrigare@beehaw.org · 9 months ago

Correct, which is why before I had said

I think OP is making a joke about python’s GIL, which makes it so even if you are explicitly multi threading, only one thread is ever running at a time, which can defeat the point in some circumstances.

anton@lemmy.blahaj.zone · 9 months ago

Ups, my attention got trapped by the code and I didn’t properly read the comment.

thisisnotgoingwell@programming.dev · 9 months ago

Isn’t that what threading is? Concurrency always happens on single core. Parallelism is when separate threads are running on different cores. Either way, while the post is meant to be humorous, understanding the difference is what prevents people from picking up the topic. It’s really not difficult. Most reasons to bypass the GIL are IO bound, meaning using threading is perfectly fine. If things ran on multiple cores by default it would be a nightmare with race conditions.

AndrasKrigare@beehaw.org · 9 months ago

I haven’t heard of that being what threading is, but that threading is about shared resourcing and memory space and not any special relationship with the scheduler.

Per the wiki:

On a multiprocessor or multi-core system, multiple threads can execute in parallel, with every processor or core executing a separate thread simultaneously; on a processor or core with hardware threads, separate software threads can also be executed concurrently by separate hardware threads.

https://en.m.wikipedia.org/wiki/Thread_(computing)

I also think you might be misunderstanding the relationship between concurrency and parallelism; they are not mutually exclusive. Something can be concurrent through parallelism, as the wiki page has (emphasis mine):

Concurrency refers to the ability of a system to execute multiple tasks through simultaneous execution or time-sharing (context switching), sharing resources and managing interactions.

https://en.m.wikipedia.org/wiki/Concurrency_(computer_science)

groknull@programming.dev · 9 months ago

I initially read this as “all programmers are single-threaded” and thought to myself, “yeah, that tracks”

nickwitha_k (he/him)@lemmy.sdf.org · 9 months ago

https://docs.python.org/3/whatsnew/3.13.html#whatsnew313-free-threaded-cpython

Lena@gregtech.eu · 9 months ago

Oooooh this is really cool, thanks for sharing. How could I install it on Linux (Ubuntu)? I assume I would have to compile CPython. Also, would the source of the programs I run need any modifications?

nickwitha_k (he/him)@lemmy.sdf.org · 9 months ago

In this case, it’s a feature of the language that enables developers to implement greater amounts of parallelism. So, the developers of the Python-based application will need to refactor to take advantage of it.

computergeek125@lemmy.world · 9 months ago

From memory I can only answer one of those: The way I understand it (and I could be wrong), your programs theoretically should only need modifications if they have a concurrency related bug. The global interlock is designed to take a sledgehammer at “fixing” a concurrency data race. If you have a bug that the GIL fixed, you’ll need to solve that data race using a different control structure once free threading is enabled.

I know it’s kind of a vague answer, but every program that supports true concurrency will do it slightly differently. Your average script with just a few libraries may not benefit, unless a library itself uses threads. Some libraries that use native compiled components may already be able to utilize the full power of you computer even on standard Python builds because threads spawned directly in the native code are less beholden to the GIL (depending on how often they’d need to communicate with native python code)

Lena@gregtech.eu · 9 months ago

Thanks for the answer, I really hope Synapse will be able to work with concurrency enabled.

SaharaMaleikuhm@feddit.org · 9 months ago

Oh wow, a programming language that is not supposed to be used for every single software in the world. Unlike Javascript for example which should absolutely be used for making everything (horrible). Nodejs was a mistake.

Lena@gregtech.eu · 9 months ago

Nodejs was a mistake.

More choice is always better

_stranger_@lemmy.world · 9 months ago

And some of those choices are mistakes.

Lena@gregtech.eu · 9 months ago

I like Typescript >:3

_stranger_@lemmy.world · 9 months ago

I appreciate Typescript for addressing the sins of its predecessor.

driving_crooner@lemmy.eco.br · 9 months ago

Citations Needed: Episode 95: The Hollow Vanity of Libertarian “Choice” Rhetoric

Episode webpage: https://dts.podtrac.com/redirect.mp3/traffic.libsyn.com/secure/citationsneeded/CN95_20191205_choice_Stites_v2.mp3

Fucking Citations Needed, every time I finish an episode, someone comment something related to it.

twice_hatch@midwest.social · 9 months ago

don’t worry it’ll use all the RAM anyway

SatouKazuma@programming.dev · 9 months ago

I paid for all the memory. I’ll use all the memory.

goodbible@lemm.ee · 9 months ago

JG Memoryworth

Lena@gregtech.eu · 9 months ago

No RAM gets wasted!

kSPvhmTOlwvMd7Y7E@programming.dev · 9 months ago

let’s be honest here, he actually means 0.01 core performance

burlemarx@lemmygrad.ml · 9 months ago

Yes, 0.99 performance being consumed by the interpreter.

dan@upvote.au · 9 months ago

Do you mean Synapse the Matrix server? In my experience, Conduit is much more efficient.

jimmy90@lemmy.world · 9 months ago

i wish they would switch the reference implementation to conduit

there is core components on the client side in rust so maybe that’s the way for the future

Lena@gregtech.eu · 9 months ago

Yep, I mean as in matrix. There is currently no was to migrate to conduit/conduwuit. Btw from what I’ve seen conduwuit is more full-featured.

fmstrat@lemmy.nowsci.com · 9 months ago

I may have something to read up on.

dan@upvote.au · 9 months ago

The documentation is kinda lacking, but if you could figure out how to set up Synapse then you can probably figure out Conduit too. https://conduit.rs/

driving_crooner@lemmy.eco.br · 9 months ago

I tough this was about excel and was like yeah haha!

But is about Python, so I’m officially offended.

tetris11@lemmy.ml · 9 months ago

I prefer this default. Im sick of having to rein in Numba cores or OpenBlas threads or other out of control software that immediately tries to bottleneck my stack.

CGroups (Docker/LXC) is the obvious solution, but it shouldn’t have to be

h4x0r@lemmy.dbzer0.com · 9 months ago

https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.ProcessPoolExecutor

TropicalDingdong@lemmy.world · 9 months ago

Python

…so… so you made it single threaded?

Gonzako@lemmy.world · 9 months ago

I’ll be honest, this only matters when running single services that are very expensive. it’s fine if your program can’t be pararlelized if the OS does its job and spreads the love around the cpus

alcasa@lemmy.sdf.org · 9 months ago

It only took us how many years?

Fortatech@gregtech.eu · 9 months ago

!lemmySilver