cross-posted from: https://programming.dev/post/51407459
Check what can you use and at what rate of token per seconds would it be… It has examples of many models and quantization levels. Huge resource!
cross-posted from: https://programming.dev/post/51407459
Check what can you use and at what rate of token per seconds would it be… It has examples of many models and quantization levels. Huge resource!
I don’t know why you are spamming this across the Threadiverse but this is a data harvesting site with no respect for privacy loudly announcing how important privacy is. No thanks.
I am regretting having done so much crossposts. It was an impulse, not being sure where to post… I was also interested on seeing what was the take from fellow lemmings. And it’s been enriching to me in that regard. I wonder how much were people really self hosting LLMs… Apparently not that much.
How do you figure it’s a data harvesting site? It’s actually suggested a model that works pretty well for me.
Because I looked at the source code on the site. Being a data harvester doesn’t mean they don’t give real answers. Just that there is no such thing as a free lunch. You got a suggestion and they got your data.